LLM – nanonomad

LLaMA-Factory with Flash Attention 2 and Unsloth

Posted on June 20, 2024 (June 20, 2024) by nanonomad

It was tough to get this working, but I think I’ve figured it out enough to share. Here’s a quick guide on how to set up LLaMA-Factory with support for Flash Attention 2 and Unsloth training on Windows. This is using a RTX3060 12GB GPU, Windows 10, and CUDA 12.1. Unsloth is an optimization library […]

Automate Image Captioning using Multimodal LLMs

Posted on November 19, 2023 (April 2, 2024) by nanonomad

Using multi-modal large language models for automated image captioning. Rich captions can be used for training Stable Diffusion Dreambooth or LoRAs. […]

Fine Tuning Mistral 7B

Posted on October 27, 2023 (April 2, 2024) by nanonomad

Can you train new or forbidden knowledge into a LLM? Let’s fine out as I throw 1 gigabyte of scraped, cleaned, plaintext KiwiFarms posts at Mistral 7B. I go over my experience fine-tuning Mistral 7B on a few large datasets of scraped text data including English language song lyrics, and a huge KiwiFarms post dataset. […]

Read More… from Fine Tuning Mistral 7B