Fine Tuning Llama Models With Qlora and Axolotl

www.animal-machine.com

Fine Tuning Llama Models With Qlora and Axolotl

www.animal-machine.com

InattentiveRaccoon@lemmy.animal-machine.comM to

Animal House@lemmy.animal-machine.comEnglish · 1 year ago

Fine Tuning Llama Models With Qlora and Axolotl | ANIMAL-MACHINE

www.animal-machine.com

A technology blog with a dash of art.

This is my step-by-step guide on how to replicate fine tuning of the example datasets using axolotl.

Last I checked, the bitsandbytes library copy was still needed and open-llama-3b was still problematic for quantizing, but hopefully those issues are solved at some point.

What I didn’t know when I first wrote the post was that it was possible to load the finetuned LoRA file in a frontend like text-generation-webui. I have since updated the text to account for that. There are performance side-effects of just loading the qlora adapter in the webui besides just the penalty to load time. This should show how fast text inference was with little context in tokens/p while using the transformers library and source model in f16 or quantized 8-bit & 4-bit and how fast I can run a merged q4_0 quantization.

You must log in or register to comment.

Chat

Animal House@lemmy.animal-machine.com

animal_house@lemmy.animal-machine.com

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !animal_house@lemmy.animal-machine.com

Discussion area for the main blog: animal-machine.com. Feel free to comment here to discuss any of my blog posts.

Rules:

Excessive hate speech such racism will not be tolerated.
Excessive self-promotion or advertisement will probably get modded.
Try to be kind where possible. At the very least, be respectful when disagreeing.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

0 users / day
0 users / week
1 user / month
1 user / 6 months
0 local subscribers
0 subscribers
3 Posts
0 Comments
Modlog