jordiclive
/

gpt4all-alpaca-oa-codealpaca-lora-7b

Text Generation

Model card Files Files and versions Community

gpt4all-alpaca-oa-codealpaca-lora-7b / README.md

jordiclive's picture

Update README.md

eadbb15 over 1 year ago

|

605 Bytes

	---
	license: mit
	datasets:
	- Nebulous/gpt4all_pruned
	- sahil2801/CodeAlpaca-20k
	- yahma/alpaca-cleaned
	---

	This repo contains a low-rank adapter for LLaMA-7b fit on
	- `Nebulous/gpt4all_pruned`
	- `sahil2801/CodeAlpaca-20k`
	- `yahma/alpaca-cleaned`
	- datasets part of the OpenAssistant project.


	This version of the weights was trained with the following hyperparameters:

	- Epochs: 2
	- Batch size: 128
	- Max Length: 2048
	- Learning rate: 4e-6
	- Lora _r_: 8
	- Lora Alpha: 32
	- Lora target modules: q_proj, k_proj, v_proj, o_proj

	The model was trained with flash attention and gradient checkpointing.