Text Generation
English
sft
File size: 515 Bytes
d9791fd
 
732a0ac
 
 
 
d9791fd
 
 
 
 
 
 
 
 
 
 
 
732a0ac
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
license: mit
datasets:
- Nebulous/gpt4all_pruned
- sahil2801/CodeAlpaca-20k
- yahma/alpaca-cleaned
---

This repo contains a low-rank adapter for LLaMA-7b fit on `Nebulous/gpt4all_pruned`, `sahil2801/CodeAlpaca-20k`, `yahma/alpaca-cleaned` and some datasets part of the OpenAssistant project.


This version of the weights was trained with the following hyperparameters:

- Epochs: 2
- Batch size: 128
- Max Length: 2048
- Learning rate: 4e-6
- Lora _r_: 16
- Lora target modules: q_proj, k_proj, v_proj, o_proj