Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
diegobit
/
llama-3-8b-ita-4k-orpo-v3
like
0
Text Generation
Transformers
Safetensors
mii-community/ultrafeedback-preferences-translated-ita
efederici/alpaca-vs-alpaca-orpo-dpo
llama
unsloth
conversational
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
License:
llama3
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
diegobit
commited on
Jun 12
Commit
13e571a
•
1 Parent(s):
94d720b
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-0
README.md
CHANGED
Viewed
@@ -61,6 +61,8 @@ dtype = None
61
load_in_4bit = False
62
```
63
64
```
65
r = 64
66
lora_alpha = 64
61
load_in_4bit = False
62
```
63
64
+
- **PEFT parameters:**
65
+
66
```
67
r = 64
68
lora_alpha = 64