Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
martimfasantos
/
tinyllama-1.1b-sum-dpo-full_LR2e-8_2epochs_old
like
0
Text Generation
Transformers
TensorBoard
Safetensors
openai/summarize_from_feedback
llama
alignment-handbook
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
dd98c41
tinyllama-1.1b-sum-dpo-full_LR2e-8_2epochs_old
/
model.safetensors
Commit History
Training in progress, step 1500
bba9901
verified
martimfasantos
commited on
Jun 21
Training in progress, step 1400
9cfe920
verified
martimfasantos
commited on
Jun 21
Training in progress, step 1300
1beab80
verified
martimfasantos
commited on
Jun 21
Training in progress, step 1200
bb08d2b
verified
martimfasantos
commited on
Jun 21
Training in progress, step 1100
3294cd1
verified
martimfasantos
commited on
Jun 21
Training in progress, step 1000
3018829
verified
martimfasantos
commited on
Jun 21
Training in progress, step 900
15f825b
verified
martimfasantos
commited on
Jun 21
Training in progress, step 800
798ac51
verified
martimfasantos
commited on
Jun 21
Training in progress, step 700
123ee64
verified
martimfasantos
commited on
Jun 21
Training in progress, step 600
72e3194
verified
martimfasantos
commited on
Jun 21
Training in progress, step 500
9efb2cb
verified
martimfasantos
commited on
Jun 21
Training in progress, step 400
5a740e3
verified
martimfasantos
commited on
Jun 21
Training in progress, step 300
f5c44b2
verified
martimfasantos
commited on
Jun 21
Training in progress, step 200
f9de7ef
verified
martimfasantos
commited on
Jun 21
Previous
1
2
3
Next