-
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Paper • 2402.12030 • Published -
mistralai/Mistral-7B-Instruct-v0.2
Text Generation • Updated • 743k • • 2.58k -
meta-llama/Llama-2-7b-chat-hf
Text Generation • Updated • 697k • • 4.02k -
EleutherAI/pythia-160m-deduped
Text Generation • Updated • 12.8k • 3
Nicolas-BZRD
Nicolas-BZRD
AI & ML interests
PhD Student | NLP - LLMs - Adaptation real-world problem
Optimization
Organizations
Collections
1
models
92
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_uld_loss
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_dialogsum_Mistral-7B-Instruct-v0.2_text_teacher
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_pubmed_qa_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
2
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_qed_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_uld_loss
Text2Text Generation
•
Updated
•
1
Nicolas-BZRD/mt0-base_dialogsum_Llama-2-7b-chat-hf_text_teacher
Text2Text Generation
•
Updated
•
3
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_uld_loss
Text Generation
•
Updated
•
6
Nicolas-BZRD/pythia-160m-deduped_FairytaleQA_Llama-2-7b-chat-hf_text_teacher
Text Generation
•
Updated
•
8
datasets
33
Nicolas-BZRD/gsm8k-ar-Qwen2-72B-Instruct
Viewer
•
Updated
•
7.47k
•
33
Nicolas-BZRD/gsm8k-ar-Meta-Llama-3.1-70B-Instruct
Viewer
•
Updated
•
7.47k
•
32
Nicolas-BZRD/gsm8k-ar-gemma-2-27b-it
Viewer
•
Updated
•
7.47k
•
39
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-pubmed_qa_50k
Viewer
•
Updated
•
50.5k
•
50
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-squad
Viewer
•
Updated
•
87.6k
•
39
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-squad
Viewer
•
Updated
•
87.6k
•
35
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-dialogsum
Viewer
•
Updated
•
12.4k
•
39
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-qed
Viewer
•
Updated
•
7.62k
•
44
Nicolas-BZRD/uld_loss_Mistral-7B-Instruct-v0.2-FairytaleQA
Viewer
•
Updated
•
9.57k
•
39
Nicolas-BZRD/uld_loss_Llama-2-7b-chat-hf-FairytaleQA
Viewer
•
Updated
•
9.57k
•
35