MaziyarPanahi/Mistral-11B-Instruct-v0.2-Mistral-7B-Instruct-v0.2-slerp Text Generation • Updated Jan 10 • 26 • 2
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences Paper • 2404.03715 • Published Apr 4 • 59
tomaarsen/span-marker-roberta-large-ontonotes5 Token Classification • Updated Sep 22, 2023 • 420 • 11
McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp-supervised Sentence Similarity • Updated Apr 30 • 10.8k • 41
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated 2 days ago • 13.6k • 6
meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text • Updated 3 days ago • 139k • • 472