Stronger Models are NOT Stronger Teachers for Instruction Tuning Paper • 2411.07133 • Published 19 days ago • 32
view article Article 🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦⬛ By anakin87 • Oct 21 • 18
Training Language Models to Self-Correct via Reinforcement Learning Paper • 2409.12917 • Published Sep 19 • 135
MagpieLM Collection Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated Sep 22 • 15
Magpie Open Recipes Collection Open-aligned models using Magpie datasets. • 11 items • Updated Sep 14 • 1
Magpie-Llama3.1 Datasets Collection Dataset built with Meta Llama 3.1 70B. • 6 items • Updated Sep 20 • 3
Zebra Logic Bench Collection ZebraLogic Bench: Testing the Limits of LLMs in Logical Reasoning • 4 items • Updated 3 days ago • 4
ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates Paper • 2406.12935 • Published Jun 17 • 1
synthetic-data-generation-demos Collection A collection of demos for various approaches to synthetic data generation • 4 items • Updated Jun 25 • 13
Magpie-Qwen2 Datasets Collection Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Sep 14 • 10
view article Article Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation By davanstrien • Jun 20 • 12
FP8 LLMs for vLLM Collection Accurate FP8 quantized models by Neural Magic, ready for use with vLLM! • 44 items • Updated Oct 17 • 59
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper • 2402.11753 • Published Feb 19 • 5
Magpie-Pro Datasets (Llama-3) Collection Dataset built with Meta Llama 3 70B. Models are fine-tuned from Llama 3 8B. • 6 items • Updated Sep 20 • 16
Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing Paper • 2406.08464 • Published Jun 12 • 65
SimPO Collection This collections contains a list of SimPO and baseline models. • 49 items • Updated 23 days ago • 16
SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding Paper • 2402.08983 • Published Feb 14 • 2