Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
albertvillanovaΒ 
posted an update Oct 24
Post
1217
🚨 Instruct-tuning impacts models differently across families! Qwen2.5-72B-Instruct excels on IFEval but struggles with MATH-Hard, while Llama-3.1-70B-Instruct avoids MATH performance loss! Why? Can they follow the format in examples? πŸ“Š Compare models: open-llm-leaderboard/comparator