Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance Paper • 2406.15330 • Published Jun 21, 2024
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training Paper • 2411.14318 • Published Nov 21, 2024
EpiCoder: Encompassing Diversity and Complexity in Code Generation Paper • 2501.04694 • Published 2 days ago • 7
HumanEval Pro and MBPP Pro: Evaluating Large Language Models on Self-invoking Code Generation Paper • 2412.21199 • Published 11 days ago • 11
MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction Paper • 2410.02241 • Published Oct 3, 2024 • 8
MIGA: Mixture-of-Experts with Group Aggregation for Stock Market Prediction Paper • 2410.02241 • Published Oct 3, 2024 • 8
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation Paper • 2312.14187 • Published Dec 20, 2023 • 49