Papers - Text - Training - Mixture Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Text - Datasets - Math - AMC Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Training - Eval - Out of Distribution Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Training - Overfitting - Decontamination Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Pretraining - Synthetic Data - Problem Solving Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Pretraining - Synthetic Data - Reasoning Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Papers - Fine-tuning - DPO - Pivotal Token Search Collection by matlok 10 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
TechReport Collection by pppa 11 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92
Large group of models Collection by ragius 13 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92 cognitivecomputations/dolphin-2.9.2-qwen2-72b Text Generation • Updated Oct 8 • 10.7k • 132 ByteWave/prompt-generator Text Generation • Updated Nov 10, 2023 • 260 • 18 Qwen/QwQ-32B-Preview Text Generation • Updated 28 days ago • 128k • • 1.43k
LLMs Collection by hg2wzh 13 days ago - Phi-4 Technical Report Paper • 2412.08905 • Published 15 days ago • 92