LLM - a L-Hongbin Collection

L-Hongbin 's Collections

MutiModal_Paper

LLM

MutiModal_Dataset

Optimizer_Papers

LLM

updated 2 days ago

Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering

Paper • 2411.11504 • Published 13 days ago • 19
Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published 19 days ago • 17
Adaptive Decoding via Latent Preference Optimization

Paper • 2411.09661 • Published 16 days ago • 10
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training

Paper • 2411.13476 • Published 10 days ago • 14
HuggingFaceTB/smoltalk

Viewer • Updated 5 days ago • 2.2M • 2.5k • 184
Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published 10 days ago • 37
TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published 8 days ago • 54
Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 5 days ago • 41
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 5 days ago • 28
MH-MoE:Multi-Head Mixture-of-Experts

Paper • 2411.16205 • Published 6 days ago • 21
nGPT: Normalized Transformer with Representation Learning on the Hypersphere

Paper • 2410.01131 • Published Oct 1 • 9
O1-OPEN/OpenO1-SFT

Preview • Updated 9 days ago • 357 • 51
AI-MO/NuminaMath-CoT

Viewer • Updated 6 days ago • 860k • 2.48k • 248
GAIR/o1-journey

Viewer • Updated Oct 16 • 327 • 1.09k • 92
allenai/tulu-3-sft-mixture

Viewer • Updated 9 days ago • 939k • 1.53k • 53