Jay Shin
jshin49
AI & ML interests
None yet
Organizations
Collections
7
-
Pre-training Small Base LMs with Fewer Tokens
Paper • 2404.08634 • Published • 34 -
Ziya2: Data-centric Learning is All LLMs Need
Paper • 2311.03301 • Published • 16 -
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 38 -
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Paper • 2404.06395 • Published • 21
models
None public yet
datasets
None public yet