arxiv:2412.09871
Zhou
Chunting
AI & ML interests
None yet
Recent Activity
authored
a paper
27 days ago
Byte Latent Transformer: Patches Scale Better Than Tokens
authored
a paper
9 months ago
Megalodon: Efficient LLM Pretraining and Inference with Unlimited
Context Length
authored
a paper
11 months ago
Instruction-tuned Language Models are Better Knowledge Learners
Organizations
None yet
models
None public yet
datasets
None public yet