arxiv:2408.16961
Leshem Choshen
borgr
AI & ML interests
Merging models, collaboratively improving pretraining, evaluation, understanding
Recent Activity
New activity
about 12 hours ago
chuxin-llm/Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints:Losses and checkpoints
upvoted
a
paper
about 1 month ago
LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content
upvoted
a
paper
about 2 months ago
SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image
Classification
Organizations
Papers
18
models
None public yet
datasets
None public yet