Xiaosen Zheng

xszheng2020

AI & ML interests

Data-Centric AI and AI Safety.

Recent Activity

liked a dataset 20 days ago
proj-persona/PersonaHub
liked a model 21 days ago
Qwen/Qwen2.5-3B-Instruct-AWQ
liked a model 21 days ago
Qwen/Qwen2.5-1.5B-Instruct-AWQ
View all activity

Organizations

xszheng2020's activity

upvoted an article about 1 month ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

272
upvoted an article about 2 months ago
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

67
upvoted 2 articles 5 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

104
view article
Article

RegMix: Data Mixture as Regression for Language Model Pre-training

10