Anton Lozhkov's picture

Anton Lozhkov

anton-l

·

AI & ML interests

Generative Models, Distributed Training, Photo and Video Enhancement

Recent Activity

updated a Space about 12 hours ago

science/README

liked a model 1 day ago

HuggingFaceTB/SmolVLM-Instruct

liked a dataset 1 day ago

HuggingFaceTB/smol-smoltalk

View all activity

Articles

SmolLM - blazingly fast and remarkably powerful

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

StarCoder2 and The Stack v2

Organizations

anton-l's activity

upvoted a paper 3 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22 • 121

upvoted an article 4 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16

• 271

upvoted an article 5 months ago

Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

Jun 24

• 33

upvoted a paper 5 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

upvoted a collection 6 months ago

📀 Dataset comparison models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated Jun 12 • 31

upvoted a paper 9 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 136

upvoted a paper about 1 year ago

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122

upvoted a paper over 1 year ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 31