Luca Soldaini's picture

298682.6 TFLOPS

Luca Soldaini

soldni

·

https://soldaini.net

AI & ML interests

question answering, information retrieval, scientific document processing

Recent Activity

liked a dataset about 8 hours ago

RealTimeData/bbc_news_alltime

updated a dataset 1 day ago

allenai/dolmino-mix-1124

updated a model 2 days ago

allenai/dolma2-tokenizer-sigdig

View all activity

Organizations

Posts 1

Post

release day release day! OLMo 1b + 7b out today 🥳 and 65b coming soon...

With OLMo, we are really focused on advancing the study of LLMs. We release **everything**, from toolkit to create its training dataset (dolma) to training & inference code:

- OLMo paper: https://allenai.org/olmo/olmo-paper.pdf
- OLMo train code: https://github.com/allenai/OLMo
- OLMo eval code: https://github.com/allenai/OLMo-Eval
- OLMo 7b: allenai/OLMo-7B
- OLMo 1b: allenai/OLMo-1B
- Dolma paper: https://allenai.org/olmo/dolma-paper.pdf
- Dolma dataset v1.6: allenai/dolma
- Dolma toolkit v1.0: https://github.com/allenai/dolma

Papers 24

arxiv:2411.15124

arxiv:2411.14199

arxiv:2409.17146

arxiv:2409.02060

spaces 1

Viz Summaries

models 1

soldni/redpajama_v1_bpe_aa_af_16B

Updated Apr 24, 2023

datasets 4

soldni/OLMoE-mix-0924

Updated Oct 23 • 3

soldni/jeopardy

Viewer • Updated Oct 7 • 219k • 63

soldni/test2

Updated Jul 6, 2023 • 36

soldni/test

Updated Jul 6, 2023 • 35