Luca Soldaini

soldni

AI & ML interests

question answering, information retrieval, scientific document processing

Recent Activity

liked a dataset about 8 hours ago
RealTimeData/bbc_news_alltime
updated a dataset 1 day ago
allenai/dolmino-mix-1124
updated a model 2 days ago
allenai/dolma2-tokenizer-sigdig
View all activity

Organizations

Posts 1

view post
Post
release day release day! OLMo 1b + 7b out today 🥳 and 65b coming soon...

With OLMo, we are really focused on advancing the study of LLMs. We release **everything**, from toolkit to create its training dataset (dolma) to training & inference code:

- OLMo paper: https://allenai.org/olmo/olmo-paper.pdf
- OLMo train code: https://github.com/allenai/OLMo
- OLMo eval code: https://github.com/allenai/OLMo-Eval
- OLMo 7b: allenai/OLMo-7B
- OLMo 1b: allenai/OLMo-1B
- Dolma paper: https://allenai.org/olmo/dolma-paper.pdf
- Dolma dataset v1.6: allenai/dolma
- Dolma toolkit v1.0: https://github.com/allenai/dolma