Catherine Arnett

catherinearnett

AI & ML interests

multilingual NLP, tokenization

Recent Activity

Articles

Organizations

Blog-explorers's profile picture Language and Cognition Lab (UCSD)'s profile picture PleIAs's profile picture

catherinearnett's activity

upvoted an article 17 days ago
view article
Article

Releasing the largest multilingual open pretraining dataset

96
New activity in PleIAs/ToxicCommons 27 days ago
published an article 2 months ago