Lj V. Miranda's picture

Lj V. Miranda

ljvmiranda921

·

https://ljvmiranda921.github.io

AI & ML interests

NLP - multilinguality, data-centric AI

Recent Activity

upvoted a collection 1 day ago

authored a paper 1 day ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

upvoted a paper 1 day ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

View all activity

Organizations

ljvmiranda921's activity

upvoted a collection 1 day ago

OLMo 2

Artifacts for the second set of OLMo models. • 17 items • Updated about 4 hours ago • 27

upvoted a paper 1 day ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published 5 days ago • 52

upvoted a collection 7 days ago

Tulu 3 Datasets

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 32 items • Updated about 4 hours ago • 44

upvoted a paper about 1 month ago

Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback

Paper • 2410.19133 • Published Oct 24 • 11

upvoted a collection about 1 month ago

Multilingual RewardBench

Multilingual Reward Model Evaluation Dataset and Results • 2 items • Updated Oct 26 • 4

upvoted a paper about 1 month ago

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Paper • 2410.15522 • Published Oct 20 • 10

upvoted a paper 3 months ago

SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages

Paper • 2407.19672 • Published Jul 29 • 55

upvoted a paper 4 months ago

Consent in Crisis: The Rapid Decline of the AI Data Commons

Paper • 2407.14933 • Published Jul 20 • 12

upvoted a collection 4 months ago

Reward Bench

Datasets, spaces, and models for the reward model benchmark! • 5 items • Updated about 3 hours ago • 7

upvoted 2 papers 5 months ago

Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing

Paper • 2406.08464 • Published Jun 12 • 65

SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages

Paper • 2406.10118 • Published Jun 14 • 30

upvoted a collection about 1 year ago

State-of-the-Art NER models - Tagalog

2 items • Updated Feb 27 • 2

upvoted 2 papers about 1 year ago

Universal NER: A Gold-Standard Multilingual Named Entity Recognition Benchmark

Paper • 2311.09122 • Published Nov 15, 2023 • 7

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 28