Anthony Ivan S

anthonyivn

anthonyivn2

AI & ML interests

None yet

Recent Activity

liked a model 8 days ago

nvidia/Hymba-1.5B-Base

upvoted a paper 23 days ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

liked a model about 2 months ago

nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

View all activity

Organizations

None yet

anthonyivn's activity

upvoted a paper 23 days ago

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Paper • 2411.03562 • Published 25 days ago • 60

upvoted an article 2 months ago

Article

Document Similarity Search with ColPali

•

Sep 21

• 47

upvoted 2 papers 2 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 75

upvoted a paper 3 months ago

Generative Verifiers: Reward Modeling as Next-Token Prediction

Paper • 2408.15240 • Published Aug 27 • 13

upvoted an article 5 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15

• 78

upvoted a paper 5 months ago

SpreadsheetLLM: Encoding Spreadsheets for Large Language Models

Paper • 2407.09025 • Published Jul 12 • 129

upvoted a collection 5 months ago

InternLM2.5

Collection

14 items • Updated Sep 14 • 70

upvoted 2 papers 5 months ago

LongIns: A Challenging Long-context Instruction-based Exam for LLMs

Paper • 2406.17588 • Published Jun 25 • 20

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 86

upvoted a paper 6 months ago

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10 • 65

upvoted 2 articles 6 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13

• 374

Article

Putting RL back in RLHF

Jun 12

• 62

upvoted a paper 6 months ago

Perplexed by Perplexity: Perplexity-Based Data Pruning With Small Reference Models

Paper • 2405.20541 • Published May 30 • 21

upvoted an article 6 months ago

Article

Hugging Face on AMD Instinct MI300 GPU

May 21

• 10

upvoted 3 papers 7 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

Better & Faster Large Language Models via Multi-token Prediction

Paper • 2404.19737 • Published Apr 30 • 73

How faithful are RAG models? Quantifying the tug-of-war between RAG and LLMs' internal prior

Paper • 2404.10198 • Published Apr 16 • 7

upvoted an article 7 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18

• 279

upvoted a paper 8 months ago

LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders

Paper • 2404.05961 • Published Apr 9 • 64