Haoling Li

Ringo1110

AI & ML interests

None yet

Recent Activity

authored a paper about 8 hours ago

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

authored a paper about 8 hours ago

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

upvoted a paper 1 day ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

View all activity

Organizations

Ringo1110's activity

authored 2 papers about 8 hours ago

Gradient-Mask Tuning Elevates the Upper Limits of LLM Performance

Paper • 2406.15330 • Published Jun 21, 2024

Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training

Paper • 2411.14318 • Published Nov 21, 2024

upvoted a paper 1 day ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published 2 days ago • 7

authored a paper 1 day ago

EpiCoder: Encompassing Diversity and Complexity in Code Generation

Paper • 2501.04694 • Published 2 days ago • 7

upvoted a paper 1 day ago

URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal Mathematics

Paper • 2501.04686 • Published 2 days ago • 40

liked a dataset 4 days ago

RLHFlow/Deepseek-PRM-Data

Viewer • Updated Nov 9, 2024 • 253k • 77 • 8

liked a model 15 days ago

deepseek-ai/DeepSeek-V3-Base

Updated 12 days ago • 9.24k • 1.22k

upvoted a paper 21 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 22 days ago • 339

liked a Space 22 days ago

Running

452

📈

Scaling test-time compute

upvoted a paper about 1 month ago

Critical Tokens Matter: Token-Level Contrastive Estimation Enhence LLM's Reasoning Capability

Paper • 2411.19943 • Published Nov 29, 2024 • 56

upvoted a paper 3 months ago

Horizon-Length Prediction: Advancing Fill-in-the-Middle Capabilities for Code Generation with Lookahead Planning

Paper • 2410.03103 • Published Oct 4, 2024 • 7

liked a dataset 4 months ago

bigcode/starcoderdata

Viewer • Updated May 16, 2023 • 207M • 3.95k • 409

upvoted a paper 9 months ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 88

upvoted a collection 10 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 224