taicheng guo

taicheng

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

liked a model about 1 month ago

Qwen/Qwen2-0.5B

upvoted a collection about 2 months ago

Power-LM

View all activity

Organizations

taicheng's activity

upvoted a paper 2 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 5 days ago • 28

upvoted a collection about 2 months ago

Power-LM

Collection

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted 2 papers about 2 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 62

MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions

Paper • 2409.12958 • Published Sep 19 • 7

upvoted 5 papers 2 months ago

GRIN: GRadient-INformed MoE

Paper • 2409.12136 • Published Sep 18 • 15

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25 • 59

Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization

Paper • 2409.12903 • Published Sep 19 • 21

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19 • 135

Power Scheduler: A Batch Size and Token Number Agnostic Learning Rate Scheduler

Paper • 2408.13359 • Published Aug 23 • 22

upvoted 2 papers 3 months ago

Alignment Studio: Aligning Large Language Models to Particular Contextual Regulations

Paper • 2403.09704 • Published Mar 8 • 31

OLMoE: Open Mixture-of-Experts Language Models

Paper • 2409.02060 • Published Sep 3 • 77

upvoted an article 3 months ago

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

Jan 18

• 38

upvoted a paper 3 months ago

LongVILA: Scaling Long-Context Visual Language Models for Long Videos

Paper • 2408.10188 • Published Aug 19 • 51

upvoted a paper 4 months ago

Turbo Sparse: Achieving LLM SOTA Performance with Minimal Activated Parameters

Paper • 2406.05955 • Published Jun 10 • 22

upvoted a collection 4 months ago

DeepSeekCoder-V2

Collection

6 items • Updated Sep 5 • 83

upvoted a paper 4 months ago

Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies

Paper • 2407.13623 • Published Jul 18 • 52

upvoted a paper 5 months ago

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

Paper • 2406.13542 • Published Jun 19 • 16

upvoted 2 papers 6 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 150

SceMQA: A Scientific College Entrance Level Multimodal Question Answering Benchmark

Paper • 2402.05138 • Published Feb 6 • 2

upvoted a collection 7 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217