taicheng guo's picture

22 6

taicheng guo

taicheng

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

liked a model about 1 month ago

Qwen/Qwen2-0.5B

upvoted a collection about 2 months ago

View all activity

Organizations

taicheng's activity

upvoted a paper 2 days ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published 5 days ago • 28

liked a model about 1 month ago

Qwen/Qwen2-0.5B

Text Generation • Updated Oct 22 • 2.04M • 113

upvoted a collection about 2 months ago

Power-LM

Dense & MoE LLMs trained with power learning rate scheduler. • 4 items • Updated Oct 17 • 15

upvoted 2 papers about 2 months ago

LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 62

MURI: High-Quality Instruction Tuning Datasets for Low-Resource Languages via Reverse Instructions

Paper • 2409.12958 • Published Sep 19 • 7

updated 15 models 2 months ago

taicheng/zephyr-7b-align-scan-0.0-0.0-linear-1

Text Generation • Updated Sep 28 • 10

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-1

Text Generation • Updated Sep 28 • 9

taicheng/zephyr-7b-align-scan-0.0-0.0-cosine-2

Text Generation • Updated Sep 28 • 11

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-2

Text Generation • Updated Sep 28 • 8

taicheng/zephyr-7b-align-scan-0.0-0.0-polynomial-3

Text Generation • Updated Sep 28 • 9

taicheng/zephyr-7b-align-scan-0.0-0.0-linear-3

Text Generation • Updated Sep 28 • 8

taicheng/zephyr-7b-align-scan

Text Generation • Updated Sep 28 • 9

taicheng/zephyr-7b-align-scan-1e-07-0.27-polynomial-1.0

taicheng/zephyr-7b-align-scan-7e-07-0.45-cosine-3.0

Text Generation • Updated Sep 28 • 8

taicheng/zephyr-7b-align-scan-6e-07-0.53-polynomial-2.0

Text Generation • Updated Sep 28 • 9

taicheng/zephyr-7b-align-scan-2e-07-0.39-polynomial-1.0

Text Generation • Updated Sep 28 • 8

taicheng/zephyr-7b-align-scan-4e-07-0.45-cosine-1.0

Text Generation • Updated Sep 28 • 8

taicheng/zephyr-7b-align-scan-2e-07-0.5-linear-2.0

Text Generation • Updated Sep 27 • 8

taicheng/zephyr-7b-align-scan-7e-07-0.99-cosine-2.0

Text Generation • Updated Sep 27 • 10

taicheng/zephyr-7b-align-scan-3e-07-0.62-polynomial-3.0

Text Generation • Updated Sep 27 • 8