ZHANG Jipeng

OldFriends

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

liked a dataset about 2 months ago

cognitivecomputations/Code-290k-ShareGPT-Vicuna

View all activity

Organizations

None yet

OldFriends's activity

upvoted a paper 5 days ago

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Paper • 2412.19723 • Published 11 days ago • 72

liked a dataset about 2 months ago

cognitivecomputations/Code-290k-ShareGPT-Vicuna

Viewer • Updated Feb 12, 2024 • 289k • 37 • 14

liked a dataset 2 months ago

Sterzhang/PVIT-3M

Viewer • Updated Nov 2, 2024 • 3M • 13.1k • 17

upvoted a collection 2 months ago

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 31

updated a model 2 months ago

OldFriends/llava-critic-7b-hf

Image-Text-to-Text • Updated Oct 30, 2024 • 4

upvoted a collection 3 months ago

LLaVA-Critic

Collection

as a general evaluator for assessing model performance • 6 items • Updated Oct 6, 2024 • 8

upvoted a paper 3 months ago

Personalized Visual Instruction Tuning

Paper • 2410.07113 • Published Oct 9, 2024 • 70

upvoted an article 5 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 297

upvoted a collection 6 months ago

NuminaMath

Collection

Datasets and models for training SOTA math LLMs. See our GitHub for training & inference code: https://github.com/project-numina/aimo-progress-prize • 6 items • Updated Jul 21, 2024 • 70

liked a dataset 6 months ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 3.5k • 296

upvoted an article 6 months ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

• 110

upvoted a paper 6 months ago

TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts

Paper • 2407.03203 • Published Jul 3, 2024 • 12

upvoted an article 6 months ago

Article

Large-scale Near-deduplication Behind BigCode

May 16, 2023

• 20

upvoted a paper 7 months ago

Jailbreaking as a Reward Misspecification Problem

Paper • 2406.14393 • Published Jun 20, 2024 • 12

liked a Space 7 months ago

Running

552

🍷

FineWeb: decanting the web for the finest text data at scale

upvoted a paper 8 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 66

upvoted a paper 10 months ago

LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning

Paper • 2403.17919 • Published Mar 26, 2024 • 16

liked a model over 1 year ago

meta-llama/Llama-2-7b

Text Generation • Updated Apr 17, 2024 • 4.21k

liked a Space over 1 year ago

Runtime error

🔥

ZHANG Jipeng

AI & ML interests

Recent Activity

Organizations

OldFriends's activity

SmolLM - blazingly fast and remarkably powerful

How NuminaMath Won the 1st AIMO Progress Prize

Large-scale Near-deduplication Behind BigCode

FineWeb: decanting the web for the finest text data at scale

Robin 7b