Puffy Bird's picture

Puffy Bird

puffy310

·

AI & ML interests

None yet

Organizations

puffy310's activity

New activity in TheBirdLegacy/DallData about 2 months ago

License is missing

#1 opened 2 months ago by

commented a paper about 2 months ago

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48 •

New activity in G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b about 2 months ago

G-reen/gpt5o-reflexion-q-agi-llama-3.1-8b Just SHOCKED The Entire INDUSTRY with 12000 volts

#15 opened about 2 months ago by

New activity in deepseek-ai/DeepSeek-V2.5 2 months ago

DeepSeek-Coder-V2.5-Lite

#3 opened 2 months ago by

New activity in qihoo360/FancyVideo 3 months ago

Glad to see Qihoo Using HF!

#1 opened 3 months ago by

commented a paper 4 months ago

Patch-Level Training for Large Language Models

Paper • 2407.12665 • Published Jul 17 • 16 •

New activity in Tencent-Hunyuan/HunyuanDiT-v1.2-Diffusers-Distilled 4 months ago

Update from HunyuanDiT v1.1

#1 opened 4 months ago by

commented 3 papers 4 months ago

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37 •

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37 •

LiteSearch: Efficacious Tree Search for LLM

Paper • 2407.00320 • Published Jun 29 • 37 •

commented 2 papers 5 months ago

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published Jun 24 • 22 •

Scaling Laws for Linear Complexity Language Models

Paper • 2406.16690 • Published Jun 24 • 22 •

New activity in hpcai-tech/open-sora 5 months ago

🚩 Report: Not working

#4 opened 5 months ago by

uraniumcrystalsmaster

New activity in puffy310/ZeroGPU-DeepSeek-V2-LiteCoder 5 months ago

Apply for community grant: Academic project (gpu)

#1 opened 5 months ago by

commented 2 papers 5 months ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17 • 57 •

Exploring the Role of Large Language Models in Prompt Encoding for Diffusion Models

Paper • 2406.11831 • Published Jun 17 • 20 •

New activity in IndexTeam/Index-1.9B-Chat 5 months ago

Model Scaling

#1 opened 5 months ago by

commented 3 papers 5 months ago

PowerInfer-2: Fast Large Language Model Inference on a Smartphone

Paper • 2406.06282 • Published Jun 10 • 36 •

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Paper • 2406.06563 • Published Jun 3 • 17 •

Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

Paper • 2406.06563 • Published Jun 3 • 17 •