2 30 9

Junjie Chen

coderchen01

https://junjie-chen.info

AI & ML interests

Efficient AI, Multimodal AI, Generative AI

Recent Activity

liked a model 1 day ago

nvidia/Hymba-1.5B-Instruct

liked a Space 2 days ago

opencompass/open_vlm_leaderboard

liked a model 3 days ago

HuggingFaceTB/SmolVLM-Instruct

View all activity

Organizations

None yet

coderchen01's activity

upvoted a paper 10 days ago

RedPajama: an Open Dataset for Training Large Language Models

Paper • 2411.12372 • Published 14 days ago • 47

upvoted an article 11 days ago

Article

Decoding GPT-4'o': In-Depth Exploration of Its Mechanisms and Creating Similar AI.

•

May 21

• 34

upvoted a paper 13 days ago

SlimLM: An Efficient Small Language Model for On-Device Document Assistance

Paper • 2411.09944 • Published 19 days ago • 12

upvoted a paper 14 days ago

Top-nσ: Not All Logits Are You Need

Paper • 2411.07641 • Published 21 days ago • 18

upvoted a paper 26 days ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 29 days ago • 46

upvoted an article about 1 month ago

Article

🕳️ Attention Sinks in LLMs for endless fluency

•

Oct 9, 2023

• 7

upvoted a paper about 2 months ago

Addition is All You Need for Energy-efficient Language Models

Paper • 2410.00907 • Published Oct 1 • 144

upvoted 2 articles about 2 months ago

Article

Scaling AI-based Data Processing with Hugging Face + Dask

Oct 9

• 24

Article

How 🤗 Accelerate runs very large models thanks to PyTorch

Sep 27, 2022

• 10

upvoted a paper about 2 months ago

MLP-KAN: Unifying Deep Representation and Function Learning

Paper • 2410.03027 • Published Oct 3 • 28

upvoted 2 papers 2 months ago

LEOPARD : A Vision Language Model For Text-Rich Multi-Image Tasks

Paper • 2410.01744 • Published Oct 2 • 25

YesBut: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models

Paper • 2409.13592 • Published Sep 20 • 48

upvoted a paper 3 months ago

LLaMA-Omni: Seamless Speech Interaction with Large Language Models

Paper • 2409.06666 • Published Sep 10 • 55

upvoted 2 papers 4 months ago

POA: Pre-training Once for Models of All Sizes

Paper • 2408.01031 • Published Aug 2 • 26

Gemma 2: Improving Open Language Models at a Practical Size

Paper • 2408.00118 • Published Jul 31 • 75

upvoted a paper 5 months ago

OmniNOCS: A unified NOCS dataset and model for 3D lifting of 2D objects

Paper • 2407.08711 • Published Jul 11 • 6

upvoted a collection 5 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 217

upvoted 3 papers 5 months ago

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4 • 35

OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation

Paper • 2407.02371 • Published Jul 2 • 49

MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation

Paper • 2407.00468 • Published Jun 29 • 34