4 47 39

Charles I Niswander II

charlesniswander

dhar174

AI & ML interests

None yet

Recent Activity

upvoted a paper about 13 hours ago

Star Attention: Efficient LLM Inference over Long Sequences

liked a model 1 day ago

neuralmagic/Sparse-Llama-3.1-8B-2of4

upvoted a paper 16 days ago

Small Language Models are Equation Reasoners

View all activity

Organizations

None yet

charlesniswander's activity

upvoted a paper about 13 hours ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published 2 days ago • 32

liked a model 1 day ago

neuralmagic/Sparse-Llama-3.1-8B-2of4

Text Generation • Updated 7 days ago • 576 • 21

upvoted 2 papers 16 days ago

Small Language Models are Equation Reasoners

Paper • 2409.12393 • Published Sep 19 • 1

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published 21 days ago • 30

upvoted a paper 19 days ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 20 days ago • 48

upvoted a paper 26 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 50

upvoted 2 papers about 1 month ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5 • 26

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11 • 16

upvoted a collection about 2 months ago

Reasoning

Collection

151 items • Updated Apr 6 • 27

upvoted 2 papers about 2 months ago

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

Paper • 2409.20537 • Published Sep 30 • 12

upvoted 3 papers 2 months ago

upvoted an article 2 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 103

upvoted a paper 3 months ago

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

Paper • 2408.15914 • Published Aug 28 • 22

liked a model 3 months ago

THUDM/CogVideoX-5b

Text-to-Video • Updated 5 days ago • 172k • 517

upvoted a paper 3 months ago

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22 • 25

liked 2 models 3 months ago

microsoft/Phi-3.5-mini-instruct

Text Generation • Updated Sep 18 • 584k • • 657

nvidia/Llama-3.1-Minitron-4B-Width-Base

Updated Aug 22 • 16 • 187