Ethan Maxwell's picture

11 7

Ethan Maxwell

EthanMaxwell

·

AI & ML interests

None yet

Recent Activity

liked a model 7 days ago

NexaAIDev/Qwen2-Audio-7B-GGUF

liked a model 18 days ago

NexaAIDev/omnivision-968M

liked a model 24 days ago

genmo/mochi-1-preview

View all activity

Organizations

None yet

EthanMaxwell's activity

liked a model 7 days ago

NexaAIDev/Qwen2-Audio-7B-GGUF

Audio-Text-to-Text • Updated 7 days ago • 10k • 95

liked a model 18 days ago

NexaAIDev/omnivision-968M

Updated 4 days ago • 10.3k • 442

liked 4 models 24 days ago

genmo/mochi-1-preview

Text-to-Video • Updated 11 days ago • 45.4k • 1.06k

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 6 days ago • 92.5k • 399

microsoft/OmniParser

Image-Text-to-Text • Updated about 7 hours ago • 10.6k • 1.4k

Etched/oasis-500m

Updated 28 days ago • 5.13k • 421

upvoted 11 papers 24 days ago

SVDQunat: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Paper • 2411.05007 • Published 25 days ago • 16

GazeGen: Gaze-Driven User Interaction for Visual Content Generation

Paper • 2411.04335 • Published 26 days ago • 14

Needle Threading: Can LLMs Follow Threads through Near-Million-Scale Haystacks?

Paper • 2411.05000 • Published 25 days ago • 21

VideoGLaMM: A Large Multimodal Model for Pixel-Level Visual Grounding in Videos

Paper • 2411.04923 • Published 25 days ago • 20

Thanos: Enhancing Conversational Agents with Skill-of-Mind-Infused Large Language Model

Paper • 2411.04496 • Published 26 days ago • 22

DimensionX: Create Any 3D and 4D Scenes from a Single Image with Controllable Video Diffusion

Paper • 2411.04928 • Published 25 days ago • 48

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 25 days ago • 49

TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Paper • 2411.04709 • Published 27 days ago • 25

BitNet a4.8: 4-bit Activations for 1-bit LLMs

Paper • 2411.04965 • Published 25 days ago • 63

ReCapture: Generative Video Camera Controls for User-Provided Videos using Masked Video Fine-Tuning

Paper • 2411.05003 • Published 25 days ago • 70

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

Paper • 2411.04905 • Published 25 days ago • 109

liked a model 24 days ago

tencent/Tencent-Hunyuan-Large

Text Generation • Updated 9 days ago • 283 • 476