136 665 565

Adam Molnar

lunarflu

AI & ML interests

join the Hugging Face discord! hf.co/discord/join

Recent Activity

New activity about 6 hours ago

alpindale/two-million-bluesky-posts:🚩 Report: Legal issue(s)

updated a Space about 7 hours ago

lunarflu/verification-bot

updated a Space about 9 hours ago

discord-community/LevelBot

View all activity

Organizations

lunarflu's activity

New activity in alpindale/two-million-bluesky-posts about 6 hours ago

🚩 Report: Legal issue(s)

#3 opened about 6 hours ago by

terenceeden

updated a Space about 7 hours ago

Running on CPU Upgrade

🔔

Discord to Slack bot

updated a Space about 9 hours ago

Running on CPU Upgrade

🥇

openai-community/gpt2

Text Generation • Updated Feb 19 • 17.7M • • 2.38k

upvoted an article 1 day ago

Article

Let’s make a generation of amazing image generation models

•

1 day ago

• 29

liked a Space 1 day ago

Running

🔥

China AI policy research 🤗

New activity in discord-community/LevelBot 1 day ago

Suggestion Discussion

#25 opened 27 days ago by

nroggendorff

liked a model 3 days ago

rwitz/cat1.0

Text Generation • Updated 18 days ago • 127 • 7

liked a Space 3 days ago

Running on Zero

📉

OS ATLAS

A Foundation Action Model For Generalist GUI Agents

Reacted to sequelbox's post with 👍 9 days ago

Post

1153

next version of sequelbox/Celestia will be microsoft/orca-agentinstruct-1M-v1 style. coming soon

1 reply

Reacted to reach-vb's post with 🚀🤗👍🔥 9 days ago

Post

4083

What a brilliant week for Open Source AI!

Qwen 2.5 Coder by Alibaba - 0.5B / 1.5B / 3B / 7B / 14B/ 32B (Base + Instruct) Code generation LLMs, with 32B tackling giants like Gemnini 1.5 Pro, Claude Sonnet
Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f

LLM2CLIP from Microsoft - Leverage LLMs to train ultra-powerful CLIP models! Boosts performance over the previous SOTA by ~17%
microsoft/llm2clip-672323a266173cfa40b32d4c

Athene v2 Chat & Agent by NexusFlow - SoTA general LLM fine-tuned from Qwen 2.5 72B excels at Chat + Function Calling/ JSON/ Agents
Nexusflow/athene-v2-6735b85e505981a794fb02cc

Orca Agent Instruct by Microsoft - 1 million instruct pairs covering text editing, creative writing, coding, reading comprehension, etc - permissively licensed
microsoft/orca-agentinstruct-1M-v1

Ultravox by FixieAI - 70B/ 8B model approaching GPT4o level, pick any LLM, train an adapter with Whisper as Audio Encoder
reach-vb/ultravox-audio-language-model-release-67373b602af0a52b2a88ae71

JanusFlow 1.3 by DeepSeek - Next iteration of their Unified MultiModal LLM Janus with RectifiedFlow
deepseek-ai/JanusFlow-1.3B

Common Corpus by Pleais - 2,003,039,184,047 multilingual, commercially permissive and high quality tokens!
PleIAs/common_corpus

I'm sure I missed a lot, can't wait for the next week!

Put down in comments what I missed! 🤗

Reacted to TuringsSolutions's post with 👀 9 days ago

Post

733

If I am correct and the LLM model changes the 'shape' of the data as it learns, then I should be able to track and utilize those shape changes as a backpropagation training mechanism, right? Well guess what, I can do that! Entropy, Sparsity, and Density, this is how I can measure the shape of the data the LLM model is creating. Nodes, Clusters, and Edges, these are the mechanisms within the neural network the LLM model updates as it learns these concepts. I measure the effects of these updates, via Entropy, Sparsity, and Density. Check out more in this video: https://youtu.be/jADTt5HHtiw

2 replies

Reacted to erikkaum's post with 👀🔥 9 days ago

Post

1668

A while ago I started experimenting with compiling the Python interpreter to WASM.

To build a secure, fast, and lightweight sandbox for code execution — ideal for running LLM-generated Python code.

- Send code simply as a POST request
- 1-2ms startup times

Hack away:
https://github.com/ErikKaum/runner

Reacted to AdinaY's post with 👀 9 days ago

Post

1437

LLaMA Mesh 🔥 Unifying 3D Mesh Generation with Language Models

Model: Zhengyi/LLaMA-Mesh
Demo: Zhengyi/LLaMA-Mesh
Paper: LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models (2411.09595)

✨ Unified 3D generation & text understanding.
✨ 3D meshes as plain text for seamless LLM integration.
✨ High-quality 3D outputs rivaling specialized models.

Reacted to sayakpaul's post with 🚀❤️ 9 days ago

Post

2340

It's been a while we shipped native quantization support in diffusers 🧨

We currently support bistandbytes as the official backend but using others like torchao is already very simple.

This post is just a reminder of what's possible:

1. Loading a model with a quantization config
2. Saving a model with quantization config
3. Loading a pre-quantized model
4. enable_model_cpu_offload()
5. Training and loading LoRAs into quantized checkpoints

Docs:
https://huggingface.co/docs/diffusers/main/en/quantization/bitsandbytes

1 reply