91 29 324

Florian Zimmermeister

flozi00

AI & ML interests

ASR, German LLM

Recent Activity

updated a dataset about 9 hours ago

flozi00/asr-german-mixed-evals

liked a dataset 7 days ago

AI-MO/NuminaMath-CoT

liked a dataset 8 days ago

HuggingFaceTB/smoltalk

View all activity

Organizations

$A\\Ware's profile picture$

flozi00's activity

updated a dataset about 9 hours ago

flozi00/asr-german-mixed-evals

Viewer • Updated about 9 hours ago • 9.02k • 65 • 1

liked a dataset 7 days ago

AI-MO/NuminaMath-CoT

Viewer • Updated 8 days ago • 860k • 2.8k • 252

liked a dataset 8 days ago

HuggingFaceTB/smoltalk

Viewer • Updated 7 days ago • 2.2M • 3.52k • 199

liked a dataset 11 days ago

mlabonne/smoltalk-flat

Viewer • Updated 11 days ago • 1.1M • 194 • 3

liked a model 12 days ago

parler-tts/parler-tts-mini-multilingual

Text-to-Speech • Updated about 7 hours ago • 6.62k • 15

liked a model 14 days ago

mistralai/Mistral-Large-Instruct-2411

Updated 13 days ago • 2.75k • 158

liked a dataset 16 days ago

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1 • 1.05M • 4.16k • 381

New activity in primeline/whisper-large-v3-turbo-german 19 days ago

Convert to .bin?

#4 opened about 1 month ago by

Artmart23

liked a dataset 19 days ago

PleIAs/common_corpus

Viewer • Updated 10 days ago • 397M • 55.9k • 167

upvoted an article 19 days ago

Article

Releasing the largest multilingual open pretraining dataset

•

19 days ago

• 97

liked a model 21 days ago

Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • Updated 14 days ago • 157k • • 1.13k

upvoted an article 24 days ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

•

24 days ago

• 9

liked a dataset 25 days ago

mlabonne/open-perfectblend

Viewer • Updated Oct 18 • 1.42M • 188 • 44

liked a model 25 days ago

VAGOsolutions/SauerkrautLM-v2-14b-DPO

Updated 25 days ago • 897 • 14

upvoted a paper 27 days ago

"Give Me BF16 or Give Me Death"? Accuracy-Performance Trade-Offs in LLM Quantization

Paper • 2411.02355 • Published 28 days ago • 46

liked 2 models 28 days ago

juampahc/bge-m3-m2v-1024

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16 • 1.37M • • 6.87k

reacted to DavidGF's post with 👍 28 days ago

Post

2991

🎉 Celebrating One Year of #SauerkrautLM with Two Groundbreaking Releases!

We're thrilled to announce the release of SauerkrautLM-v2-14b in two specialized versions: VAGOsolutions/SauerkrautLM-v2-14b-SFT and VAGOsolutions/SauerkrautLM-v2-14b-DPO. Built on the robust Qwen2.5-14B foundation, these models represent a significant leap forward in multilingual AI capabilities.

🔬 Technical Breakthroughs:
💠 Innovative three-phase Fine-Tuning approach
💠 Two-step Spectrum SFT + one-step Spectrum DPO optimization phase for enhanced performance
💠 Balance of German and English language capabilities
💠 Advanced function calling - almost on par with Claude-3.5-Sonnet-20240620

🇩🇪 German Language Excellence:
What sets this release apart is our unique achievement in simultaneously improving both German and English capabilities. Through our specialized training approach with over 1.2B tokens across two phases, we've managed to:
💠 Enhance German language understanding and generation (SFT Version > DPO Version)
💠 Maintain authentic German linguistic nuances
💠 Improve cross-lingual capabilities
💠 Preserve cultural context awareness

📊 Training Innovation:
Our three-phase approach targeted specific layer percentages (15%, 20% and 25%) with carefully curated datasets, including:
💠 Mathematics-focused content (proprietary classifier-selected)
💠 High-quality German training data
💠 Specialized function calling datasets
💠 Premium multilingual content

🎁 Community Contribution:
We're also releasing two new datasets in a few days:
1️⃣ SauerkrautLM-Fermented-GER-DPO: 3,300 high-quality German training samples
2️⃣ SauerkrautLM-Fermented-Irrelevance-GER-DPO: 2,000 specialized samples for optimized function call irrelevance handling

Thank you to our incredible community and partners who have supported us throughout this journey. Here's to another year of AI innovation! 🚀

reacted to qq8933's post with 👍 29 days ago

Post

5768

LLaMA-O1: Open Large Reasoning Model Frameworks For Training, Inference and Evaluation With PyTorch and HuggingFace
Large Reasoning Models powered by Monte Carlo Tree Search (MCTS), Self-Play Reinforcement Learning, PPO, AlphaGo Zero's dua policy paradigm and Large Language Models!
https://github.com/SimpleBerry/LLaMA-O1/

What will happen when you compound MCTS ❤ LLM ❤ Self-Play ❤RLHF?
Just a little bite of strawberry!🍓

Past related works:
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning (2410.02884)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B (2406.07394)

2 replies

New activity in primeline/whisper-large-v3-turbo-german about 1 month ago

german or swiss-german

#5 opened about 1 month ago by

jschoene