Nyaribari Reuben

foscraft
Ā·

AI & ML interests

LLMs, VLMs , Vision

Recent Activity

Reacted to DavidGF's post with šŸ”„ 26 days ago
šŸŽ‰ Celebrating One Year of #SauerkrautLM with Two Groundbreaking Releases! We're thrilled to announce the release of SauerkrautLM-v2-14b in two specialized versions: https://huggingface.co/VAGOsolutions/SauerkrautLM-v2-14b-SFT and https://huggingface.co/VAGOsolutions/SauerkrautLM-v2-14b-DPO. Built on the robust Qwen2.5-14B foundation, these models represent a significant leap forward in multilingual AI capabilities. šŸ”¬ Technical Breakthroughs: šŸ’  Innovative three-phase Fine-Tuning approach šŸ’  Two-step Spectrum SFT + one-step Spectrum DPO optimization phase for enhanced performance šŸ’  Balance of German and English language capabilities šŸ’  Advanced function calling - almost on par with Claude-3.5-Sonnet-20240620 šŸ‡©šŸ‡Ŗ German Language Excellence: What sets this release apart is our unique achievement in simultaneously improving both German and English capabilities. Through our specialized training approach with over 1.2B tokens across two phases, we've managed to: šŸ’  Enhance German language understanding and generation (SFT Version > DPO Version) šŸ’  Maintain authentic German linguistic nuances šŸ’  Improve cross-lingual capabilities šŸ’  Preserve cultural context awareness šŸ“Š Training Innovation: Our three-phase approach targeted specific layer percentages (15%, 20% and 25%) with carefully curated datasets, including: šŸ’  Mathematics-focused content (proprietary classifier-selected) šŸ’  High-quality German training data šŸ’  Specialized function calling datasets šŸ’  Premium multilingual content šŸŽ Community Contribution: We're also releasing two new datasets in a few days: 1ļøāƒ£ SauerkrautLM-Fermented-GER-DPO: 3,300 high-quality German training samples 2ļøāƒ£ SauerkrautLM-Fermented-Irrelevance-GER-DPO: 2,000 specialized samples for optimized function call irrelevance handling Thank you to our incredible community and partners who have supported us throughout this journey. Here's to another year of AI innovation!Ā šŸš€
View all activity

Organizations

AI-CREATION's profile picture scikit-learn's profile picture Kornia AI's profile picture Keras Dreambooth Event's profile picture MLX Community's profile picture Paris AI Running Club's profile picture Interactive Media Services Kenya's profile picture

foscraft's activity

Reacted to DavidGF's post with šŸ”„ 26 days ago
view post
Post
2986
šŸŽ‰ Celebrating One Year of #SauerkrautLM with Two Groundbreaking Releases!

We're thrilled to announce the release of SauerkrautLM-v2-14b in two specialized versions: VAGOsolutions/SauerkrautLM-v2-14b-SFT and VAGOsolutions/SauerkrautLM-v2-14b-DPO. Built on the robust Qwen2.5-14B foundation, these models represent a significant leap forward in multilingual AI capabilities.

šŸ”¬ Technical Breakthroughs:
šŸ’  Innovative three-phase Fine-Tuning approach
šŸ’  Two-step Spectrum SFT + one-step Spectrum DPO optimization phase for enhanced performance
šŸ’  Balance of German and English language capabilities
šŸ’  Advanced function calling - almost on par with Claude-3.5-Sonnet-20240620

šŸ‡©šŸ‡Ŗ German Language Excellence:
What sets this release apart is our unique achievement in simultaneously improving both German and English capabilities. Through our specialized training approach with over 1.2B tokens across two phases, we've managed to:
šŸ’  Enhance German language understanding and generation (SFT Version > DPO Version)
šŸ’  Maintain authentic German linguistic nuances
šŸ’  Improve cross-lingual capabilities
šŸ’  Preserve cultural context awareness

šŸ“Š Training Innovation:
Our three-phase approach targeted specific layer percentages (15%, 20% and 25%) with carefully curated datasets, including:
šŸ’  Mathematics-focused content (proprietary classifier-selected)
šŸ’  High-quality German training data
šŸ’  Specialized function calling datasets
šŸ’  Premium multilingual content

šŸŽ Community Contribution:
We're also releasing two new datasets in a few days:
1ļøāƒ£ SauerkrautLM-Fermented-GER-DPO: 3,300 high-quality German training samples
2ļøāƒ£ SauerkrautLM-Fermented-Irrelevance-GER-DPO: 2,000 specialized samples for optimized function call irrelevance handling

Thank you to our incredible community and partners who have supported us throughout this journey. Here's to another year of AI innovation!Ā šŸš€
replied to automatedstockminingorg's post 27 days ago
view reply

Try google colab.
You can run it on the free tier.

updated a Space 4 months ago