Victor Gallego's picture

Victor Gallego

vicgalle

·

https://github.com/vicgalle

AI & ML interests

Preference fine-tuning, alignment & synthetic data. Building LLMs in general!

Organizations

Posts 1

Post

Can you merge models of different sizes? ⚗️

Well, yes, if the models are somewhat compatible. Here is an experiment I did. I wanted to merge two of the best performing models: mlabonne/NeuralBeagle14-7B and jeonsworld/CarbonVillain-en-10.7B-v4

Here is my recipe:
1. Expand the layers of NeuralBeagle to 10.7B ala frankenmerge.
2. DPO-tune the previous model with a high-quality preference dataset, argilla/distilabel-intel-orca-dpo-pairs
3. Merge the previous model with CarbonVillain (needs —allow-crimes in mergekit! 🔪)

And here is the resulting model, CarbonBeagle-11B, which ranked top in the leaderboard for its size class:
vicgalle/CarbonBeagle-11B

Collections 3

Papers 7

arxiv:2406.07188

arxiv:2404.00495

arxiv:2402.08005

arxiv:2312.01957

models 60

vicgalle/Configurable-Hermes-3-Llama-3.1-8B

Text Generation • Updated Aug 30 • 2.45k • 4

vicgalle/Roleplay-Hermes-3-Llama-3.1-8B

Text Generation • Updated Aug 15 • 248 • 7

vicgalle/Merge-Mixtral-Prometheus-8x7B

Text Generation • Updated Aug 13 • 27 • 2

vicgalle/Merge-Mistral-Prometheus-7B

Text Generation • Updated Aug 5 • 16 • 1

vicgalle/Humanish-Roleplay-Llama-3.1-8B

Text Generation • Updated Aug 3 • 284 • 4

vicgalle/Configurable-Hermes-2-Pro-Llama-3-8B

Text Generation • Updated Jul 31 • 9.33k • 6

vicgalle/ConfigurableSOLAR-10.7B

Text Generation • Updated Jul 29 • 3.73k • 2

vicgalle/ConfigurableHermes-7B

Text Generation • Updated Jul 25 • 4.37k • 3

vicgalle/CarbonBeagle-11B-truthy

Text Generation • Updated Jul 25 • 13.3k • 9

vicgalle/CarbonBeagle-11B

Text Generation • Updated Jul 25 • 7.12k • 9

datasets 6

vicgalle/configurable-system-prompt-multitask

Viewer • Updated Apr 23 • 1.95k • 165 • 19

vicgalle/Synthetic-RP

Viewer • Updated Apr 21 • 8 • 39 • 3

vicgalle/worldsim-claude-opus

Viewer • Updated Mar 24 • 552 • 55 • 9

vicgalle/OpenHermesPreferences-roleplay

Viewer • Updated Feb 29 • 3.06k • 43 • 3

vicgalle/OpenHermesPreferences-1k

Viewer • Updated Feb 29 • 1.11k • 40 • 3

vicgalle/alpaca-gpt4

Viewer • Updated Feb 10 • 52k • 4.76k • 238