Reason/COT
Collection
12 items
•
Updated
•
3
Developed by: Daemontatox
License: Apache 2.0
Base Model: Daemontatox/RA_Reasoner
This model is fine-tuned from the Falcon-10B-Instruct model, leveraging advanced training optimizations to enhance reasoning and instruction-following capabilities. It was trained 2x faster using Unsloth and Hugging Face's TRL library.
Further details on hyperparameters and fine-tuning methodology will be added in future updates.
This model is intended for research and development in text generation, reasoning tasks, and instruction-following applications.
---# Open LLM Leaderboard Evaluation Results Detailed results can be found here! Summarized results can be found here!
Metric | Value (%) |
---|---|
Average | 29.00 |
IFEval (0-Shot) | 53.66 |
BBH (3-Shot) | 43.07 |
MATH Lvl 5 (4-Shot) | 22.89 |
GPQA (0-shot) | 9.96 |
MuSR (0-shot) | 7.18 |
MMLU-PRO (5-shot) | 37.26 |
Base model
tiiuae/Falcon3-10B-Base