This model started out of curiosity about what would be the result if four RP models were combined. Was this not suitable for the MoE's design? A problem occurred during the quantization process