Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
MoSMamba
custom_code
AutoTrain Compatible
Misc with no match
Inference Endpoints
text-generation-inference
Eval Results
Merge
4-bit precision
text-embeddings-inference
8-bit precision
Carbon Emissions
Mixture of Experts
Apply filters
Models
13
Full-text search
Edit filters
Sort: Trending
Active filters:
MoSMamba
Clear all
jonathanjordan21/mos-mamba-6x130m-hf
Text Generation
•
Updated
Aug 30
•
32
jonathanjordan21/mos-mamba-6x130m-train
Text Generation
•
Updated
Jun 26
•
11
jonathanjordan21/mixba
Text Generation
•
Updated
Jun 27
•
192
jonathanjordan21/mos-mamba-6x130m-trainer
Text Generation
•
Updated
Jul 17
•
33
jonathanjordan21/mos-mamba-18x130m-trainer-dgx
Text Generation
•
Updated
Jul 17
•
9
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-pile
Text Generation
•
Updated
Aug 16
•
9
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-pile-lora-sft
Updated
Aug 23
•
5
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-lora-sft-merged
Text Generation
•
Updated
Aug 23
•
7
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-pile-sft
Updated
Sep 1
•
5
jonathanjordan21/mos-mamba-6x130m-trainer-sft
Text Generation
•
Updated
Sep 15
•
132
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-pile-sft-reinforcement
Updated
Sep 10
•
2
jonathanjordan21/mos-mamba-18x130m-trainer-dgx-pile-sft-dpo
Updated
Sep 12
•
6
jonathanjordan21/mos-mamba-6x130m-trainer-dgx-pile-sft-2
Updated
Sep 10
•
6