Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference status
Reset Inference status
Warm
Cold
Frozen
Misc
Reset Misc
reward model
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
custom_code
Carbon Emissions
8-bit precision
Eval Results
Mixture of Experts
Misc with no match
Merge
text-embeddings-inference
Apply filters
Models
85
Full-text search
Edit filters
Sort: Trending
Active filters:
reward model
Clear all
nvidia/Llama-3.1-Nemotron-70B-Reward
Updated
25 days ago
•
3.54k
•
58
Qwen/Qwen2.5-Math-RM-72B
Text Classification
•
Updated
9 days ago
•
7.79k
•
48
nvidia/Llama-3.1-Nemotron-70B-Reward-HF
Updated
25 days ago
•
2.94k
•
64
fnlp/moss-rlhf-reward-model-7B-en
Updated
Jul 13, 2023
•
9
berkeley-nest/Starling-LM-7B-alpha
Text Generation
•
Updated
Mar 20
•
55.5k
•
553
berkeley-nest/Starling-RM-7B-alpha
Updated
Jul 30
•
114
•
100
nvidia/Llama2-13B-SteerLM-RM
Text Generation
•
Updated
Feb 22
•
32
•
8
Nexusflow/Starling-LM-7B-beta
Text Generation
•
Updated
Apr 3
•
6.22k
•
341
johnsnowlabs/JSL-MedMNX-7B
Text Generation
•
Updated
Apr 18
•
2.59k
•
4
johnsnowlabs/JSL-MedMNX-7B-SFT
Text Generation
•
Updated
Apr 18
•
2.63k
•
2
johnsnowlabs/JSL-MedMNX-7B-v2.0
Text Generation
•
Updated
Apr 22
•
2.64k
•
3
nvidia/Nemotron-4-340B-Reward
Updated
Jun 19
•
360
•
109
internlm/internlm2-7b-reward
Text Classification
•
Updated
Jul 15
•
463
•
15
second-state/Llama-3.1-Nemotron-70B-Reward-HF-GGUF
Text Generation
•
Updated
22 days ago
•
1.29k
•
1
gaianet/Llama-3.1-Nemotron-70B-Reward-HF-GGUF
Text Generation
•
Updated
22 days ago
•
629
•
1
yale-nlp/MDCureRM
Updated
6 days ago
•
17
•
1
mradermacher/Starling-LM-7B-alpha-GGUF
Updated
5 days ago
•
176
•
1
nicholasKluge/RewardModelPT
Text Classification
•
Updated
Jun 18
•
65
nicholasKluge/RewardModel
Text Classification
•
Updated
Jun 18
•
17
Ablustrund/moss-rlhf-reward-model-7B-zh
Updated
Jul 13, 2023
•
5
•
23
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
14
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
15
•
2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
•
Updated
Nov 27, 2023
•
14
•
1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
•
Updated
Nov 27, 2023
•
15
•
2
TheBloke/Starling-LM-7B-alpha-GGUF
Updated
Nov 28, 2023
•
1.57k
•
95
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
•
Updated
Nov 28, 2023
•
56
•
9
second-state/Starling-LM-7B-alpha-GGUF
Text Generation
•
Updated
Mar 20
•
254
•
3
TheBloke/Starling-LM-7B-alpha-GPTQ
Text Generation
•
Updated
Nov 28, 2023
•
35
•
9
bartowski/Starling-LM-7B-alpha-old-exl2
Text Generation
•
Updated
Nov 28, 2023
Previous
1
2
3
Next