Edit Models filters

Inference status

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

Misc with no match

4-bit precision

text-embeddings-inference

8-bit precision

Carbon Emissions

Mixture of Experts

Models

10

Full-text search

Active filters: PPO

fb700/chatglm-fitness-RLHF

Updated Mar 6 • 268

fb700/Bofan-chatglm-Best-lora

Updated Aug 24, 2023 • 11 • 10

sehyun66/Tiny-lama-1.3B-chat-ppo

Question Answering • Updated Jan 13

Lichang-Chen/ODIN-ppo-L230-best

Text Generation • Updated Feb 14 • 8

vibhorg/rl4llm_uofm_nlpo_super_t5_arxiv

Text2Text Generation • Updated Mar 20 • 5

vibhorg/rl4llm_uofm_nlpo_unsuper_t5_arxiv

Text2Text Generation • Updated Mar 20 • 3

Fizzarolli/sapphia-410m-RM

pt-sk/GPT2-IMDB-Sentiment-FineTuned-with-PPO

Text Generation • Updated Jun 25 • 14

pt-sk/GPT2_NonToxic

Text Generation • Updated Jul 15 • 9

Kwaai/GPT2_NonToxic

Text Generation • Updated Jul 20 • 8