hendrydong
/

Mistral-RM-for-RAFT-GSHF-v0

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hendrydong commited on Mar 23

Commit

e5e7aaf

•

1 Parent(s): 739cb2d

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
 To use this model, you need to load by `AutoModelForSequenceClassification`,
 ```python
 model = AutoModelForSequenceClassification.from_pretrained(

+# Training
+The base model is `mistralai/Mistral-7B-Instruct-v0.2`.
+We also merge the training script at https://github.com/WeiXiongUST/RLHF-Reward-Modeling.
+Thanks Wei (https://huggingface.co/weqweasdas) for his help and contribution to the community.
+# Usage
 To use this model, you need to load by `AutoModelForSequenceClassification`,
 ```python
 model = AutoModelForSequenceClassification.from_pretrained(