Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ Full details on simulation and training can be found [here](https://github.com/a
|
|
29 |
|
30 |
# Training Procedure
|
31 |
|
32 |
-
Trained with [Stable Alignment](https://github.com/agi-templar/Stable-Alignment) on 8xA100s for 3H. The start checkpoint is the [
|
33 |
|
34 |
We have also released the [better-base model](https://huggingface.co/agi-css/better-base) which is the start checkpoint of SFT.
|
35 |
|
|
|
29 |
|
30 |
# Training Procedure
|
31 |
|
32 |
+
Trained with [Stable Alignment](https://github.com/agi-templar/Stable-Alignment) on 8xA100s for 3H. The start checkpoint is the [hh-rlhf-sft model](https://huggingface.co/agi-css/hh-rlhf-sft).
|
33 |
|
34 |
We have also released the [better-base model](https://huggingface.co/agi-css/better-base) which is the start checkpoint of SFT.
|
35 |
|