Chen Zheng
commited on
Commit
•
97666a0
1
Parent(s):
9509278
update readme
Browse files
README.md
CHANGED
@@ -13,7 +13,7 @@ ICE-GRT is a chat assistant trained by Reinforcement Learning from Human Feedbac
|
|
13 |
|
14 |
Paper 1 (SFT: ICE-Instruct): https://arxiv.org/abs/2310.04945
|
15 |
|
16 |
-
|
17 |
|
18 |
## Uses
|
19 |
|
|
|
13 |
|
14 |
Paper 1 (SFT: ICE-Instruct): https://arxiv.org/abs/2310.04945
|
15 |
|
16 |
+
__Paper 2 (RLHF: ICE-GRT):__ https://arxiv.org/abs/2401.02072
|
17 |
|
18 |
## Uses
|
19 |
|