nvidia
/

Llama-3.1-Nemotron-70B-Instruct-HF

Text Generation

text-generation-inference

Model card Files Files and versions Community

okuchaiev commited on 29 days ago

Commit

dfe497a

•

1 Parent(s): 250db5c

Update README.md

Files changed (1) hide show

README.md +5 -8

README.md CHANGED Viewed

@@ -98,11 +98,6 @@ print(generated_text)
-## Contact
-E-Mail: [Zhilin Wang](mailto:zhilinw@nvidia.com)
 ## Citation
 If you find this model useful, please cite the following works
@@ -129,14 +124,13 @@ If you find this model useful, please cite the following works
 ## References(s):
 * [HelpSteer2-Preference](https://arxiv.org/abs/2410.01257)
-* [SteerLM method](https://arxiv.org/abs/2310.05344)
-* [HelpSteer](https://arxiv.org/abs/2311.09528)
 * [HelpSteer2](https://arxiv.org/abs/2406.08673)
 * [Introducing Llama 3.1: Our most capable models to date](https://ai.meta.com/blog/meta-llama-3-1/)
 * [Meta's Llama 3.1 Webpage](https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1)
 * [Meta's Llama 3.1 Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md)
 ## Model Architecture:
 **Architecture Type:** Transformer <br>
@@ -167,6 +161,9 @@ v1.0
 # Training & Evaluation:
 ## Datasets:
 **Data Collection Method by dataset** <br>

 ## Citation
 If you find this model useful, please cite the following works
 ## References(s):
+* [NeMo Aligner](https://arxiv.org/abs/2405.01481)
 * [HelpSteer2-Preference](https://arxiv.org/abs/2410.01257)
 * [HelpSteer2](https://arxiv.org/abs/2406.08673)
 * [Introducing Llama 3.1: Our most capable models to date](https://ai.meta.com/blog/meta-llama-3-1/)
 * [Meta's Llama 3.1 Webpage](https://www.llama.com/docs/model-cards-and-prompt-formats/llama3_1)
 * [Meta's Llama 3.1 Model Card](https://github.com/meta-llama/llama-models/blob/main/models/llama3_1/MODEL_CARD.md)
 ## Model Architecture:
 **Architecture Type:** Transformer <br>
 # Training & Evaluation:
+## Alignment methodology
+* REINFORCE implemented in NeMo Aligner
 ## Datasets:
 **Data Collection Method by dataset** <br>