sarvamai
/

sarvam-2b-v0.5

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rahular commited on Aug 15

Commit

d223b8f

•

1 Parent(s): cd638eb

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -11,6 +11,8 @@ The final checkpoint of `sarvam-2b` will be released soon, and it will be traine
 The current checkpoint has not undergone any post-training. You can see the capabilities of the current checkpoint in [this video](https://www.youtube.com/watch?v=DFtAS1BCKvk).
 Getting started:
 ```
 from transformers import pipeline
@@ -39,7 +41,4 @@ Here is a comparison of fertility scores between `sarvam-2b` and other popular m
 |tel_Telu|2.14  |13.3     |4.57   |3.06  |
 |**Average** |**2.08**  |**9.34**     |**4.01**   |**3.00**  |
-This model is trained on the NeMo stack on Nvidia H100s on the Yotta data center.
 More technical details like evaluations and benchmarking will be posted soon.

 The current checkpoint has not undergone any post-training. You can see the capabilities of the current checkpoint in [this video](https://www.youtube.com/watch?v=DFtAS1BCKvk).
+The model was trained with NVIDIA NeMo™ Framework on the Yotta Shakti Cloud using HGX H100 systems.
 Getting started:
 ```
 from transformers import pipeline
 |tel_Telu|2.14  |13.3     |4.57   |3.06  |
 |**Average** |**2.08**  |**9.34**     |**4.01**   |**3.00**  |
 More technical details like evaluations and benchmarking will be posted soon.