digo-prayudha
/

Llama-3.2-1B-Indonesian

Text Generation

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

digo-prayudha commited on 14 days ago

Commit

ed00d68

•

1 Parent(s): c26162c

Update README.md

Files changed (1) hide show

README.md +30 -14

README.md CHANGED Viewed

@@ -11,6 +11,9 @@ tags:
 model-index:
 - name: Llama-3.2-1B-Indonesian
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -18,21 +21,34 @@ should probably proofread and complete it, then remove this comment. -->
 # Llama-3.2-1B-Indonesian
-This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) on the generator dataset.
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -50,8 +66,8 @@ The following hyperparameters were used during training:
 - mixed_precision_training: Native AMP
 ### Training results
 ### Framework versions

 model-index:
 - name: Llama-3.2-1B-Indonesian
   results: []
+language:
+- id
+pipeline_tag: text-generation
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # Llama-3.2-1B-Indonesian
+This model is a fine-tuned version of [meta-llama/Llama-3.2-1B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) that has been optimized for Indonesian language understanding and generation..
 ## Training and evaluation data
+[Ichsan2895/alpaca-gpt4-indonesian](https://huggingface.co/datasets/Ichsan2895/alpaca-gpt4-indonesian)
+### Use WIth Transformers
+```python
+import torch
+from transformers import pipeline
+model_id = "meta-llama/Llama-3.2-3B-Instruct"
+pipe = pipeline(
+    "text-generation",
+    model=model_id,
+    torch_dtype=torch.bfloat16,
+    device_map="auto",
+)
+messages = [
+    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
+    {"role": "user", "content": "Who are you?"},
+]
+outputs = pipe(
+    messages,
+    max_new_tokens=256,
+)
+print(outputs[0]["generated_text"][-1])
+```
 ### Training hyperparameters
 - mixed_precision_training: Native AMP
 ### Training results
+![Train Loss]
+<img src="./train_loss.svg" width="300" height="200">
 ### Framework versions