EpistemeAI
/

EpistemeAI-codegemma-2-9b

Text Classification

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

legolasyiu commited on Aug 14

Commit

3a29eab

•

1 Parent(s): e05f96f

Update README.md

Files changed (1) hide show

README.md +27 -26

README.md CHANGED Viewed

@@ -18,32 +18,6 @@ finetuinng with
 # code
-```python
-from unsloth import FastLanguageModel
-model, tokenizer = FastLanguageModel.from_pretrained(
-    model_name = "EpistemeAI/EpistemeAI-codegemma-2-9b", # YOUR MODEL YOU USED FOR TRAINING
-    max_seq_length = max_seq_length,
-    dtype = dtype,
-    load_in_4bit = load_in_4bit,
-)
-FastLanguageModel.for_inference(model) # Enable native 2x faster inference
-# alpaca_prompt = You MUST copy from above!
-inputs = tokenizer(
-[
-    alpaca_prompt.format(
-        "Create a function to calculate the sum of a sequence of integers.", # instruction
-        "", # input
-        "", # output - leave this blank for generation!
-    )
-], return_tensors = "pt").to("cuda")
-outputs = model.generate(**inputs, max_new_tokens = 64, use_cache = True)
-tokenizer.batch_decode(outputs)
 ```
 Formated text
@@ -71,6 +45,33 @@ Formated text
 ''
 # Uploaded  model
 - **Developed by:** EpistemeAI

 # code
 ```
 Formated text
 ''
+```python
+from unsloth import FastLanguageModel
+model, tokenizer = FastLanguageModel.from_pretrained(
+    model_name = "EpistemeAI/EpistemeAI-codegemma-2-9b", # YOUR MODEL YOU USED FOR TRAINING
+    max_seq_length = max_seq_length,
+    dtype = dtype,
+    load_in_4bit = load_in_4bit,
+)
+FastLanguageModel.for_inference(model) # Enable native 2x faster inference
+# alpaca_prompt = You MUST copy from above!
+inputs = tokenizer(
+[
+    alpaca_prompt.format(
+        "Create a function to calculate the sum of a sequence of integers.", # instruction
+        "", # input
+        "", # output - leave this blank for generation!
+    )
+], return_tensors = "pt").to("cuda")
+outputs = model.generate(**inputs, max_new_tokens = 64, use_cache = True)
+tokenizer.batch_decode(outputs)
 # Uploaded  model
 - **Developed by:** EpistemeAI