jan-hq
/

TinyJensen-1.1B-Chat

@@ -6,9 +6,6 @@ tags:
 - trl
 - sft
 - generated_from_trainer
-- trl
-- sft
-- generated_from_trainer
 datasets:
 - jan-hq/systemchat_binarized
 - jan-hq/youtube_transcripts_qa
@@ -16,32 +13,68 @@ datasets:
 model-index:
 - name: TinyJensen-1.1B-Chat
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# TinyJensen-1.1B-Chat
-This model is a fine-tuned version of [jan-hq/LlamaCorn-1.1B-Chat](https://huggingface.co/jan-hq/LlamaCorn-1.1B-Chat) on the jan-hq/systemchat_binarized, the jan-hq/youtube_transcripts_qa and the jan-hq/youtube_transcripts_qa_ext datasets.
-It achieves the following results on the evaluation set:
-- Loss: 0.8771
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
-### Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
@@ -56,7 +89,7 @@ The following hyperparameters were used during training:
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 5
-### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
@@ -64,10 +97,10 @@ The following hyperparameters were used during training:
 | 0.6608        | 2.0   | 414  | 0.7941          |
 | 0.526         | 3.0   | 621  | 0.8186          |
 | 0.4388        | 4.0   | 829  | 0.8643          |
-| 0.3888        | 4.99  | 1035 | 0.8771          |
-### Framework versions
 - Transformers 4.37.2
 - Pytorch 2.1.2+cu121

 - trl
 - sft
 - generated_from_trainer
 datasets:
 - jan-hq/systemchat_binarized
 - jan-hq/youtube_transcripts_qa
 model-index:
 - name: TinyJensen-1.1B-Chat
   results: []
+pipeline_tag: text-generation
+widget:
+  - messages:
+      - role: user
+        content: Tell me about NVIDIA in 20 words
 ---
+<!-- header start -->
+<!-- 200823 -->
+<div style="width: auto; margin-left: auto; margin-right: auto"
+>
+<img src="https://github.com/janhq/jan/assets/89722390/35daac7d-b895-487c-a6ac-6663daaad78e" alt="Jan banner"
+ style="width: 100%; min-width: 400px; display: block; margin: auto;">
+</div>
+<p align="center">
+    <a href="https://jan.ai/">Jan</a
+>
+    - <a
+ href="https://discord.gg/AsJ8krTT3N">Discord</a>
+</p>
+<!-- header end -->
+# Model description
+- Finetuned [LlamaCorn-1.1B-Chat](https://huggingface.co/jan-hq/LlamaCorn-1.1B-Chat) further to act like Jensen Huang - CEO of NVIDIA.
+- Use this model with caution because it can make you laugh.
+# Prompt template
+ChatML
+```
+<|im_start|>system
+{system_message}<|im_end|>
+<|im_start|>user
+{prompt}<|im_end|>
+<|im_start|>assistant
+```
+# Run this model
+You can run this model using [Jan Desktop](https://jan.ai/) on Mac, Windows, or Linux.
+Jan is an open source, ChatGPT alternative that is:
+- 💻  **100% offline on your machine**: Your conversations remain confidential, and visible only to you.
+- 🗂️ **
+An Open File Format**: Conversations and model settings stay on your computer and can be exported or deleted at any time.
+- 🌐 **OpenAI Compatible**: Local server on port `1337` with OpenAI compatible endpoints
+- 🌍 **Open Source & Free**: We build in public; check out our [Github](https://github.com/janhq)
+![image/png](https://cdn-uploads.huggingface.co/production/uploads/65713d70f56f9538679e5a56/r7VmEBLGXpPLTu2MImM7S.png)
+# About Jan
+Jan believes in the need for an open-source AI ecosystem and is building the infra and tooling to allow open-source AIs to compete on a level playing field with proprietary ones.
+Jan's long-term vision is to build a cognitive framework for future robots, who are practical, useful assistants for humans and businesses in everyday life.
+# Training hyperparameters
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
 - lr_scheduler_warmup_ratio: 0.1
 - num_epochs: 5
+# Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
 | 0.6608        | 2.0   | 414  | 0.7941          |
 | 0.526         | 3.0   | 621  | 0.8186          |
 | 0.4388        | 4.0   | 829  | 0.8643          |
+| 0.3888        | 5.0  | 1035 | 0.8771          |
+# Framework versions
 - Transformers 4.37.2
 - Pytorch 2.1.2+cu121