Aratako
/

LightChatAssistant-4x7B

@@ -13,13 +13,101 @@ tags:
 [GGUF版はこちら/Click here for the GGUF version](https://huggingface.co/Aratako/LightChatAssistant-4x7B-GGUF)
 ## 概要
-[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)と全く同じ手法を用いつつ、対象を4モデルに拡張してそれらをMoEしたモデルです。
-素晴らしい手法を提案いただいた[@Sdff-Ltba](https://huggingface.co/Sdff-Ltba)さんに感謝します。
-以下の4モデルをChatVectorの加算によって対話強化して素材に使用しています。
 - [NTQAI/chatntq-ja-7b-v1.0](https://huggingface.co/NTQAI/chatntq-ja-7b-v1.0)
 - [stabilityai/japanese-stablelm-instruct-gamma-7b](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma-7b)
 - [Elizezen/Antler-7B](https://huggingface.co/Elizezen/Antler-7B)
 - [Elizezen/Hameln-japanese-mistral-7B](https://huggingface.co/Elizezen/Hameln-japanese-mistral-7B)
-※詳細は後ほど更新

 [GGUF版はこちら/Click here for the GGUF version](https://huggingface.co/Aratako/LightChatAssistant-4x7B-GGUF)
 ## 概要
+[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)と全く同じ手法を用いつつ、対象を4モデルに拡張してそれらをMoEしたモデルです。素晴らしい手法を提案いただいた[@Sdff-Ltba](https://huggingface.co/Sdff-Ltba)さんに感謝します。
+モデルサイズは大きくなっていますが、推論に同時に使われるエキスパートは2つなので、GPUにオフロードさえできれば元の[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)とほぼ同じような速度で動きます。
+以下の4モデルをChat Vectorの加算によって対話強化して素材に使用しています。
 - [NTQAI/chatntq-ja-7b-v1.0](https://huggingface.co/NTQAI/chatntq-ja-7b-v1.0)
 - [stabilityai/japanese-stablelm-instruct-gamma-7b](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma-7b)
 - [Elizezen/Antler-7B](https://huggingface.co/Elizezen/Antler-7B)
 - [Elizezen/Hameln-japanese-mistral-7B](https://huggingface.co/Elizezen/Hameln-japanese-mistral-7B)
+Chat Vectorの適用方法は[Sdff-Ltba/LightChatAssistant-2x7B](https://huggingface.co/Sdff-Ltba/LightChatAssistant-2x7B)と同じで、Mistral-7B-Instruct-v0.2とMistral-7B-v0.1の差分を0.8倍してそれぞれのモデルに足しています。
+その後、[mergekit](https://github.com/cg123/mergekit)を使い、4モデルをMoEしました。MoEのconfigは以下の通りです。
+```yaml
+base_model: ./chatntq-chatvector
+gate_mode: hidden
+dtype: bfloat16
+experts:
+  - source_model: ./chatntq-chatvector
+    positive_prompts:
+    - "roleplay"
+    - "question"
+    - "answer"
+    - "chat"
+    - "companion"
+    - "character"
+    - "math"
+    - "code"
+    - "[Mode: Roleplay]"
+    - "[Mode: Chat]"
+    negative_prompts:
+    - "storywriting"
+    - "book"
+    - "story"
+    - "chapter"
+    - "tale"
+    - "history"
+    - "write"
+    - "novel"
+    - "[Mode: Writing]"
+  - source_model: ./japanese-stablelm-instruct-gamma-chatvector
+    positive_prompts:
+    - "roleplay"
+    - "question"
+    - "answer"
+    - "chat"
+    - "companion"
+    - "character"
+    - "math"
+    - "code"
+    - "[Mode: Roleplay]"
+    - "[Mode: Chat]"
+    negative_prompts:
+    - "storywriting"
+    - "book"
+    - "story"
+    - "chapter"
+    - "tale"
+    - "history"
+    - "write"
+    - "novel"
+    - "[Mode: Writing]"
+  - source_model: ./Hameln-japanese-mistral-chatvector
+    positive_prompts:
+    - "sex"
+    - "storywriting"
+    - "erotic"
+    - "fuck"
+    - "orgasm"
+    - "uncensored"
+    - "book"
+    - "story"
+    - "chapter"
+    - "tale"
+    - "history"
+    - "write"
+    - "novel"
+    - "[Mode: Writing]"
+  - source_model: ./Antler-chatvector
+    positive_prompts:
+    - "sex"
+    - "storywriting"
+    - "erotic"
+    - "fuck"
+    - "orgasm"
+    - "uncensored"
+    - "book"
+    - "story"
+    - "chapter"
+    - "tale"
+    - "history"
+    - "write"
+    - "novel"
+    - "[Mode: Writing]"
+tokenizer_source: union
+```