ohtaman
/

falcon-7b-kokkai2022-lora

Text Generation

Model card Files Files and versions Community

ohtaman commited on Jul 16, 2023

Commit

2f33103

•

1 Parent(s): da1bd1d

Update README.md

Files changed (1) hide show

README.md +5 -3

README.md CHANGED Viewed

@@ -16,8 +16,10 @@ license: apache-2.0
 This model learned the proceedings of the Japanese parliament in 2022.
 The [dataset](https://huggingface.co/datasets/ohtaman/kokkai2022) is collected using
 [National Diet Library's Search API](https://kokkai.ndl.go.jp/api.html).
-example input:
 ```
 # question
@@ -28,7 +30,7 @@ example input:
 鈴木　俊一
 ```
-output:
 ```
 「財政民主主義」のためには、国庫負担を引き下げるならば、企業の賃上げを実現するためにも、消費者物価の高騰対策等を含めて、経済対策を行い、成長と分配の好循環を実珉化することが重要でございます。
@@ -82,7 +84,7 @@ base_model = transformers.AutoModelForCausalLM.from_pretrained(base_model_name,
 peft_model = peft.PeftModelForCausalLM.from_pretrained(base_model, peft_model_name, torch_dtype=torch.bfloat16)
-prompt = "# question\n麻生太郎\n\n増税すべきとお考えか？\n# answer\n岸田文雄\n\n〔内閣総理大臣岸田文雄君登壇〕"
 input_tokens = tokenizer(prompt, return_tensors="pt").to(peft_model.device)
 input_length = input_tokens.input_ids.shape[1]

 This model learned the proceedings of the Japanese parliament in 2022.
 The [dataset](https://huggingface.co/datasets/ohtaman/kokkai2022) is collected using
 [National Diet Library's Search API](https://kokkai.ndl.go.jp/api.html).
+This model was build for a hackerthon event,  [
+第1回大規模言語モデル分散学習ハッカソン](https://abci.ai/event/2023/06/13/ja_event.html) ([#ABCILLM](https://twitter.com/hashtag/ABCILLM)), as an example of training which used multiple GPUs or multiple nodes.
+An example input is as follows:
 ```
 # question
 鈴木　俊一
 ```
+and the respons is:
 ```
 「財政民主主義」のためには、国庫負担を引き下げるならば、企業の賃上げを実現するためにも、消費者物価の高騰対策等を含めて、経済対策を行い、成長と分配の好循環を実珉化することが重要でございます。
 peft_model = peft.PeftModelForCausalLM.from_pretrained(base_model, peft_model_name, torch_dtype=torch.bfloat16)
+prompt = "# question\n麻生太郎\n\n増税が必要とお考えでしょうか？\n# answer\n鈴木　俊一\n\n"
 input_tokens = tokenizer(prompt, return_tensors="pt").to(peft_model.device)
 input_length = input_tokens.input_ids.shape[1]