mrsteyk
/

openchatgpt-neo-125m

Text Generation

Generated from Trainer

text generation

Inference Endpoints

Model card Files Files and versions Community

mrsteyk commited on Dec 21, 2022

Commit

6868d07

•

1 Parent(s): c038dba

Update README.md

Files changed (1) hide show

README.md +16 -6

README.md CHANGED Viewed

@@ -1,7 +1,12 @@
 ---
 license: mit
 tags:
 - generated_from_trainer
 metrics:
 - accuracy
 model-index:
@@ -9,9 +14,6 @@ model-index:
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # openchatgpt-neo-r1
 This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the openchatgpt safe-r1 dataset.
@@ -21,18 +23,26 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
 ### Training hyperparameters
 The following hyperparameters were used during training:

 ---
 license: mit
+language:
+- en
 tags:
 - generated_from_trainer
+- text generation
+- pytorch
+- casual-lm
 metrics:
 - accuracy
 model-index:
   results: []
 ---
 # openchatgpt-neo-r1
 This model is a fine-tuned version of [EleutherAI/gpt-neo-125M](https://huggingface.co/EleutherAI/gpt-neo-125M) on the openchatgpt safe-r1 dataset.
 ## Model description
+Finetune based on the inner workings of ChatGPT. I won't elaborate on that. You must have a faint idea of how prompt is made for it to spit anything that's not garbled mess.
+This is effectively a schizophrenic idea that met the light of day. Practically a collab of 3 students in a virtual shed.
 ## Intended uses & limitations
+Intended uses & limitations fall in line with OpenAI's. Dataset used consists of safe texts (i.e. not highly sexual/erotica type stuff). NSFW version of the dataset is not planned to exist at the moment.
+Keep in mind that this is a 125m version of GPT-Neo. My 1050Ti Mobile couldn't even handle that without gradient thingmabobs. If anyone knows how to effectively finetune larger models on free colabs - feel free to let me know. Pile tokenizer also has one downside compared to native GPT-2/3 - `Assistant`.
 ## Training and evaluation data
+Data was split in ratio of 95%/5%. Preproccess included removing mentions of OpenAI wherever it was not deemed appropriete (GPT-2 has one of the appropriete mentions). Whole dataset consists of just shy off 3k input-output pairs. One input has multiple outputs (read as: one message has multiple variants of an answer). <<<1% (3 total) are curated lines (i.e. a huge mistake was spotted that needed corrections).
+Heavy bias on IT.
 ## Training procedure
+Input and output were straight up concatenated due to the nature of how ChatGPT works. Padding chosen was the same as the separator token, if that's not effective - please let me know as I am new to this stuff.
 ### Training hyperparameters
 The following hyperparameters were used during training: