(주)미디어그룹사람과숲과 (주)마커의 LLM 연구 컨소시엄에서 개발된 모델입니다
The license is cc-by-nc-sa-4.0.

COLA3-7B : Lamm2 7B 베이스 모델을 IA3방식으로 Fine tuning한 모델

** IA3방식에 대한 디테일 정보: K(G)OAT**

Model Details

Model Developers Seungyoo-Lee (DopeorNope)

Input Models input text only.

Output Models generate text only.

Model Architecture

KO-Platypus2-7B-ex is an auto-regressive language model based on the LLaMA2 transformer architecture.

Base Model kyujinpy/KO-Platypus2-7B-ex

Training Dataset

Eng_Kor_COT_combined was used for finetuning.

I used A5000 GPU 24GB x2 desktop for training.

Limitations and bias

Llama 2 and fine-tuned variants are a new technology that carries risks with use. Testing conducted to date has been in English, and has not covered, nor could it cover all scenarios. For these reasons, as with all LLMs, Llama 2 and any fine-tuned varient's potential outputs cannot be predicted in advance, and the model may in some instances produce inaccurate, biased or other objectionable responses to user prompts. Therefore, before deploying any applications of Llama 2 variants, developers should perform safety testing and tuning tailored to their specific applications of the model.

Please see the Responsible Use Guide available at https://ai.meta.com/llama/responsible-use-guide/

Citations

@article{platypus2023,
    title={Platypus: Quick, Cheap, and Powerful Refinement of LLMs}, 
    author={Ariel N. Lee and Cole J. Hunter and Nataniel Ruiz},
    booktitle={arXiv preprint arxiv:2308.07317},
    year={2023}
}

@misc{touvron2023llama,
    title={Llama 2: Open Foundation and Fine-Tuned Chat Models}, 
    author={Hugo Touvron and Louis Martin and Kevin Stone and Peter Albert and Amjad Almahairi and Yasmine Babaei and Nikolay Bashlykov       year={2023},
    eprint={2307.09288},
    archivePrefix={arXiv},
}

@inproceedings{
    hu2022lora,
    title={Lo{RA}: Low-Rank Adaptation of Large Language Models},
    author={Edward J Hu and Yelong Shen and Phillip Wallis and Zeyuan Allen-Zhu and Yuanzhi Li and Shean Wang and Lu Wang and Weizhu Chen},
    booktitle={International Conference on Learning Representations},
    year={2022},
    url={https://openreview.net/forum?id=nZeVKeeFYf9}
}

DopeorNope
/

COLA3-7B

COLA3-7B : Lamm2 7B 베이스 모델을 IA3방식으로 Fine tuning한 모델

Model Details

Limitations and bias

Citations

Dataset used to train DopeorNope/COLA3-7B