Kimiko-13B-fp16 / README.md

TheBloke

Update base_model formatting

39b09fe 12 months ago

preview code

raw

history blame contribute delete

No virus

7.27 kB

	---
	license: other
	model_name: Kimiko 13B
	inference: false
	model_creator: nRuaif
	model_link: https://huggingface.co/nRuaif/Kimiko_13B
	model_type: llama
	quantized_by: TheBloke
	base_model: nRuaif/Kimiko_13B
	---

	<!-- header start -->
	<div style="width: 100%;">
	<img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
	</div>
	<div style="display: flex; justify-content: space-between; width: 100%;">
	<div style="display: flex; flex-direction: column; align-items: flex-start;">
	<p><a href="https://discord.gg/theblokeai">Chat & support: my new Discord server</a></p>
	</div>
	<div style="display: flex; flex-direction: column; align-items: flex-end;">
	<p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
	</div>
	</div>
	<!-- header end -->

	# Kimiko 13B - FP16
	- Model creator: [nRuaif](https://huggingface.co/nRuaif)
	- Original model: [Kimiko 13B](nRuaif/Kimiko_13B)

	## Description

	This repo contains pytorch format fp16 model files for [none](nRuaif/Kimiko_13B).

	It is the result of merging and/or converting the source repository to float16.

	## Repositories available

	* [GPTQ models for GPU inference, with multiple quantisation parameter options.](https://huggingface.co/TheBloke/Kimiko-13B-GPTQ)
	* [2, 3, 4, 5, 6 and 8-bit GGML models for CPU+GPU inference](https://huggingface.co/TheBloke/Kimiko-13B-GGML)
	* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/TheBloke/Kimiko-13B-fp16)
	* [nRuaif's original LoRA adapter, which can be merged on to the base model.](https://huggingface.co/nRuaif/Kimiko_13B)

	## Prompt template: %%PROMPT_TEMPLATE_TITLE

	```
	<<HUMAN>>
	{prompt}

	<<AIBOT>>
	```

	<!-- footer start -->
	## Discord

	For further support, and discussions on these models and AI in general, join us at:

	[TheBloke AI's Discord server](https://discord.gg/theblokeai)

	## Thanks, and how to contribute.

	Thanks to the [chirper.ai](https://chirper.ai) team!

	I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.

	If you're able and willing to contribute it will be most gratefully received and will help me to keep providing more models, and to start work on new AI projects.

	Donaters will get priority support on any and all AI/LLM/model questions and requests, access to a private Discord room, plus other benefits.

	* Patreon: https://patreon.com/TheBlokeAI
	* Ko-Fi: https://ko-fi.com/TheBlokeAI

	Special thanks to: Luke from CarbonQuill, Aemon Algiz.

	Patreon special mentions: Slarti, Chadd, John Detwiler, Pieter, zynix, K, Mano Prime, ReadyPlayerEmma, Ai Maven, Leonard Tan, Edmond Seymore, Joseph William Delisle, Luke @flexchar, Fred von Graf, Viktor Bowallius, Rishabh Srivastava, Nikolai Manek, Matthew Berman, Johann-Peter Hartmann, ya boyyy, Greatston Gnanesh, Femi Adebogun, Talal Aujan, Jonathan Leane, terasurfer, David Flickinger, William Sang, Ajan Kanaga, Vadim, Artur Olbinski, Raven Klaugh, Michael Levine, Oscar Rangel, Randy H, Cory Kujawski, RoA, Dave, Alex, Alexandros Triantafyllidis, Fen Risland, Eugene Pentland, vamX, Elle, Nathan LeClaire, Khalefa Al-Ahmad, Rainer Wilmers, subjectnull, Junyu Yang, Daniel P. Andersen, SuperWojo, LangChain4j, Mandus, Kalila, Illia Dulskyi, Trenton Dambrowitz, Asp the Wyvern, Derek Yates, Jeffrey Morgan, Deep Realms, Imad Khwaja, Pyrater, Preetika Verma, biorpg, Gabriel Tamborski, Stephen Murray, Spiking Neurons AB, Iucharbius, Chris Smitley, Willem Michiel, Luke Pendergrass, Sebastain Graf, senxiiz, Will Dee, Space Cruiser, Karl Bernard, Clay Pascal, Lone Striker, transmissions 11, webtim, WelcomeToTheClub, Sam, theTransient, Pierre Kircher, chris gileta, John Villwock, Sean Connelly, Willian Hasse


	Thank you to all my generous patrons and donaters!

	<!-- footer end -->

	# Original model card: none


	# Model Card for Kimiko_13B

	<!-- Provide a quick summary of what the model is/does. -->

	This is my new Kimiko models, trained with LLaMA2-13B for...purpose

	## Model Details

	### Model Description

	<!-- Provide a longer summary of what this model is. -->



	- Developed by: nRuaif
	- Model type: Decoder only
	- License: CC BY-NC-SA
	- Finetuned from model [optional]: LLaMA 2

	### Model Sources [optional]

	<!-- Provide the basic links for the model. -->

	- Repository: https://github.com/OpenAccess-AI-Collective/axolotl
	[<img src="https://raw.githubusercontent.com/OpenAccess-AI-Collective/axolotl/main/image/axolotl-badge-web.png" alt="Built with Axolotl" width="200" height="32"/>](https://github.com/OpenAccess-AI-Collective/axolotl)
	## Uses

	<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->


	### Direct Use

	<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->

	This model is trained on 3k examples of instructions dataset, high quality roleplay, for best result follow this format
	```
	<<HUMAN>>
	How to do abc

	<<AIBOT>>
	Here is how

	Or with system prompting for roleplay

	<<SYSTEM>>
	A's Persona:
	B's Persona:
	Scenario:
	Add some instruction here on how you want your RP to go.
	```


	## Bias, Risks, and Limitations

	<!-- This section is meant to convey both technical and sociotechnical limitations. -->

	All bias of this model come from LlaMA2 with an exception of NSFW bias.....




	## Training Details

	### Training Data

	<!-- This should link to a Data Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->

	3000 examples from LIMAERP, LIMA and I sample 1000 good instruction from Airboro

	### Training Procedure

	<!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->

	Model is trained with 1 L4 from GCP costing a whooping 2.5USD





	#### Training Hyperparameters

	- Training regime: [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->

	3 epochs with 0.0002 lr, full 4096 ctx token, QLoRA

	#### Speeds, Sizes, Times [optional]

	<!-- This section provides information about throughput, start/end time, checkpoint size if relevant, etc. -->

	It takes 18 hours to train this model with xformers enable

	[More Information Needed]







	[More Information Needed]

	## Environmental Impact

	<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->

	Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).

	- Hardware Type: L4 with 12CPUs 48gb ram
	- Hours used: 5
	- Cloud Provider: GCP
	- Compute Region: US
	- Carbon Emitted: 0.5KG