QuantFactory
/

Biggie-SmoLlm-0.4B-GGUF

Inference Endpoints

Model card Files Files and versions Community

Biggie-SmoLlm-0.4B-GGUF / README.md

aashish1904's picture

Upload README.md with huggingface_hub

e17769d verified 3 months ago

|

1.18 kB


	---

	base_model: HuggingFaceTB/SmolLM-360M-Instruct

	---

	![](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)

	# QuantFactory/Biggie-SmoLlm-0.4B-GGUF
	This is quantized version of [nisten/Biggie-SmoLlm-0.4B](https://huggingface.co/nisten/Biggie-SmoLlm-0.4B) created using llama.cpp

	# Original Model Card


	###Coherent Frankenstein of smolLm-0.36b upped to 0.4b

	This took about 5 hours of semi-automated continuous merging to figure out the recipe.
	Model is smarter, and UNTRAINED. Uploaded it for training. Yet it performs well as is even quantized to 8bit.
	8bit gguf included for testing.

	```bash
	wget https://huggingface.co/nisten/Biggie-SmoLlm-0.4B/resolve/main/Biggie_SmolLM_400M_q8_0.gguf
	```
	```verilog
	./llama-cli -ngl 99 -co --temp 0 -p "How to build a city on Mars via calculating Aldrin-Cycler orbits?" -m Biggie_SmolLM_400M_q8_0.gguf -cnv -fa --keep -1
	```
	![image/png](https://cdn-uploads.huggingface.co/production/uploads/6379683a81c1783a4a2ddba8/XgF2kz3Zz0Jqz7BEVZ96h.png)