Add recommendations for inference and finetuning
Browse files
README.md
CHANGED
@@ -13,6 +13,9 @@ license: apache-2.0
|
|
13 |
|
14 |
*Paper coming soon* π.
|
15 |
|
|
|
|
|
|
|
16 |
## Why use Falcon-7B?
|
17 |
|
18 |
* **It outperforms comparable open-source models** (e.g., [MPT-7B](https://huggingface.co/mosaicml/mpt-7b), [StableLM](https://github.com/Stability-AI/StableLM), [RedPajama](https://huggingface.co/togethercomputer/RedPajama-INCITE-Base-7B-v0.1) etc.), thanks to being trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. See the [OpenLLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
|
@@ -54,6 +57,9 @@ for seq in sequences:
|
|
54 |
|
55 |
π₯ **Falcon LLMs require PyTorch 2.0 for use with `transformers`!**
|
56 |
|
|
|
|
|
|
|
57 |
|
58 |
# Model Card for Falcon-7B
|
59 |
|
|
|
13 |
|
14 |
*Paper coming soon* π.
|
15 |
|
16 |
+
π€ To get started with Falcon (inference, finetuning, quantization, etc.), we recommend reading [this great blogpost fron HF](https://huggingface.co/blog/falcon)!
|
17 |
+
|
18 |
+
|
19 |
## Why use Falcon-7B?
|
20 |
|
21 |
* **It outperforms comparable open-source models** (e.g., [MPT-7B](https://huggingface.co/mosaicml/mpt-7b), [StableLM](https://github.com/Stability-AI/StableLM), [RedPajama](https://huggingface.co/togethercomputer/RedPajama-INCITE-Base-7B-v0.1) etc.), thanks to being trained on 1,500B tokens of [RefinedWeb](https://huggingface.co/datasets/tiiuae/falcon-refinedweb) enhanced with curated corpora. See the [OpenLLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard).
|
|
|
57 |
|
58 |
π₯ **Falcon LLMs require PyTorch 2.0 for use with `transformers`!**
|
59 |
|
60 |
+
For fast inference with Falcon, check-out [Text Generation Inference](https://github.com/huggingface/text-generation-inference)! Read more in this [blogpost]((https://huggingface.co/blog/falcon).
|
61 |
+
|
62 |
+
You will need **at least 16GB of memory** to swiftly run inference with Falcon-7B.
|
63 |
|
64 |
# Model Card for Falcon-7B
|
65 |
|