TheBloke
/

Falcon-7B-Instruct-GPTQ

@@ -1,10 +1,13 @@
 ---
 datasets:
 - tiiuae/falcon-refinedweb
 language:
 - en
 inference: false
 ---
 <div style="width: 100%;">
     <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
 </div>
@@ -13,9 +16,10 @@ inference: false
         <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
     </div>
     <div style="display: flex; flex-direction: column; align-items: flex-end;">
-        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? Patreon coming soon!</a></p>
     </div>
 </div>
 # Falcon-7B-Instruct GPTQ
@@ -31,29 +35,27 @@ Join me at: https://discord.gg/UBgz4VXf
 Please note this is an experimental GPTQ model. Support for it is currently quite limited.
-It is also expected to be **VERY SLOW**. This is unavoidable at the moment, but is being looked at.
-To use it you will require:
-1. AutoGPTQ, from the latest `main` branch and compiled with `pip install .`
-2. `pip install einops`
-You can then use it immediately from Python code - see example code below - or from text-generation-webui.
-## AutoGPTQ
-To install AutoGPTQ please follow these instructions:
 ```
 git clone https://github.com/PanQiWei/AutoGPTQ
 cd AutoGPTQ
 pip install .
 ```
-These steps will require that you have the [Nvidia CUDA toolkit](https://developer.nvidia.com/cuda-12-0-1-download-archive) installed.
 ## text-generation-webui
-There is also provisional AutoGPTQ support in text-generation-webui.
 This requires text-generation-webui as of commit 204731952ae59d79ea3805a425c73dd171d943c3.
@@ -129,14 +131,28 @@ It was created with groupsize 64 to give higher inference quality, and without `
   * Does not work with any version of GPTQ-for-LLaMa
   * Parameters: Groupsize = 64. No act-order.
-## Want to support my work?
-I've had a lot of people ask if they can contribute. I love providing models and helping people, but it is starting to rack up pretty big cloud computing bills.
-So if you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on various AI proejcts.
-* Patreon: coming soon! (just awaiting approval)
 * Ko-Fi: https://ko-fi.com/TheBlokeAI
 # ✨ Original model card: Falcon-7B-Instruct

 ---
 datasets:
 - tiiuae/falcon-refinedweb
+license: apache-2.0
 language:
 - en
 inference: false
 ---
+<!-- header start -->
 <div style="width: 100%;">
     <img src="https://i.imgur.com/EBdldam.jpg" alt="TheBlokeAI" style="width: 100%; min-width: 400px; display: block; margin: auto;">
 </div>
         <p><a href="https://discord.gg/UBgz4VXf">Chat & support: my new Discord server</a></p>
     </div>
     <div style="display: flex; flex-direction: column; align-items: flex-end;">
+        <p><a href="https://www.patreon.com/TheBlokeAI">Want to contribute? TheBloke's Patreon page</a></p>
     </div>
 </div>
+<!-- header end -->
 # Falcon-7B-Instruct GPTQ
 Please note this is an experimental GPTQ model. Support for it is currently quite limited.
+It is also expected to be **SLOW**. This is currently unavoidable, but is being looked at.
+## AutoGPTQ
+AutoGPTQ is required: `pip install auto-gptq`
+AutoGPTQ provides pre-compiled wheels for Windows and Linux, with CUDA toolkit 11.7 or 11.8.
+If you are running CUDA toolkit 12.x, you will need to compile your own by following these instructions:
 ```
 git clone https://github.com/PanQiWei/AutoGPTQ
 cd AutoGPTQ
 pip install .
 ```
+These manual steps will require that you have the [Nvidia CUDA toolkit](https://developer.nvidia.com/cuda-12-0-1-download-archive) installed.
 ## text-generation-webui
+There is provisional AutoGPTQ support in text-generation-webui.
 This requires text-generation-webui as of commit 204731952ae59d79ea3805a425c73dd171d943c3.
   * Does not work with any version of GPTQ-for-LLaMa
   * Parameters: Groupsize = 64. No act-order.
+<!-- footer start -->
+## Discord
+For further support, and discussions on these models and AI in general, join us at: [TheBloke AI's Discord server](https://discord.gg/UBgz4VXf)
+## Thanks, and how to contribute.
+Thanks to the [chirper.ai](https://chirper.ai) team!
+I've had a lot of people ask if they can contribute. I enjoy providing models and helping people, and would love to be able to spend even more time doing it, as well as expanding into new projects like fine tuning/training.
+If you're able and willing to contribute, it'd be most gratefully received and will help me to keep providing models, and work on new AI projects.
+Donaters will get priority support on any and all AI/LLM/model questions, plus other benefits.
+* Patreon: https://patreon.com/TheBlokeAI
 * Ko-Fi: https://ko-fi.com/TheBlokeAI
+**Patreon special mentions**: Aemon Algiz; Johann-Peter Hartmann; Talal Aujan; Jonathan Leane; Illia Dulskyi; Khalefa Al-Ahmad; senxiiz; Sebastain Graf; Eugene Pentland; Nikolai Manek; Luke Pendergrass.
+Thank you to all my generous patrons and donaters.
+<!-- footer end -->
 # ✨ Original model card: Falcon-7B-Instruct