Spaces:

amd
/

README

Running

dbouius commited on Dec 7, 2023

Commit

aa0c6e4

•

1 Parent(s): db517d8

Added link and description about Optimum support for AMD GPUs

Files changed (1) hide show

README.md CHANGED Viewed

@@ -93,6 +93,10 @@ Here are a few of the more popular ones to get you started:
 Click on the 'Use in Transformers' button to see the exact code to import a specific model into your Python application.
 # Serving a model with TGI
 Text Generation Inference (a.k.a “TGI”) provides an end-to-end solution to deploy large language models for inference at scale.

 Click on the 'Use in Transformers' button to see the exact code to import a specific model into your Python application.
+## 5. Optimum Support
+For a deeper dive into using Hugging Face libraries on AMD GPUs, check out the [Optimum](https://huggingface.co/docs/optimum/main/en/amd/amdgpu/overview) page
+describing details on Flash Attention 2, GPTQ Quantization and ONNX Runtime integration.
 # Serving a model with TGI
 Text Generation Inference (a.k.a “TGI”) provides an end-to-end solution to deploy large language models for inference at scale.