elyza
/

Llama-3-ELYZA-JP-8B-GGUF

Inference Endpoints

Model card Files Files and versions Community

passaglia commited on Jun 25

Commit

7471f47

•

1 Parent(s): ae550c6

Update README.md

Files changed (1) hide show

README.md +5 -8

README.md CHANGED Viewed

@@ -33,14 +33,13 @@ The following table shows the performance degradation due to quantization:
 ## Use with llama.cpp
-Install llama.cpp through brew (works on Mac and Linux)
 ```bash
 brew install llama.cpp
 ```
-Invoke the llama.cpp server.
 ```bash
 $ llama-server \
 --hf-repo elyza/Llama-3-ELYZA-JP-8B-GGUF \
@@ -48,8 +47,7 @@ $ llama-server \
 --port 8080
 ```
-Call the API using curl.
 ```bash
 $ curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
@@ -64,8 +62,7 @@ $ curl http://localhost:8080/v1/chat/completions \
 }'
 ```
-Call the API using Python.
 ```python
 import openai
@@ -85,7 +82,7 @@ completion = client.chat.completions.create(
 ## Use with Desktop App
-There are various desktop applications that can handle GGUF models, but here we will introduce how to use the model in the no-code environment LM Studio.
 - **Installation**: Download and install [LM Studio](https://lmstudio.ai/).
 - **Downloading the Model**: Search for `elyza/Llama-3-ELYZA-JP-8B-GGUF` in the search bar on the home page 🏠, and download `Llama-3-ELYZA-JP-8B-q4_k_m.gguf`.

 ## Use with llama.cpp
+Install llama.cpp through brew (works on Mac and Linux):
 ```bash
 brew install llama.cpp
 ```
+Invoke the llama.cpp server:
 ```bash
 $ llama-server \
 --hf-repo elyza/Llama-3-ELYZA-JP-8B-GGUF \
 --port 8080
 ```
+Call the API using curl:
 ```bash
 $ curl http://localhost:8080/v1/chat/completions \
 -H "Content-Type: application/json" \
 }'
 ```
+Call the API using Python:
 ```python
 import openai
 ## Use with Desktop App
+There are various desktop applications that can handle GGUF models, but here we will introduce how to use the model in the no-code environment [LM Studio](https://lmstudio.ai/).
 - **Installation**: Download and install [LM Studio](https://lmstudio.ai/).
 - **Downloading the Model**: Search for `elyza/Llama-3-ELYZA-JP-8B-GGUF` in the search bar on the home page 🏠, and download `Llama-3-ELYZA-JP-8B-q4_k_m.gguf`.