File size: 634 Bytes
a1f93e9
adffeb2
2fa4e4c
a1f93e9
 
 
8988bbf
a1f93e9
 
adffeb2
2fa4e4c
 
 
a1f93e9
 
726a01e
 
dbf8ee3
726a01e
dbf8ee3
 
 
726a01e
 
8344bac
 
 
 
 
dbf8ee3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8344bac
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
---
title: Self Chat
emoji: 🤖🤖
colorFrom: yellow
colorTo: purple
sdk: gradio
sdk_version: 4.39.0
app_file: app.py
pinned: false
license: apache-2.0
tags:
  - chatbot
short_description: Generating synthetic data via self-chat
---



## ss

## Dependency

Install llama-cpp-python with the following arguments
```sh
pip install git+https://github.com/abetlen/llama-cpp-python.git -C cmake.args="-DGGML_BLAS=ON;-DGGML_BLAS_VENDOR=OpenBLAS"
```




## Local inference

```sh
python models/cpp_qwen2.py
```

## Serverless Inference


```sh
python client_gradio.py
```


For streaming inference
```sh
python client_streaming.py
```