self-chat / README.md
xu song
update
dbf8ee3
|
raw
history blame
634 Bytes
metadata
title: Self Chat
emoji: πŸ€–πŸ€–
colorFrom: yellow
colorTo: purple
sdk: gradio
sdk_version: 4.39.0
app_file: app.py
pinned: false
license: apache-2.0
tags:
  - chatbot
short_description: Generating synthetic data via self-chat

ss

Dependency

Install llama-cpp-python with the following arguments

pip install git+https://github.com/abetlen/llama-cpp-python.git -C cmake.args="-DGGML_BLAS=ON;-DGGML_BLAS_VENDOR=OpenBLAS"

Local inference

python models/cpp_qwen2.py

Serverless Inference

python client_gradio.py

For streaming inference

python client_streaming.py