self-chat / models /vllm_qwen2.py
xu song
update
dbf8ee3
raw
history blame
97 Bytes
"""
https://github.com/vllm-project/vllm/blob/main/examples/offline_inference_with_prefix.py
"""