Commit History

change port export
2a6826a

ztime commited on

change port export
0dc41c6

ztime commited on

change port export
1941b22

ztime commited on

add llama, remove
4ec3328

ztime commited on

add llama.cpp server
8fd0c06

ztime commited on

Update main.py
a9adc36
verified

ztime commited on

Update Dockerfile (#1)
6dfac1c
verified

ztime commited on

Update main.py (#2)
9d4476b
verified

ztime commited on

Update Dockerfile
f52c4bc
verified

ztime commited on

feat: updated to OpenHermes-2.5-Mistral-7B-GGUF model
060482d

limcheekin commited on

feat: updated to openchat_3.5-GGUF model
4e50e86

limcheekin commited on

feat: updated to agentlm-7B-GGUF model
0a542dc

limcheekin commited on

feat: updated to WizardCoder-Python-7B-V1.0-GGUF model
ed5be76

limcheekin commited on

feat: changed to dolphin-2.1-mistral-7B-GGUF model
c968716

limcheekin commited on

feat: updated for Mistral-7B-OpenOrca-GGUF model
94e3839

limcheekin commited on

feat: enabled the embeddings endpoint
6c3814d

limcheekin commited on

chore: removed OPENBLAS_NUM_THREADS as no performance improvement had been observed.
41122b6

limcheekin commited on

chore: updated OPENBLAS_NUM_THREADS to 2.
db6f5ea

limcheekin commited on

chore: added OPENBLAS_NUM_THREADS to specify the number of threads used by the OpenBLAS.
36e1e32

limcheekin commited on

feat: added notebook on how to use the api and updated index.html to include the link to the notebook
19485c0

limcheekin commited on

feat: added Mistral-7B-Instruct-v0.1-GGUF (Q4_K_M) model
8a4f00d

limcheekin commited on

updated for the CodeLlama-13B-oasst-sft-v10-GGUF (Q4_K_M) model
52a27ed

limcheekin commited on

Duplicate from limcheekin/WizardCoder-Python-13B-V1.0-GGUF
106db30

limcheekin commited on