llmapi server
1.0.0
Self-host llmapi server
中文文档
llmapi-server is an abstract backend that encapsulates a variety of large language models (LLM, such as ChatGPT, GPT-3, GPT-4, etc.), and provides simple access services through OpenAPI
? If it is helpful to you,please star it ?
graph LR
subgraph llmapi server
OpenAPI --> session
OpenAPI --> pre_post
subgraph backend
style backend fill:#f9f
pre_post-->chatgpt
pre_post-->dall-e
pre_post-->llama
pre_post-->...
end
end
text-->OpenAPI
image-->OpenAPI
embedding-->OpenAPI
others--> OpenAPI
chatgpt: openai's official ChatGPT interfacegpt3: openai's official GPT-3 interfacegpt-embedding: openai's official Embedding interfacedall-e: openai's official DALL·E interfacewelm: wechat's llm interfacenewbing: New Bing search based on ChatGPT(unofficial)# python >= 3.8
python3 -m pip install -r requirements.txt
python3 run_api_server.py./build_docker.sh
./start_docker.shcurl command to access:# 1. Start a new session
curl -X POST -H "Content-Type: application/json" -d '{"bot_type":"mock"}' http://127.0.0.1:5050/v1/chat/start
# response sample: {"code":0,"msg":"Success","session":"123456"}
# 2. chat with LLMs
curl -X POST -H "Content-Type: application/json" -d '{"session":"123456","content":"hello"}' http://127.0.0.1:5050/v1/chat/ask
# response sample: {"code":0,"msg":"Success","reply":"Text mock reply for your prompt:hello","timestamp":1678865301.0842562}
# 3. Close the session and end chat
curl -X POST -H "Content-Type: application/json" -d '{"session":"123456"}' http://127.0.0.1:5050/v1/chat/end
# response: {"code":0,"msg":"Success"}llmapi_cli --host="http://127.0.0.1:5050" --bot=mockfrom llmapi_cli import LLMClient
client = LLMClient(host = "http://127.0.0.1:5050", bot = "mock")
rep = client.ask("hello")
print(rep)newllm), you can directly cp -r mock newllm
mock, change the backend name to newllm
newllm directory, add the necessary dependencies, and all related development is bound to this directorynewllm in backend.py