nos
v0.3.0
網站|文檔|教程|操場|博客|不和諧
NOS是在任何云或AI HW上運行的快速靈活的Pytorch推理服務器。
inf2 )運行時。我們強烈建議您訪問我們的Quickstart指南以開始。要安裝NOS客戶端,您可以運行以下命令:
conda create -n nos python=3.8 -y
conda activate nos
pip install torch-nos安裝客戶端后,您可以通過NOS serve CLI啟動NOS服務器。這將自動檢測您的本地環境,下載Docker Runtime Image並旋轉NOS服務器:
nos serve up --http --logging-level INFO現在,您可以使用NOS運行您的第一個推理請求!您可以運行以下任何命令來嘗試一下。如果您需要從服務器中獲得更多詳細信息,則可以將記錄級別設置為DEBUG 。
NOS提供了一台具有流支持的OpenAI兼容服務器,因此您可以連接您喜歡的OpenAI兼容LLM客戶端與NOS交談。

GRPC API⚡
from nos . client import Client
client = Client ()
model = client . Module ( "TinyLlama/TinyLlama-1.1B-Chat-v1.0" )
response = model . chat ( message = "Tell me a story of 1000 words with emojis" , _stream = True )REST API
curl
-X POST http://localhost:8000/v1/chat/completions
-H " Content-Type: application/json "
-d ' {
"model": "TinyLlama/TinyLlama-1.1B-Chat-v1.0",
"messages": [{
"role": "user",
"content": "Tell me a story of 1000 words with emojis"
}],
"temperature": 0.7,
"stream": true
} '在幾秒鐘內構建Midjourney Discord機器人。

GRPC API⚡
from nos . client import Client
client = Client ()
sdxl = client . Module ( "stabilityai/stable-diffusion-xl-base-1-0" )
image , = sdxl ( prompts = [ "hippo with glasses in a library, cartoon styling" ],
width = 1024 , height = 1024 , num_images = 1 )REST API
curl
-X POST http://localhost:8000/v1/infer
-H ' Content-Type: application/json '
-d ' {
"model_id": "stabilityai/stable-diffusion-xl-base-1-0",
"inputs": {
"prompts": ["hippo with glasses in a library, cartoon styling"],
"width": 1024, "height": 1024,
"num_images": 1
}
} '在幾分鐘內構建圖像/視頻的可擴展語義搜索。

GRPC API⚡
from nos . client import Client
client = Client ()
clip = client . Module ( "openai/clip-vit-base-patch32" )
txt_vec = clip . encode_text ( texts = [ "fox jumped over the moon" ])REST API
curl
-X POST http://localhost:8000/v1/infer
-H ' Content-Type: application/json '
-d ' {
"model_id": "openai/clip-vit-base-patch32",
"method": "encode_text",
"inputs": {
"texts": ["fox jumped over the moon"]
}
} '使用耳語執行實時音頻轉錄。

GRPC API⚡
from pathlib import Path
from nos . client import Client
client = Client ()
model = client . Module ( "openai/whisper-small.en" )
with client . UploadFile ( Path ( "audio.wav" )) as remote_path :
response = model ( path = remote_path )
# {"chunks": ...}REST API
curl
-X POST http://localhost:8000/v1/infer/file
-H ' accept: application/json '
-H ' Content-Type: multipart/form-data '
-F ' model_id=openai/whisper-small.en '
-F ' [email protected] '在2行代碼中運行經典的計算機視覺任務。

GRPC API⚡
from pathlib import Path
from nos . client import Client
client = Client ()
model = client . Module ( "yolox/medium" )
response = model ( images = [ Image . open ( "image.jpg" )])REST API
curl
-X POST http://localhost:8000/v1/infer/file
-H ' accept: application/json '
-H ' Content-Type: multipart/form-data '
-F ' model_id=yolox/medium '
-F ' [email protected] '想要運行NOS不支持的模型嗎?您可以按照NOS操場上的示例輕鬆添加自己的型號。
該項目已根據APACHE-2.0許可獲得許可。
NOS使用Sentry收集匿名用法數據。這用於幫助我們了解社區如何使用NOS並幫助我們確定功能的優先級。您可以通過設置NOS_TELEMETRY_ENABLED=0來選擇遙測。
我們歡迎捐款!請參閱我們的貢獻指南以獲取更多信息。