تنزيل uni api - تنزيل رمز مصدر uni api

Uni-api

الإنجليزية | الصينية

مقدمة

للاستخدام الشخصي ، يعد واحد/New-API معقدًا للغاية مع العديد من الميزات التجارية التي لا يحتاجها الأفراد. إذا كنت لا تريد واجهة واجهة أمامية معقدة وتفضل الدعم لمزيد من النماذج ، فيمكنك تجربة UNI-API. هذا مشروع يوحد إدارة واجهات برمجة تطبيقات نموذج اللغة الكبيرة ، مما يتيح لك استدعاء خدمات الواجهة الخلفية المتعددة من خلال واجهة API موحدة واحدة ، وتحويلها جميعًا إلى Openai ، ودعم موازنة التحميل. تشمل خدمات الواجهة الخلفية المدعومة حاليًا: Openai ، Anthropic ، Gemini ، Vertex ، Cohere ، Groq ، Cloudflare ، OpenRouter ، وأكثر من ذلك.

سمات

لا يوجد ملف تكوين نقي في الواجهة الأمامية لتكوين قنوات API. يمكنك تشغيل محطة API الخاصة بك فقط عن طريق كتابة ملف ، والوثائق لها دليل تكوين مفصل ، صديق للمبتدئين.
الإدارة الموحدة لخدمات الخلفية المتعددة ، ومقدمي الدعم مثل Openai و Deepseek و OpenRouter و APIs الأخرى بتنسيق Openai. يدعم توليد الصور Openai Dalle-3.
يدعم في وقت واحد الأنثروبور ، الجوزاء ، قمة الرأس AI ، Cohere ، Groq ، CloudFlare. تدعم Vertex في وقت واحد كلود وجيميني API.
دعم Openai ، الأنثروبور ، الجوزاء ، تستخدم مكالمات دالة الأدوات الأصلية في Vertex.
دعم Openai ، أنثروبور ، الجوزاء ، API المعترف بها في Vertex.
دعم أربعة أنواع من موازنة التحميل.
1. يدعم موازنة التحميل المرجحة على مستوى القناة ، مما يسمح بتوزيع الطلبات وفقًا لأوزان القناة المختلفة. لا يتم تمكينه افتراضيًا ويتطلب تكوين أوزان القناة.
2. دعم موازنة التحميل الإقليمية في Vertex والتزامن العالي ، والتي يمكن أن تزيد من التزامن الجوزاء والكلود بما يصل إلى (عدد واجهات برمجة التطبيقات * عدد المناطق). تمكين تلقائيا دون تكوين إضافي.
3. باستثناء موازنة التحميل على مستوى المنطقة ، فإن جميع واجهات برمجة التطبيقات لدعم موازنة التحميل المتسلسل على مستوى القناة ، مما يعزز تجربة الترجمة الغامرة. لا يتم تمكينه افتراضيًا ويتطلب تكوين SCHEDULING_ALGORITHM كـ round_robin .
4. دعم موازنة التحميل التلقائي على مستوى API على مستوى مفاتيح API متعددة في قناة واحدة.
دعم إعادة المحاولة التلقائية ، عند فشل استجابة قناة API ، أعد إعادة تلقائيًا إلى قناة API التالية.
تبريد قناة الدعم: عند فشل استجابة قناة API ، سيتم استبعاد القناة وتبريدها تلقائيًا لفترة من الوقت ، وسيتم إيقاف طلبات القناة. بعد انتهاء فترة التبريد ، سيتم استعادة النموذج تلقائيًا حتى يفشل مرة أخرى ، وعند هذه النقطة سيتم تبريده مرة أخرى.
دعم إعدادات مهلة النماذج ذات الحبيبات الدقيقة ، مما يتيح فترات مهلة مختلفة لكل نموذج.
دعم التحكم في الإذن بالحبوب. الدعم باستخدام البطاقات البرية لضبط نماذج محددة متاحة لقنوات مفاتيح API.
عند الحد من معدل الدعم ، يمكنك تعيين الحد الأقصى لعدد الطلبات في الدقيقة كإجراء عدد صحيح ، مثل 2/دقيقة ، مرتين في الدقيقة ، 5/ساعة ، 5 مرات في الساعة ، 10/يوم ، 10 مرات في اليوم ، 10/شهر ، 10 مرات في الشهر ، 10/سنة ، 10 مرات في السنة. الافتراضي هو 60/دقيقة.
يدعم العديد من واجهات Openai لتنسيق Openai: /v1/chat/completions ، /v1/images/generations ، /v1/audio/transcriptions ، /v1/moderations ، /v1/models .
دعم Openai Aderation Action Review ، والتي يمكن أن تجري مراجعات أخلاقية لرسائل المستخدم. إذا تم العثور على رسائل غير مناسبة ، فسيتم إرجاع رسالة خطأ. هذا يقلل من خطر حظر API الخلفية من قبل مقدمي الخدمات.

طريقة الاستخدام

لبدء UNI-API ، يجب استخدام ملف التكوين. هناك طريقتان للبدء بملف التكوين:

الطريقة الأولى هي استخدام متغير بيئة CONFIG_URL لملء عنوان URL ملف التكوين ، والذي سيتم تنزيله تلقائيًا عند بدء تشغيل UNI-API.
الطريقة الثانية هي تثبيت ملف التكوين المسماة api.yaml في الحاوية.

الطريقة 1: قم بتركيب ملف تكوين `api.yaml` لبدء Uni-API

يجب عليك ملء ملف التكوين مقدمًا لبدء uni-api ، ويجب عليك استخدام ملف تكوين يسمى api.yaml لبدء uni-api ، يمكنك تكوين نماذج متعددة ، يمكن لكل طراز تكوين خدمات الخلفية المتعددة ، ودعم موازنة التحميل. فيما يلي مثال على ملف تكوين api.yaml الأدنى الذي يمكن تشغيله:

 providers :
  - provider : provider_name # Service provider name, such as openai, anthropic, gemini, openrouter, can be any name, required
    base_url : https://api.your.com/v1/chat/completions # Backend service API address, required
    api : sk-YgS6GTi0b4bEabc4C # Provider's API Key, required, automatically uses base_url and api to get all available models through the /v1/models endpoint.
  # Multiple providers can be configured here, each provider can configure multiple API Keys, and each API Key can configure multiple models.
api_keys :
  - api : sk-Pkj60Yf8JFWxfgRmXQFWyGtWUddGZnmi3KlvowmRWpWpQxx # API Key, user request uni-api requires API key, required
  # This API Key can use all models, that is, it can use all models in all channels set under providers, without needing to add available channels one by one.

التكوين المتقدم المفصل لـ api.yaml :

 providers :
  - provider : provider_name # Service provider name, such as openai, anthropic, gemini, openrouter, can be any name, required
    base_url : https://api.your.com/v1/chat/completions # Backend service API address, required
    api : sk-YgS6GTi0b4bEabc4C # Provider's API Key, required
    model : # Optional, if model is not configured, all available models will be automatically obtained through base_url and api via the /v1/models endpoint.
      - gpt-4o # Usable model name, required
      - claude-3-5-sonnet-20240620 : claude-3-5-sonnet # Rename model, claude-3-5-sonnet-20240620 is the provider's model name, claude-3-5-sonnet is the renamed name, you can use a simple name to replace the original complex name, optional
      - dall-e-3

  - provider : anthropic
    base_url : https://api.anthropic.com/v1/messages
    api : # Supports multiple API Keys, multiple keys automatically enable polling load balancing, at least one key, required
      - sk-ant-api03-bNnAOJyA-xQw_twAA
      - sk-ant-api02-bNnxxxx
    model :
      - claude-3-5-sonnet-20240620 : claude-3-5-sonnet # Rename model, claude-3-5-sonnet-20240620 is the provider's model name, claude-3-5-sonnet is the renamed name, you can use a simple name to replace the original complex name, optional
    tools : true # Whether to support tools, such as generating code, generating documents, etc., default is true, optional

  - provider : gemini
    base_url : https://generativelanguage.googleapis.com/v1beta # base_url supports v1beta/v1, only for Gemini model use, required
    api : # Supports multiple API Keys, multiple keys automatically enable polling load balancing, at least one key, required
      - AIzaSyAN2k6IRdgw123
      - AIzaSyAN2k6IRdgw456
      - AIzaSyAN2k6IRdgw789
    model :
      - gemini-1.5-pro
      - gemini-1.5-flash-exp-0827 : gemini-1.5-flash # After renaming, the original model name gemini-1.5-flash-exp-0827 cannot be used, if you want to use the original name, you can add the original name in the model, just add the line below to use the original name
      - gemini-1.5-flash-exp-0827 # Add this line, both gemini-1.5-flash-exp-0827 and gemini-1.5-flash can be requested
    tools : true
    preferences :
      api_key_rate_limit : 15/min # Each API Key can request up to 15 times per minute, optional. The default is 999999/min. Supports multiple frequency constraints: 15/min,10/day
      # api_key_rate_limit: # You can set different frequency limits for each model
      #   gemini-1.5-flash: 15/min,1500/day
      #   gemini-1.5-pro: 2/min,50/day
      #   default: 4/min # If the model does not set the frequency limit, use the frequency limit of default
      api_key_cooldown_period : 60 # Each API Key will be cooled down for 60 seconds after encountering a 429 error. Optional, the default is 0 seconds. When set to 0, the cooling mechanism is not enabled. When there are multiple API keys, the cooling mechanism will take effect.
      api_key_schedule_algorithm : round_robin # Set the request order of multiple API Keys, optional. The default is round_robin, and the optional values are: round_robin, random. It will take effect when there are multiple API keys. round_robin is polling load balancing, and random is random load balancing.
      model_timeout : # Model timeout, in seconds, default 100 seconds, optional
        gemini-1.5-pro : 10 # Model gemini-1.5-pro timeout is 10 seconds
        gemini-1.5-flash : 10 # Model gemini-1.5-flash timeout is 10 seconds
        default : 10 # Model does not have a timeout set, use the default timeout of 10 seconds, when requesting a model not in model_timeout, the timeout is also 10 seconds, if default is not set, uni-api will use the default timeout set by the environment variable TIMEOUT, the default timeout is 100 seconds
      proxy : socks5://[username]:[password]@[ip]:[port] # Proxy address, optional. Supports socks5 and http proxies, default is not used.

  - provider : vertex
    project_id : gen-lang-client-xxxxxxxxxxxxxx # Description: Your Google Cloud project ID. Format: String, usually composed of lowercase letters, numbers, and hyphens. How to obtain: You can find your project ID in the project selector of the Google Cloud Console.
    private_key : " -----BEGIN PRIVATE KEY----- n xxxxx n -----END PRIVATE " # Description: Private key for Google Cloud Vertex AI service account. Format: A JSON formatted string containing the private key information of the service account. How to obtain: Create a service account in Google Cloud Console, generate a JSON formatted key file, and then set its content as the value of this environment variable.
    client_email : [email protected] # Description: Email address of the Google Cloud Vertex AI service account. Format: Usually a string like "[email protected]". How to obtain: Generated when creating a service account, or you can view the service account details in the "IAM and Admin" section of the Google Cloud Console.
    model :
      - gemini-1.5-pro
      - gemini-1.5-flash
      - gemini-1.5-pro : gemini-1.5-pro-search # Only supports using the gemini-1.5-pro-search model to request uni-api when using the Vertex Gemini API, to automatically use the Google official search tool.
      - claude-3-5-sonnet@20240620 : claude-3-5-sonnet
      - claude-3-opus@20240229 : claude-3-opus
      - claude-3-sonnet@20240229 : claude-3-sonnet
      - claude-3-haiku@20240307 : claude-3-haiku
    tools : true
    notes : https://xxxxx.com/ # You can put the provider's website, notes, official documentation, optional

  - provider : cloudflare
    api : f42b3xxxxxxxxxxq4aoGAh # Cloudflare API Key, required
    cf_account_id : 8ec0xxxxxxxxxxxxe721 # Cloudflare Account ID, required
    model :
      - ' @cf/meta/llama-3.1-8b-instruct ' : llama-3.1-8b # Rename model, @cf/meta/llama-3.1-8b-instruct is the provider's original model name, must be enclosed in quotes, otherwise yaml syntax error, llama-3.1-8b is the renamed name, you can use a simple name to replace the original complex name, optional
      - ' @cf/meta/llama-3.1-8b-instruct ' # Must be enclosed in quotes, otherwise yaml syntax error

  - provider : other-provider
    base_url : https://api.xxx.com/v1/messages
    api : sk-bNnAOJyA-xQw_twAA
    model :
      - causallm-35b-beta2ep-q6k : causallm-35b
      - anthropic/claude-3-5-sonnet
    tools : false
    engine : openrouter # Force the use of a specific message format, currently supports gpt, claude, gemini, openrouter native format, optional

api_keys :
  - api : sk-KjjI60Yf0JFWxfgRmXqFWyGtWUd9GZnmi3KlvowmRWpWpQRo # API Key, required for users to use this service
    model : # Models that can be used by this API Key, required. Default channel-level polling load balancing is enabled, and each request model is requested in sequence according to the model configuration. It is not related to the original channel order in providers. Therefore, you can set different request sequences for each API key.
      - gpt-4o # Usable model name, can use all gpt-4o models provided by providers
      - claude-3-5-sonnet # Usable model name, can use all claude-3-5-sonnet models provided by providers
      - gemini/* # Usable model name, can only use all models provided by providers named gemini, where gemini is the provider name, * represents all models
    role : admin

  - api : sk-pkhf60Yf0JGyJxgRmXqFQyTgWUd9GZnmi3KlvowmRWpWqrhy
    model :
      - anthropic/claude-3-5-sonnet # Usable model name, can only use the claude-3-5-sonnet model provided by the provider named anthropic. Models with the same name from other providers cannot be used. This syntax will not match the model named anthropic/claude-3-5-sonnet provided by other-provider.
      - <anthropic/claude-3-5-sonnet> # By adding angle brackets on both sides of the model name, it will not search for the claude-3-5-sonnet model under the channel named anthropic, but will take the entire anthropic/claude-3-5-sonnet as the model name. This syntax can match the model named anthropic/claude-3-5-sonnet provided by other-provider. But it will not match the claude-3-5-sonnet model under anthropic.
      - openai-test/text-moderation-latest # When message moderation is enabled, the text-moderation-latest model under the channel named openai-test can be used for moderation.
      - sk-KjjI60Yd0JFWtxxxxxxxxxxxxxxwmRWpWpQRo/* # Support using other API keys as channels
    preferences :
      SCHEDULING_ALGORITHM : fixed_priority # When SCHEDULING_ALGORITHM is fixed_priority, use fixed priority scheduling, always execute the channel of the first model with a request. Default is enabled, SCHEDULING_ALGORITHM default value is fixed_priority. SCHEDULING_ALGORITHM optional values are: fixed_priority, round_robin, weighted_round_robin, lottery, random.
      # When SCHEDULING_ALGORITHM is random, use random polling load balancing, randomly request the channel of the model with a request.
      # When SCHEDULING_ALGORITHM is round_robin, use polling load balancing, request the channel of the model used by the user in order.
      AUTO_RETRY : true # Whether to automatically retry, automatically retry the next provider, true for automatic retry, false for no automatic retry, default is true. Also supports setting a number, indicating the number of retries.
      rate_limit : 15/min # Supports rate limiting, each API Key can request up to 15 times per minute, optional. The default is 999999/min. Supports multiple frequency constraints: 15/min,10/day
      # rate_limit: # You can set different frequency limits for each model
      #   gemini-1.5-flash: 15/min,1500/day
      #   gemini-1.5-pro: 2/min,50/day
      #   default: 4/min # If the model does not set the frequency limit, use the frequency limit of default
      ENABLE_MODERATION : true # Whether to enable message moderation, true for enable, false for disable, default is false, when enabled, it will moderate the user's message, if inappropriate messages are found, an error message will be returned.

  # Channel-level weighted load balancing configuration example
  - api : sk-KjjI60Yd0JFWtxxxxxxxxxxxxxxwmRWpWpQRo
    model :
      - gcp1/* : 5 # The number after the colon is the weight, weight only supports positive integers.
      - gcp2/* : 3 # The size of the number represents the weight, the larger the number, the greater the probability of the request.
      - gcp3/* : 2 # In this example, there are a total of 10 weights for all channels, and 10 requests will have 5 requests for the gcp1/* model, 2 requests for the gcp2/* model, and 3 requests for the gcp3/* model.

    preferences :
      SCHEDULING_ALGORITHM : weighted_round_robin # Only when SCHEDULING_ALGORITHM is weighted_round_robin and the above channel has weights, it will request according to the weighted order. Use weighted polling load balancing, request the channel of the model with a request according to the weight order. When SCHEDULING_ALGORITHM is lottery, use lottery polling load balancing, request the channel of the model with a request according to the weight randomly. Channels without weights automatically fall back to round_robin polling load balancing.
      AUTO_RETRY : true

preferences : # Global configuration
  model_timeout : # Model timeout, in seconds, default 100 seconds, optional
    gpt-4o : 10 # Model gpt-4o timeout is 10 seconds, gpt-4o is the model name, when requesting models like gpt-4o-2024-08-06, the timeout is also 10 seconds
    claude-3-5-sonnet : 10 # Model claude-3-5-sonnet timeout is 10 seconds, when requesting models like claude-3-5-sonnet-20240620, the timeout is also 10 seconds
    default : 10 # Model does not have a timeout set, use the default timeout of 10 seconds, when requesting a model not in model_timeout, the default timeout is 10 seconds, if default is not set, uni-api will use the default timeout set by the environment variable TIMEOUT, the default timeout is 100 seconds
    o1-mini : 30 # Model o1-mini timeout is 30 seconds, when requesting models starting with o1-mini, the timeout is 30 seconds
    o1-preview : 100 # Model o1-preview timeout is 100 seconds, when requesting models starting with o1-preview, the timeout is 100 seconds
  cooldown_period : 300 # Channel cooldown time, in seconds, default 300 seconds, optional. When a model request fails, the channel will be automatically excluded and cooled down for a period of time, and will not request the channel again. After the cooldown time ends, the model will be automatically restored until the request fails again, and it will be cooled down again. When cooldown_period is set to 0, the cooling mechanism is not enabled.
  error_triggers : # Error triggers, when the message returned by the model contains any of the strings in the error_triggers, the channel will return an error. Optional
    - The bot's usage is covered by the developer
    - process this request due to overload or policy

قم بتركيب ملف التكوين وابدأ حاوية Uni-API Docker:

docker run --user root -p 8001:8000 --name uni-api -dit 
-v ./api.yaml:/home/api.yaml 
yym68686/uni-api:latest

الطريقة الثانية: ابدأ UNI-API باستخدام متغير بيئة `CONFIG_URL`

بعد كتابة ملف التكوين وفقًا للطريقة الأولى ، قم بتحميله على القرص السحابي ، واحصل على الرابط المباشر للملف ، ثم استخدم متغير Environment CONFIG_URL لبدء حاوية Docker Uni-API:

docker run --user root -p 8001:8000 --name uni-api -dit 
-e CONFIG_URL=http://file_url/api.yaml 
yym68686/uni-api:latest

متغير البيئة

config_url: عنوان تنزيل ملف التكوين ، والذي يمكن أن يكون ملفًا محليًا أو ملفًا عن بُعد ، اختياري
مهلة: طلب المهلة ، الافتراضي هو 100 ثانية. يمكن أن تتحكم المهلة في الوقت اللازم للتبديل إلى القناة التالية عندما لا تستجيب قناة واحدة. خياري
disable_database: سواء كان ذلك لتدوين قاعدة البيانات ، يكون الافتراضي خطأ واختياريًا

النشر عن بعد Vercel

بعد النقر فوق زر النقر بنقرة واحدة أعلاه ، قم بتعيين متغير البيئة CONFIG_URL على الرابط المباشر لملف التكوين ، و DISABLE_DATABASE إلى true ، ثم انقر فوق "إنشاء" لإنشاء المشروع. بعد النشر ، تحتاج إلى تعيين الدالة كحد أقصى يدويًا إلى 60 ثانية في لوحة مشروع Vercel ضمن الإعدادات -> الوظائف ، ثم انقر فوق قائمة Deployments وانقر فوق إعادة النشر إلى إعادة النشر ، والتي ستعطف المهلة على 60 ثانية. إذا لم تتم إعادة نشرها ، فستبقى المهلة الافتراضية في 10 ثوان أصلية. لاحظ أنه يجب ألا تحذف مشروع Vercel وإعادة إنشائها ؛ بدلاً من ذلك ، انقر فوق "إعادة النشر" في قائمة النشر داخل مشروع Vercel الذي تم نشره حاليًا لجعل وظيفة تعديل المدة Max تعمل.

نشر Ubuntu

في إصدارات المستودع ، ابحث عن أحدث إصدار من الملف الثنائي المقابل ، على سبيل المثال ، ملف يسمى UNI-API-Linux-X86_64-0.0.99.PEX. قم بتنزيل الملف الثنائي على الخادم وقم بتشغيله:

wget https://github.com/yym68686/uni-api/releases/download/v0.0.99/uni-api-linux-x86_64-0.0.99.pex
chmod +x uni-api-linux-x86_64-0.0.99.pex
./uni-api-linux-x86_64-0.0.99.pex

Serv00 النشر عن بُعد (FreeBSD 14.0)

أولاً ، قم بتسجيل الدخول إلى اللوحة ، في خدمات إضافية ، انقر فوق علامة التبويب قم بتشغيل تطبيقاتك الخاصة لتمكين خيار تشغيل برامجك الخاصة ، ثم انتقل إلى حجز منفذ اللوحة لفتح منفذ بشكل عشوائي.

إذا لم يكن لديك اسم المجال الخاص بك ، فانتقل إلى مواقع الويب الخاصة بالوحة WWW وحذف اسم المجال الافتراضي المقدم. ثم قم بإنشاء مجال جديد مع النطاق الذي تم حذفه للتو. بعد النقر فوق الإعدادات المتقدمة ، قم بتعيين نوع الموقع على مجال الوكيل ، ويجب أن يشير منفذ الوكيل إلى المنفذ الذي فتحته للتو. لا تختار استخدام HTTPS.

تسجيل الدخول SSH إلى خادم SERV00 ، قم بتنفيذ الأمر التالي:

git clone --depth 1 -b main --quiet https://github.com/yym68686/uni-api.git
cd uni-api
python -m venv uni-api
tmux new -s uni-api
source uni-api/bin/activate
export CFLAGS= " -I/usr/local/include "
export CXXFLAGS= " -I/usr/local/include "
export CC=gcc
export CXX=g++
export MAX_CONCURRENCY=1
export CPUCOUNT=1
export MAKEFLAGS= " -j1 "
CMAKE_BUILD_PARALLEL_LEVEL=1 cpuset -l 0 pip install -vv -r requirements.txt
cpuset -l 0 pip install -r -vv requirements.txt

CTRL+BD للخروج من TMUX ، انتظر بضع ساعات حتى يكتمل التثبيت ، وبعد اكتمال التثبيت ، قم بتنفيذ الأمر التالي:

tmux attach -t uni-api
source uni-api/bin/activate
export CONFIG_URL=http://file_url/api.yaml
export DISABLE_DATABASE=true
# Modify the port, xxx is the port, modify it yourself, corresponding to the port opened in the panel Port reservation
sed -i ' ' ' s/port=8000/port=xxx/ ' main.py
sed -i ' ' ' s/reload=True/reload=False/ ' main.py
python main.py

استخدم CTRL+BD للخروج من TMUX ، مما يسمح للبرنامج بالتشغيل في الخلفية. في هذه المرحلة ، يمكنك استخدام UNI-API في عملاء الدردشة الآخرين. السيناريو اختبار حليقة:

curl -X POST https://xxx.serv00.net/v1/chat/completions 
-H ' Content-Type: application/json ' 
-H ' Authorization: Bearer sk-xxx ' 
-d ' {"model": "gpt-4o","messages": [{"role": "user","content": "Hello"}]} '

وثيقة مرجعية:

https://docs.serv00.com/python/

https://linux.do/t/topic/201181

https://linux.do/t/topic/218738

Docker النشر المحلي

ابدأ الحاوية

docker run --user root -p 8001:8000 --name uni-api -dit 
-e CONFIG_URL=http://file_url/api.yaml  # If the local configuration file has already been mounted, there is no need to set CONFIG_URL
-v ./api.yaml:/home/api.yaml  # If CONFIG_URL is already set, there is no need to mount the configuration file
-v ./uniapi_db:/home/data  # If you do not want to save statistical data, there is no need to mount this folder
yym68686/uni-api:latest

أو إذا كنت ترغب في استخدام Docker Compose ، فإليك Docker-corm.yml مثال:

 services :
  uni-api :
    container_name : uni-api
    image : yym68686/uni-api:latest
    environment :
      - CONFIG_URL=http://file_url/api.yaml # If a local configuration file is already mounted, there is no need to set CONFIG_URL
    ports :
      - 8001:8000
    volumes :
      - ./api.yaml:/home/api.yaml # If CONFIG_URL is already set, there is no need to mount the configuration file
      - ./uniapi_db:/home/data # If you do not want to save statistical data, there is no need to mount this folder

config_url هو عنوان URL لملف التكوين عن بُعد يمكن تنزيله تلقائيًا. على سبيل المثال ، إذا لم تكن مرتاحًا لتعديل ملف التكوين على منصة معينة ، فيمكنك تحميل ملف التكوين إلى خدمة استضافة وتوفير رابط مباشر إلى Uni-API للتنزيل ، وهو Config_url. إذا كنت تستخدم ملف تكوين محلي مثبت ، فلا داعي لضبط config_url. يتم استخدام config_url عندما لا يكون مريحًا لتركيب ملف التكوين.

Run Docker Compener في الخلفية

docker-compose pull
docker-compose up -d

بناء Docker

docker build --no-cache -t uni-api:latest -f Dockerfile --platform linux/amd64 .
docker tag uni-api:latest yym68686/uni-api:latest
docker push yym68686/uni-api:latest

بنقرة واحدة إعادة تشغيل صورة Docker

 set -eu
docker pull yym68686/uni-api:latest
docker rm -f uni-api
docker run --user root -p 8001:8000 -dit --name uni-api 
-e CONFIG_URL=http://file_url/api.yaml 
-v ./api.yaml:/home/api.yaml 
-v ./uniapi_db:/home/data 
yym68686/uni-api:latest
docker logs -f uni-api

اختبار حليقة مريح

curl -X POST http://127.0.0.1:8000/v1/chat/completions 
-H " Content-Type: application/json " 
-H " Authorization: Bearer ${API} " 
-d ' {"model": "gpt-4o","messages": [{"role": "user", "content": "Hello"}],"stream": true} '

PEX Linux Packaging:

VERSION= $( cat VERSION )
pex -D . -r requirements.txt 
    -c uvicorn 
    --inject-args ' main:app --host 0.0.0.0 --port 8000 ' 
    --platform linux_x86_64-cp-3.10.12-cp310 
    --interpreter-constraint ' ==3.10.* ' 
    --no-strip-pex-env 
    -o uni-api-linux-x86_64- ${VERSION} .pex

عبوة MacOS:

VERSION= $( cat VERSION )
pex -r requirements.txt 
    -c uvicorn 
    --inject-args ' main:app --host 0.0.0.0 --port 8000 ' 
    -o uni-api-macos-arm64- ${VERSION} .pex

الرعاة

نشكر الرعاة التاليين على دعمهم:

powerhunter: ¥ 2000
@IOI ： ¥ 50

كيف ترعا لنا

إذا كنت ترغب في دعم مشروعنا ، فيمكنك رعايتنا بالطرق التالية:

PayPal
USDT-TRC20 ، عنوان محفظة USDT-TRC20: TLFbqSv5pDu5he43mVmK1dNx7yBMFeN7d8
WeChat
alipay

شكرا لك على دعمك!

التعليمات

لماذا Error processing request or performing moral check: 404: No matching model found دائمًا؟

سيؤدي تحديد Enable_Moderation إلى FALSE إلى إصلاح هذه المشكلة. عندما يكون enable_moderation صحيحًا ، يجب أن تكون واجهة برمجة التطبيقات (API) قادرة على استخدام نموذج تعديل النص ، وإذا لم تكن قد قدمت تعديل النص في إعدادات نموذج المزود ، فسيحدث خطأ يشير إلى أنه لا يمكن العثور على النموذج.

كيفية إعطاء الأولوية لطلبات قناة معينة ، كيفية تعيين أولوية القناة؟

قم بتعيين ترتيب القناة مباشرة في API_KEYS. لا يلزم وجود إعدادات أخرى. نموذج ملف التكوين:

 providers :
  - provider : ai1
    base_url : https://xxx/v1/chat/completions
    api : sk-xxx

  - provider : ai2
    base_url : https://xxx/v1/chat/completions
    api : sk-xxx

api_keys :
  - api : sk-1234
    model :
      - ai2/*
      - ai1/*

وبهذه الطريقة ، اطلب AI2 أولاً ، وإذا فشل ، فاطلب AI1.

ما هو السلوك وراء خوارزميات الجدولة المختلفة؟ على سبيل المثال ، exped_priority ، insided_round_robin ، اليانصيب ، عشوائي ، round_robin؟

يجب تمكين جميع خوارزميات الجدولة عن طريق تعيين API_KEYS. (API) .Preferences.Scheduling_algorithm في ملف التكوين إلى أي من القيم: ثابتة _priority ، وزن _Round_robin ، Lottery ، RONDAND_ROBIN.

ثابت _priority: جدولة الأولوية الثابتة. يتم تنفيذ جميع الطلبات دائمًا بواسطة قناة النموذج التي لديها طلب المستخدم أولاً. في حالة وجود خطأ ، سيتم تحويله إلى القناة التالية. هذه هي خوارزمية الجدولة الافتراضية.
Owned_Round_robin: موازنة تحميل روبن مرجحة ، تطلب القنوات مع النموذج المطلوب للمستخدم وفقًا لترتيب الوزن المحدد في ملف التكوين API_KEYS. (API) .MODEL.
اليانصيب: ارسم موازنة التحميل الدائري ، اطلب بشكل عشوائي قناة النموذج مع طلبات المستخدم وفقًا للوزن المحدد في ملف التكوين API_KEYS. (API) .MODEL.
Round_robin: توازن تحميل Round-Robin ، يطلب القناة التي تمتلك النموذج المطلوب من قبل المستخدم وفقًا لترتيب التكوين في ملف التكوين API_KEYS. (API) .MOREEL. يمكنك التحقق من السؤال السابق حول كيفية تعيين أولوية القنوات.

كيف ينبغي ملء base_url بشكل صحيح؟

باستثناء بعض القنوات الخاصة الموضحة في التكوين المتقدم ، يحتاج جميع مزودي تنسيق Openai إلى ملء Base_url تمامًا ، مما يعني أن BASE_URL يجب أن ينتهي بـ/V1/Chat/Explive. إذا كنت تستخدم نماذج github ، فيجب ملء base_url في https://models.inference.ai.azure.com/chat/completions ، وليس عنوان URL الخاص بـ Azure.

كيف يعمل وقت مهلة النموذج؟ ما هي أولوية إعداد المهلة على مستوى القناة وإعداد مهلة النموذج العالمي؟

يعد إعداد مهلة على مستوى القناة أولوية أعلى من إعداد مهلة النموذج العالمي. ترتيب الأولوية هو: إعداد مهلة النموذج على مستوى القناة> إعداد المهلة الافتراضية على مستوى القناة> إعداد مهلة النموذج العالمي> إعداد المهلة الافتراضية العالمية> مهلة متغير البيئة.

عن طريق ضبط وقت مهلة النموذج ، يمكنك تجنب خطأ بعض القنوات. إذا واجهت الخطأ {'error': '500', 'details': 'fetch_response_stream Read Response Timeout'} ، يرجى محاولة زيادة وقت مهلة النموذج.

كيف يعمل api_key_rate_limit؟ كيف يمكنني تعيين حد نفس المعدل لنماذج متعددة؟

إذا كنت ترغب في تعيين حد التردد نفسه للنماذج الأربعة Gemini-1.5-Pro-Latest ، Gemini-1.5-Pro ، Gemini-1.5-Pro-001 ، Gemini-1.5-Pro-002 في وقت واحد ، يمكنك تعيينه مثل هذا:

 api_key_rate_limit :
  gemini-1.5-pro : 1000/min

سيطابق هذا جميع النماذج التي تحتوي على سلسلة Gemini-1.5-Pro. سيتم تعيين حد التردد لهذه النماذج الأربعة ، Gemini-1.5-Pro-Latest ، Gemini-1.5-Pro ، Gemini-1.5-Pro-001 ، Gemini-1.5-Pro-002 ، على 1000/دقيقة. المنطق لتكوين حقل API_KEY_RATE_LIMIT كما يلي ، إليك ملف تكوين عينة:

 api_key_rate_limit :
  gemini-1.5-pro : 1000/min
  gemini-1.5-pro-002 : 500/min

في هذا الوقت ، إذا كان هناك طلب باستخدام النموذج Gemini-1.5-Pro-002.

أولاً ، سيحاول Uni-API مطابقة النموذج بدقة في API_KEY_RATE_LIMIT. إذا تم تعيين حد معدل Gemini-1.5-Pro-002 ، فإن حد معدل Gemini-1.5-Pro-002 هو 500/دقيقة. إذا لم يكن النموذج المطلوب في هذا الوقت ليس Gemini-1.5-Pro-002 ، ولكن Gemini-1.5-Pro-latest ، لأن API_KEY_RATE_LIMIT لا يحتوي على حد كبير في Gemini-1.5-Pro-lat ، وبالتالي فإن الحد الأدنى للمعدل -5 --5-سيحظى ببالة Gemini-1.5-pro-late ، وبالتالي فإن الحد الأدنى للمعدل. يتم ضبطها على 1000/دقيقة.

تاريخ النجوم

يوسع

uni api

Uni-api

مقدمة

سمات

طريقة الاستخدام

الطريقة 1: قم بتركيب ملف تكوين `api.yaml` لبدء Uni-API

الطريقة الثانية: ابدأ UNI-API باستخدام متغير بيئة `CONFIG_URL`

متغير البيئة

النشر عن بعد Vercel

نشر Ubuntu

Serv00 النشر عن بُعد (FreeBSD 14.0)

Docker النشر المحلي

الرعاة

كيف ترعا لنا

التعليمات

تاريخ النجوم

evolution api

أوروبا يوني

Spellcaster Uni

يوني

يوني المكسورة

تانك يوني

chat.petals.dev

GPT Prompt Templates

GPTyped

Google Dorks

shepherd

mongo express

Google Dorks

shepherd

mongo express

uni api

Uni-api

مقدمة

سمات

طريقة الاستخدام

الطريقة 1: قم بتركيب ملف تكوين api.yaml لبدء Uni-API

الطريقة الثانية: ابدأ UNI-API باستخدام متغير بيئة CONFIG_URL

متغير البيئة

النشر عن بعد Vercel

نشر Ubuntu

Serv00 النشر عن بُعد (FreeBSD 14.0)

Docker النشر المحلي

الرعاة

كيف ترعا لنا

التعليمات

تاريخ النجوم

الطريقة 1: قم بتركيب ملف تكوين `api.yaml` لبدء Uni-API

الطريقة الثانية: ابدأ UNI-API باستخدام متغير بيئة `CONFIG_URL`