nexa sdk 다운로드 -Nexa nexa sdk 소스 코드 다운로드

nexa sdk

AI 소스 코드

v0.0.9.7

다운로드

nexa-sdk-demo.mp4

NEXA SDK- 로컬 디바이스 추론 프레임 워크

기기 모델 허브 | 문서 | 불화 | 블로그 | X (트위터)

NEXA SDK 는 ONNX 및 GGML 모델을위한 로컬 오피스의 추론 프레임 워크로 텍스트 생성, 이미지 생성, VLM (Vision-Language Models), 오디오 언어 모델, ASR (Speech-to-Text) 및 텍스트 음주 (TTS) 기능을 지원합니다. Python 패키지 또는 실행 파일 설치 프로그램을 통해 설치할 수 있습니다.

특징

장치 지원 : CPU, GPU (Cuda, Metal, ROCM), iOS
서버 : OpenAi 호환 API, 기능 호출 및 스트리밍 지원을위한 JSON 스키마
로컬 UI : 대화식 모델 배포 및 테스트를위한 간소화

설치 옵션 1 : 실행 파일 설치 프로그램

MACOS 설치 프로그램

Windows 설치 프로그램

Linux 설치 프로그램

curl -fsSL https://public-storage.nexa4ai.com/install.sh | sh

FAQ : 이미 설치된 Nexaai Python 패키지와 함께 실행 파일을 사용할 수 없습니다.

대신 nexa-exe 사용해보십시오.

nexa-exe < command >

옵션 2 : 파이썬 패키지를 설치하십시오

인덱스 페이지에서 편리한 설치를 위해 다양한 Python 버전, 플랫폼 및 백엔드 용 미리 제작 된 휠을 출시했습니다.

CPU

pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cpu --extra-index-url https://pypi.org/simple --no-cache-dir

Apple GPU (금속)

GPU 버전 지원 금속 (MACOS) 의 경우 :

CMAKE_ARGS= " -DGGML_METAL=ON -DSD_METAL=ON " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/metal --extra-index-url https://pypi.org/simple --no-cache-dir

FAQ : M1에서 금속/GPU를 사용할 수 없습니다

다음 명령을 시도하십시오.

wget https://github.com/conda-forge/miniforge/releases/latest/download/Miniforge3-MacOSX-arm64.sh
bash Miniforge3-MacOSX-arm64.sh
conda create -n nexasdk python=3.10
conda activate nexasdk
CMAKE_ARGS= " -DGGML_METAL=ON -DSD_METAL=ON " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/metal --extra-index-url https://pypi.org/simple --no-cache-dir

NVIDIA GPU (CUDA)

CUDA 지원으로 설치하려면 CUDA 툴킷이 12.0 이상 설치되어 있는지 확인하십시오.

Linux 용 :

CMAKE_ARGS= " -DGGML_CUDA=ON -DSD_CUBLAS=ON " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cu124 --extra-index-url https://pypi.org/simple --no-cache-dir

Windows PowerShell 의 경우 :

 $env :CMAKE_ARGS= " -DGGML_CUDA=ON -DSD_CUBLAS=ON " ; pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cu124 --extra-index-url https://pypi.org/simple --no-cache-dir

Windows 명령 프롬프트 의 경우 :

 set CMAKE_ARGS= " -DGGML_CUDA=ON -DSD_CUBLAS=ON " & pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cu124 --extra-index-url https://pypi.org/simple --no-cache-dir

Windows Git Bash 의 경우 :

CMAKE_ARGS= " -DGGML_CUDA=ON -DSD_CUBLAS=ON " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/cu124 --extra-index-url https://pypi.org/simple --no-cache-dir

FAQ : Llava의 건물 문제

구축 중에 다음 문제가 발생하면 다음과 같습니다.

다음 명령을 시도하십시오.

CMAKE_ARGS= " -DCMAKE_CXX_FLAGS=-fopenmp " pip install nexaai

AMD GPU (ROCM)

ROCM 지원으로 설치하려면 ROCM 6.2.1 이상이 설치되어 있는지 확인하십시오.

Linux 용 :

CMAKE_ARGS= " -DGGML_HIPBLAS=on " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/rocm621 --extra-index-url https://pypi.org/simple --no-cache-dir

GPU (Vulkan)

Vulkan 지원으로 설치하려면 Vulkan SDK 1.3.261.1 이상이 설치되어 있는지 확인하십시오.

Windows PowerShell 의 경우 :

 $env :CMAKE_ARGS= " -DGGML_VULKAN=on " ; pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/vulkan --extra-index-url https://pypi.org/simple --no-cache-dir

Windows 명령 프롬프트 의 경우 :

 set CMAKE_ARGS= " -DGGML_VULKAN=on " & pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/vulkan --extra-index-url https://pypi.org/simple --no-cache-dir

Windows Git Bash 의 경우 :

CMAKE_ARGS= " -DGGML_VULKAN=on " pip install nexaai --prefer-binary --index-url https://github.nexa.ai/whl/vulkan --extra-index-url https://pypi.org/simple --no-cache-dir

로컬 빌드

이 저장소를 복제하는 방법

git clone --recursive https://github.com/NexaAI/nexa-sdk

사용하는 것을 잊어 버린 경우 --recursive 아래 명령을 사용하여 하위 모드를 추가 할 수 있습니다.

git submodule update --init --recursive

그런 다음 패키지를 빌드하고 설치할 수 있습니다

pip install -e .

분화

아래는 다른 유사한 도구와의 차별화입니다.

특징	Nexa SDK	올라마	최적	LM 스튜디오
GGML 지원	✅	✅		✅
Onnx 지원	✅		✅
텍스트 생성	✅	✅	✅	✅
이미지 생성	✅
비전 언어 모델	✅	✅	✅	✅
오디오 언어 모델	✅
텍스트 음성	✅		✅
서버 기능	✅	✅	✅	✅
사용자 인터페이스	✅			✅
실행 가능한 설치	✅	✅		✅

지원되는 모델 및 모델 허브

ON-DEVICE 모델 허브는 RAM, 파일 크기, 작업 등을위한 필터가있는 모든 유형의 양자화 된 모델 (텍스트, 이미지, 오디오, 멀티 모달)을 제공하여 UI로 모델을 쉽게 탐색 할 수 있도록 도와줍니다. 오전 기기 모델 허브에서 기기 모델을 탐색하십시오

지원되는 모델 예제 (모델 허브의 전체 목록) :

모델	유형	체재	명령
Omniaudio	오디 올름	GGUF	`nexa run omniaudio`
Qwen2audio	오디 올름	GGUF	`nexa run qwen2audio`
낙지 -V2	기능 호출	GGUF	`nexa run octopus-v2`
Octo-net	텍스트	GGUF	`nexa run octo-net`
Omnivlm	멀티 모달	GGUF	`nexa run omniVLM`
나노 라바	멀티 모달	GGUF	`nexa run nanollava`
llava-phi3	멀티 모달	GGUF	`nexa run llava-phi3`
llava-llama3	멀티 모달	GGUF	`nexa run llava-llama3`
llava1.6-mistral	멀티 모달	GGUF	`nexa run llava1.6-mistral`
llava1.6-Vicuna	멀티 모달	GGUF	`nexa run llava1.6-vicuna`
llama3.2	텍스트	GGUF	`nexa run llama3.2`
llama3-incensored	텍스트	GGUF	`nexa run llama3-uncensored`
젬마 2	텍스트	GGUF	`nexa run gemma2`
qwen2.5	텍스트	GGUF	`nexa run qwen2.5`
MathQwen	텍스트	GGUF	`nexa run mathqwen`
CodeQwen	텍스트	GGUF	`nexa run codeqwen`
미스트랄	텍스트	GGUF/ONNX	`nexa run mistral`
Deepseek 코더	텍스트	GGUF	`nexa run deepseek-coder`
ph.3.5	텍스트	GGUF	`nexa run phi3.5`
OpenElm	텍스트	GGUF	`nexa run openelm`
안정된 확산 -V2-1	이미지 생성	GGUF	`nexa run sd2-1`
안정적인 분해 -3- 메디움	이미지 생성	GGUF	`nexa run sd3`
플럭스 1-Schnell	이미지 생성	GGUF	`nexa run flux`
LCM-DREAMSHAPER	이미지 생성	GGUF/ONNX	`nexa run lcm-dreamshaper`
Whisper-Large-V3-Turbo	음성-텍스트	큰 상자	`nexa run faster-whisper-large-turbo`
Whisper-Tiny.en	음성-텍스트	onx	`nexa run whisper-tiny.en`
MXBAI- 엠 베드-래지 -V1	임베딩	GGUF	`nexa embed mxbai`
nomic-embed-text-v1.5	임베딩	GGUF	`nexa embed nomic`
모든 미닐름 L12-V2	임베딩	GGUF	`nexa embed all-MiniLM-L12-v2:fp16`
짖는 소리	텍스트 음성	GGUF	`nexa run bark-small:fp16`

모델 실행? 포옹 또는? ModelsCope

NEXA SDK를 사용하여 HF 또는 MS에서 지원되는 텍스트 생성 모델을 당기고 (.gguf)로 변환하고 (.gguf), 양자화 및 실행할 수 있습니다.

.gguf 파일을 실행하십시오

nexa run -hf <hf-model-id> 또는 nexa run -ms <ms-model-id> 사용하여 제공된 .gguf 파일과 함께 모델을 실행하십시오.

nexa run -hf Qwen/Qwen2.5-Coder-7B-Instruct-GGUF

nexa run -ms Qwen/Qwen2.5-Coder-7B-Instruct-GGUF

참고 : 단일 .gguf 파일을 선택하라는 메시지가 표시됩니다. 원하는 양자화 버전에 여러 분할 파일 (FP16-00001-of-00004)이있는 경우 NEXA의 변환 도구 (아래 참조)를 사용하여 모델을 로컬로 변환하고 정량화하십시오.

.safetensors 파일을 변환합니다

Nexa Python 패키지를 설치하고 pip install "nexaai[convert]" 사용하여 Nexa 변환 도구를 설치 한 다음 nexa convert <hf-model-id> : HuggingFace에서 모델을 변환하십시오.

nexa convert HuggingFaceTB/SmolLM2-135M-Instruct

또는 nexa convert -ms <ms-model-id> 로 ModelsCope에서 모델을 변환 할 수 있습니다.

nexa convert -ms Qwen/Qwen2.5-7B-Instruct

참고 : 주류 언어 모델의 다양한 양자화 된 버전의 성능 벤치 마크와 양자화 옵션에 대해 알아 보려면 리더 보드를 확인하십시오.

? nexa list 으로 다운로드 및 변환 된 모델을 볼 수 있습니다.

선적 서류 비치

메모

ONNX 모델을 사용하려면 pip install nexaai pip install "nexaai[onnx]" 교체하십시오.
벤치 마크 평가를 실행 하려면 제공된 명령에서 pip install nexaai pip install "nexaai[eval]" 로 교체하십시오.
GGUF 모델로 포옹 페이스 모델을 변환하고 양자화 하려면 pip install nexaai pip install "nexaai[convert]" 로 바꾸십시오.
중국 개발자의 경우 Tsinghua 오픈 소스 미러를 추가 색인 URL로 사용하는 것이 좋습니다. --extra-index-url https://pypi.org/simple --extra-index-url https://pypi.tuna.tsinghua.edu.cn/simple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple In Insimple을 대체하는 것이 좋습니다.