Install Stable Install Latest
LLMFarm is an iOS and MacOS app to work with large language models (LLM). It allows you to load different LLMs with certain parameters.With LLMFarm, you can test the performance of different LLMs on iOS and macOS and find the most suitable model for your project.
Based on ggml and llama.cpp by Georgi Gerganov.
LLaMA
Gemma
Phi
GPT2 + Cerebras
Starcoder(Santacoder)
Falcon
MPT
Bloom
StableLM-3b-4e1t
Qwen
Yi models
Deepseek models
Mixtral MoE
PLaMo-13B
Mamba
RWKV
GPTNeoXSee full list here.
Note: For Falcon, Alpaca, GPT4All, Chinese LLaMA / Alpaca and Chinese LLaMA-2 / Alpaca-2, Vigogne (French), Vicuna, Koala, OpenBuddy (Multilingual), Pygmalion/Metharme, WizardLM, Baichuan 1 & 2 + derivations, Aquila 1 & 2, Mistral AI v0.1, Refact, Persimmon 8B, MPT, Bloom select llama inference in model settings.
You can find answers to some questions in the FAQ section.
When creating a chat, a JSON file is generated in which you can specify additional inference options. The chat files are located in the "chats" directory. You can see all inference options here.
You can find some of the supported models here.
llmfarm_core has been moved to a separate repository. To build llmfarm, you need to clone this repository recursively:
git clone --recurse-submodules https://github.com/guinmoon/LLMFarm