self llm Download - self llm Source code download

self llm

AI Source Code

1.0.0

Download

Open Source Mockup Edible Guide

This project is a tutorial on exclusive Chinese baby models for open source models, for domestic beginners and based on Linux platforms. It provides full-process guidance for various open source models including environmental configuration, local deployment, efficient fine-tuning and other skills, simplifying the deployment, use and application process of open source models, allowing more ordinary students and researchers to better use open source models, helping open source and free models to integrate into the lives of ordinary learners faster.

The main contents of this project include:

An open source LLM environment configuration guide based on the Linux platform, providing different detailed environment configuration steps for different model requirements;
Tutorials for the deployment and use of mainstream open source LLM at home and abroad, including LLaMA, ChatGLM, InternLM, etc.;
Administrative application guidance for open source LLM, including command line calls, online demo deployment, LangChain framework integration, etc.
Full-scale fine-tuning and efficient fine-tuning methods of open source LLM, including distributed full-scale fine-tuning, LoRA, ptuning, etc.

The main content of the project is tutorials, so that more students and future practitioners can understand and familiarize themselves with the methods of eating open source big models! Anyone can propose an issue or submit a PR to jointly build and maintain this project.

Students who want to participate deeply can contact us and we will add you to the project maintainer.

Learning Suggestions: The learning suggestions for this project are to first learn environment configuration, then learn model deployment and use, and finally learn fine-tuning. Because environment configuration is the basis, the deployment and use of the model is the basis, and fine-tuning is advanced. Beginners can choose Qwen1.5, InternLM2, MiniCPM and other models to prioritize learning.

Note: If students want to understand the model composition of the big model and write tasks such as RAG, Agent and Eval from scratch, they can learn another project of Datawhale. Big model is a hot topic in the field of deep learning at present, but most of the existing big model tutorials are only to teach you how to call APIs to complete the application of big models, and few people can explain the model structure, RAG, Agent and Eval from the principle level. Therefore, the repository will provide all handwriting and do not use the form of calling the API to complete the RAG, Agent, and Eval tasks of the big model.

Note: Considering that some students hope to learn the theoretical part of the big model before studying this project, if they want to further study the theoretical basis of LLM and further understand and apply LLM on the basis of theory, they can refer to Datawhale's so-large-llm course.

Note: If any student wants to develop large model applications by himself after studying this course. Students can refer to Datawhale's hands-on big model application development course, which is a big model application development tutorial for novices. It aims to fully present the big model application development process to students based on Alibaba Cloud server and combined with personal knowledge base assistant projects.

Project significance

What is a big model?

Large model (LLM) narrowly refers to natural language processing (NLP) models trained based on deep learning algorithms. They are mainly used in fields such as natural language understanding and generation. In a broad sense, they also include machine vision (CV) large models, multimodal large models and scientific computing large models.

The battle of hundreds of models is in full swing, and open source LLMs are emerging one after another. Nowadays, many excellent open source LLMs have emerged at home and abroad, such as LLaMA and Alpaca, and domestically, such as ChatGLM, BaiChuan, InternLM (Scholar Puyu), etc. Open source LLM supports local deployment of users and fine-tuning of private domains. Everyone can create their own unique big model based on open source LLM.

However, if ordinary students and users want to use these big models, they need to have certain technical capabilities to complete the deployment and use of the models. For open source LLMs that are emerging one after another, it is a relatively challenging task to quickly master the application methods of open source LLM.

This project aims to first realize the deployment, use and fine-tuning tutorials of mainstream open source LLM at home and abroad based on the experience of core contributors; after realizing the relevant parts of mainstream LLM, we hope to fully gather co-creators to enrich this world of open source LLM and create more and more comprehensive tutorials for special LLMs. Sparks dotted, converging into the sea.

We hope to be the ladder for LLM and the general public, and embrace the more magnificent and vast LLM world with the open source spirit of freedom and equality.

Project audience

This project is suitable for the following learners:

Want to use or experience LLM, but unconditionally obtain or use relevant APIs;
Hope to apply LLM in a long-term, low-cost and large-scale manner;
Interested in open source LLM and want to get started with open source LLM yourself;
NLP is studying, hoping to further study LLM;
We hope to combine open source LLM to create a private domain LLM with domain characteristics;
And the vast and most ordinary student groups.

Project planning and progress

This project plans to organize the entire process of open source LLM application, including environmental configuration and use, deployment and application, fine-tuning, etc. Each part covers mainstream and features open source LLM:

Example Series

Chat-Huanhuan: Chat-Zhen Huan is a chat language model that imitates Zhen Huan's tone using all the lines and sentences about Zhen Huan in the script "The Legend of Zhen Huan" and fine-tuning based on LLM.
Tianji: Tianji is a social scenario based on human feelings and worldly styles, covering the entire process of prompt word engineering, intelligent body production, data acquisition and model fine-tuning, RAG data cleaning and use, etc.

Supported models

Qwen2.5-Coder
- Qwen2.5-Coder-7B-Instruct FastApi deployment call @Zhao Wenkai
- Qwen2.5-Coder-7B-Instruct Langchian access @Yang Chenxu
- Qwen2.5-Coder-7B-Instruct WebDemo Deployment @Wang Zeyu
- Qwen2.5-Coder-7B-Instruct vLLM Deployment @Wang Zeyu
- Qwen2.5-Coder-7B-Instruct Lora Fine Tuning @ Buckwheat
- Qwen2.5-Coder-7B-Instruct Lora Fine-tuning SwanLab Visual Record Version @Yang Zhuo
Qwen2-vl
- Qwen2-vl-2B FastApi deployment call @Jiang Shufan
- Qwen2-vl-2B WebDemo Deployment @Zhao Wei
- Qwen2-vl-2B vLLM Deployment @Buckwheat
- Qwen2-vl-2B Lora fine-tuning @Li Kechen
- Qwen2-vl-2B Lora fine-tuning SwanLab visual record version @ Lin Zeyi
- Qwen2-vl-2B Lora fine-tuning case - LaTexOCR @Lin Zeyi
Qwen2.5
- Qwen2.5-7B-Instruct FastApi deployment call @Lou Tianao
- Qwen2.5-7B-Instruct langchain access @Lou Tianao
- Qwen2.5-7B-Instruct vLLM deployment call @Jiang Shufan
- Qwen2.5-7B-Instruct WebDemo Deployment @Gao Liye
- Qwen2.5-7B-Instruct Lora Fine Tuning @ Zuo Chunsheng
- Qwen2.5-7B-Instruct o1-like reasoning chain implementation @Jiang Shufan
- Qwen2.5-7B-Instruct Lora Fine-tuning SwanLab Visual Record Version @Lin Zeyi
Apple OpenELM
- OpenELM-3B-Instruct FastApi deployment call @Wang Zeyu
- OpenELM-3B-Instruct Lora Fine Tuning @Wang Zeyu
Llama3_1-8B-Instruct
- Llama3_1-8B-Instruct FastApi Deployment Call @Don't Scallions, Ginger, Garlic
- Llama3_1-8B-Instruct langchain access @ Zhang Jin
- Llama3_1-8B-Instruct WebDemo Deployment @Zhang Jin
- Llama3_1-8B-Instruct Lora Fine Tuning @Don't Scallions, Ginger and Garlic
Gemma-2-9b-it
- Gemma-2-9b-it FastApi deployment call @Don't scallion, ginger, garlic
- Gemma-2-9b-it langchain access @ do not have scallions, ginger and garlic
- Gemma-2-9b-it WebDemo Deploy @Don't Scallions, Ginger and Garlic
- Gemma-2-9b-it Peft Lora Fine Tuning @Don't Scallions, Ginger and Garlic
Yuan2.0
- Yuan2.0-2B FastApi deployment call @Zhang Fan
- Yuan2.0-2B Langchain access @ Zhang Fan
- Yuan2.0-2B WebDemo Deployment @Zhang Fan
- Yuan2.0-2B vLLM deployment call @Zhang Fan
- Yuan2.0-2B Lora fine-tuning @ Zhang Fan
Yuan2.0-M32
- Yuan2.0-M32 FastApi deployment call @Zhang Fan
- Yuan2.0-M32 Langchain access @ Zhang Fan
- Yuan2.0-M32 WebDemo Deployment @Zhang Fan
DeepSeek-Coder-V2
- DeepSeek-Coder-V2-Lite-Instruct FastApi Deployment Call @Jiang Shufan
- DeepSeek-Coder-V2-Lite-Instruct langchain access @Jiang Shufan
- DeepSeek-Coder-V2-Lite-Instruct WebDemo Deployment @Kailigithub
- DeepSeek-Coder-V2-Lite-Instruct Lora Fine Tuning @Yu Yang
Bilibili Index-1.9B
- Index-1.9B-Chat FastApi deployment call @Deng Kaijun
- Index-1.9B-Chat langchain Access @Zhang Youdong
- Index-1.9B-Chat WebDemo Deployment @September
- Index-1.9B-Chat Lora Fine Tuning @Jiang Shufan
Qwen2
- Qwen2-7B-Instruct FastApi deployment call @Kang Jingqi
- Qwen2-7B-Instruct langchain access @ do not have scallions, ginger and garlic
- Qwen2-7B-Instruct WebDemo Deployment @Sanshui
- Qwen2-7B-Instruct vLLM deployment call @Jiang Shufan
- Qwen2-7B-Instruct Lora Fine Tuning @ Walking
GLM-4
- GLM-4-9B-chat FastApi deployment call @Zhang Youdong
- GLM-4-9B-chat langchain access @Tan Yike
- GLM-4-9B-chat WebDemo Deployment @He Zhixuan
- GLM-4-9B-chat vLLM Deployment @Wang Yiming
- GLM-4-9B-chat Lora Fine Tuning @Xiao Hongru
- GLM-4-9B-chat-hf Lora Fine Tuning @ Fu Zhiyuan
Qwen 1.5
- Qwen1.5-7B-chat FastApi deployment call @Yan Xin
- Qwen1.5-7B-chat langchain access @Yan Xin
- Qwen1.5-7B-chat WebDemo Deployment @Yan Xin
- Qwen1.5-7B-chat Lora Fine Tuning @Don't Spicy Ginger and Garlic
- Qwen1.5-72B-chat-GPTQ-Int4 Deployment Environment @byx020119
- Qwen1.5-MoE-chat Transformers Deployment Call @Ding Yue
- Qwen1.5-7B-chat vLLM reasoning deployment@Gao Liye
- Qwen1.5-7B-chat Lora fine-tuning access to SwanLab experimental management platform @Huang Bote
Google - Gemma
- gemma-2b-it FastApi deployment call @Dongdong
- gemma-2b-it langchain access @Dongdong
- gemma-2b-it WebDemo Deployment @Dongdong
- gemma-2b-it Peft Lora Fine Tuning @ Dongdong
phi-3
- Phi-3-mini-4k-instruct FastApi deployment call @Zheng Haohua
- Phi-3-mini-4k-instruct langchain access @Zheng Haohua
- Phi-3-mini-4k-instruct WebDemo Deployment @Ding Yue
- Phi-3-mini-4k-instruct Lora fine-tuning @Ding Yue
CharacterGLM-6B
- CharacterGLM-6B Transformers deployment call @Sun Jianzhuang
- CharacterGLM-6B FastApi deployment call @Sun Jianzhuang
- CharacterGLM-6B webdemo deployment @Sun Jianzhuang
- CharacterGLM-6B Lora fine-tuning @Sun Jianzhuang
LLaMA3-8B-Instruct
- LLaMA3-8B-Instruct FastApi deployment call @Gao Liye
- LLaMA3-8B-Instruct langchain access @Don't scallions, ginger and garlic
- LLaMA3-8B-Instruct WebDemo Deployment @Don't Scallions, Ginger and Garlic
- LLaMA3-8B-Instruct Lora Fine Tuning @ Gao Liye
XVERSE-7B-Chat
- XVERSE-7B-Chat transformers deployment call @Guo Zhihang
- XVERSE-7B-Chat FastApi deployment call @Guo Zhihang
- XVERSE-7B-Chat langchain access @Guo Zhihang
- XVERSE-7B-Chat WebDemo Deployment @Guo Zhihang
- XVERSE-7B-Chat Lora Fine Tuning @ Guo Zhihang
TransNormerLLM
- TransNormerLLM-7B-Chat FastApi Deployment Call @Wang Maolin
- TransNormerLLM-7B-Chat langchain access @Wang Maolin
- TransNormerLLM-7B-Chat WebDemo Deployment @Wang Maolin
- TransNormerLLM-7B-Chat Lora Fine Tuning @Wang Maolin
BlueLM Vivo Blue Heart Model
- BlueLM-7B-Chat FatApi deployment call @Guo Zhihang
- BlueLM-7B-Chat langchain access @Guo Zhihang
- BlueLM-7B-Chat WebDemo Deployment @Guo Zhihang
- BlueLM-7B-Chat Lora Fine Tuning @ Guo Zhihang
InternLM2
- InternLM2-7B-chat FastApi deployment call @Don't scallion, ginger, garlic
- InternLM2-7B-chat langchain access @ do not have scallions, ginger and garlic
- InternLM2-7B-chat WebDemo Deployment @Zheng Haohua
- InternLM2-7B-chat Xtuner Qlora fine-tuning @Zheng Haohua
DeepSeek In-depth search
- DeepSeek-7B-chat FastApi deployment call @Don't scallion, ginger, garlic
- DeepSeek-7B-chat langchain access @ do not have scallions, ginger and garlic
- DeepSeek-7B-chat WebDemo @Don't have scallions, ginger and garlic
- DeepSeek-7B-chat Lora Fine Tuning @Don't Scallions, Ginger and Garlic
- DeepSeek-7B-chat 4bits Quantitative Qlora Fine Tuning @Don't Scallions, Ginger, Garlic
- DeepSeek-MoE-16b-chat Transformers Deployment call @Kailigithub
- DeepSeek-MoE-16b-chat FastApi deployment call @Kailigithub
- DeepSeek-coder-6.7b fineune colab @Swiftie
- Deepseek-coder-6.7b webdemo colab @Swiftie
MiniCPM
- MiniCPM-2B-chat transformers deployment call @Kailigithub
- MiniCPM-2B-chat FastApi deployment call @Kailigithub
- MiniCPM-2B-chat langchain access @ do not have scallions, ginger and garlic
- MiniCPM-2B-chat webdemo deployment @Kailigithub
- MiniCPM-2B-chat Lora && Full Fine Adjust @Don't Scallions, Ginger and Garlic
- Official Link: MiniCPM Tutorial on the Wall-facing Small Cannon @OpenBMB
- Official Link: MiniCPM-Cookbook @OpenBMB
Qwen-Audio
- Qwen-Audio FastApi deployment call @Chen Sizhou
- Qwen-Audio WebDemo @Chen Sizhou
Qwen
- Qwen-7B-chat Transformers Deployment Call @Li Jiaojiao
- Qwen-7B-chat FastApi deployment call @Li Jiaojiao
- Qwen-7B-chat WebDemo @Li Jiaojiao
- Qwen-7B-chat Lora Fine Tuning @Don't Spicy Ginger and Garlic
- Qwen-7B-chat ptuning fine-tuning @Xiao Hongru
- Qwen-7B-chat Full amount fine adjustment @Don't scallions, ginger and garlic
- Qwen-7B-Chat access to langchain to build knowledge base assistant @Li Jiaojiao
- Qwen-7B-chat low-precision training @Xiao Hongru
- Qwen-1_8B-chat CPU Deployment @成
One thousand things
- Yi-6B-chat FastApi deployment call @Li Kechen
- Yi-6B-chat langchain access @Li Kechen
- Yi-6B-chat WebDemo @Xiao Hongru
- Yi-6B-chat Lora Fine Tuning @Li Jiaojiao
Baichuan Intelligent
- Baichuan2-7B-chat FastApi deployment call @Hui Jiahao
- Baichuan2-7B-chat WebDemo @Hui Jiahao
- Baichuan2-7B-chat access to LangChain framework @Hui Jiahao
- Baichuan2-7B-chat Lora Fine Tuning @ Hui Jiahao
InternLM
- InternLM-Chat-7B Transformers deployment call @小时
- InternLM-Chat-7B FastApi Deployment Call @Don't Scallions, Ginger and Garlic
- InternLM-Chat-7B WebDemo @Don't have scallions, ginger and garlic
- Lagent+InternLM-Chat-7B-V1.1 WebDemo @Don't scallions, ginger and garlic
- Puyu Lingbi graphic and text understanding & creation WebDemo @日本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本本�
- InternLM-Chat-7B Access to LangChain Framework @Logan Zou
Atom (llama2)
- Atom-7B-chat WebDemo @Kailigithub
- Atom-7B-chat Lora Fine Tuning @Logan Zou
- Atom-7B-Chat connects to langchain to build knowledge base assistant @ Chen Sizhou
- Atom-7B-chat Full Fine Tuning @Logan Zou
ChatGLM3
- ChatGLM3-6B Transformers deployment call @Ding Yue
- ChatGLM3-6B FastApi deployment call @Ding Yue
- ChatGLM3-6B chat WebDemo @Don't scallion, ginger, garlic
- ChatGLM3-6B Code Interpreter WebDemo @Don't scallion, ginger, garlic
- ChatGLM3-6B Access to LangChain Framework @Logan Zou
- ChatGLM3-6B Lora fine-tuning @Xiao Hongru

General environment configuration

Acknowledgements

Core Contributors

Song Zhixue (do not have onion, ginger and garlic)-Project leader (Member of Datawhale-China University of Mining and Technology (Beijing))
Zou Yuheng - Project leader (Member of Datawhale - University of International Business and Economics)
Xiao Hongru (Member of Datawhale-Tongji University)
Guo Zhihang (content creator)
Zhang Fan (Content creator-Datawhale member)
Jiang Shufan (Content creator-Jingying Assistant Teacher)
Li Jiaojiao (Member of Datawhale)
Ding Yue (Datawhale-Jingying Assistant Teacher)
Lin Zeyi (Content creator-SwanLab product manager)
Hui Jiahao (Datawhale-Propaganda Ambassador)
Wang Maolin (Content creator-Datawhale member)
Sun Jianzhuang (Content Creator-University of International Business and Economics)
Dongdong (Content creator-Google developer machine learning technology expert)
Gao Liye (Content creator-DataWhale member)
Wang Zeyu (Content creator-Taiyuan University of Technology-Jingying Assistant Teaching)
Kailigithub (Datawhale member)
Zheng Haohua (content creator)
Li Kechen (Member of Datawhale)
Chen Sizhou (Member of Datawhale)
Walk (Datawhale member)
Yan Xin (Member of Datawhale)
Buckwheat (Content creator-Datawhale member)
Swiftie (Xiaomi NLP Algorithm Engineer)
Huang Bote (Content creator-Xi'an University of Electronic Science and Technology)
Zhang Youdong (Content creator-Datawhale member)
Yu Yang (Content creator-Datawhale member)
Zhang Jin (Content creator-Datawhale member)
Lou Tianao (Content creator-University of Chinese Academy of Sciences-Jingying Assistant Teaching)
Zuo Chunsheng (Content creator-Datawhale member)
Yang Zhuo (Content creator-Xi'an University of Electronic Science and Technology-Jingying Assistant Teaching)
Luo Luo (content creator-Datawhale member)
Tan Yike (Content Creator - University of International Business and Economics)
Wang Yiming (Content creator-Datawhale member)
He Zhixuan (Content creator-Jingying Assistant Teacher)
Kang Jingqi (Content creator-Datawhale member)
Sanshui (Content creator-Jingying Assistant Teacher)
September (Content Creator-Datawhale Intention Member)
Deng Kaijun (Content creator-Datawhale member)
Yang Chenxu (Content creator-Taiyuan University of Technology-Jingying Assistant Teaching)
Zhao Wenkai (Content creator-Taiyuan University of Technology-Jingying Assistant Teaching)
Zhao Wei (Content creator-Jingying Assistant Teacher)
Fu Zhiyuan (Content creator-Hainan University)

Note: Rankings are sorted by contribution level

other

Special thanks to @Sm1les for their help and support for this project
Some lora code and explanation reference repository: https://github.com/zyds/transformers-code.git
If you have any ideas, please contact us DataWhale. Everyone is welcome to submit an issue.
Special thanks to the students who contributed to the tutorial below!