LLM with RAG Download - LLM with RAG Source code download

Download

Stage One

RAG File
- Large Language Model:
  - Language Model: "databricks/dbrx-instruct": https://huggingface.co/databricks/dbrx-instruct
  - Nvidia Client: https://build.nvidia.com/databricks/dbrx-instruct
- Vector Database:
  - Milvus: https://milvus.io/
  - Embedding model: https://huggingface.co/thenlper/gte-base
  - Support OS: Linux
    - Currently Does not support Windows OS because Milvus_lite does not support Windows OS
    - Will choose different database in the future in order to fix this issue
pdf_to_txt File
- Current Handle:
  - pdf(text) to txt
  - Need to improve preprocessing inorder to feed to RAG model
Progress(10/01/2024): Simplified version works on Linux, with one query ability
- (10/02/2024): Able to reuse collection for query

Creating pdf reader using OCR
- accept uploaded pdf
- read using EasyOCR
- store results in files, preferably one file for each pdf
RAG File supports recursive question and answer
Able to store historical QA in corresponding files