
Welcome to the RAG Console Chat Application, a simple yet powerful tool designed to transform the way you interact with and extract information from vast document repositories. This project implements document ingestion, embedding generation, and retrieval-augmented generation (RAG). If you are looking to chat with your data or summarize complex topics, the RAG Console Chat Application is your go-to tool for intelligent information processing.
RAG is a novel approach that combines the strengths of information retrieval and natural language generation. It enhances the quality of generated responses by incorporating relevant context from a large corpus of documents. The RAG Pipeline implements this by:
A vector database like ChromaDB is essential for storing and querying embeddings. It allows for fast retrieval of similar documents based on vector similarity, which is crucial for the RAG process.
Embeddings are numerical representations of text that capture semantic information. They enable the comparison of text data in a meaningful way, facilitating tasks like document similarity and clustering.
To set up the RAG Pipeline on your local machine, follow these steps:
Install Python 3.10+: Download and install from here
Install Microsoft Visual Studio C++ Build Tools: This is necessary for compiling some of the dependencies. Download and install from here.
Clone the Repository:
git clone https://github.com/olifarhaan/rag-console-chat.git
cd rag-console-chatInstall Python Dependencies: Ensure you have Python 3.8+ installed. Then, create a virtual environment and install the required packages:
python -m venv venv
source venv/bin/activate # On Windows, use `venvScriptsactivate`
pip install -r requirements.txtSet Up Environment Variables:
Create a .env file in the root directory and add your OpenAI API key:
OPENAI_API_KEY=your_openai_api_key
Run the Application: Start the RAG Pipeline by executing:
python app.pyInteract with the Application: Use the command-line interface to choose between chat and summarization modes, and explore the capabilities of the RAG Pipeline.
The RAG Console Chat Application is a simple yet powerful tool designed to enhance your document processing and information retrieval capabilities. With its advanced features and user-friendly interface, it stands as a testament to the power of modern AI technologies. Dive into the world of RAG and experience the future of intelligent information systems today.
Licensing: This project is licensed under the MIT License. You are free to use, modify, and distribute the software, provided that the original license and copyright notice are included in all copies or substantial portions of the software.
Diagram: The diagram was created using Mermaid.
Contact: For any questions or feedback, please contact [email protected] or message me on LinkedIn @olifarhaan.