The Efficient Document Search and Summarization Engine is a powerful tool designed to enhance research efficiency and clarity by integrating advanced Large Language Models (LLMs) such as ChatGPT and LLAMA. This project leverages cutting-edge technologies to provide a seamless and efficient document search and summarization experience.
To set up the project locally, follow these steps:
Clone the Repository:
git clone https://github.com/mananjain02/efficient-document-search-and-summarization-engine.git
cd efficient-document-search-and-summarization-engineCreate a Virtual Environment:
python -m venv venv
source venv/bin/activateInstall Dependencies:
pip install -r requirements.txtSet Up Environment Variables:
Create a .env file in the root directory and add your configuration settings.
MONGODB_URL=<mongo-db-uri>
SECRET_KEY=<bcrypt-key>
ALGORITHM="HS256"
DATABASE=<database-name>
EMBEDDINGS_MODEL="BAAI/bge-large-en-v1.5"
VECTOR_DATABASES_FOLDER="vector_databases"
OPENAI_API_KEY=<open-ai-key-if-want-to-use-chatgpt>
TOKENIZERS_PARALLELISM="False"Run the Application:
uvicorn main:app --reloadAPI documentation and further details can be accessed using Swagger. Once the application is running, navigate to http://localhost:8000/docs to explore and interact with the API endpoints.