efficient document search and summarization engine Download - efficient document search and summarization engine Source code download

efficient document search and summarization engine

Other source code

1.0.0

Download

Efficient Document Search and Summarization Engine

Overview

The Efficient Document Search and Summarization Engine is a powerful tool designed to enhance research efficiency and clarity by integrating advanced Large Language Models (LLMs) such as ChatGPT and LLAMA. This project leverages cutting-edge technologies to provide a seamless and efficient document search and summarization experience.

Features

LLM Integration: Utilizes leading LLMs to summarize research findings, enhancing the efficiency and clarity of research output.
RESTful API: Offers a robust API for document and query retrieval, ensuring optimized and accurate search results.
Vector Databases: Implements vector databases for efficient storage and retrieval, improving search performance.
Advanced Algorithms: Applies cosine similarity and k-means clustering to ensure precise text extraction and query matching.

Technologies Used

Python: Core programming language for development.
FastAPI: Framework for building the RESTful API.
Langchain: Toolchain for integrating language models.
MongoDB: Database for storing documents and metadata.
Pinecone: Vector database for optimized query storage and retrieval.

Installation

To set up the project locally, follow these steps:

Clone the Repository:

git clone https://github.com/mananjain02/efficient-document-search-and-summarization-engine.git
cd efficient-document-search-and-summarization-engine

Create a Virtual Environment:

python -m venv venv
source venv/bin/activate

Install Dependencies:
```
pip install -r requirements.txt
```

Set Up Environment Variables: Create a .env file in the root directory and add your configuration settings.

MONGODB_URL=<mongo-db-uri>
SECRET_KEY=<bcrypt-key>
ALGORITHM="HS256"
DATABASE=<database-name>
EMBEDDINGS_MODEL="BAAI/bge-large-en-v1.5"
VECTOR_DATABASES_FOLDER="vector_databases"
OPENAI_API_KEY=<open-ai-key-if-want-to-use-chatgpt>
TOKENIZERS_PARALLELISM="False"

Run the Application:
```
uvicorn main:app --reload
```

Usage

API Documentation

API documentation and further details can be accessed using Swagger. Once the application is running, navigate to http://localhost:8000/docs to explore and interact with the API endpoints.

Expand

Additional Information

Version 1.0.0
Type Other source code
Update Time 2025-05-29
size 13.7KB
From Github

Related Applications

Word Search 800

2024-11-08
efficient language detector

2024-11-06
Parameter Efficient Transfer Learning Benchmark

2024-11-06
azure search python samples

2024-11-05
Hanfox Search Engine

2012-03-15
Xmark Template Engine

2010-06-25

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All