pdf_extractor Download - pdf_extractor Source code download

pdf_extractor

AI Source Code

1.0.0

Download

PDF Data Extractor

This is a Streamlit application designed for extracting data from PDF files. It utilizes Langchain technology for efficient data extraction and provides a user-friendly interface to upload PDF files, extract information, and convert the extracted data into CSV and JSON formats.

Features

Upload PDF files for data extraction.
Extracted data is displayed in a structured manner.
Convert extracted data to CSV and JSON formats.
Download the extracted data in CSV or JSON formats.

Usage

Install the required libraries: streamlit, pandas.
Run the Streamlit application using streamlit run main.py.
Upload your PDF files and click "Extract your data" to start the extraction process.
Download the extracted data in CSV or JSON formats using the provided buttons.

How to Run

To get started, ensure that Python is installed and follow these steps:

Install the necessary dependencies by running the command:
```
pip install -r requirements.txt
```
Run the Streamlit application by executing:
```
streamlit run main.py
```