pdf_extractor
1.0.0
This is a Streamlit application designed for extracting data from PDF files. It utilizes Langchain technology for efficient data extraction and provides a user-friendly interface to upload PDF files, extract information, and convert the extracted data into CSV and JSON formats.
streamlit, pandas.streamlit run main.py.To get started, ensure that Python is installed and follow these steps:
Install the necessary dependencies by running the command:
pip install -r requirements.txtRun the Streamlit application by executing:
streamlit run main.pyThe application leverages Langchain technology for efficient data extraction and processing.
The project consists of the following main files:
main.py: Contains the Streamlit application code.functions.py: Includes functions specifically designed for data extraction from PDF files.