Access the application here: Multi-Language Invoice Extractor
This project aims to extract and understand content from multi-language invoices using Google Generative AI (Gemini Model). This reduces the need for traditional OCR methods such as Tesseract and OpenCV.
git clone https://github.com/yourusername/invoice-extractor.git
cd invoice-extractorCreate a new Conda environment with Python 3.10.
conda create -p venv python==3.10 -yActivate the Conda environment.
conda activate "/Users/vallirajasekar/Desktop/gemini/Invoice Extractor/venv"Install the required dependencies.
pip install -r requirements.txtSet Up API Key:
Make sure to set up your Google Generative AI API key. You can store it in a .env file in the root directory of your project:
GOOGLE_API_KEY=your_google_api_key
Run the Streamlit Application: Start the Streamlit application by running:
streamlit run app.pyUpload an Invoice:
.jpg, .jpeg, .png formats).The Multi-Language Invoice Extractor project is designed to simplify the extraction of information from invoices in various languages. Traditional methods like Tesseract OCR and OpenCV require extensive preprocessing and are limited by the quality of the image and language support. By leveraging the power of Google Generative AI (Gemini Model), this project aims to directly understand and extract information from invoice images.
This project uses the following technologies:
!!! Thank You !!! Visit Us Again :)