Convert images into captivating audio stories using a combination of image-to-text, language models, and text-to-speech technologies.

This project allows you to turn images into audio stories. It employs image-to-text conversion, language models, and text-to-speech synthesis to create an engaging experience. Extract text from uploaded images, generate short stories based on the extracted text, and listen to the generated stories as audio clips.
Clone the Repository:
git clone https://github.com/fshnkarimi/Image2AudioStoryConverter.git
cd Image2AudioStoryConverterInstall Dependencies:
pip install -r requirements.txtSet Up Environment Variables:
Create a .env file in the project directory and add your Hugging Face API token:
HUGGINGFACEHUB_API_TOKEN=your_token_here
Run the Streamlit App:
streamlit run app.pyUpload an Image:
Experience the Story:
Contributions are welcome! If you'd like to contribute to this project, please follow these steps:
Enjoy turning your images into captivating audio stories! Feel free to customize and enhance this project as you see fit. If you have any questions or ideas for improvement, please don't hesitate to get in touch.