A Python GUI application that uses the Google Cloud Vision API to extract text data from images (jpg, jpeg, png) and export it into CSV files. It supports batch processing and automatically extracts details such as names and IDs from scanned images.
This program requires you to have a Google API key to access Cloud Vision.
- Install Node.js
- Create an API key on console.cloud.google.com.
- Look for "API Library" on the search bar
- Search "Cloud Vision API" and press Enable
- Next look for "Service accounts" on the search bar
- Press "Create Service Account" on the top
- Give your service account a name and press Done
- Under the Actions column, press the three dots and press "Manage keys"
- Press Add key and create new key.
- Install Node.js dependencies:
cd backend
npm install- Run the GUI
python -m frontend.gui- Select your API key:
- Go to API in the top menu and press Select API
- Import image files:
- Go to Files in the top menu and press Import
- Export CSV:
- Press Export to CSV file to extract details from all images
- The CSV will be saved in the
results/folder
- Clear the batch:
- Press Clear to reset imported images