Scanbot SDK Document Quality Analyzer (DoQA) Configurator

Introduction

This tool allows you to create a customized config for the Document Quality Analyzer from the Scanbot SDK.

The goal of the Document Quality Analyzer is to decide if a user-provided image of a document is of good enough quality to proceed, or if the user should be asked to provide an image of better quality. This distinction between good and bad quality images is difficult and largely depends on the use-case:

What type of documents will be scanned? E.g. receipts, invoices, contracts
What part of the document carries the important information? E.g. is it important that the fine-print on the image is readable? Or is it sufficient if the larger text is readable?
What are the capabilities of the next processing passes? E.g. text with poor contrast might be fine for OCR but is difficult to read for humans.

By providing examples of images that have sufficient or insufficient quality for your use-case, this tool will create a fine-tuned configuration file to optimize the DoQA performance for you.

Prerequisites

To run this tool, you need to have the following:

Docker with Docker Compose (version 2.34.0 or later)
A special license key that is only valid for the DoQA Configurator. This license key will be different from the license key you use in your app. Please contact customer support to obtain it.
You need to know the version of the ScanbotSDK Core used in your release. Please also contact customer support to obtain it.
Images of documents that are of sufficient (good) or insufficient (bad) quality for your use-case:
- Per class "good" and "bad" you should provide at least 100 samples (200 in total) to achieve optimal results. A configuration built with fewer than 100 samples per class will still work, but accuracy will be lower. If you find that the accuracy is not sufficient with 100 samples per class, you can add more samples; however, expect diminishing returns for every additional sample.
- Sometimes, it can be difficult to decide if a given sample should belong to the "good" or "bad" class. For such ambiguous samples, it is better not to include them at all, because they can reduce the accuracy of your configuration.
- The samples should be in JPG or PNG format. If you have PDF files at hand, we provide a script to convert PDF -> PNG (see below).

Usage

Clone or download the repository

git clone https://github.com/doo/scanbot-sdk-doqa-configurator

Modify the .env file and put the version of the ScanbotSDK Core and your license key there.
Place the training images into the folders data/bad & data/good. Images should be in JPG or PNG format. If you only have PDF files available, please convert them to PNG as described below.

Run the following command to produce the custom configuration:

docker compose run --env-from-file=.env --build --rm sbsdk-doqa-configurator

Your config will be created in data/DoQA_config.txt. Please provide the contents of this file during the configuration of the Scanbot SDK.
A report will be generated in data/training_report.html that shows what performance you can expect from your new configuration.
A debug file will data/DoQA_config_debug.pkl generated (see usage below).

Debugging

If you find that after creating a custom DoQA configuration, the output of the DoQA is not satisfactory, we provide some tools to understand what might be going wrong.

Usage:

The folders data/bad & data/good need to contain the same images as they did when you created the DoQA configuration.
The file data/DoQA_config_debug.pkl from your training needs to be present.
Place the images that yield unexpected DoQA results and you would like to examine in the folder data/explain (JPEG or PNG).
Run the following command
```
docker compose run --env-from-file=.env --build --rm --entrypoint python sbsdk-doqa-configurator /app/explain.py
```
The command will generate one report HTML in data/explain for every image in that folder. These reports can help you understand how the DoQA operates and what you can do to improve its performance.

PDF to PNG

If you only have PDF files at hand, we provide a convenience script to extract images from PDF files and store them as PNG so that they can be used in the training. To use this PDF -> PNG conversion, please:

Follow the basic setup instructions in the Usage section above
Place your PDF files in the data/bad & data/good folders

Then run the following script:

docker compose run --env-from-file=.env --build --rm --entrypoint python sbsdk-doqa-configurator /app/pdf_to_png.py

This will add appropriate PNG files in the respective folders.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
data		data
src		src
.env		.env
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
Readme.md		Readme.md
docker-compose.yaml		docker-compose.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scanbot SDK Document Quality Analyzer (DoQA) Configurator

Introduction

Prerequisites

Usage

Debugging

PDF to PNG

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Scanbot SDK Document Quality Analyzer (DoQA) Configurator

Introduction

Prerequisites

Usage

Debugging

PDF to PNG

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages