Evaltex
From LRDE
EvaLTex (Evaluating Text Localization) is an evaluation tool used to measure the performance of text detection algorithms. It takes as input text detection results that can be represented either by coordinates or by masks and outputs performance scores.
XML format
Input
The framework takes as input XML files containing the coordinates of the bounding boxes surrounding the text objects and compares it to another XML file representing the ground truth (GT). The GT XML format differs slightly from the result format. Its attributes are:
- name : the image name
- size : image size
- region : 1st level components
Mask representation
In addition to bounding boxes, text objects can also be represented using masks. EvaLTex takes as input binary images (white corresponds to text). TODO add images
Output
The output consists in a local valuation (for each image), as well as a global evaluation (one XML file for a whole database).
TODO add local XML file and global XML file
Performance measurements
Local evaluation
For each matched GT object we assign two quality measures: Coverage (Cov) and Accuracy (Acc);
- Cov computes the rate of the matched area with respect to the GT object area
- Acc computes the rate of the matched area with respect to the detection area
Recall
The Recall () computes the amount of detected text. We provide 3 measures: a global , a quantitative that measures the amount detected objects (regardless of the matched area) and a qualitative that corresponds to the rate of the detected text area with respect to the number of true positives ().
Precision
The Precision () computes the rate of detections that have a match in the GT. Similarly to , we compute 3 measures: a quantitative that measures the amount of valid detections (regardless of the matched area) and a qualitative that corresponds to the rate of the detected text area with respect to the number of total detections, computed as the sum of and
How to compute the measurements
Parameters to run the tool