Difference between revisions of "Publications/carlinet.15.itip.inc"
From LRDE
Line 44: | Line 44: | ||
=== Grain filters for document layout extraction === |
=== Grain filters for document layout extraction === |
||
− | <gallery> |
+ | <gallery perrow=5> |
+ | File:pscomp-mp00032c_20p.png |
||
− | File:Example.jpg|Caption1 |
||
+ | File:pscomp-mp00215c_20p.png |
||
− | File:Example.jpg|Caption2 |
||
+ | File:pscomp-mp00234c_20p.png |
||
+ | File:pscomp-mp00248c_20p.png |
||
+ | File:pscomp-mp00252c_20p.png |
||
+ | |||
+ | File:carlinet.15.tip-mp00032c_20p.png |
||
+ | File:carlinet.15.tip-mp00215c_20p.png |
||
+ | File:carlinet.15.tip-mp00234c_20p.png |
||
+ | File:carlinet.15.tip-mp00248c_20p.png |
||
+ | File:carlinet.15.tip-mp00252c_20p.png |
||
</gallery> |
</gallery> |
||
Revision as of 11:26, 4 May 2015
Materials
Multivariate Tree of Shapes Computation Binaries
You can download the x86_64 binaries to compute the Multivariate Tree of Shapes Here. This application outputs 16-bits image where each pixel stores the depth of the node it belongs to. To recover the MToS from this image, one just has to compute its max-tree. Note that the image is twice has big has the original one and has a border for topogical and algorithmic purposes. Thus, any pixel with coordinates (x,y) in the original image is now at coordinates (2*(x+1), 2*(y+1)) in the depth image. The application also outputs a 8bits grayscale version of the depth image that can be used to vizualise the shapes by thresholding this image.
Usage: ./compute_ctos-demo [options] input depth16.tiff depth8.png
Mumford-Shah Simplification with the MToS
You can download the x86_64 binaries to compute the Mumford-Shah simplification running on the MToS (as described in the paper) Here.
Usage: ./mumford_shah_on_tree_full input[rgb] α₀ α₁ λ output α₀ Grain filter size before merging trees (0 to disable) α₁ Grain filter size on the color ToS (0 to disable) λ Mumford-shah regularisation weight (e.g. 5000)
Illustrations
Object detections in videos
In the scope of the ICDAR competition on Smartphone Document Capture and OCR (SmartDoc-2015), we aim at automatically detecting documents in video captured by smartphones. The dataset covers different document layout (textual and/or having graphical content) and realistic scene analysis problems (change of illumination, motion blur, change of perspectives, partial occlusions...).