Difference between revisions of "Publications/carlinet.15.itip.inc"
From LRDE
Line 63: | Line 63: | ||
File:carlinet.15.tip-mp00252c_20p.png |
File:carlinet.15.tip-mp00252c_20p.png |
||
</gallery> |
</gallery> |
||
+ | |||
+ | More examples are available in this [https://lrde.epita.fr/~carlinet/thesis/publis/material/carlinet.15.tip/carlinet.15.tip-grainfilters.tar.bz2 archive]. |
||
=== Interactive object segmentation === |
=== Interactive object segmentation === |
Revision as of 10:36, 4 May 2015
Materials
Multivariate Tree of Shapes Computation Binaries
You can download the x86_64 binaries to compute the Multivariate Tree of Shapes Here. This application outputs 16-bits image where each pixel stores the depth of the node it belongs to. To recover the MToS from this image, one just has to compute its max-tree. Note that the image is twice has big has the original one and has a border for topogical and algorithmic purposes. Thus, any pixel with coordinates (x,y) in the original image is now at coordinates (2*(x+1), 2*(y+1)) in the depth image. The application also outputs a 8bits grayscale version of the depth image that can be used to vizualise the shapes by thresholding this image.
Usage: ./compute_ctos-demo [options] input depth16.tiff depth8.png
Mumford-Shah Simplification with the MToS
You can download the x86_64 binaries to compute the Mumford-Shah simplification running on the MToS (as described in the paper) Here.
Usage: ./mumford_shah_on_tree_full input[rgb] α₀ α₁ λ output α₀ Grain filter size before merging trees (0 to disable) α₁ Grain filter size on the color ToS (0 to disable) λ Mumford-shah regularisation weight (e.g. 5000)
Illustrations
Object detections in videos
In the scope of the ICDAR competition on Smartphone Document Capture and OCR (SmartDoc-2015), we aim at automatically detecting documents in video captured by smartphones. The dataset covers different document layout (textual and/or having graphical content) and realistic scene analysis problems (change of illumination, motion blur, change of perspectives, partial occlusions...).
Grain filters for document layout extraction
We use a grain filter to extract text boxes and graphical parts of documents. Indeed, text parts are composed of letters which are supposed to be small components if the MToS is well-formed. On the contrary, text boxes and grahical contents are large components that should remain after the filtering.
Examples:
More examples are available in this archive.