Document type recognition using evidence theory

From LRDE

The printable version is no longer supported and may have rendering errors. Please update your browser bookmarks and please use the default browser print function instead.

Abstract

This paper presents a method to recognize the type of a document when a database of models (document types) is given. For instance, when every documents are forms and when we know every different types of forms, we want to be able to assign to an input document its type of form. To that aim, we define each model by a set of characteristics whose nature can vary from one to another. For instance, a characteristic can be having a flower-shaped logo on top-left as well as having about 12pt fonts. This paper does not intent to explain how to extract such knowledge from documents but it describes how to use such information to decide what the type of a given document is when different document types are described by characteristics.


Bibtex (lrde.bib)

@InProceedings{	  geraud.03.grec,
  author	= {Thierry G\'eraud and Geoffroy Fouquier and Quoc Peyrot and
		  Nicolas Lucas and Franck Signorile},
  title		= {Document type recognition using evidence theory},
  booktitle	= {Proceedings of the 5th IAPR International Workshop on
		  Graphics Recognition (GREC)},
  year		= 2003,
  pages		= {212--221},
  editors	= {Josep Llad\`os},
  address	= {Computer Vision Center, UAB, Barcelona, Spain},
  month		= jul,
  abstract	= {This paper presents a method to recognize the type of a
		  document when a database of models (document types) is
		  given. For instance, when every documents are forms and
		  when we know every different types of forms, we want to be
		  able to assign to an input document its type of form. To
		  that aim, we define each model by a set of characteristics
		  whose nature can vary from one to another. For instance, a
		  characteristic can be having a flower-shaped logo on
		  top-left as well as having about 12pt fonts. This paper
		  does not intent to explain how to extract such knowledge
		  from documents but it describes how to use such information
		  to decide what the type of a given document is when
		  different document types are described by
		  characteristics.}
}