![]() |
Algorithm used: Shyamosree Pal, Partha Bhowmick, Arindam Biswas, and Bhargab B. Bhattacharya. Understanding Digital Documents Using Gestalt Properties of Isothetic Components, International Journal of Digital Library Systems, 2010 (accepted). Left: input Below: output (pink isothetic polygons are recognized as graphics; bluish isothetic polygons as text.) |
We have shown how Gestalt properties
can be used for identifying
various components in a document image.
The idea that our mind makes a holistic approach to vision rather than a disintegrated approach
has been shown to be effective for document analysis also.
Since the major constituent components (textual or non-textual) in a document page
are arranged in a rectilinear fashion, we first make an
isothetic decomposition
of different components in a document page. |
![]() |
![]() (a) Input document page. |
![]() (b) Set of isothetic polygons {Pj(12): j=1,...,12}. |
![]() (c) The geometric feature set corresponding to the polygons in P(12). |
![]() (d) Input subset. |
![]() (e) Set of isothetic polygons for g=6. |
![]() (f) Set of isothetic polygons for g=2. |
![]() (g) Set of isothetic polygons for g=1. |
(d-g): Isothetic covers of the components lying inside the polygon P3(12) corresponding to the component of flowchart and their sub-components for different grid sizes. The vertex centers of polygons are shown as red-colored '+'. |
![]() |
![]() |
![]() |
![]() |
![]() |
|
![]() |
|
![]() |
|
![]() |
![]() | |
![]() | |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |