This repo is for developing a Docstrum algorithm presented by O’Gorman (1993).
This source code is built on top of the work by Chadoliver. Please find the original code from here (https://github.com/chadoliver/cosc428-structor).
This project aims at segmenting a document image into meaningful components. The domain of image is specified on historical machine-printed/hand-written document image.
- python 2.7
- Packages:
numpycv2
- Pre-processing Optional for vertical-line removal
- Blurring Bilateral Filtering
- Otsu's thresholding
- Morphological erosion & dilation
- Smoothing (Averaging)
- Static thresholding
- Nearest-Neighbor Clustering and Docstrum Plot
- Spacing and Orientation Estimation
- Determination of Text-lines
- Structural Block Determination
- Post-processing
- TBD
- TBD
O'Gorman, L., 1993. The document spectrum for page layout analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence, 15(11), pp.1162-1173. pdf.
@article{o1993document, title={The document spectrum for page layout analysis}, author={O'Gorman, Lawrence}, journal={IEEE Transactions on Pattern Analysis and Machine Intelligence}, volume={15}, number={11}, pages={1162--1173}, year={1993}, publisher={IEEE} } find . -name '.DS_Store' -type f -delete 