Share Email Print

Proceedings Paper

Document boundary determination using structural and lexical analysis
Author(s): Kazem Taghva; Marc-Allen Cartright
Format Member Price Non-Member Price
PDF $17.00 $21.00

Paper Abstract

The document boundary determination problem is the process of identifying individual documents in a stack of papers. In this paper, we report on a classification system for automation of this process. The system employs features based on document structure and lexical content. We also report on experimental results to support the effectiveness of this system.

Paper Details

Date Published: 19 January 2009
PDF: 5 pages
Proc. SPIE 7247, Document Recognition and Retrieval XVI, 724704 (19 January 2009); doi: 10.1117/12.805384
Show Author Affiliations
Kazem Taghva, Univ. of Nevada, Las Vegas (United States)
Marc-Allen Cartright, Univ. of Nevada, Las Vegas (United States)

Published in SPIE Proceedings Vol. 7247:
Document Recognition and Retrieval XVI
Kathrin Berkner; Laurence Likforman-Sulem, Editor(s)

© SPIE. Terms of Use
Back to Top
Sign in to read the full article
Create a free SPIE account to get access to
premium articles and original research
Forgot your username?