Statistical Pattern and Image Analysis Area / ISTL / PARC

Extracting content from document images Model-directed recognition
Recognition, compression, & retrieval Formal probabilistic models
Page layout and printed text Automatic inference of models
 
Overview
People
Skills
Projects
Collaborations
Publications
Prof'l Activities
Computing
    Recent publications by SPIA members

Reverse chronological order, then alphabetical by first author.

Show abstracts where available

Availability and distribution subject to terms of respective copyright agreements. In particular, the availability of some items might be delayed until date of publication, and other items might not be available for general download at all. If you are interested in an item for which a download link does not appear, please contact the author, as often the copyright agreements allow limited personal distribution.

SPIA'ers, to update your publications, follow the directions here.

E. Gaussier, C. Goutte, K. Popat, and F. Chen, ``A hierarchical model for clustering and categorising documents,'' in Proceedings of the 24th BCS-IRSG European Colloquium on IR Research, March 2002. to appear.
BibTeX entry
P. Sarkar, ``An iterative algorithm for optimal style-conscious field classification,'' in [Submitted for review] Proceedings of the sixteenth ICPR, (Quebec City), IEEE Computer Society Press, 2002.
BibTeX entry, Available here
P. Sarkar, H. S. Baird, and J. Henderson, ``Triage of ocr output using 'confidence' scores,'' in [accepted for publication in] Proceedings of SPIE/IS&T 2002 Document Recognition & Retrieval IX Conf. (DR&R IX), (San Jose, California, USA), January 20-25 2002.
BibTeX entry, Available here
G. E. Kopec, M. R. Said, and K. Popat, ``N-gram language models for document image decoding,'' in Proceedings of IS&T/SPIE Electronic Imaging 2002: Document Recognition and Retrieval IX, January 2002.
BibTeX entry, PDF, PS
K. Toutanova, F. Chen, K. Popat, and T. Hofmann, ``Text classification in a hierarchical mixture model for small training sets,'' in Proceedings of the ACM Conference on Information and Knowledge Management (CIKM), November 2001.
BibTeX entry, PDF, PS
D. S. Bloomberg, T. P. Minka, and K. Popat, ``Document image decoding using iterated complete path search with subsampled heuristic scoring,'' in Proceedings of the IAPR 2001 International Conference Document Analysis and Recognition (ICDAR 2001), September 2001.
BibTeX entry, PDF, PS
A. L. Coates, H. S. Baird, and R. J. Fateman, ``Pessimal print: a reverse Turing test,'' in Proceedings of the IAPR 2001 International Conference Document Analysis and Recognition (ICDAR 2001), September 2001.
BibTeX entry
P. Sarkar and G. Nagy, ``Style consistency in isogenous patterns,'' in Proceedings of the Sixth ICDAR, (Seattle, USA), pp. 1169-1174, September 2001.
BibTeX entry, Available here
T. M. Breuel, ``Implicit manipulation of constraint sets for geometric matching under translation and rotation,'' in Scandinavian Conference on Image Analysis (SCIA 2001), (Bergen, Norway), June 2001.
BibTeX entry
T. M. Breuel, ``Classification by probabilistic clustering,'' in Proceedings of the 2001 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), (Salt Lake City, Utah), IEEE, May 2001.
BibTeX entry
K. Popat, ``Decoding of text lines in grayscale document images,'' in Proceedings of the 2001 International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), (Salt Lake City, Utah), IEEE, May 2001.
BibTeX entry, PDF, PS
T. M. Breuel and K. Popat, ``Recent work in the document image decoding group at xerox parc,'' in Proceedings of the DOD-sponsored Symposium on Document Image Understanding Technology (SDIUT 2001), April 2001.
BibTeX entry
T. M. Breuel, ``Modeling the sample distribution for clustering by ocr,'' in Proceedings of IS&T/SPIE Electronic Imaging 2001: Document Recognition and Retrieval VIII, January 2001.
BibTeX entry, PDF, PS
T. P. Minka, D. S. Bloomberg, and K. Popat, ``Document image decoding using the iterated complete path heuristic,'' in Proceedings of IS&T/SPIE Electronic Imaging 2001: Document Recognition and Retrieval VIII, January 2001.
BibTeX entry, PDF, PS
K. Popat, ``Document image compression by adaptive-offset quantization,'' in Proceedings of IS&T/SPIE Electronic Imaging 2001: Document Recognition and Retrieval VIII, January 2001.
BibTeX entry, PDF, PS
K. Popat, D. Greene, J. Romberg, and D. S. Bloomberg, ``Adding linguistic constraints to document image decoding: Comparing the iterated complete path and stack algorithms,'' in Proceedings of IS&T/SPIE Electronic Imaging 2001: Document Recognition and Retrieval VIII, January 2001.
BibTeX entry, PDF, PS
H. S. Baird, ``State of the art of document image degradation modeling,'' in Proceedings of the 4th IAPR Workshop on Document Analysis Systems (DAS 2000), (Rio de Janeiro), December 2000. Invited plenary talk.
BibTeX entry, PDF, PS
T. M. Breuel, ``Layout analysis by exploring the space of segmentation parameters,'' in Proceedings of the 4th IAPR Workshop on Document Analysis Systems (DAS 2000), December 2000.
BibTeX entry, PDF, PS
K. Popat, D. Bloomberg, and D. Greene, ``Adding linguistic constraints to document image decoding,'' in Proceedings of the 4th IAPR Workshop on Document Analysis Systems (DAS 2000), December 2000.
BibTeX entry, PDF, PS
T. Kanungo, R. M. Haralick, H. S. Baird, W. Stuezle, and D. Madigan, ``A statistical, nonparametric methodology for document degradation model validation,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 22, pp. 1209-1223, November 2000.
BibTeX entry
T. M. Breuel, ``Handwriting recognition on us census forms,'' in Mathematical Morphology and its applications to image and signal processing: Proceedings of the Fifth International Symposium on Mathematical Morphology (ISMM 2000), June 2000. Invited Plenary Talk.
BibTeX entry
K. Popat and D. S. Bloomberg, ``Two-stage lossy/lossless compression of grayscale document images,'' in Mathematical Morphology and its applications to image and signal processing: Proceedings of the Fifth International Symposium on Mathematical Morphology (ISMM 2000), June 2000.
BibTeX entry, PDF, PS
H. S. Baird and F. Chen, ``Document image retrieval.'' Special Issue of Information Retrieval journal, Vol. 2, Nos. 2/3, May 2000.
BibTeX entry
P. Sarkar and G. Nagy, ``Classification of style-constrained pattern-fields,'' in Proceedings of the fifteenth ICPR, (Barcelona), pp. 859-862, IEEE Computer Society Press, 2000.
BibTeX entry, Available here
P. Sarkar, Style consistency in pattern fields. PhD thesis, Rensselaer Polytechnic Institute, Troy, NY, 2000.
BibTeX entry, Available here
T. Berger, P. Chou, M. Effros, N. Farvardin, T. Fischer, W. R. Gardner, R. M. Gray, N. S. Jayant, R. Laroia, U. Madhow, M. W. Marcellin, J. W. Modestino, D. L. Neuhoff, A. Orlitsky, K. Popat, K. Ramchandran, J. A. Storer, V. Vaishampayan, K. Zeger, and Z. Zhang, ``Workshop report: NSF sponsored workshop on joint source-channel coding,'' tech. rep., California Institute of Technology, October 1999.
BibTeX entry, PDF, PS
H. S. Baird, ``Document image quality: Making fine discriminations,'' in Proceedings of the IAPR 1999 International Conference on Document Analysis and Recognition (ICDAR 1999), (Bangalore, India), pp. 459-462, September 1999.
BibTeX entry, PDF, PS
P. Sarkar and G. Nagy, ``Heeding more than the top template,'' in Proceedings of the Fifth International Conference on Document Analysis and Recognition, (Bangalore, India), September 1999.
BibTeX entry, Available here
H. S. Baird, ``Model-directed document image analysis,'' in Proceedings of the DOD-sponsored Symposium on Document Image Understanding Technology (SDIUT 1999), (Annapolis, MD), April 1999. Invited published talk.
BibTeX entry, PDF, PS
G. Nagy and P. Sarkar, ``Modeling statistical dependence in pattern classification,'' in Proceedings of the IAPR Workshop on Statistical Methods for Image Processing, (Uppsala), 1999.
BibTeX entry
J. Kanai and H. S. Baird, ``Document image understanding and retrieval.'' Special Issue of Computer Vision and Image Understanding journal, Vol. 70, No. 3, June 1998.
BibTeX entry
T. K. Ho and H. S. Baird, ``Pattern classification with compact distribution maps,'' Computer Vision and Image Understanding, vol. 70, pp. 101-110, March 1998.
BibTeX entry
P. Sarkar, G. Nagy, J. Zhou, and D. Lopresti, ``Spatial sampling of printed patterns,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, pp. 344-351, March 1998.
BibTeX entry, Available here
G. Nagy, A. Samal, S. Seth, T.Fisher, E. Guthman, K. Kalafala, L. Li, P. Sarkar, and Y. Xu, ``A prototype for adaptive association of street names with streets on maps,'' in Graphics Recognition: Algorithms and Systems (K. T. . A. Chhabra, ed.), vol. 1389 of Springer Lecture Notes in Computer Science, pp. 302-313, 1998.
BibTeX entry
T. K. Ho and H. S. Baird, ``Large-scale simulation studies in image pattern recognition,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 19, pp. 1067-1079, October 1997.
BibTeX entry
J. Y. Zhou, D. Lopresti, P. Sarkar, and G. Nagy, ``Spatial sampling effects on scanned 2-d patterns,'' in Advances in Visual Forms Analysis (C. Arcelli, L. P. Cordella, and G. S. di Baja, eds.), Singapore: World Scientific, 1997.
BibTeX entry, Available here
D. Lopresti, J. Zhou, G. Nagy, and P. Sarkar, ``Spatial sampling effects in optical character recognition,'' in Proceedings of the Third International Conference on Document Analysis and Recognition, pp. 309-314, 1995.
BibTeX entry, Available here
G. E. Kopec and P. A. Chou, ``Document image decoding using Markov source models,'' IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 16, pp. 602-617, June 1994.
BibTeX entry, PDF, PS
P. Sarkar, ``Random phase spatial sampling effects in digitized patterns,'' Master's thesis, Rensselaer Polytechnic Institute, 1994.
BibTeX entry
T. Kanungo, H. S. Baird, and R. M. Haralick, ``Performance evaluation: Theory, practice, and impact.'' Special Issue of Int'l J. on Document Analysis and Recognition. In Press, November 2001.
BibTeX entry