By Topic

Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Date 23-26 Sept. 2007

Go

Filter Results

Displaying Results 1 - 25 of 154
  • Ninth International Conference on Document Analysis and Recognition - Cover

    Publication Year: 2007, Page(s): c1
    Request permission for commercial reuse | PDF file iconPDF (290 KB)
    Freely Available from IEEE
  • Ninth International Conference on Document Analysis and Recognition - Title page

    Publication Year: 2007, Page(s):i - iii
    Request permission for commercial reuse | PDF file iconPDF (80 KB)
    Freely Available from IEEE
  • Ninth International Conference on Document Analysis and Recognition - Copyright

    Publication Year: 2007, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (46 KB)
    Freely Available from IEEE
  • Ninth International Conference on Document Analysis and Recognition - Table of contents

    Publication Year: 2007, Page(s):v - ix
    Request permission for commercial reuse | PDF file iconPDF (186 KB)
    Freely Available from IEEE
  • Welcome from the Honourary Chair

    Publication Year: 2007, Page(s): x
    Request permission for commercial reuse | PDF file iconPDF (21 KB) | HTML iconHTML
    Freely Available from IEEE
  • Welcome from the General Chairs

    Publication Year: 2007, Page(s): xi
    Request permission for commercial reuse | PDF file iconPDF (21 KB) | HTML iconHTML
    Freely Available from IEEE
  • Message from the PC Chairs

    Publication Year: 2007, Page(s): xii
    Request permission for commercial reuse | PDF file iconPDF (21 KB) | HTML iconHTML
    Freely Available from IEEE
  • Conference Committees

    Publication Year: 2007, Page(s): xiii
    Request permission for commercial reuse | PDF file iconPDF (22 KB)
    Freely Available from IEEE
  • list-reviewer

    Publication Year: 2007, Page(s): xiv
    Request permission for commercial reuse | PDF file iconPDF (28 KB)
    Freely Available from IEEE
  • Assessing and Improving the Quality of Document Images Acquired with Portable Digital Cameras

    Publication Year: 2007, Page(s):569 - 573
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (618 KB) | HTML iconHTML

    Professionals and students of many different areas start to use portable digital cameras to take photos of documents, instead of photocopying them. This article analyses the quality of such documents for optical character recognition and proposes ways of improving their transcription and readability. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Multi-Stage Strategy to Perspective Rectification for Mobile Phone Camera-Based Document Images

    Publication Year: 2007, Page(s):574 - 578
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2132 KB) | HTML iconHTML

    Document images captured by a mobile phone camera often have perspective distortions. Efficiency and accuracy are two important issues in designing a rectification system for such perspective documents. In this paper, we propose a new perspective rectification system based on vanishing point detection. This system achieves both the desired efficiency and accuracy using a multi-stage strategy: at t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Degraded Character Recognition by Complementary Classifiers Combination

    Publication Year: 2007, Page(s):579 - 583
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (222 KB) | HTML iconHTML

    Character degradation is a big problem for machine printed character recognition. Two main reasons for degradation are extrinsic image degradation such as blurring and low image dimension, and intrinsic degradation caused by font variations. A recognition method that combines two complementary classifiers is proposed in this paper. The local feature based classifier extracts the local contour dire... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Middle Zone Component Extraction and Recognition of Telugu Document Image

    Publication Year: 2007, Page(s):584 - 588
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (375 KB) | HTML iconHTML

    Telugu is one of the ancient languages of South India. It has a complex orthography with a large number of distinct character shapes composed of simple and compound characters. The work reported in literature till the recent period is based on the connected component approach. Less attention is observed on the generalized character model and its application in the OCR development. Script syllable ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An SVM-Based High-accurate Recognition Approach for Handwritten Numerals by Using Difference Features

    Publication Year: 2007, Page(s):589 - 593
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (225 KB) | HTML iconHTML

    Handwritten numeral recognition is an important pattern recognition task. It can be widely used in various domains, e.g., bank money recognition, which requires a very high recognition rate. As a state-of-the-art classifier, support vector machine (SVM), has been extensively used in this area. Typically, SVM is trained in a batch model, i.e., all data points are simultaneously input for training t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Retrieval of Handwritten Lines in Historical Documents

    Publication Year: 2007, Page(s):594 - 598
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (183 KB) | HTML iconHTML

    This study describes methods for the retrieval of handwritten lines of text in a historical administrative collection. The goal is to develop generic methods for bootstrapping the retrieval system from a tabula rasa starting condition, i.e., the virtual absence of labeled samples. By exploiting the currently available computing power and the fact that computation takes place off line, it should be... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Text Line Detection in Unconstrained Handwritten Documents Using a Block-Based Hough Transform Approach

    Publication Year: 2007, Page(s):599 - 603
    Cited by:  Papers (15)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (508 KB) | HTML iconHTML

    In this paper we present a new text line detection method for unconstrained handwritten documents. The proposed technique is based on a strategy that consists of three distinct steps. The first step includes image binarization and enhancement, connected component extraction and average character height estimation. In the second step, a block-based Hough transform is used for the detection of poten... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Systematic Multi-Path HMM Topology Design for Online Handwriting Recognition of East Asian Characters

    Publication Year: 2007, Page(s):604 - 608
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (768 KB) | HTML iconHTML

    This paper presents a systematic multi-path HMM topology design algorithm to better model online handwriting of East Asian characters. This data-driven algorithm solves three key problems in HMM topology design. First, HMM path number determination is formalized as a clustering problem using subsequence direction histogram vector (SDHV) as feature of both writing order and style. Second, curvature... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Bibliographic Meta-Data Extraction Using Probabilistic Finite State Transducers

    Publication Year: 2007, Page(s):609 - 613
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (153 KB) | HTML iconHTML

    We present the application of probabilistic finite state transducers to the task of bibliographic meta-data extraction from scientific references. By using the transducer approach, which is often applied successfully in computational linguistics, we obtain a trainable and modular framework. This results in simplicity, flexibility, and easy adaptability to changing requirements. An evaluation on th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Perceptive Vision for Headline Localisation in Bangla Handwritten Text Recognition

    Publication Year: 2007, Page(s):614 - 618
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (484 KB) | HTML iconHTML

    In this paper, we propose to give tools for Bangla handwriting recognition. We present a mechanism to segment documents into text lines and words, and more specifically to detect headline position in each word. Indeed, this headline is an horizontal line on the upper part of most of characters, which is characteristic of Bangla writing. Its localisation is a new approach that can improve text reco... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Segmentation of Text and Graphics from Document Images

    Publication Year: 2007, Page(s):619 - 623
    Cited by:  Papers (7)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (250 KB) | HTML iconHTML

    Text, graphics and half-tones are the major constituents of any document page. While half-tone can be characterised by its inherent intensity variation, text and graphics share common characteristics except difference in spatial distribution. The success of document image analysis systems depends on the proper segmentation. The success of document image analysis systems depends on the proper segme... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Layout Based Information Extraction from HTML Documents

    Publication Year: 2007, Page(s):624 - 628
    Cited by:  Papers (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (320 KB) | HTML iconHTML

    We propose a method of information extraction from HTML documents based on modelling the visual information in the document. A page segmentation algorithm is used for detecting the document layout and subsequently, the extraction process is based on the analysis of mutual positions of the detected blocks and their visual features. This approach is more robust that the traditional DOM-based methods... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An Overview of the Tesseract OCR Engine

    Publication Year: 2007, Page(s):629 - 633
    Cited by:  Papers (80)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (163 KB) | HTML iconHTML

    The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Symbol Recognition Using a 2-class Hierarchical Model of Choquet Integrals

    Publication Year: 2007, Page(s):634 - 638
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (197 KB) | HTML iconHTML

    We present an approach allowing to automatically extract a suitable set of soft output classifiers and to aggregate them to provide a global decision using the Choquet integral. This approach relies on two key points. A learning algorithm based on a 2-class model is performed to define a new set of decisions rules assuming to be experts dedicated to recognize one class from another one. All the as... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Deriving Symbol Dependent Edit Weights for Text Correction_The Use of Error Dictionaries

    Publication Year: 2007, Page(s):639 - 643
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (94 KB) | HTML iconHTML

    Most systems for correcting errors in texts make use of specific word distance measures such as the Levenshtein distance. In many experiments it has been shown that correction accuracy is improved when using edit weights that depend on the particular symbols of the edit operation. However, most proposed approaches so far rely on high amounts of training data where errors and their corrections are ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Arabic Handwriting Recognition Using Variable Duration HMM

    Publication Year: 2007, Page(s):644 - 648
    Cited by:  Papers (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (229 KB) | HTML iconHTML

    The present paper describes a complete system for the recognition of unconstrained handwritten Arabic words using over-segmentation of characters and variable duration hidden Markov model (VDHMM). First, a segmentation algorithm is used to translate the 2-D image into 1-D sequence of sub-character symbols. This sequence of symbols is modeled by the VDHMM. The shape information of character and sub... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.