By Topic

Document Analysis Systems, 2008. DAS '08. The Eighth IAPR International Workshop on

Date 16-19 Sept. 2008

Filter Results

Displaying Results 1 - 25 of 93
  • [Front cover]

    Publication Year: 2008, Page(s): C1
    Request permission for commercial reuse | PDF file iconPDF (280 KB)
    Freely Available from IEEE
  • [Title page i]

    Publication Year: 2008, Page(s): i
    Request permission for commercial reuse | PDF file iconPDF (28 KB)
    Freely Available from IEEE
  • [Title page iii]

    Publication Year: 2008, Page(s): iii
    Request permission for commercial reuse | PDF file iconPDF (64 KB)
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2008, Page(s): iv
    Request permission for commercial reuse | PDF file iconPDF (44 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2008, Page(s):v - xi
    Request permission for commercial reuse | PDF file iconPDF (122 KB)
    Freely Available from IEEE
  • Foreword

    Publication Year: 2008, Page(s):xii - xiii
    Request permission for commercial reuse | PDF file iconPDF (94 KB) | HTML iconHTML
    Freely Available from IEEE
  • Conference organization

    Publication Year: 2008, Page(s): xiv
    Request permission for commercial reuse | PDF file iconPDF (121 KB)
    Freely Available from IEEE
  • list-reviewer

    Publication Year: 2008, Page(s):xv - xvi
    Request permission for commercial reuse | PDF file iconPDF (113 KB)
    Freely Available from IEEE
  • Sponsors

    Publication Year: 2008, Page(s): xvii
    Request permission for commercial reuse | PDF file iconPDF (75 KB)
    Freely Available from IEEE
  • Extraction of Text Objects in Video Documents: Recent Progress

    Publication Year: 2008, Page(s):5 - 17
    Cited by:  Papers (74)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (449 KB) | HTML iconHTML

    Text extraction in video documents, as an important research field of content-based information indexing and retrieval, has been developing rapidly since 1990s. This has led to much progress in text extraction, performance evaluation, and related applications. By reviewing the approaches proposed during the past five years, this paper introduces the progress made in this area and discusses promisi... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Hilbert Warping Algorithm for Recognizing Characters from Moving Camera

    Publication Year: 2008, Page(s):21 - 27
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (488 KB) | HTML iconHTML

    We present a method for recognizing characters from image sequences captured by moving camera. In the proposed method, the sequence of the captured images is compared with those of reference character patterns using the concept of analytic signal. Since the captured image sequence can be nonlinearly warped along the time axis due to the movement of a hand-held camera, phase synchronization of two ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Writer Verification of Arabic Handwriting

    Publication Year: 2008, Page(s):28 - 34
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (951 KB) | HTML iconHTML

    Expanding on an earlier study to objectively validate the hypothesis that handwriting is individualistic, we extend the study to include handwriting in the Arabic script. Handwriting samples from twelve native speakers of Arabic were obtained. Analyzing differences in handwriting was done by using computer algorithms for extracting features from scanned images of handwriting. Attributes characteri... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Robust System to Detect and Localize Texts in Natural Scene Images

    Publication Year: 2008, Page(s):35 - 42
    Cited by:  Papers (23)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (848 KB) | HTML iconHTML

    In this paper, we present a robust system to accurately detect and localize texts in natural scene images. For text detection, a region-based method utilizing multiple features and cascade AdaBoost classifier is adopted. For text localization, a window grouping method integrating text line competition analysis is used to generate text lines. Then within each text line, local binarization is used t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An image based watermark string detection system for document security checking

    Publication Year: 2008, Page(s):43 - 50
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (8 KB)

    Document security is a very important topic in information management. In this paper, an image based watermark string detection system is proposed to detect the documents that include printed keyword strings as the watermark in the background. Therefore, the disclosure of the sensitive documents can be monitored automatically. Since the documents are represented in image format, the watermark stri... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Feature Extraction for Document Image Segmentation by pLSA Model

    Publication Year: 2008, Page(s):53 - 60
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2939 KB) | HTML iconHTML

    In this paper, we propose a method for document image segmentation based on pLSA (probabilistic latent semantic analysis) model. The pLSA model is originally developed for topic discovery in text analysis using "bag-of-words" document representation. The model is useful for image analysis by "bag-of-visual words" image representation. The performance of the method depends on the visual vocabulary ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Grouping Text Lines in Online Handwritten Japanese Documents by Combining Temporal and Spatial Information

    Publication Year: 2008, Page(s):61 - 68
    Cited by:  Papers (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (8 KB)

    We present an effective approach for grouping text lines in online handwritten Japanese documents by combining temporal and spatial information. Initially, strokes are grouped into text line strings according to off-stroke distances. Each text line string is segmented into text lines by dynamic programming (DP) optimizing a cost function trained by the minimum classification error (MCE) method. Ov... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Accurate Alignment of Double-Sided Manuscripts for Bleed-Through Removal

    Publication Year: 2008, Page(s):69 - 75
    Cited by:  Papers (23)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2536 KB) | HTML iconHTML

    Double-sided manuscripts are often degraded by bleed-through interference. Such degradation must be corrected to facilitate human perception and machine recognition. Most approaches to bleed-through removal rely on perfect alignment between the recto and verso images of a document. This paper presents a two-stage hierarchical alignment technique that can efficiently and accurately align the two si... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Super-Resolution of Text Images Using Edge-Directed Tangent Field

    Publication Year: 2008, Page(s):76 - 83
    Cited by:  Papers (9)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1269 KB) | HTML iconHTML

    This paper presents an edge-directed super-resolution algorithm for document images without using any training set. This technique creates an image with smooth regions in both the foreground and the background, while allowing sharp discontinuities across and smoothness along the edges. Our method preserves sharp corners in text images by using the local edge direction, which is computed first by e... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Attention-Based Document Classifier Learning

    Publication Year: 2008, Page(s):87 - 94
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1271 KB) | HTML iconHTML

    We describe an approach for creating precise personalized document classifiers based on the user's attention. The general idea is to observe which parts of a document the user was interested in just before he or she comes to a classification decision. Having information about this manual classification decision and the document parts the decision was based on, we can learn precise classifiers. For... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Categorization of On-Line Handwritten Documents

    Publication Year: 2008, Page(s):95 - 102
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (588 KB) | HTML iconHTML

    With the growth of on-line handwriting technologies, managing facilities for handwritten documents, such as retrieval of documents by topic, are required. These documents can contain graphics, equations or text for instance. This work reports experiments on categorization of on-line handwritten documents based on their textual contents. We assume that handwritten text blocks have been extracted fr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Combining Multiple Methods for Book Indexing

    Publication Year: 2008, Page(s):103 - 110
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (469 KB) | HTML iconHTML

    In this paper we are interested in the problem of book splitting or more generally of indexing the logical parts of a document. This involves determining the boundaries of these parts as well as their label. We report here on the combined use of generic methods published in previous papers. We discuss the effect of combining several methods, also from a quality assurance perspective. Our experimen... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Automated OCR Ground Truth Generation

    Publication Year: 2008, Page(s):111 - 117
    Cited by:  Papers (5)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (1830 KB) | HTML iconHTML

    Most optical character recognition (OCR) systems need to be trained and tested on the symbols that are to be recognized. Therefore, ground truth data is needed. This data consists of character images together with their ASCII code. Among the approaches for generating ground truth of real world data, one promising technique is to use electronic version of the scanned documents. Using an alignment m... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Digital Renaissance Making Archives, Sharing Wisdoms and Creating Values

    Publication Year: 2008, Page(s):121 - 132
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (60509 KB) | HTML iconHTML

    This paper highlights the DIS (digital image system) technology and its application projects. The "Digital Ambassadorship project" between Italy and Japan brought the exhibition of the "Mind of Leonardo" to Japan for the first time and realized 3rd Italy-Japan real-time symposium for "Primavera Italiana 2007 in Japan," resulting in a great success. This will show us new directions of th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • State: A Multimodal Assisted Text-Transcription System for Ancient Documents

    Publication Year: 2008, Page(s):135 - 142
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (2106 KB) | HTML iconHTML

    We present a complete assisted transcription system for ancient documents: State. The system consists of two applications: a pen-based, interactive application to assist humans in transcribing ancient documents and a recognition engine which offers automatic transcriptions via a web service. The interaction model and the recognition algorithm employed in the current version of State are presented.... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Authorship Identification of Ukiyoe by Using Rakkan Image

    Publication Year: 2008, Page(s):143 - 150
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (826 KB) | HTML iconHTML

    This paper describes a method of identifying authorship of Ukiyoe prints by using Rakkan images found in the prints. A weighted direction index histogram method has been used to create the feature vector for Rakkan character analysis. Also the Pseudo Mahalanobis distances were used to judge distances between dictionary templates and test data. The method includes binarization of Rakkan images whic... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.