By Topic

Proceedings IEEE Advances in Digital Libraries 2000

24-24 May 2000

Filter Results

Displaying Results 1 - 23 of 23
  • Finding themes in Medline documents - probabilistic similarity search

    Publication Year: 2000, Page(s):183 - 192
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (276 KB)

    Large on-line document databases, such as Medine, pose a major challenge of retrieving the few documents most relevant to the user's needs, while multimizing the return rate of nonrelevant documents. Retrieval of documents similar to a user provided example document is a promising query paradigm towards meeting this goal. We present a new theme-based probabilistic approach for finding documents re... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2000, Page(s): 241
    Request permission for commercial reuse | PDF file iconPDF (57 KB)
    Freely Available from IEEE
  • Proceedings IEEE Advances in Digital Libraries 2000

    Publication Year: 2000
    Request permission for commercial reuse | PDF file iconPDF (178 KB)
    Freely Available from IEEE
  • Building a digital library of newspaper clippings: the LAURIN project

    Publication Year: 2000, Page(s):15 - 26
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (280 KB)

    The field of digital libraries has attracted a lot of research during the last years. However no project has specifically taken into account the problem of building a digital library of newspaper clippings. It is well known that a great deal of cultural knowledge is stored in the newspapers of yesterday. Since newspapers are not always easily accessible, special clipping archives were created in t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Clustering and identifying temporal trends in document databases

    Publication Year: 2000, Page(s):173 - 182
    Cited by:  Papers (15)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (220 KB)

    We introduce a simple and efficient method for clustering and identifying temporal trends in hyper-linked document databases. Our method can scale to large datasets because it exploits the underlying regularity often found in hyper-linked document databases. Because of this scalability, we can use our method to study the temporal trends of individual clusters in a statistically meaningful manner. ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Searching and browsing collections of structural information

    Publication Year: 2000, Page(s):141 - 150
    Cited by:  Papers (5)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB)

    This paper proposes a new approach to querying collections of structured textual information such as SGML/XML documents. Knowledge about the structure of documents is an additional resource that should be exploited during retrieval since the semantics of the different textual objects can be used to specify an information need much more precisely. However the traditional probabilistic retrieval mod... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Electronic books in digital libraries

    Publication Year: 2000, Page(s):5 - 14
    Cited by:  Papers (1)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (200 KB)

    An electronic book is an application with a multimedia database of instructional resources, which include hyperlinked text, instructor's audio/video clips, slides, animation, still images, etc. As well as content-based information about these data, and metadata such as annotations, tags, and cross-referencing information. Electronic books in the Internet or on CDs today are not easy to learn from.... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Morphing towards interoperable catalogues

    Publication Year: 2000, Page(s):105 - 115
    Cited by:  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB)

    The World Wide Web (WWW) provides geo-spatial data custodians with an environment in which to advertise their datasets via on-line catalogue systems. The geo-spatial community is diverse and its members are spread around the world. The user's task of searching for and locating datasets of interest, would be greatly simplified if a single point of entry could search for datasets from multiple, inde... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Effects of word recognition errors in spoken query processing

    Publication Year: 2000, Page(s):39 - 47
    Cited by:  Papers (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (212 KB)

    The effects of word recognition errors (WRE) in spoken document retrieval have been well studied and well reported in recent information retrieval (IR) literature. Much less experimental work has been devoted to studying the effects of WRE in spoken query processing in IR. It is easy to hypothesize that given the typical length of the user query, the effects of WRE in spoken queries on the perform... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Detecting data and schema changes in scientific documents

    Publication Year: 2000, Page(s):160 - 170
    Cited by:  Papers (1)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (284 KB)

    Data stored in a data warehouse must be kept consistent and up-to-date with respect to the underlying information sources. By providing the capability to identify, categorize and detect changes in these sources, only the modified data needs to be transferred and entered into the warehouse. Another alternative, periodically reloading from scratch, is obviously inefficient. When the schema of an inf... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • On extending the XML engine with query-processing capabilities

    Publication Year: 2000, Page(s):127 - 138
    Cited by:  Papers (1)  |  Patents (18)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (248 KB)

    We study how to efficiently evaluate queries over XML documents whose representation is according to the XML specification, i.e., XML files. The software architecture is as follows: the XML engine (i.e., XML parser) makes the structure of the documents explicit. The query processor operates directly on the output of the XML engine. We see two basic alternatives of how such a query processor operat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Using regular tree automata as XML schemas

    Publication Year: 2000, Page(s):89 - 98
    Cited by:  Papers (8)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (244 KB)

    We address the problem of tight XML schemas and propose regular tree automata to model XML data. We show that the tree automata model is more powerful than the XML DTDs and is closed under main algebraic operations. We introduce the XML query algebra based on the tree automata model, and discuss the query optimization and query pruning techniques. Finally we show the conversion of tree automata sc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • BlueView: virtual document servers for digital libraries

    Publication Year: 2000, Page(s):207 - 217
    Cited by:  Papers (1)  |  Patents (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (212 KB)

    In the BlueView project, digital library services are developed and partially implemented based on the architecture of virtual document servers. Using standard tools like full-text databases or information retrieval systems, object-relational database management systems, and replication and caching services, different heterogeneous local document servers have been integrated into one virtual serve... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Invading the fortress: how to besiege reinforced information bunkers

    Publication Year: 2000, Page(s):27 - 35
    Cited by:  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (208 KB)

    Information retrieval (IR) research has been very active over the last decades to develop approaches that allow machine indexing to significantly improve indexing practice in libraries. However due to practical limitations, this technology, is not often used in large-scale libraries. We propose a mix of existing technologies and new ideas to enable traditional libraries to adopt modern IR technolo... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • An integrated metamodel for knowledge representation in geolibraries

    Publication Year: 2000, Page(s):151 - 159
    Cited by:  Papers (1)  |  Patents (3)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (184 KB)

    Geolibraries containing large volumes of complex spatiotemporal data occur in a variety of application domains. Considerable effort has gone into the design of metadata standards to enable exchange and sharing of georeferenced data sets. Strategies for federating information stores have to deal with integration at several levels, syntax, structure as well as semantics. This paper describes a gener... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A view system for CORBA-wrapped data sources

    Publication Year: 2000, Page(s):228 - 237
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (200 KB)

    Integration of data from disparate, heterogeneous and autonomous data sources is a common problem of different domains, including molecular biology. Mediator-based architectures and views have been used to integrate data and restructure data representation. CORBA facilitates data integration by providing language, platform and network transparency. In CORBA, data itself can be modelled in IDL, ess... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Audio structuring and personalized retrieval using ontologies

    Publication Year: 2000, Page(s):116 - 126
    Cited by:  Papers (7)  |  Patents (2)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (296 KB)

    The goal of this work is to improve the accuracy (precision and recall) and communication effectiveness of a database system response to a user information request, by utilizing a domain-specific ontology. This ontology is employed along with user profile information, to automatically select and deliver appropriate information units from a database. Specifically, the focus is on multimedia audio d... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Towards supporting user-defined hypertext contexts in Web searching

    Publication Year: 2000, Page(s):48 - 57
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (276 KB)

    A hypertext context is a generic high-level hypermedia structure that groups together a set of nodes and links into a logical whole. It can be used as boundaries of searches in large hypertext systems and helps to improve the results of searches concerning specific topics or subject domains. On the current Web, there exist no hypertext contexts that are explicitly represented and can be recognized... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Boolean query processing with a result cache in mediator systems

    Publication Year: 2000, Page(s):218 - 227
    Cited by:  Patents (4)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (352 KB)

    A mediator system is a kind of meta-search engine that provides a seamlessly integrated search service for diverse search engines (collections). Since collections of a mediator system are geographically distributed, its performance is mainly influenced by data transmission time between the mediator and its collections. Existing mediator systems employ a result cache that is composed of the results... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Efficient similarity search in digital libraries

    Publication Year: 2000, Page(s):193 - 199
    Cited by:  Papers (2)  |  Patents (8)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (236 KB)

    Digital libraries are a core information technology. When the stored data is complex, e.g. high-resolution images or molecular protein structures, simple query types such as exact match query are hardly applicable. In such environments similarity queries, particularly range queries and k-nearest neighbor queries, are important query types. Numerous approaches have been proposed for the processing ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • WebSSQL-a query language for multimedia Web documents

    Publication Year: 2000, Page(s):58 - 67
    Cited by:  Papers (1)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (320 KB)

    We describe an SQL-like query language-WebSSQL-for retrieving desired Web pages. WebSSQL has several unique features. First, WebSSQL assumes that each Web page is a multimedia document consisting of structured data, text data and possibly image data. Second, WebSSQL treats each page as a node in a directed graph composed of many Web pages and links among them. Third, WebSSQL is similarity-based me... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mobile agents and the SARA digital library

    Publication Year: 2000, Page(s):71 - 77
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (312 KB)

    Remote-sensing data about the Earth's environment is being created at an ever-increasing rate and distributed among heterogeneous remote sites. Traditional models of distributed computing are inadequate to support such complex applications, which generally involve a large quantity of data. We explore an approach based on mobile agent techniques for autonomous data processing and information discov... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Copy detection systems for digital documents

    Publication Year: 2000, Page(s):78 - 88
    Cited by:  Papers (5)  |  Patents (5)
    Request permission for commercial reuse | Click to expandAbstract | PDF file iconPDF (908 KB)

    Partial or total duplication of document content is common to large digital libraries. We present a copy detection system to automate the detection of application in digital documents. The system we present is sentence-based and makes three contributions: it proposes an intuitive definition of similarity between documents; it produces the distribution of overlap that exists between overlapping doc... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.