2017 ACM/IEEE Joint Conference on Digital Libraries (JCDL)

19-23 June 2017

Filter Results

Displaying Results 1 - 25 of 85
  • [Title page]

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | PDF file iconPDF (8028 KB)
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | PDF file iconPDF (76 KB)
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2017, Page(s):1 - 6
    Request permission for reuse | PDF file iconPDF (107 KB)
    Freely Available from IEEE
  • Chairs’ Welcome

    Publication Year: 2017, Page(s):1 - 2
    Request permission for reuse | PDF file iconPDF (197 KB)
    Freely Available from IEEE
  • Conference Organization

    Publication Year: 2017, Page(s):1 - 6
    Request permission for reuse | PDF file iconPDF (154 KB)
    Freely Available from IEEE
  • Sponsors

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | PDF file iconPDF (166 KB)
    Freely Available from IEEE
  • Keynote Liz Lyons

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (243 KB)

    Provides an abstract of the keynote presentation and a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Keynote Ray Siemens

    Publication Year: 2017, Page(s):1 - 2
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (119 KB)

    Provides an abstract of the keynote presentation and a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Keynote Salvatore Mele

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (106 KB)

    Provides an abstract of the keynote presentation and a brief professional biography of the presenter. The complete presentation was not made available for publication as part of the conference proceedings. View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Keynote Bios

    Publication Year: 2017, Page(s): 1
    Request permission for reuse | PDF file iconPDF (50 KB)
    Freely Available from IEEE
  • Schedule

    Publication Year: 2017, Page(s):1 - 2
    Request permission for reuse | PDF file iconPDF (175 KB)
    Freely Available from IEEE
  • Author index

    Publication Year: 2017, Page(s):1 - 4
    Request permission for reuse | PDF file iconPDF (366 KB)
    Freely Available from IEEE
  • Technical Paper 
  • Archival Crawlers and JavaScript: Discover More Stuff but Crawl More Slowly

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (420 KB) | HTML iconHTML

    The web is today's primary publication medium, making web archiving an important activity for historical and analytical purposes. Web pages are increasingly interactive, resulting in pages that are correspondingly difficult to archive. JavaScript enables interactions that can potentially change the client-side state of a representation. We refer to representations that load embedded resources via ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Building and Querying Semantic Layers for Web Archives

    Publication Year: 2017, Page(s):1 - 10
    Cited by:  Papers (1)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (274 KB) | HTML iconHTML

    Web archiving is the process of collecting portions of the Web to ensure that the information is preserved for future exploitation. However, despite the increasing number of web archives worldwide, the absence of efficient and meaningful exploration methods still remains a major hurdle in the way of turning them into a usable and useful information source. In this paper, we focus on this problem a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • WikiM: Metapaths Based Wikification of Scientific Abstracts

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (1966 KB) | HTML iconHTML

    In order to disseminate the exponential extent of knowledge being produced in the form of scientific publications, it would be best to design mechanisms that connect it with already existing rich repository of concepts - the Wikipedia. Not only does it make scientific reading simple and easy (by connecting the involved concepts used in the scientific articles to their Wikipedia explanations) but a... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Descriptor-Invariant Fusion Architectures for Automatic Subject Indexing

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (313 KB) | HTML iconHTML

    Documents indexed with controlled vocabularies enable users of libraries to discover relevant documents, even across language barriers. Due to the rapid growth of scientific publications, digital libraries require automatic methods that index documents accurately, especially with regard to explicit or implicit concept drift, that is, with respect to new descriptor terms and new types of documents,... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Identifying Important Citations Using Contextual Information from Full Text

    Publication Year: 2017, Page(s):1 - 8
    Cited by:  Papers (1)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (1321 KB) | HTML iconHTML

    In this paper we address the problem of classifying cited work into important and non-important to the developments presented in a research publication. This task is vital for the algorithmic techniques that detect and follow emerging research topics and to qualitatively measure the impact of publications in increasingly growing scholarly big data. We consider cited work as important to a publicat... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Learning to Predict Citation-Based Impact Measures

    Publication Year: 2017, Page(s):1 - 10
    Cited by:  Papers (2)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (931 KB) | HTML iconHTML

    Citations implicitly encode a community's judgment of a paper's importance and thus provide a unique signal by which to study scientific impact. Efforts in understanding and refining this signal are reflected in the probabilistic modeling of citation networks and the proliferation of citation-based impact measures such as Hirsch's h-index. While these efforts focus on understanding the past and pr... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Understanding the Impact of Early Citers on Long-Term Scientific Impact

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (2626 KB) | HTML iconHTML

    This paper explores an interesting new dimension to the challenging problem of predicting long-term scientific impact (LTSI) usually measured by the number of citations accumulated by a paper in the long-term. It is well known that early citations (within 1-2 years after publication) acquired by a paper positively affects its LTSI. However, there is no work that investigates if the set of authors ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Matrix-Based News Aggregation: Exploring Different News Perspectives

    Publication Year: 2017, Page(s):1 - 10
    Cited by:  Papers (1)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (1612 KB) | HTML iconHTML

    News aggregators capably handle the large amount of news that is published nowadays. However, these systems focus on the presentation of important, common information in news, but do not reveal different perspectives on the same topic. Thus, current news aggregators suffer from media bias, i.e. differences in the content or presentation of news. Finding such differences is crucial to reduce the ef... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Quill: A Framework for Constructing Negotiated Texts - With a Case Study on the US Constitutional Convention of 1787

    Publication Year: 2017, Page(s):1 - 10
    Cited by:  Papers (1)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (2269 KB) | HTML iconHTML

    This paper describes a new approach to the presentation of records relating to formal negotiations and the texts that they create. It describes the architecture of a model, platform, and web-interface (https://www.quillproject.net) that can be used by domain-experts to convert the records typical of formal negotiations in to a model of decision-making (with minimal training). This model has implic... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Realising a Layered Digital Library: Exploration and Analysis of the Live Music Archive through Linked Data

    Publication Year: 2017, Page(s):1 - 10
    Cited by:  Papers (1)
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (1216 KB) | HTML iconHTML

    Building upon a collection with functionality for discovery and analysis has been described by Lynch as a `layered' approach to digital libraries. Meanwhile, as digital corpora have grown in size, their analysis is necessarily supplemented by automated application of computational methods, which can create layers of information as intricate and complex as those within the content itself. This comb... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Benchmark and Evaluation for Text Extraction from PDF

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (1730 KB) | HTML iconHTML

    Extracting the body text from a PDF document is an important but surprisingly difficult task. The reason is that PDF is a layout-based format which specifies the fonts and positions of the individual characters rather than the semantic units of the text (e.g., words or paragraphs) and their role in the document (e.g., body text or caption). There is an abundance of extraction tools, but their qual... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A Text Extraction Software Benchmark Based on a Synthesized Dataset

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (615 KB) | HTML iconHTML

    Text extraction plays an important function for data processing workflows in digital libraries. For example, it is a crucial prerequisite for evaluating the quality of migrated textual documents. Complex file formats make the extraction process error-prone and have made it very challenging to verify the correctness of extraction components. Based on digital preservation and information retrieval s... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Mathematical Document Categorization with Structure of Mathematical Expressions

    Publication Year: 2017, Page(s):1 - 10
    Request permission for reuse | Click to expandAbstract | PDF file iconPDF (547 KB) | HTML iconHTML

    A mathematical document is a document subjected to mathematical communication, for example, a math paper and discussion in online Q&A community. Mathematical document categorization (MDC) is a task to classify mathematical documents to mathematical categories, e.g. probability theory and set theory. This task is an important task for supporting user search on recent wide-spreaded digital l... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.