By Topic

Universal Communication Symposium (IUCS), 2010 4th International

Date 18-19 Oct. 2010

Filter Results

Displaying Results 1 - 25 of 84
  • [Front cover]

    Publication Year: 2010 , Page(s): c1
    Request Permissions | PDF file iconPDF (892 KB)  
    Freely Available from IEEE
  • [Front and back cover]

    Publication Year: 2010 , Page(s): c1 - c4
    Request Permissions | PDF file iconPDF (1156 KB)  
    Freely Available from IEEE
  • [Title page]

    Publication Year: 2010 , Page(s): 1
    Request Permissions | PDF file iconPDF (1803 KB)  
    Freely Available from IEEE
  • [Copyright notice]

    Publication Year: 2010 , Page(s): 1
    Request Permissions | PDF file iconPDF (23 KB)  
    Freely Available from IEEE
  • Table of contents

    Publication Year: 2010 , Page(s): xi - xix
    Request Permissions | PDF file iconPDF (85 KB)  
    Freely Available from IEEE
  • Pseudo natural language vs. controlled natural language

    Publication Year: 2010 , Page(s): K-1 - K-3
    Request Permissions | Click to expandAbstract | PDF file iconPDF (74 KB) |  | HTML iconHTML  

    Natural language is an indispensable means of communication. But it is also a serious barrier for communication, in particular between human and computers. Efforts have been made to overcome this barrier since long time ago. One of these efforts was to design controlled natural languages (CNL), which are subsets of natural languages, yet are easy to use for non-native users and can be processed by... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Information analysis technology for Universal Communication

    Publication Year: 2010 , Page(s): K-4 - K-5
    Request Permissions | Click to expandAbstract | PDF file iconPDF (47 KB) |  | HTML iconHTML  

    The Internet functions as a social infrastructure and stores a huge amount of information. People use the Internet whenever they need information. The primary role of the Internet is changing from that of digital content sharing to that of knowledge sharing. NICT Knowledge Clustered Group is researching and developing information analysis technology for “Universal Communication”. In ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • New resources trigger new technologies

    Publication Year: 2010 , Page(s): K-6 - K-7
    Request Permissions | Click to expandAbstract | PDF file iconPDF (53 KB) |  | HTML iconHTML  

    Two decades ago large-scale corpora, as new language resources brought forth a new paradigm shift marked by the revival of empiricism. However, now some researchers including the beginner of the revival began to rethink: “what should they (next generation students) do when most of the low hanging fruit has been pretty much picked over?” or to predict that the weird state of computati... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Author index

    Publication Year: 2010 , Page(s): A-1 - A-3
    Request Permissions | PDF file iconPDF (193 KB)  
    Freely Available from IEEE
  • Joint tokenization, parsing, and translation

    Publication Year: 2010 , Page(s): 1
    Cited by:  Papers (2)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (28 KB) |  | HTML iconHTML  

    Summary form only given. Natural language processing is all about ambiguities. In machine translation, tokenization and parsing mistakes due to segmentation and structural ambiguities potentially introduce translation errors. A well-known solution is to provide more alternatives by using compact representations such as lattice and forest. In this talk, I will introduce a technique that goes beyond... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Domain adaptation for statistical machine translation in development corpus selection

    Publication Year: 2010 , Page(s): 2 - 7
    Cited by:  Patents (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (205 KB) |  | HTML iconHTML  

    The performance of statistical machine translation (SMT) system is affected by model parameters (e.g. weights of feature functions), which are usually tuned on a development corpus. Most research done to date has focused on algorithms for tuning parameters. However, the selection of development corpus is lack of discussion. It is believed that the parameters trained on a proper corpus will improve... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Discriminative reranking for SMT using various global features

    Publication Year: 2010 , Page(s): 8 - 14
    Cited by:  Papers (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (182 KB) |  | HTML iconHTML  

    In this paper, we propose to use various global features for discriminative reranking in an SMT framework. We employ an online large-margin based training algorithm for the structural output support vector machines based on the margin infused relaxed algorithm. Besides the standard features used, such as decoder's scores, source and target sentences, alignments and part-of-speech tags, we include ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Head- and relation-driven tree-to-tree translation using phrases in a monolingual corpus

    Publication Year: 2010 , Page(s): 15 - 22
    Cited by:  Papers (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (271 KB) |  | HTML iconHTML  

    We propose an extension of context-based machine translation (CBMT) to deal with distant language pairs such as Japanese and English, incorporating a syntactic transfer approach. Our method uses a tree structure where a node is a head and an edge is a dependency with a relation between heads. We retrieve partial trees from a monolingual corpus using a bilingual dictionary to generate candidate tra... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Structuring and manipulating hand-drawn concept maps

    Publication Year: 2010 , Page(s): 23
    Request Permissions | Click to expandAbstract | PDF file iconPDF (29 KB) |  | HTML iconHTML  

    Concept maps are an important tool to knowledge organization, representation, and sharing. Most current concept map tools do not provide full support for hand-drawn concept map creation and manipulation, largely due to the lack of methods to recognize hand-drawn concept maps. We propose a structure recognition method. Our algorithm can extract node blocks and link blocks of a hand-drawn concept ma... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A geometric approach to approximate continuous k-median query

    Publication Year: 2010 , Page(s): 24 - 31
    Request Permissions | Click to expandAbstract | PDF file iconPDF (304 KB) |  | HTML iconHTML  

    We revisit the classic k-median problem in continuous distributed model. The rapid advance in electronic miniaturization, wireless communication and position technologies makes a significant contribution to pervasive applications of continuous distributed model. Data sets acquired in continuous distributed model are automatically and continuously updated, or even distributed over a wide area in ty... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A relation-based services management mechanism for service computing

    Publication Year: 2010 , Page(s): 32 - 39
    Request Permissions | Click to expandAbstract | PDF file iconPDF (252 KB) |  | HTML iconHTML  

    In this paper we propose a service management mechanism based on a relational model, in which all services to be managed are represented, and operations for services are relational operations using SQL. To define the model, we use Web Services Description Language (WSDL) descriptions and extract information. To use SQL for relational operations for service management, we introduce a new operator t... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Services in the Cloud Computing era: A survey

    Publication Year: 2010 , Page(s): 40 - 46
    Cited by:  Papers (19)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (161 KB) |  | HTML iconHTML  

    Cloud Computing is becoming a well-known buzzword nowadays. As a brand new infrastructure to offer services, Cloud Computing systems have many superiorities in comparing to those existed traditional service provisions, such as reduced upfront investment, expected performance, high availability, infinite scalability, tremendous fault-tolerance capability and so on and consequently chased by most of... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • HMM based speech synthesis with Global Variance Training method

    Publication Year: 2010 , Page(s): 47
    Request Permissions | Click to expandAbstract | PDF file iconPDF (32 KB) |  | HTML iconHTML  

    Although Hidden Markov Model based speech synthesis has been proved to have good performance,there are still some factors which degrade the quality of synthesized speech: vocoder,model accuracy and over-smoothing. Experimental results show that over-smoothing in frequency domain mainly affect the quality of synthesized speech whereas over-smoothing in time domain can nearly be ignored. Time domain... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • A preliminary exploration on tone error detection in Mandarin based on clustering

    Publication Year: 2010 , Page(s): 48 - 51
    Request Permissions | Click to expandAbstract | PDF file iconPDF (105 KB) |  | HTML iconHTML  

    This paper addresses the ongoing issue of tone error detection for Mandarin Computer Assisted Language Learning (CALL) systems. A novel approach based on clustering is proposed. The selection of different contextual tonal factors including Uni-tone, LBi-tone and RBi-tone are explored. Experimental results show that our proposed approach is feasible, obtaining an Equal Error Rate (EER) of 18.75% by... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Korean pronunciation variation modeling with probabilistic Bayesian networks

    Publication Year: 2010 , Page(s): 52 - 57
    Cited by:  Papers (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (139 KB) |  | HTML iconHTML  

    In Korean language, a large proportion of word units are pronounced differently from their written forms due to an agglutinative and highly inflective nature having severe phonological phenomena and coarticulation effects. This paper reports on an ongoing study of Korean pronunciation modeling, in which the mapping between phonemic and orthographic units is modeled by a Bayesian network (BN). The ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Improving spontaneous English ASR using a joint-sequence pronunciation model

    Publication Year: 2010 , Page(s): 58 - 61
    Request Permissions | Click to expandAbstract | PDF file iconPDF (131 KB) |  | HTML iconHTML  

    The performance of English automatic speech recognition systems decreases when recognizing spontaneous speech mainly due to occurring multiple pronunciation variants in the utterances. Previous approaches address the multiple pronunciation problem by modeling the alteration of the pronunciation on a phoneme to phoneme level. However, the phonetic transformation effects induced by the pronunciation... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Storage and index support for data intensive web applications

    Publication Year: 2010 , Page(s): 62 - 68
    Request Permissions | Click to expandAbstract | PDF file iconPDF (366 KB) |  | HTML iconHTML  

    In this paper, a system named as DisGR, for Distributed Graph Repository, that is designed and developed for supporting Chinese Web related research, is introduced. The system is designed based on a graph data model, TGM (for Tagged Graph Model), that is designed for representing Web data, especially forum and BBS data. DisGR supports the query language TGM-L that aims at analytical tasks for TGM ... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Materialized view maintenance in columnar storage for massive data analysis

    Publication Year: 2010 , Page(s): 69 - 76
    Request Permissions | Click to expandAbstract | PDF file iconPDF (379 KB) |  | HTML iconHTML  

    Data-intensive computing becomes a buzz word nowadays, where constant data for current operational processing and historical data for massive analysis are often separated into two systems. How to keep the historical data for analysis (often in a materialized view manner) consistent with their data sources (often in the operational databases) is the main problem to be solved imperatively. In this p... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Optimization of multi-join query processing within MapReduce

    Publication Year: 2010 , Page(s): 77 - 83
    Cited by:  Patents (1)
    Request Permissions | Click to expandAbstract | PDF file iconPDF (286 KB) |  | HTML iconHTML  

    MapReduce is a programming model which is usually applied to process large-scale data. Many tasks can be implemented under the framework, such as data processing of search engines and machine learning. However, there is no efficient support for join operation in current implementations of MapReduce. Former work has studied Map-Reduce-Merge for join operator, however, because of the time cost in th... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.
  • Searching XML data by SLCA on a MapReduce cluster

    Publication Year: 2010 , Page(s): 84 - 89
    Request Permissions | Click to expandAbstract | PDF file iconPDF (585 KB) |  | HTML iconHTML  

    XML keyword search is a popular topic in research field, and the Smallest Lowest Common Ancestor (SLCA) concept is fundamental for XML keyword search algorithms. With the rapid growth of XML data in internet, we are confronted with big data issues, it's becoming a new research direction for managing massive XML data now. Conventional centralized data management technologies are limited in the aspe... View full abstract»

    Full text access may be available. Click article title to sign in or learn about subscription options.