Cart (Loading....) | Create Account
Close category search window

Finding authoritative people from the Web

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

3 Author(s)
Harada, M. ; Network Innovation Labs., Nippon Telegraph & Telephone Corp., Tokyo, Japan ; Sato, S.-y. ; Kazama, K.

Today's Web is so huge and diverse that it arguably reflects the real world. For this reason, searching the Web is a promising approach to find things in the real world. We present NEXAS, an extension to Web search engines that attempts to find real-world entities relevant to a topic. Its basic idea is to extract proper names from the Web pages retrieved for the topic. A main advantage of this approach is that users can query any topic and learn about relevant real-world entities without dedicated databases for the topic. In particular, we focus on an application for finding authoritative people from the Web. This application is practically important because once personal names are obtained; they can lead users from the Web to managed information stored in digital libraries. To explore effective ways of finding people, we first examine the distribution of Japanese personal names by analyzing about 50 million Japanese Web pages. We observe that personal names appear frequently on the Web, but the distribution is highly influenced by automatically generated texts. To remedy the bias and find widely acknowledged people accurately, we utilize the number of Web servers containing a name instead of the number of Web pages. We show its effectiveness by an experiment covering a wide range of topics. Finally, we demonstrate several examples and suggest possible applications.

Published in:

Digital Libraries, 2004. Proceedings of the 2004 Joint ACM/IEEE Conference on

Date of Conference:

7-11 June 2004

Need Help?

IEEE Advancing Technology for Humanity About IEEE Xplore | Contact | Help | Terms of Use | Nondiscrimination Policy | Site Map | Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest professional association for the advancement of technology.
© Copyright 2014 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.