By Topic


Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $33
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)

Human beings and the computer systems they design generally operate best with information sources that are organized. Information in these sources is typically stored in a predefined format, facilitating its search, retrieval, and analysis. In real life, however, for every source of structured information (such as a database of purchasing records), there are many sources of unstructured information (such as natural language documents, still images, and video files). It is estimated that 80 to 85 percent of all corporate information is unstructured, and with the growth of the Internet and corporate Intranets, the volume and heterogeneity of this information has increased prodigiously.

Note: The Institute of Electrical and Electronics Engineers, Incorporated is distributing this Article with permission of the International Business Machines Corporation (IBM) who is the exclusive owner. The recipient of this Article may not assign, sublicense, lease, rent or otherwise transfer, reproduce, prepare derivative works, publicly display or perform, or distribute the Article.  

Published in:

IBM Systems Journal  (Volume:43 ,  Issue: 3 )