By Topic

Using MEDLINE as Standard Corpus for Measuring Semantic Similarity in the Biomedical Domain

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$33 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Hisham Al-Mubaid ; University of Houston ; Hoa A. Nguyen

Finding the similarity between biomedical terms and concepts is a very important task for biomedical information extraction and knowledge discovery. We propose and investigate the feasibility of using MEDLINE as standard corpus and MeSH ontology for measuring semantic similarity between concepts in the biomedical domain within UMLS framework. We adapted information-based semantic similarity measures from general English and applied them into the biomedical domain to measure the similarity between biomedical terms. The experimental results show that, by using MEDLINE and MeSH ontology, the information-based similarity measures perform very well and produce high correlations with human ratings. The similarity measure of Jiang & Conrath achieved 82% correlation with human similarity scores, and the average correlation with human scores of three measures is approaching 78%. These results confirm that MEDLINE is an effective information source for measuring semantic similarity between biomedical terms and concepts

Published in:

Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06)

Date of Conference:

16-18 Oct. 2006