Conferences >2016 IEEE/ACM 38th Internatio...

From Word Embeddings to Document Similarities for Improved Information Retrieval in Software Engineering

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The application of information retrieval techniques to search tasks in software engineering is made difficult by the lexical gap between search queries, usually expressed...Show More

Metadata

Abstract:

The application of information retrieval techniques to search tasks in software engineering is made difficult by the lexical gap between search queries, usually expressed in natural language (e.g. English), and retrieved documents, usually expressed in code (e.g. programming languages). This is often the case in bug and feature location, community question answering, or more generally the communication between technical personnel and non-technical stake holders in a software project. In this paper, we propose bridging the lexical gap by projecting natural language statements and code snippets as meaning vectors in a shared representation space. In the proposed architecture, word embeddings are rst trained on API documents, tutorials, and reference documents, and then aggregated in order to estimate semantic similarities between documents. Empirical evaluations show that the learned vector space embeddings lead to improvements in a previously explored bug localization task and a newly de ned task of linking API documents to computer programming questions.

Published in: 2016 IEEE/ACM 38th International Conference on Software Engineering (ICSE)

Date of Conference: 14-22 May 2016

Date Added to IEEE Xplore: 03 April 2017

ISBN Information:

Electronic ISSN: 1558-1225

DOI: 10.1145/2884781.2884862

Conference Location: Austin, TX, USA

Contents

References is not available for this document.

From Word Embeddings to Document Similarities for Improved Information Retrieval in Software Engineering

Abstract:

Metadata

Abstract:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

From Word Embeddings to Document Similarities for Improved Information Retrieval in Software Engineering

Alerts

Abstract:

Metadata

Abstract:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?