Abstract:
While researchers develop many new exciting code recommender systems, such as method-call completion, code-snippet completion, or code search, an accurate evaluation of s...Show MoreMetadata
Abstract:
While researchers develop many new exciting code recommender systems, such as method-call completion, code-snippet completion, or code search, an accurate evaluation of such systems is always a challenge. We analyzed the current literature and found that most of the current evaluations rely on artificial queries extracted from released code, which begs the question: Do such evaluations reflect real-life usages? To answer this question, we capture 6,189 fine-grained development histories from real IDE interactions. We use them as a ground truth and extract 7,157 real queries for a specific method-call recommender system. We compare the results of such real queries with different artificial evaluation strategies and check several assumptions that are repeatedly used in research, but never empirically evaluated. We find that an evolving context that is often observed in practice has a major effect on the prediction quality of recommender systems, but is not commonly reflected in artificial evaluations.
Date of Conference: 03-07 September 2016
Date Added to IEEE Xplore: 06 October 2016
ISBN Information:
Conference Location: Singapore
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Recommender Systems ,
- Evolutionary History ,
- Evaluation Of Strategies ,
- Prediction Quality ,
- Query Results ,
- Integrated Development Environment ,
- Open-source ,
- Control Experiments ,
- Source Code ,
- Contextual Information ,
- Selection Strategy ,
- Random Selection ,
- Types Of Users ,
- Random Strategy ,
- Evaluation Techniques ,
- Complete Coding ,
- End State ,
- Starting State ,
- Code Changes ,
- History Of Usage ,
- Code Snippets ,
- Ground Truth Dataset ,
- Corner Cases ,
- Final Snapshot ,
- Piece Of Code ,
- Event Stream ,
- Random Approach ,
- Maintenance Tasks ,
- Professional Development ,
- Code States
- Author Keywords
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Recommender Systems ,
- Evolutionary History ,
- Evaluation Of Strategies ,
- Prediction Quality ,
- Query Results ,
- Integrated Development Environment ,
- Open-source ,
- Control Experiments ,
- Source Code ,
- Contextual Information ,
- Selection Strategy ,
- Random Selection ,
- Types Of Users ,
- Random Strategy ,
- Evaluation Techniques ,
- Complete Coding ,
- End State ,
- Starting State ,
- Code Changes ,
- History Of Usage ,
- Code Snippets ,
- Ground Truth Dataset ,
- Corner Cases ,
- Final Snapshot ,
- Piece Of Code ,
- Event Stream ,
- Random Approach ,
- Maintenance Tasks ,
- Professional Development ,
- Code States
- Author Keywords