Test-based and metric-based evaluation of code generation models for practical question answering | IEEE Conference Publication | IEEE Xplore