Abstract:
Deep-learning algorithms are being used extensively in question–answering systems based on natural language classifiers to classify an incoming user question into a set o...Show MoreMetadata
Abstract:
Deep-learning algorithms are being used extensively in question–answering systems based on natural language classifiers to classify an incoming user question into a set of classes with the same answer. We treat a natural language classifier as a black box and study its performance with respect to the ground truth that is used to train and test the system. We have observed that maintaining ground truth is challenging; for example, 1) the number of answer classes can be large (in the several hundreds), 2) manual mapping of questions to answers can result in inconsistent mappings, leading to overlap and confusion among them, and 3) users ask questions within a context that is not apparent by examining the question standalone, leading to erroneous mappings. We propose a methodology for guided evolution of the ground truth, from its initial creation to its ongoing maintenance in the deployed production environment. We measure performance using two metrics: accuracy and confidence. Accuracy measures how many classifications are correct, based on an assessment, while confidence is a raw metric, output by the classifier, which correlates with accuracy. Confidence can further be used to effectively manage the perceived accuracy of the system from a user's perspective, appropriately trading off accuracy versus coverage.
Published in: IBM Journal of Research and Development ( Volume: 61, Issue: 4/5, 01 July-Sept. 2017)
Raimo Bakis IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (bakis@us.ibm.com). Dr. Bakis is a retired Research Staff Member, now
working as a contractor, in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a
B.A. in physics from Sterling College in 1954, and M.S. and Ph.D. degrees in physics from Kansas State University in
1956 and 1959, respective...Show More
Raimo Bakis IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (bakis@us.ibm.com). Dr. Bakis is a retired Research Staff Member, now
working as a contractor, in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a
B.A. in physics from Sterling College in 1954, and M.S. and Ph.D. degrees in physics from Kansas State University in
1956 and 1959, respective...View more
Daniel P. Connors IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dconnors@us.ibm.com). Dr. Connors is a Research Staff Member
in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a B.S.E. degree in
electrical engineering from the University of Michigan in 1982 and M.S. and Ph.D. degrees in electrical engineering
from the University of Illinois in 1...Show More
Daniel P. Connors IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dconnors@us.ibm.com). Dr. Connors is a Research Staff Member
in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a B.S.E. degree in
electrical engineering from the University of Michigan in 1982 and M.S. and Ph.D. degrees in electrical engineering
from the University of Illinois in 1...View more
Parijat Dube IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (pdube@us.ibm.com). Dr. Dube is a Research Staff Member in the Cloud
and Cognitive Platform department at the IBM T. J. Watson Research Center. He received his Ph.D. (2002) in computer
science from INRIA, France. His research interests are in performance modeling, analysis, and optimization of systems.
Parijat Dube IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (pdube@us.ibm.com). Dr. Dube is a Research Staff Member in the Cloud
and Cognitive Platform department at the IBM T. J. Watson Research Center. He received his Ph.D. (2002) in computer
science from INRIA, France. His research interests are in performance modeling, analysis, and optimization of systems.
View more
Pavan Kapanipathi IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (kapanipa@us.ibm.com). Dr. Kapanipathi is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree
from Wright State University in 2016. His research interests are in the areas of semantic web, knowledge graphs, and
machine learning.
Pavan Kapanipathi IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (kapanipa@us.ibm.com). Dr. Kapanipathi is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree
from Wright State University in 2016. His research interests are in the areas of semantic web, knowledge graphs, and
machine learning.View more
Abhishek Kumar IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (abhishk@us.ibm.com). Dr. Kumar is a Research Staff Member in
the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree from the
University of Maryland, College Park, in 2013. His interests broadly lie in the areas of machine learning and
statistics.
Abhishek Kumar IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (abhishk@us.ibm.com). Dr. Kumar is a Research Staff Member in
the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree from the
University of Maryland, College Park, in 2013. His interests broadly lie in the areas of machine learning and
statistics.View more
Dmitry Malioutov IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dmalioutov@us.ibm.com). Dr. Malioutov is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. His research interests include
optimization in machine learning, inference and learning in graphical models, message passing algorithms, sparse
signal representation, and interpret...Show More
Dmitry Malioutov IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dmalioutov@us.ibm.com). Dr. Malioutov is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. His research interests include
optimization in machine learning, inference and learning in graphical models, message passing algorithms, sparse
signal representation, and interpret...View more
Chitra Venkataramani IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA. Dr. Venkatramani is a Distinguished Engineer in the Cognitive
Computing department at the IBM T. J. Watson Research Center. Her current focus is on addressing performance and
methodology challenges in cognitive computing systems. Since joining IBM in 1997, she has also worked on large-scale
distributed systems and appl...Show More
Chitra Venkataramani IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA. Dr. Venkatramani is a Distinguished Engineer in the Cognitive
Computing department at the IBM T. J. Watson Research Center. Her current focus is on addressing performance and
methodology challenges in cognitive computing systems. Since joining IBM in 1997, she has also worked on large-scale
distributed systems and appl...View more
Raimo Bakis IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (bakis@us.ibm.com). Dr. Bakis is a retired Research Staff Member, now
working as a contractor, in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a
B.A. in physics from Sterling College in 1954, and M.S. and Ph.D. degrees in physics from Kansas State University in
1956 and 1959, respectively. He subsequently joined IBM at the T. J. Watson Research center where he worked on
automatic speech recognition, optical character recognition, satellite image processing, speech synthesis, and dialog
management. He is the author or coauthor of 20 patents and 32 technical papers.
Raimo Bakis IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (bakis@us.ibm.com). Dr. Bakis is a retired Research Staff Member, now
working as a contractor, in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a
B.A. in physics from Sterling College in 1954, and M.S. and Ph.D. degrees in physics from Kansas State University in
1956 and 1959, respectively. He subsequently joined IBM at the T. J. Watson Research center where he worked on
automatic speech recognition, optical character recognition, satellite image processing, speech synthesis, and dialog
management. He is the author or coauthor of 20 patents and 32 technical papers.View more
Daniel P. Connors IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dconnors@us.ibm.com). Dr. Connors is a Research Staff Member
in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a B.S.E. degree in
electrical engineering from the University of Michigan in 1982 and M.S. and Ph.D. degrees in electrical engineering
from the University of Illinois in 1984 and 1988, respectively. He subsequently joined IBM at the T. J. Watson
Research Center, where he has worked on manufacturing, production planning, supply chain and workforce management and
cognitive computing. Dr. Connors is a member of the Institute of Electrical and Electronics Engineers (IEEE), the
Institute for Operations Research and the Management Sciences (INFORMS), and the Society for Industrial and Applied
Mathematics (SIAM).
Daniel P. Connors IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dconnors@us.ibm.com). Dr. Connors is a Research Staff Member
in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received a B.S.E. degree in
electrical engineering from the University of Michigan in 1982 and M.S. and Ph.D. degrees in electrical engineering
from the University of Illinois in 1984 and 1988, respectively. He subsequently joined IBM at the T. J. Watson
Research Center, where he has worked on manufacturing, production planning, supply chain and workforce management and
cognitive computing. Dr. Connors is a member of the Institute of Electrical and Electronics Engineers (IEEE), the
Institute for Operations Research and the Management Sciences (INFORMS), and the Society for Industrial and Applied
Mathematics (SIAM).View more
Parijat Dube IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (pdube@us.ibm.com). Dr. Dube is a Research Staff Member in the Cloud
and Cognitive Platform department at the IBM T. J. Watson Research Center. He received his Ph.D. (2002) in computer
science from INRIA, France. His research interests are in performance modeling, analysis, and optimization of systems.
Parijat Dube IBM Research, Thomas J. Watson Research Center,
Yorktown Heights, NY 10598 USA (pdube@us.ibm.com). Dr. Dube is a Research Staff Member in the Cloud
and Cognitive Platform department at the IBM T. J. Watson Research Center. He received his Ph.D. (2002) in computer
science from INRIA, France. His research interests are in performance modeling, analysis, and optimization of systems.
View more
Pavan Kapanipathi IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (kapanipa@us.ibm.com). Dr. Kapanipathi is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree
from Wright State University in 2016. His research interests are in the areas of semantic web, knowledge graphs, and
machine learning.
Pavan Kapanipathi IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (kapanipa@us.ibm.com). Dr. Kapanipathi is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree
from Wright State University in 2016. His research interests are in the areas of semantic web, knowledge graphs, and
machine learning.View more
Abhishek Kumar IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (abhishk@us.ibm.com). Dr. Kumar is a Research Staff Member in
the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree from the
University of Maryland, College Park, in 2013. His interests broadly lie in the areas of machine learning and
statistics.
Abhishek Kumar IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (abhishk@us.ibm.com). Dr. Kumar is a Research Staff Member in
the Cognitive Computing department at the IBM T. J. Watson Research Center. He received his Ph.D. degree from the
University of Maryland, College Park, in 2013. His interests broadly lie in the areas of machine learning and
statistics.View more
Dmitry Malioutov IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dmalioutov@us.ibm.com). Dr. Malioutov is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. His research interests include
optimization in machine learning, inference and learning in graphical models, message passing algorithms, sparse
signal representation, and interpretable machine learning. Dr. Malioutov is a senior member of the Institute of
Electrical and Electronics Engineers (IEEE), serves as an associate editor of the IEEE Transactions on Signal
Processing, is a guest editor for an IEEE journal of selected topics in signal processing, and a recipient of
the IEEE signal processing society 5-year best paper award, ICASSP 2006 student paper award, and MIT presidential
fellowship.
Dmitry Malioutov IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA (dmalioutov@us.ibm.com). Dr. Malioutov is a Research Staff
Member in the Cognitive Computing department at the IBM T. J. Watson Research Center. His research interests include
optimization in machine learning, inference and learning in graphical models, message passing algorithms, sparse
signal representation, and interpretable machine learning. Dr. Malioutov is a senior member of the Institute of
Electrical and Electronics Engineers (IEEE), serves as an associate editor of the IEEE Transactions on Signal
Processing, is a guest editor for an IEEE journal of selected topics in signal processing, and a recipient of
the IEEE signal processing society 5-year best paper award, ICASSP 2006 student paper award, and MIT presidential
fellowship.View more
Chitra Venkataramani IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA. Dr. Venkatramani is a Distinguished Engineer in the Cognitive
Computing department at the IBM T. J. Watson Research Center. Her current focus is on addressing performance and
methodology challenges in cognitive computing systems. Since joining IBM in 1997, she has also worked on large-scale
distributed systems and applications, such as video streaming servers, content distribution networks, stream-computing
systems, and big data applications. She holds numerous patents and publications and is a recipient of IBM's
Outstanding Technical Achievement awards.
Chitra Venkataramani IBM Research, Thomas J. Watson Research
Center, Yorktown Heights, NY 10598 USA. Dr. Venkatramani is a Distinguished Engineer in the Cognitive
Computing department at the IBM T. J. Watson Research Center. Her current focus is on addressing performance and
methodology challenges in cognitive computing systems. Since joining IBM in 1997, she has also worked on large-scale
distributed systems and applications, such as video streaming servers, content distribution networks, stream-computing
systems, and big data applications. She holds numerous patents and publications and is a recipient of IBM's
Outstanding Technical Achievement awards.View more