TY - JOUR
T1 - Building a parsimonious model for identifying best answers using interaction history in community Q&A
AU - Shah, Chirag
N1 - Publisher Copyright:
Copyright © 2015 by Association for Information Science and Technology
PY - 2015/1
Y1 - 2015/1
N2 - Evaluating answer quality or identifying/predicting which answer would be selected as the best for a given question is an important problem in community-based Q&A services. In this article we introduce new interaction-based features depicting the amount of distinct interactions between an asker and answerer over time, in order to predict whether an answer will be selected as Best Answer or not within Yahoo! Answers. Through a series of experiments ran on a data set of 23,218 question-answer pairs, we determined that after the data was first run using a model trained on textual features, and then the failed cases re-run with a model trained on interaction features, we were able to significantly improve the performance of the original model in identifying these difficult cases. In addition, when compared to models using often five to seven times the amount of features and requiring a large amount of computational effort, our model performed at to above the same evaluative measures. This suggests that future classification models can be made more parsimonious and handle larger datasets using less computational effort by developing a two-step classifier that includes interaction history as a feature.
AB - Evaluating answer quality or identifying/predicting which answer would be selected as the best for a given question is an important problem in community-based Q&A services. In this article we introduce new interaction-based features depicting the amount of distinct interactions between an asker and answerer over time, in order to predict whether an answer will be selected as Best Answer or not within Yahoo! Answers. Through a series of experiments ran on a data set of 23,218 question-answer pairs, we determined that after the data was first run using a model trained on textual features, and then the failed cases re-run with a model trained on interaction features, we were able to significantly improve the performance of the original model in identifying these difficult cases. In addition, when compared to models using often five to seven times the amount of features and requiring a large amount of computational effort, our model performed at to above the same evaluative measures. This suggests that future classification models can be made more parsimonious and handle larger datasets using less computational effort by developing a two-step classifier that includes interaction history as a feature.
KW - Community Q&A
KW - Interaction history
KW - Model building
KW - Online communities
UR - http://www.scopus.com/inward/record.url?scp=84987704941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84987704941&partnerID=8YFLogxK
U2 - 10.1002/pra2.2015.145052010051
DO - 10.1002/pra2.2015.145052010051
M3 - Article
AN - SCOPUS:84987704941
VL - 52
SP - 1
EP - 10
JO - Proceedings of the Association for Information Science and Technology
JF - Proceedings of the Association for Information Science and Technology
SN - 2373-9231
IS - 1
ER -