TY - GEN
T1 - Combining topic specific language models
AU - Shi, Yangyang
AU - Wiggers, Pascal
AU - Jonker, Catholijn M.
PY - 2011
Y1 - 2011
N2 - In this paper we investigate whether a combination of topic specific language models can outperform a general purpose language model, using a trigram model as our baseline model. We show that in the ideal case - in which it is known beforehand which model to use - specific models perform considerably better than the baseline model. We test two methods that combine specific models and show that these combinations outperform the general purpose model, in particular if the data is diverse in terms of topics and vocabulary. Inspired by these findings, we propose to combine a decision tree and a set of dynamic Bayesian networks into a new model. The new model uses context information to dynamically select an appropriate specific model. © 2011 Springer-Verlag.
AB - In this paper we investigate whether a combination of topic specific language models can outperform a general purpose language model, using a trigram model as our baseline model. We show that in the ideal case - in which it is known beforehand which model to use - specific models perform considerably better than the baseline model. We test two methods that combine specific models and show that these combinations outperform the general purpose model, in particular if the data is diverse in terms of topics and vocabulary. Inspired by these findings, we propose to combine a decision tree and a set of dynamic Bayesian networks into a new model. The new model uses context information to dynamically select an appropriate specific model. © 2011 Springer-Verlag.
UR - https://www.scopus.com/pages/publications/80052733385
U2 - 10.1007/978-3-642-23538-2_13
DO - 10.1007/978-3-642-23538-2_13
M3 - Conference contribution
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 99
EP - 106
BT - Text, Speech and Dialogue - 14th International Conference, TSD 2011, Proceedings
T2 - 14th International Conference on Text, Speech and Dialogue, TSD 2011
Y2 - 1 September 2011 through 5 September 2011
ER -