Skip to main navigation Skip to search Skip to main content

Combining topic specific language models

Research output: Chapter in Book / Report / Conference proceedingConference contributionAcademicpeer-review

Abstract

In this paper we investigate whether a combination of topic specific language models can outperform a general purpose language model, using a trigram model as our baseline model. We show that in the ideal case - in which it is known beforehand which model to use - specific models perform considerably better than the baseline model. We test two methods that combine specific models and show that these combinations outperform the general purpose model, in particular if the data is diverse in terms of topics and vocabulary. Inspired by these findings, we propose to combine a decision tree and a set of dynamic Bayesian networks into a new model. The new model uses context information to dynamically select an appropriate specific model. © 2011 Springer-Verlag.
Original languageEnglish
Title of host publicationText, Speech and Dialogue - 14th International Conference, TSD 2011, Proceedings
Pages99-106
DOIs
Publication statusPublished - 2011
Externally publishedYes
Event14th International Conference on Text, Speech and Dialogue, TSD 2011 - , Czech Republic
Duration: 1 Sept 20115 Sept 2011

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference14th International Conference on Text, Speech and Dialogue, TSD 2011
Country/TerritoryCzech Republic
Period1/09/115/09/11

Fingerprint

Dive into the research topics of 'Combining topic specific language models'. Together they form a unique fingerprint.

Cite this