Paper Search Console

Home Search Page About Contact

Journal Title

Title of Journal:

Search In Journal Title:

Abbravation:

Search In Journal Abbravation:

Publisher

Springer, Cham

Search In Publisher:

DOI

10.1016/0014-5793(80)80783-6

Search In DOI:

ISSN

Search In ISSN:
Search In Title Of Papers:

Evaluation of Statistical POMDPBased Dialogue Sys

Authors: Steve Young Catherine Breslin Milica Gašić Matthew Henderson Dongho Kim Martin Szummer Blaise Thomson Pirros Tsiakoulis Eli Tzirkel Hancock
Publish Date: 2016
Volume: , Issue: , Pages: 3-14
PDF Link

Abstract

Compared to conventional handcrafted rulebased dialogue management systems statistical POMDPbased dialogue managers offer the promise of increased robustness reduced development and maintenance costs and scaleability to large opendomains As a consequence there has been considerable research activity in approaches to statistical spoken dialogue systems over recent years However building and deploying a realtime spoken dialogue system is expensive and even when operational it is hard to recruit sufficient users to get statistically significant results Instead researchers have tended to evaluate using user simulators or by reprocessing existing corpora both of which are unconvincing predictors of actual real world performance This paper describes the deployment of a realworld restaurant information system and its evaluation in a motor car using subjects recruited locally and by remote users recruited using Amazon Mechanical Turk The paper explores three key questions are statistical dialogue systems more robust than conventional handcrafted systems how does the performance of a system evaluated on a user simulator compare to performance with real users and can performance of a system tested over the telephone network be used to predict performance in more hostile environments such as a motor car The results show that the statistical approach is indeed more robust but results from a simulator significantly overestimate performance both absolute and relative Finally by matching WER rates performance results obtained over the telephone can provide useful predictors of performance in noisier environments such as the motor car but again they tend to overestimate performance


Keywords:

References


.
Search In Abstract Of Papers:
Other Papers In This Journal:


    Search Result: