| Now that continuous speech recognition is available commercially (and cheaply), there is an explosion of interest in applications that can make use of it. A particularly straightforward idea is to have a system capable of answering questions and taking instructions from a user, usually someone calling on the telephone.
However, there is a lot more to this than simply recognizing the words in the user's speech. Even leaving aside the complicated and still evolving matter of how users will react when they suspect or know that they are in conversation with a machine, a speech dialogue system has to:
- recognize the intent of the user's speech, which may be expressed in pieces of dialogue rather than a single sentence, and indeed in what is not said,
- construct a suitable answer (or supplementary question) out of the always limited set of data and strategies that the system commands,
- select an appropriate intonation pattern for the answer.
|