摘要 |
A method of rapidly and precisely managing a dialog turn between a user and an agent by using speech information, facial expression information, and delay time information includes generating first dialog turn information using dialog information analyzed from a speech uttered by the user, generating second dialog turn information using facial expression information analyzed from a face image of the user, and determining a final dialog turn using the first and second dialog turn information, information on a status of the spoken dialog system, information on whether the user speech is input, and information on a no-answer time of the user.
|