摘要 |
A recording medium is provided that records a separating step of separating a mixed sound signal in which a plurality of excitations are mixed into the respective excitations, and a step of performing speech detection on the plurality of separated excitation signals, judging whether or not the plurality of excitation signals are speech and generating speech section information indicating speech/non-speech information for each excitation signal. The recording medium also includes at least one of a step of calculating and analyzing an utterance overlap duration using the speech section information for combinations of the plurality of excitation signals and a step of calculating and analyzing a silence duration. The recording medium further includes a step of calculating a degree of establishment of a conversation indicating the degree of establishment of a conversation based on the extracted utterance overlap duration or the silence duration.
|