摘要 |
Telecommunication networks and methods are disclosed for providing a group session service for a plurality of participants. An application server in the telecommunication network receives a plurality of real-time media streams from the participants of the group session, and identifies voice media in the individual media streams. The voice media represents the spoken voice of the participants, and includes talking intervals separated by idle intervals (i.e., pauses in the spoken voice). The application server inputs the talking intervals as audio media elements into an audio media queue in the order received, and also outputs the audio media elements from the audio media queue in the order in which the audio media elements were inputted (i.e., in a first-in-first-out (FIFO) fashion) to generate a collective media stream for the group session. The collected audio stream is then provided to the participants of the group session.
|