发明名称 Speech segmentation
摘要 The present invention relates to the management of voice data. Voice messages left on a recipient's answerphone or delivered via a voicemail system are a popular form of person-to-person communication. Such voice messages are quick to generate for the sender but are relatively difficult to review for the recipient; speech is slow to listen to and, unlike inherently visual forms of messages such as electronic mail or handwritten notes, cannot be quickly scanned for the relevant information. The present invention aims to make it easier for users to find relevant information in voice messages, and other kinds of voice record, such as recordings of meetings and recorded dictation. According to the present invention we provide a method of speech segmentation comprising processing speech data so as to detect putative pauses and characterised by forming speech block boundaries at a selected subset of the pauses, said selection being based on a preselected target speech block length. The invention may be applied in an application where speech is represented visually.
申请公布号 US6055495(A) 申请公布日期 2000.04.25
申请号 US19970846612 申请日期 1997.04.30
申请人 HEWLETT-PACKARD COMPANY 发明人 TUCKER, ROGER CECIL FERRY;COLLINS, MICHAEL JOHN
分类号 G06F3/16;G10L11/02;G10L15/04;H04M1/65;H04M3/42;H04M3/533;(IPC1-7):G01L1/06 主分类号 G06F3/16
代理机构 代理人
主权项
地址