发明名称 CONVERTING DATA BETWEEN USERS
摘要 A method and system for converting voice data to text data between users is provided. The method includes receiving voice data from at least one user and determining phoneme data items corresponding to the voice data. Conversion candidate string representations of the phoneme data items are identified by referencing a conversion dictionary defining the conversion candidate string representations for each phoneme data item. The plurality of conversion candidate string representations are scored and a specified conversion candidate string representation is selected as text data based on the scores. The text data is transmitted to a terminal device accessed by the at least one user.
申请公布号 US2015073789(A1) 申请公布日期 2015.03.12
申请号 US201414444224 申请日期 2014.07.28
申请人 INTERNATIONAL BUSINESS MACHINES CORPORATION 发明人 Hashimoto Kensuke;Hattori Yohichi;Sanui Taroh;Shiiki Hisae
分类号 G10L15/26;G10L15/04;G10L15/30;G10L15/02 主分类号 G10L15/26
代理机构 代理人
主权项 1. A method for converting voice data to text data, the method comprising: receiving, by a computer processor of a computing system executing a receiving unit, voice data from a terminal device used by at least one user of a plurality of users; determining, by said computer processor executing a recognition unit, phoneme data items corresponding to the voice data; identifying, by said computer processor executing an identifying unit, conversion candidate string representations of the phoneme data items by referencing a conversion dictionary defining the conversion candidate string representations for each phoneme data item of the phoneme data items; scoring, by said computer processor executing a scoring unit, the plurality of conversion candidate string representations displayed on a shared screen viewed by the plurality of users during a data exchange session, said scoring comprising: assigning a first score to a first conversion candidate string representation of the plurality of conversion candidate string representations, wherein the first conversion candidate string representation is displayed within a predetermined range of a cursor on the shared screen during reception of the voice data,assigning a second score to a second conversion candidate string representation of the plurality of conversion candidate string representations, wherein the second conversion candidate string representation is displayed outside of the predetermined range of the curser on the shared screen during reception of the voice data, and wherein the second score is less than the first score, andassigning a third score to a third conversion candidate string representation of the plurality of conversion candidate string representations, wherein the third conversion candidate string representation is displayed on the shared screen prior to reception of the voice data, and wherein the third score is less than the second score; selecting as text data, by said computer processor from the plurality of conversion candidate string representations, the first conversion candidate string representation, the second conversion candidate string representation, or the third conversion candidate string representation based on the first score, the second score, and the third score; transmitting, by the computer processor, the text data to a terminal device accessed by the at least one user.
地址 ARMONK NY US