发明名称 |
SYSTEM FOR GENERATION OF A LARGE-SCALE DATABASE OF HETROGENEOUS SPEECH |
摘要 |
A system for generating a large-scale database of heterogeneous speech is provided. The system comprises a processor a plurality of independent computation cores configured to generate signatures of a plurality of speech segments; a large scale database configured to maintain a plurality of transcribed multimedia signals; a memory, the memory containing instructions that, when executed by the processor, configure the system to: randomly select a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; provide the plurality of speech segments to the plurality of independent computation cores for generation of the signatures; collect the signatures from the plurality of independent computation cores; and populate the large-scale database with the plurality of signatures respective of the plurality of multimedia signals. |
申请公布号 |
US2016239566(A1) |
申请公布日期 |
2016.08.18 |
申请号 |
US201615140977 |
申请日期 |
2016.04.28 |
申请人 |
Cortica, Ltd. |
发明人 |
Raichelgauz Igal;Odinaev Karina;Zeevi Yehoshua Y |
分类号 |
G06F17/30;G10L25/57 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A system for generating a large-scale database of heterogeneous speech, comprising:
a processor; a plurality of independent computation cores configured to generate signatures of a plurality of speech segments; a large scale database configured to maintain a plurality of transcribed multimedia signals; a memory, the memory containing instructions that, when executed by the processor, configure the system to: randomly select a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; provide the plurality of speech segments to the plurality of independent computation cores for generation of the signatures; collect the signatures from the plurality of independent computation cores; and populate the large-scale database with the plurality of signatures respective of the plurality of multimedia signals. |
地址 |
TEL AVIV IL |