发明名称 |
SIGNATURE GENERATION FOR MULTIMEDIA DEEP-CONTENT-CLASSIFICATION BY A LARGE-SCALE MATCHING SYSTEM AND METHOD THEREOF |
摘要 |
A method and system for generating a large-scale database of heterogeneous speech are provided. The method includes transcribing a plurality of multimedia signals retrieved from a large text database and a speech database; randomly selecting a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; generating a plurality of signatures based on the plurality of speech segments; and populating the large-scale database with the plurality of signatures respective of the plurality of multimedia signals |
申请公布号 |
US2015154189(A1) |
申请公布日期 |
2015.06.04 |
申请号 |
US201514619767 |
申请日期 |
2015.02.11 |
申请人 |
CORTICA, LTD. |
发明人 |
Raichelgauz Igal;Odinaev Karina;Zeevi Yehoshua Y. |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A method for generating a large-scale database of heterogeneous speech, comprising:
transcribing a plurality of multimedia signals retrieved from a large text database and a speech database; randomly selecting a plurality of speech segments from the plurality of multimedia signals, wherein each speech segment of the plurality of speech segments is of a random length; generating a plurality of signatures based on the plurality of speech segments; and populating the large-scale database with the plurality of signatures respective of the plurality of multimedia signals. |
地址 |
RAMAT GAN IL |