发明名称 |
Multimedia Device Voice Control System and Method, and Computer Storage Medium |
摘要 |
A voice control system and method for a multimedia device are provided. The system includes an image sensing module configured to collect a user action image; an image recognizing module configured to determine a type or a status of a control instruction according to the user action image; a voice recognition status managing module configured to activate or wake up the voice recognition program according to a type of a current control instruction; a pickup module configured to collect voice signal; a voice recognizing module configured to recognize the collected voice data to generate a control instruction; and a multimedia function module configured to execute the control instruction to provide a corresponding multimedia function to the user. An image recognition technology, a voice recognition technology, and a storage medium of a computer are combined in the illustrated embodiment, a free and convenient voice control which is not depended on a hand-held remote control unit and not limited to a close pickup device is achieved. The interference of the sound output by the multimedia device, the environment background noise, and a non-control instruction voice signal of the user to the control instruction voice recognition can be effectively avoided, the instruction of the user can be precisely recognized. |
申请公布号 |
US2015222948(A1) |
申请公布日期 |
2015.08.06 |
申请号 |
US201314421900 |
申请日期 |
2013.09.26 |
申请人 |
Shenzhen Prtek Co. Ltd. |
发明人 |
Wang Hongzhi;Liu Leyuan;Sang Nong;Liu Guohua |
分类号 |
H04N21/422;G10L15/22;H04N21/47;H04N21/44;H04N21/4415;H04N21/442;G06F3/16;H04N21/4223 |
主分类号 |
H04N21/422 |
代理机构 |
|
代理人 |
|
主权项 |
1. A voice control system for a multimedia device, comprising: an image sensing module configured to collect a user action image; an image recognizing module configured to determine a type or a status of a control instruction according to the user action image; determine a position of a user who sends the user action image as a position of a target voice source, send the position of the target voice source, determine a target user according to the position of the target voice source, the target user being an operator;
a voice recognition status managing module configured to activate or wake up a voice recognition program according to a type of a the control instruction; sending the position of the target voice source to a sound beam forming module, controlling a multimedia function module to reduce a output volume of a multimedia device; the sound beam forming module configured to determine a pickup direction and a pickup angle according to the position of the target voice source; a pickup module configured to collect a voice signal of the target voice source according to the pickup direction and the pickup angle; a voice recognizing module configured to recognize the collected voice data to generate a control instruction; and the multimedia function module configured to execute the control instruction to provide a corresponding multimedia function to the user. |
地址 |
Shenzhen CN |