发明名称 Multimedia Device Voice Control System and Method, and Computer Storage Medium
摘要 A voice control system and method for a multimedia device are provided. The system includes an image sensing module configured to collect a user action image; an image recognizing module configured to determine a type or a status of a control instruction according to the user action image; a voice recognition status managing module configured to activate or wake up the voice recognition program according to a type of a current control instruction; a pickup module configured to collect voice signal; a voice recognizing module configured to recognize the collected voice data to generate a control instruction; and a multimedia function module configured to execute the control instruction to provide a corresponding multimedia function to the user. An image recognition technology, a voice recognition technology, and a storage medium of a computer are combined in the illustrated embodiment, a free and convenient voice control which is not depended on a hand-held remote control unit and not limited to a close pickup device is achieved. The interference of the sound output by the multimedia device, the environment background noise, and a non-control instruction voice signal of the user to the control instruction voice recognition can be effectively avoided, the instruction of the user can be precisely recognized.
申请公布号 US2015222948(A1) 申请公布日期 2015.08.06
申请号 US201314421900 申请日期 2013.09.26
申请人 Shenzhen Prtek Co. Ltd. 发明人 Wang Hongzhi;Liu Leyuan;Sang Nong;Liu Guohua
分类号 H04N21/422;G10L15/22;H04N21/47;H04N21/44;H04N21/4415;H04N21/442;G06F3/16;H04N21/4223 主分类号 H04N21/422
代理机构 代理人
主权项 1. A voice control system for a multimedia device, comprising: an image sensing module configured to collect a user action image; an image recognizing module configured to determine a type or a status of a control instruction according to the user action image; determine a position of a user who sends the user action image as a position of a target voice source, send the position of the target voice source, determine a target user according to the position of the target voice source, the target user being an operator; a voice recognition status managing module configured to activate or wake up a voice recognition program according to a type of a the control instruction; sending the position of the target voice source to a sound beam forming module, controlling a multimedia function module to reduce a output volume of a multimedia device; the sound beam forming module configured to determine a pickup direction and a pickup angle according to the position of the target voice source; a pickup module configured to collect a voice signal of the target voice source according to the pickup direction and the pickup angle; a voice recognizing module configured to recognize the collected voice data to generate a control instruction; and the multimedia function module configured to execute the control instruction to provide a corresponding multimedia function to the user.
地址 Shenzhen CN