发明名称 Audio processing device, audio processing method, program and integrated circuit
摘要 An audio processing device including a feature calculation unit, a boundary calculation unit and a judgment unit, detects points of change of audio features from an audio signal in an AV content. The feature calculation unit calculates, for each unit section of the audio signal, section feature data expressing features of the audio signal in the unit section. The boundary calculation unit calculates, for each target unit section among the unit sections of the audio signal, a piece of boundary information relating to at least one boundary of a similarity section. The similarity section consists of consecutive unit sections, inclusive of the target unit section, which each have similar section feature data. The judgment unit calculates a priority of each boundary indicated by one or more of the pieces of boundary information and judges whether the boundary is a scene change point based on the priority.
申请公布号 US8930190(B2) 申请公布日期 2015.01.06
申请号 US201314113481 申请日期 2013.03.11
申请人 Panasonic Intellectual Property Corporation Of America 发明人 Konuma Tomohiro;Uenoyama Tsutomu
分类号 G10L15/06;H04N5/60;G11B27/28;H04N5/14;H04N21/439;H04N21/845;G10L25/54;G10L25/57 主分类号 G10L15/06
代理机构 Wenderoth, Lind & Ponack, L.L.P. 代理人 Wenderoth, Lind & Ponack, L.L.P.
主权项 1. An audio processing device comprising: a non-transitory memory storing a program; and a hardware processor configured to execute the program and cause the image recognition device to operate as the following units stored in the non-transitory memory: a feature calculation unit configured to calculate, for each of a plurality of unit sections of an audio signal, section feature data expressing features of the audio signal in the unit section; a boundary calculation unit configured to calculate, for each of a plurality of target unit sections among the unit sections of the audio signal, a piece of boundary information relating to at least one boundary between a similarity section and another section of the audio signal, the similarity section consisting of a plurality of consecutive unit sections, inclusive of the target unit section, which each have similar section feature data; and a judgment unit configured to calculate a priority of each boundary that is indicated by one or more of the pieces of boundary information and judge whether the boundary is a scene change point based on the priority of the boundary, wherein each of the pieces of boundary information includes at least one out of a start time and an end time of the similarity section to which the piece of boundary information relates, and the similarity section includes sections having a section feature that represents a distance from a reference feature that is within a reference threshold, the reference feature being calculated using a section feature of the target unit section.
地址 Torrance CA US