发明名称 Method and system for providing region-of-interest video compression
摘要 Embodiments of the present invention provide for a region-of-interest compression methodology wherein a variety of encoders may be utilized to perform video compression on a plurality of filtered video frames without the need to generate specific instructions for each of the variety of encoders. Embodiments of the present invention receive a video frame and create a region-of-interest map based on the received video frame. The region-of-interest map is utilized to create a filtered video frame based on the received video frame. This process may be repeated for each video frame within a video stream, thereby creating a plurality of filtered video frames. The plurality of filtered video frames is transmitted to an encoder for video compression.
申请公布号 US9036693(B2) 申请公布日期 2015.05.19
申请号 US200912644707 申请日期 2009.12.22
申请人 SRI International 发明人 Isnardi Michael Anthony;Kopansky Arkady
分类号 H04N11/04;H04N19/80;G06T9/00;H04N19/61;H04N19/117;H04N19/17 主分类号 H04N11/04
代理机构 代理人 Taboada Moser
主权项 1. A method for compressing a video stream comprising: receiving one or more region-of-interest maps which define one or more regions of interest across corresponding video frames of the video stream; applying, for each video frame, a first spatial filter to a first set of pixels of the video frame to generate a first set of filtered pixel values, and applying a second spatial filter to a second set of pixels of the video frame to generate a second set of filtered pixel values, wherein the first spatial filter reduces an amount of high spatial frequency energy in the pixels and the second spatial filter reduces a greater amount of high spatial frequency energy in pixels than the first spatial filter; forming, for each video frame, a filtered video frame comprising a plurality of filtered pixel values, each of said filtered pixel values of the filtered video frame being derived based on: (a) a value in a corresponding location of the one or more reaction-of-interest maps corresponding to the video frame, and (b) a filtered pixel value in a corresponding location from at least one of the first set and second set of filtered pixel values; forming a spatially filtered video stream comprising each of the filtered video frames; and providing the spatially filtered video stream to a standard video encoder for encoding the spatially filtered video stream, wherein the standard video encoder automatically assigns fewer bits to regions with less high spatial frequency energy and more bits to regions with greater higher spatial frequency energy.
地址 Menlo Park CA US