Adaptive multi-modal detection and fusion in videos via classification-based-learning,申请号US201314100886-传众专利搜索

发明名称	Adaptive multi-modal detection and fusion in videos via classification-based-learning
摘要	Described is a system for object detection using classification-based learning. A fusion method is selected, then a video sequence is processed to generate detections for each frame, wherein a detection is a representation of an object candidate. The detections are fused to generate a set of fused detections for each frame. The classification module generates a classification score labeling each fused detection based on a predetermined classification threshold. Otherwise, a token indicating that the classification module has abstained from generating a classification score is generated. The scoring module produces a confidence score for each fused detection based on a set of learned parameters from the learning module and the set of fused detections. The set of fused detections are filtered by the accept-reject module based on one of the classification score or the confidence score. Finally, a set of final detections representing an object is output.
申请公布号	US8965115(B1)	申请公布日期	2015.02.24
申请号	US201314100886	申请日期	2013.12.09
申请人	HRL Laboratories, LLC	发明人	Khosla Deepak;Honda Alexander L.;Chen Yang;Cheng Shinko Y.;Kim Kyungnam;Zhang Lei;Jeong Changsoo S.
分类号	G06K9/62;G06K9/00	主分类号	G06K9/62
代理机构	Tope-McKay & Associates	代理人	Tope-McKay & Associates
主权项	1. A system for object detection for video sequences using classification-based learning, the system comprising: one or more processors and a non-transitory memory having instructions encoded thereon such that when the instructions are executed, the one or more processors perform operations of: selecting a fusion method; processing an input video sequence to generate a plurality of detections for each frame in the input video sequence, wherein a detection is a representation of an object candidate within the frame of the input video sequence; fusing the plurality of detections to generate a set of fused detections for each frame using the selected fusion method; sending the set of fused detections to a classification module, a scoring module, and an accept-reject module; wherein the classification module: a) generates a classification score for each fused detection, labeling the fused detection based on a predetermined classification threshold, and passes the classification score to a learning module and the accept-reject module; orb) generates a token indicating that the classification module has abstained from generating a classification score and passes the token to the learning module and the accept-reject module; wherein the scoring module produces a confidence score for each fused detection based on a set of learned parameters from the learning module and the set of fused detections; filtering of the set of fused detections by the accept-reject module based on one of the classification score or the confidence score; and outputting a set of final detections representing an object.
地址	Malibu CA US