发明名称 Scanbox
摘要 Embodiments are provided for content item classification. In some embodiments, an image for classification is received, a compact representation for the image having values indicative of pixel values within the received image is generated, a plurality of angle measurements for possible edges of at least one potential document within the received image are determined, and the image is classified using said compact representation and said plurality of angle measurements.
申请公布号 US9171203(B2) 申请公布日期 2015.10.27
申请号 US201314022933 申请日期 2013.09.10
申请人 DROPBOX, INC. 发明人 Chajed Tej;Welinder Peter;Babekno Boris;Simeonov Dimitar
分类号 G06K9/48;G06K9/00;G06K9/46;G06T7/00 主分类号 G06K9/48
代理机构 Keller Jolley Preece 代理人 Keller Jolley Preece
主权项 1. A method for content item classification, comprising: receiving an image for classification; generating, using at least one processor, a compact representation for the image by downsampling the received image, the compact representation having a reduced set of pixel values indicative of pixel values within the received image; identifying, using the at least one processor, a plurality of angle measurements for possible page edges of at least one potential document within the received image, wherein identifying the plurality of angle measurements for possible page edges comprises: calculating a plurality of gradient values from the reduced set of pixel values;identifying, based on the plurality of gradient values, one or more edge candidates of the at least one potential document; andcalculating the plurality of angle measurements based on a vector extending from a selected origin to a point on each of the one or more edge candidates; determining, based on the identified plurality of angle measurements, that the image contains a document; and in response to determining that the image contains a document, classifying the image as a document containing image based on the identified plurality of angle measurements for possible page edges.
地址 San Francisco CA US