发明名称 Label-embedding for text recognition
摘要 A system and method for comparing a text image and a character string are provided. The method includes embedding a character string into a vectorial space by extracting a set of features from the character string and generating a character string representation based on the extracted features, such as a spatial pyramid bag of characters (SPBOC) representation. A text image is embedded into a vectorial space by extracting a set of features from the text image and generating a text image representation based on the text image extracted features. A compatibility between the text image representation and the character string representation is computed, which includes computing a function of the text image representation and character string representation.
申请公布号 US9008429(B2) 申请公布日期 2015.04.14
申请号 US201313757014 申请日期 2013.02.01
申请人 Xerox Corporation 发明人 Rodriguez-Serrano Jose Antonio;Perronnin Florent C.
分类号 G06K9/00;G06K9/18 主分类号 G06K9/00
代理机构 Fay Sharpe LLP 代理人 Fay Sharpe LLP
主权项 1. A method for comparing a text image and a character string comprising: embedding a character string into a vectorial space, comprising extracting a set of features from the character string and generating a character string representation based on the extracted character string features; embedding a text image into a vectorial space, comprising extracting a set of features from the text image and generating a text image representation based on the extracted text image features; and computing a compatibility between the text image representation and character string representation comprising computing a function of the text image representation and character string representation, the function including an embedding parameter w which is a DE-dimensional vector or a D×E matrix W which embeds the text image representation and character string representation into a new space, where D is the dimensionality of the text image representation and E is the dimensionality of the character string representation, wherein at least one of the embedding and the computing of the compatibility is performed with a processor.
地址 Norwalk CT US