发明名称 |
Label-embedding for text recognition |
摘要 |
A system and method for comparing a text image and a character string are provided. The method includes embedding a character string into a vectorial space by extracting a set of features from the character string and generating a character string representation based on the extracted features, such as a spatial pyramid bag of characters (SPBOC) representation. A text image is embedded into a vectorial space by extracting a set of features from the text image and generating a text image representation based on the text image extracted features. A compatibility between the text image representation and the character string representation is computed, which includes computing a function of the text image representation and character string representation. |
申请公布号 |
US9008429(B2) |
申请公布日期 |
2015.04.14 |
申请号 |
US201313757014 |
申请日期 |
2013.02.01 |
申请人 |
Xerox Corporation |
发明人 |
Rodriguez-Serrano Jose Antonio;Perronnin Florent C. |
分类号 |
G06K9/00;G06K9/18 |
主分类号 |
G06K9/00 |
代理机构 |
Fay Sharpe LLP |
代理人 |
Fay Sharpe LLP |
主权项 |
1. A method for comparing a text image and a character string comprising:
embedding a character string into a vectorial space, comprising extracting a set of features from the character string and generating a character string representation based on the extracted character string features; embedding a text image into a vectorial space, comprising extracting a set of features from the text image and generating a text image representation based on the extracted text image features; and computing a compatibility between the text image representation and character string representation comprising computing a function of the text image representation and character string representation, the function including an embedding parameter w which is a DE-dimensional vector or a D×E matrix W which embeds the text image representation and character string representation into a new space, where D is the dimensionality of the text image representation and E is the dimensionality of the character string representation, wherein at least one of the embedding and the computing of the compatibility is performed with a processor. |
地址 |
Norwalk CT US |