发明名称 Extracting and scoring class-instance pairs
摘要 Methods, systems, and apparatus, including computer programs encoded on computer storage media, for extracting and scoring class-instance pairs. One method includes applying extraction patterns to document text to derive class-instance pairs, determining a frequency score and a diversity score for each distinct class-instance pair, and determining a pair score for each class-instance pair from the frequency score and the diversity score. Another method includes applying extraction patterns to document text to derive candidate class-instance pairs, determining, for each distinct candidate class-instance pair, a number of distinct phrases from which the distinct candidate class-instance pair was derived, and determining a pair score for each distinct candidate class-instance pair from the number of distinct phrases from which the candidate class-instance pair was extracted.
申请公布号 US8452763(B1) 申请公布日期 2013.05.28
申请号 US20100727940 申请日期 2010.03.19
申请人 PASCA MARIUS;GOOGLE INC. 发明人 PASCA MARIUS
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项
地址