发明名称 |
TEXT MATCHING DEVICE AND METHOD, AND TEXT CLASSIFICATION DEVICE AND METHOD |
摘要 |
[Object] To provide a system for automatically and reliably collecting information belonging to a given category, and matching the information appropriately in a timely manner.;[Solution] A text classifying device 30 analyzes dependency of collected texts by a morpheme analyzing unit 52 and a dependency analyzing unit 54. A problem report collecting unit 64 specifies a core consisting of noun+predicate in a text based on dependency relation of the text, and using a combination of noun classification (trouble/non-trouble) and predicate classification (excitatory/inhibitory), classifies the text to a problem report or the rest, by a method referred to as core-based matrix. Support information collecting device 66 and request message collecting device 68 collect support information and request messages in the similar manner. A matching device 76 matches problem reports and support information collected by problem report collecting unit 64 and support information collecting device 66 by a method referred to as co-occurrence core matrix, and thus associates problem information (support information) with appropriate support information (problem information). |
申请公布号 |
US2016140217(A1) |
申请公布日期 |
2016.05.19 |
申请号 |
US201414898565 |
申请日期 |
2014.05.15 |
申请人 |
NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY |
发明人 |
SANO Motoki;VARGA Istvan;TORISAWA Kentaro;HASHIMOTO Chikara;OOTAKE Kiyonori;KAWAI Takao;OH Jonghoon;DE SAEGER Stijin |
分类号 |
G06F17/30;G06N5/04;G06N99/00 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A text matching device, matching, in a set of texts classified to a first or second category, a text in said first category with a text in said second category, wherein
a text included in said set is classified to said first or second category by a text classifying device using machine learning, using as features, one or a plurality of morphemes forming the text, dependency information of the one or a plurality of morphemes, and a combination of a noun classification and a predicate classification in a core of a sentence consisting of a combination of a noun included in said text and a predicate on which the noun depends; said text matching device comprising: storage means for storing texts of said first category and said second category distinguished from each other; text pair generating means for generating a text pair consisting of a text of said first category and a text of said second category; features-for-matching generating means for generating, from said pair, features-for-matching, including said features used when the text in said pair generated by said text pair generating means is classified by said text classifying device; and matching means for determining, using the features-for-matching generated by said features-for-matching generating means, whether two texts forming said pair match or not; wherein said matching means includes a machine learning model pre-trained using training data for matching in advance, to determine whether a pair of texts matches based on said features-for-matching. |
地址 |
Koganei-shi, Tokyo JP |