发明名称 |
CHARACTER SEQUENCE MAP GENERATING APPARATUS, INFORMATION SEARCHING APPARATUS, CHARACTER SEQUENCE MAP GENERATING METHOD, INFORMATION SEARCHING METHOD, AND COMPUTER PRODUCT |
摘要 |
A computer-readable recording medium stores therein a sequence-map generating program that causes a computer to execute extracting from files that include character strings written therein, a word having q (q≧2) characters; extracting from the word extracted at the extracting the word, consecutive characters from a character position s-th (1≦s≦q−r+1) from a head of the word to a character position determined by a number of characters r (r≦q); and generating, for each character position s-th from the head, a consecutive-character sequence map including a flag row that indicates, for each file, whether a file includes the consecutive characters extracted at the extracting the consecutive characters. |
申请公布号 |
US2016026630(A1) |
申请公布日期 |
2016.01.28 |
申请号 |
US201514835053 |
申请日期 |
2015.08.25 |
申请人 |
FUJITSU LIMITED |
发明人 |
Kataoka Masahiro;Nagase Tomoki;Tsubokura Takashi |
分类号 |
G06F17/30 |
主分类号 |
G06F17/30 |
代理机构 |
|
代理人 |
|
主权项 |
1. A searching apparatus comprising:
a word extracting unit that extracts a word that includes a plurality of characters, from a plurality of files that include character strings written therein, the character strings including keywords; a consecutive-character extracting unit that extracts consecutive characters of a given number from a given position of the word extracted by the word extracting unit; a judging unit that judges for each of the consecutive characters extracted by the consecutive-character extracting unit and based on information that correlates each of the keywords included in the files and a file that includes the keyword, whether the consecutive characters matches any of the keywords included in the information; a generating unit that generates for each of the consecutive characters judged to match the keyword by the judging unit, a consecutive-character sequence map that includes flag rows indicating whether the consecutive characters are included in each of the files; and a determining unit that determines, when a keyword for which a search is requested is searched for from among the files and based on the consecutive-character sequence map generated by the generating unit, a file that includes a keyword that matches the keyword for which the search is requested. |
地址 |
Kawasaki-shi JP |