发明名称 CHARACTER SEQUENCE MAP GENERATING APPARATUS, INFORMATION SEARCHING APPARATUS, CHARACTER SEQUENCE MAP GENERATING METHOD, INFORMATION SEARCHING METHOD, AND COMPUTER PRODUCT
摘要 A computer-readable recording medium stores therein a sequence-map generating program that causes a computer to execute extracting from files that include character strings written therein, a word having q (q≧2) characters; extracting from the word extracted at the extracting the word, consecutive characters from a character position s-th (1≦s≦q−r+1) from a head of the word to a character position determined by a number of characters r (r≦q); and generating, for each character position s-th from the head, a consecutive-character sequence map including a flag row that indicates, for each file, whether a file includes the consecutive characters extracted at the extracting the consecutive characters.
申请公布号 US2016026630(A1) 申请公布日期 2016.01.28
申请号 US201514835053 申请日期 2015.08.25
申请人 FUJITSU LIMITED 发明人 Kataoka Masahiro;Nagase Tomoki;Tsubokura Takashi
分类号 G06F17/30 主分类号 G06F17/30
代理机构 代理人
主权项 1. A searching apparatus comprising: a word extracting unit that extracts a word that includes a plurality of characters, from a plurality of files that include character strings written therein, the character strings including keywords; a consecutive-character extracting unit that extracts consecutive characters of a given number from a given position of the word extracted by the word extracting unit; a judging unit that judges for each of the consecutive characters extracted by the consecutive-character extracting unit and based on information that correlates each of the keywords included in the files and a file that includes the keyword, whether the consecutive characters matches any of the keywords included in the information; a generating unit that generates for each of the consecutive characters judged to match the keyword by the judging unit, a consecutive-character sequence map that includes flag rows indicating whether the consecutive characters are included in each of the files; and a determining unit that determines, when a keyword for which a search is requested is searched for from among the files and based on the consecutive-character sequence map generated by the generating unit, a file that includes a keyword that matches the keyword for which the search is requested.
地址 Kawasaki-shi JP