发明名称 SYSTEM AND METHOD FOR EXTRACTING DISTRIBUTED PARALLEL ENTITY BASED ON MAPREDUCE
摘要 PURPOSE: A MapReduce based dispersion parallel entity extracting system and a method thereof are provided to guarantee shortened entity extracting response time by extracting entity based on a MapReduce framework. CONSTITUTION: A master server device(100) distributes target document data to slave server devices(200a-200N) by dividing an input document into the target document data. The slave server device converts the target document data into a data format which is able to be processed in a MapReduce framework, divides the content of the converted document into sentences, and divides the divided sentences into construction units. The slave server device extracts the combination of the construction units as entity candidates and defines a relationship between the extracted entities. [Reference numerals] (100) Master server; (200a) Slave server 1; (200b) Slave server 2; (200N) Slave server N;
申请公布号 KR101255060(B1) 申请公布日期 2013.04.16
申请号 KR20120077379 申请日期 2012.07.16
申请人 KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION 发明人 UM, JUNG HO;CHOI, SUNG PIL;CHOI, YUN SOO;JEONG, CHANG HOO;KIM, TAE HONG;SONG, SA KWANG;JUNG, HAN MIN
分类号 G06F15/16;G06F9/38 主分类号 G06F15/16
代理机构 代理人
主权项
地址