SYSTEM AND METHOD FOR EXTRACTING DISTRIBUTED PARALLEL ENTITY BASED ON MAPREDUCE
摘要
PURPOSE: A MapReduce based dispersion parallel entity extracting system and a method thereof are provided to guarantee shortened entity extracting response time by extracting entity based on a MapReduce framework. CONSTITUTION: A master server device(100) distributes target document data to slave server devices(200a-200N) by dividing an input document into the target document data. The slave server device converts the target document data into a data format which is able to be processed in a MapReduce framework, divides the content of the converted document into sentences, and divides the divided sentences into construction units. The slave server device extracts the combination of the construction units as entity candidates and defines a relationship between the extracted entities. [Reference numerals] (100) Master server; (200a) Slave server 1; (200b) Slave server 2; (200N) Slave server N;
申请公布号
KR101255060(B1)
申请公布日期
2013.04.16
申请号
KR20120077379
申请日期
2012.07.16
申请人
KOREA INSTITUTE OF SCIENCE & TECHNOLOGY INFORMATION
发明人
UM, JUNG HO;CHOI, SUNG PIL;CHOI, YUN SOO;JEONG, CHANG HOO;KIM, TAE HONG;SONG, SA KWANG;JUNG, HAN MIN