发明名称 METHOD AND DEVICE FOR POINTS OF INTEREST DATA REDUNDANCY CHECK
摘要 The invention discloses a point of interest (POI) data redundancy detection method and device. The POI data redundancy detection method comprises the following steps: determining the geographic area of a POI according to the position information of POI data; extracting name feature words from the name information of the POI data; partitioning the POI data which are in the same geographic area and have the same name feature words into the same redundant data candidate set; calculating the similarity of any two POI data in the redundant data candidate set, and if the similarity meets a preset requirement, judging that the two POI data are redundant data of each other. According to the technical scheme of the invention, the geographical area with high granularity is partitioned roughly on the aspect of space independent of accurate longitude and latitude information, and redundancy detection is performed in combination with other information of the POI data. According to the entire scheme, the calculation complexity can be well controlled, and the method and the device can be effectively applied to the application scene of large-scale POI data redundancy detection.
申请公布号 HK1201601(A1) 申请公布日期 2015.09.04
申请号 HK20150101918 申请日期 2015.02.26
申请人 ALIBABA GROUP HOLDING LIMITED 发明人 ZHANG, BUFENG
分类号 G06F 主分类号 G06F
代理机构 代理人
主权项
地址