一种基于隐含狄利克雷分配模型的并行数据处理方法,申请号CN200810126728.3-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	一种基于隐含狄利克雷分配模型的并行数据处理方法
摘要	本发明公开了一种基于隐含狄利克雷分配模型的并行数据处理方法，属于数据挖掘领域，该方法包含了多进程并行处理、多线程并行处理和复合多进程多线程处理三种方案，在这三种方案中都将要处理的数据DM分成长度为等长或不等长的数据片段，每个数据片段都有一个索引，每个计算进程/线程通过申请索引来处理对应的数据片段，进而获得每个数据项的主题信息并生成局部充分统计量；处理完整个DM后，通过归并局部充分统计量，得到全局充分统计量，即可估计得到当前模型Mi，直到该模型收敛。该方法能够充分利用单机上的多内核并行架构和多机上的机群大规模并行能力，进而实现对大规模文本集合的高速处理，并能有效降低并行处理过程中内存的使用量。
申请公布号	CN101359333B	申请公布日期	2010.06.16
申请号	CN200810126728.3	申请日期	2008.06.20
申请人	中国科学院软件研究所	发明人	李文波;孙乐
分类号	G06F17/30(2006.01)I	主分类号	G06F17/30(2006.01)I
代理机构	北京君尚知识产权代理事务所(普通合伙) 11200	代理人	余长江
主权项	一种基于隐含狄利克雷分配模型的并行数据处理方法，对于多进程并行处理，其步骤包括：1)根据节点计算机的硬件并发能力自动生成具有相应数量计算进程；2)输入要处理的数据集，随机给出初始模型M0；3)将数据集分成若干数据片段，每个数据片段包含若干文档，且数据片段的长度远小于文档数，每个数据片段有一个索引；4)每个计算进程申请一个索引处理相应数据片段，并计算生成局部充分统计量；5)归并局部充分统计量，得到全局充分统计量，进而估计得到当前模型Mi；6)判断模型Mi是否收敛，收敛则完成计算，否则从步骤3)开始循环。
地址	100190 北京市海淀区中关村南四街4号

您可能感兴趣的专利

Process for monitoring the direction of frictional drive from a vehicle transmission at near-zero vehicle speed

Method for controlling the steering feedback torque

Climate control system and method for optimizing energy consumption of a vehicle

Robot safety system and a method

Method for operating a management system of function modules

Treatment of shoulder dysfunction using a percutaneous intramuscular stimulation system

Handheld mobile communication device with moveable display/cover member

Automated communication using image capture

Radiation reducing apparatus for wireless communication device

System and method for operating a communication service

Methods and apparatus for line selection in a communication device

Cell selection method and mobile station

Using local codecs

Self-detecting electronic connection for electronic devices

Method and device for bidirectional IR data transfer between a medical treatment table and an operator control device

Image pickup apparatus with back focus adjustment mechanism

Method of automatically editing media recordings

Method of manufacturing an optical composite

Image retrieval system and method

System and methods for handling financial document returns and processing exceptions