基于大规模术语语料库对译稿自动碎片化分类的方法,申请号CN201210591759.2-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	基于大规模术语语料库对译稿自动碎片化分类的方法
摘要	本发明提供了一种基于大规模术语语料库对译稿自动碎片化分类的方法，包括：对译稿进行分词处理，去除停用词，获得其关键词集合，提取译稿每段的各个关键词，建立每个段落与其包含的各个关键词的对应关系；将所述译稿的各个关键词逐个在术语语料库中匹配，将每个关键词匹配的术语的行业类别属性，作为该关键词在其对应的每个段所归属的行业类别属性；根据所述对应关系，确定每个段包含相同的最多的行业类别属性；将最多的行业类别属性对该段分类。由于译稿的词语数要远小于术语库的词语数；且术语库具备按字母顺序查找的功能，在其中进行关键词匹配不需要采用模式匹配算法，可以极大的减少查询时间。缩短对译稿碎片化的时间，提高碎片化效率。
申请公布号	CN103106245A	申请公布日期	2013.05.15
申请号	CN201210591759.2	申请日期	2012.12.31
申请人	武汉传神信息技术有限公司	发明人	江潮
分类号	G06F17/30(2006.01)I	主分类号	G06F17/30(2006.01)I
代理机构		代理人
主权项	一种基于大规模术语语料库对译稿自动碎片化分类的方法，其特征在于，包括：提取译稿每段的各个关键词，建立每个段落与其包含的各个关键词的对应关系；将所述译稿的各个关键词逐个在术语语料库中匹配，将每个关键词匹配的术语的行业类别属性，作为该关键词在其对应的每个段所归属的行业类别属性；根据所述对应关系，确定每个段包含相同的最多的行业类别属性；将最多的行业类别属性对该段分类。
地址	430073 湖北省武汉市东湖开发区光谷软件园一期以西、南湖南路以南光谷软件园六期2幢5层205号

您可能感兴趣的专利

Antistatic compositions based on polyamide

Aqueous fungicide dispersion

Bis-(N,N'-bis-(2-haloethyl)amino)phosphoramidates as antitumor agents

Exhaust system for watercraft

Method and apparatus for dynamically rendering components at run time

Cable joint with improved screen connection

Methods of treating cardiovascular diseases, dyslipidemia, dyslipoproteinemia, and hypertension with ether compounds

Resin composition for denture base

Aminoplast resin photochromic coating composition and photochromic articles

Gas-liquid reaction process including ejector and monolith catalyst

Process for controlling scale in the sugar process

Method for subcutaneous access to the vascular system of a patient

Pin header and a method of making same

Cage for constant-velocity joint and method for manufacturing the same

Integrally formed stamping sheet-metal blades having 3D structure

Photopolymerizable composition

Magnetically shielded conductor

Method for estimating optical flow

Antibodies to human tumor necrosis factor receptor TR10

Information recording apparatus