发明名称 METHODS AND SYSTEMS FOR EXTRACTING PHENOTYPIC INFORMATION FROM THE LITERATURE VIA NATURAL LANGUAGE PROCESSING
摘要 Systems and methods for extracting and encoding genotype-phenotype information from journal articles and other publications are provided. In some embodiments, the disclosed subject matter includes a preprocessor, boundary identifier, parser, phrase recognizer and an encoder to convert natural-language input text and parameters into structured text. The structured text can take the form of codes which account for genotype-phenotype information and are compatible with a controlled vocabulary.
申请公布号 US2010010804(A1) 申请公布日期 2010.01.14
申请号 US20090498898 申请日期 2009.07.07
申请人 THE TRUSTEES OF COLUMBIA UNIVERSITY IN THE CITY OF NEW YORK 发明人 FRIEDMAN CAROL;LUSSIER YVES A.;ENA LYUDMILA
分类号 G06F17/27 主分类号 G06F17/27
代理机构 代理人
主权项
地址