发明名称 Patient data mining
摘要 The present invention provides a data mining framework for mining high-quality structured clinical information. The data mining framework includes a data miner that mines medical information from a computerized patient record (CPR) based on domain-specific knowledge contained in a knowledge base. The data miner includes components for extracting information from the CPR, combining all available evidence in a principled fashion over time, and drawing inferences from this combination process. The mined medical information is stored in a structured CPR which can be a data warehouse.
申请公布号 US8949079(B2) 申请公布日期 2015.02.03
申请号 US200912488083 申请日期 2009.06.19
申请人 Siemens Medical Solutions USA, Inc. 发明人 Rao R. Bharat;Sandilya Sathyakama;Amies Christopher Jude;Niculescu Radu Stefan;Goel Arun Kumar;Warrick Thomas R.
分类号 G06F7/60;G06F17/10;G06F19/00;G06F17/30;G06Q10/10;G06Q50/22;G06Q50/24 主分类号 G06F7/60
代理机构 代理人 Ryan Joshua B
主权项 1. A system for producing structured clinical information from patient records, the system comprising: a patient record comprising at least two data sources having patient information, at least one of the data sources being an unstructured data source and at least one of the data sources being a structured data source; a probabilistic data miner of a computer platform configured to (a) extract multiple pieces of information related to a variable for a patient from mining structured data of the at least one structured data source and mining unstructured data of the at least one unstructured data source of the patient record, the mining of the at least one unstructured data source comprising mining free text information, and (b) combine the extracted multiple pieces of information related to the variable into a value of the variable for the patient, the value being a function of the multiple pieces related to the variable, and the data miner configured to repeat (a) and (b) for a plurality of different variables of the same patient for a same time, each repetition of extracting and combining multiple pieces of information related to the variable of the different variables being handled in the repetition such that the multiple pieces of information for one variable are different than the multiple pieces of information for other ones of the different variables, the variable and the different variables comprising characteristics of the patient at the time; wherein one or both of (a) and (b) are performed as a function of domain-specific criteria.
地址 Malvern PA US