发明名称 METHOD AND SYSTEM FOR TEXT SUMMARIZATION AND SUMMARY BASED QUERY ANSWERING
摘要 A method and system for generating answers to questions based on electronic data summary which is itself derived on context and semantics of a corpus of authoritative documents and its subsequent usage is disclosed. The method and system provides for generating a taxonomy of concepts, assigning unique-identifiers and weights to the taxonomy concepts using a given corpus of electronic data, using the taxonomy to identify the semantics of the document to be summarized, generating an ontology from a summarized authoritative text, having the ontology generation and the summary generation in a feedback loop, selecting text from a given document based on the weights of unique-identifiers in the taxonomy/ontology, sentences as a summary and pruning of the list based upon an entropy threshold, and the presence of a probability distribution, publishing of the summary in a known format on server or any other software/hardware platform with or without monetization for consumption, usage of the summary to generate answers which can be configured using an ontology and thus prevent denial of information/information overload.
申请公布号 US2010287162(A1) 申请公布日期 2010.11.11
申请号 US20090413518 申请日期 2009.03.28
申请人 SHIRWADKAR SANIKA 发明人 SHIRWADKAR SANIKA
分类号 G06F17/30;G06F17/21 主分类号 G06F17/30
代理机构 代理人
主权项
地址