发明名称 SYSTEMS, METHODS AND COMPUTER PROGRAM PRODUCTS FOR BUILDING A DATABASE ASSOCIATING N-GRAMS WITH COGNITIVE MOTIVATION ORIENTATIONS
摘要 Computer-implemented methods can transform a corpus of meaningful text sequences into a generalized computer-usable repository of neurolinguistic information that can be applied by one or more computer systems. The computer system(s) can use the neurolinguistic information to neurolinguistically analyze meaningful text sequences to derive statistical information and identify dominant cognitive motivation orientations expressed in those text sequences. The identified dominant cognitive motivation orientations can be used to improve the efficacy of both human-generated and machine-generated communications. The computer system(s) thereby transform a meaningful text sequence into actionable information about the dominant cognitive motivation orientation(s) of the author of that text sequence within the context in which the text sequence was composed. Computer systems and computer-program products for implementing the methods are also described.
申请公布号 US2016239479(A1) 申请公布日期 2016.08.18
申请号 US201615014747 申请日期 2016.02.03
申请人 Weongozi Inc. 发明人 Rose Charvet Shelle;Tschichholz Michael Horst;Busemann Stephan;Steffen Jorg;Rose Jonathan Scott
分类号 G06F17/27;G06N7/00;G06F17/30;G06N99/00 主分类号 G06F17/27
代理机构 代理人
主权项 1. A computer-implemented method for building an analysis database associating each of a plurality of n-grams with corresponding respective cognitive motivation orientations, comprising: receiving a training corpus of training documents in electronic form; each training document comprising a plurality of meaningfully arranged words; each training document having at least one annotated word sequence therein; wherein within each training document, each particular annotated word sequence is annotated with a corresponding word-sequence-level annotation identifying at least one cognitive motivation orientation that is associated with that particular annotated word sequence; for each training document: for each annotated word sequence in that particular training document: extracting n-grams overlapping that particular annotated word sequence; and associating each extracted n-gram with the at least one cognitive motivation orientation associated with that particular annotated word sequence; generating a set of indicator candidate n-grams wherein: each indicator candidate n-gram represents all instances of a particular n-gram in the training corpus for which at least one instance of that particular n-gram was extracted from any annotated word sequence in any training document;each indicator candidate n-gram being associated with every cognitive motivation orientation that is associated with at least one instance of the particular n-gram represented by that particular indicator candidate n-gram; applying at least one relevance filter to each indicator candidate n-grams in the set of indicator candidate n-grams to obtain a set of indicator n-grams, wherein: the set of indicator n-grams is a subset of the set of indicator candidate n-grams, so that each indicator n-gram corresponds to only one indicator candidate n-gram and thereby each indicator n-gram represents all instances of a corresponding particular n-gram in the training corpus for which at least one instance of that particular n-gram was extracted from any annotated word sequence in any training document;each indicator n-gram is associated with only a single cognitive motivation orientation; andeach indicator n-gram has, as its associated single cognitive motivation orientation, that single cognitive motivation orientation with which the instances of the particular n-gram represented by that particular indicator n-gram are most frequently associated.
地址 Burlington CA