主权项 |
1. A method comprising:
computing, by a computer processor of a computing system, a term frequency-inverse document frequency (tf-idf) associated with n-grams of an n-gram model of a domain; determining, by said computer processor based on said tf-idf, a frequently occurring group of n-grams of said n-grams; generating, by said computer processor executing a deep parser component of said computing system with respect to said frequently occurring group of n-grams, a deep parse output comprising results of said executing said deep parser component with respect to said frequently occurring group of n-grams; storing, by said computer processor in a database cache, said deep parse output; indexing, by said computer processor executing said frequently occurring group of n-grams in said database cache, said deep parse output; and verifying, by said computer processor, if a pre-computed specified text word sequence of said deep parse output is available in said database cache, wherein said verifying comprises:
retrieving from said deep parse output, a plurality of tokens of said deep parser output, wherein said plurality of tokens are associated with a portion of said pre-computed specified text word sequence, wherein said plurality of tokens comprise suffixes associated with structures of said deep parser output, and wherein said plurality of tokens comprise a version token; anddetermining based on said plurality of tokens, variations associated with said pre-computed specified text word sequence. |