发明名称 Determination of sequence frequency
摘要 A computer-implemented sequence analysis process is provided for determining frequency in an n-gram hash sequence of a symbolic string, such as a series of characters. The method includes initializing a hash table, creating a matrix, reading the present value in the sequence, determining whether the present value is unknown, inserting a value index into the hash table if unknown and identifying the value index otherwise, and incrementing a cell within the array. The hash table has a plurality of levels from one to n. The matrix includes first and second indices corresponding to an array of cells. The first index corresponds to an end of a prior value, while the second index corresponds to a start of a present value. The cell corresponds to the first index for the prior value and the second index for the present value.
申请公布号 US9087295(B1) 申请公布日期 2015.07.21
申请号 US201313783639 申请日期 2013.03.04
申请人 The United States of America as Represented by the Secretary of the Navy 发明人 Dillon Brian S.;Kelly Jeremy Gene
分类号 G06N5/02 主分类号 G06N5/02
代理机构 代理人 Thielman, Esq Gerhard W.
主权项 1. A computer-implemented sequence analysis method for determining frequency in an n-gram sequence of a symbolic string, said method comprising: initializing a plurality of levels from one to n, such that operations for each level include: creating a hash table having first and second columns respectively assigned to a present value and a value index that corresponds to said present value; creating a matrix having first and second indices corresponding to an array of cells, said first index corresponding to an end of a prior value, said second index corresponding to a start of said present value; receiving said present value in the sequence; determining whether said present value is one of unknown and previously encountered; inserting said value index into said hash table for said present value being unknown, and identifying said value index in said hash table for said present value being encountered; and incrementing a cell within said array, said cell corresponding to said first index for said prior value and said second index for said present value.
地址 Washington DC US