发明名称 Fast distributed database frequency summarization
摘要 A mechanism is provided for computing the frequency packets in network devices. Respective packets are associated with entities in a vector, where each of the entities is mapped to corresponding ones of the respective packets, and the entities correspond to computers. Upon a network device receiving the respective packets, a count is individually increased for the respective packets in the vector respectively mapped to the entities, and computing a matrix vector product of a matrix A and the vector. The matrix A is a product of at least a first matrix and a second matrix. The first matrix includes rows and columns where each of the rows has a single random location with a one value and remaining locations with zero values. The matrix vector product is transmitted to a centralized computer for aggregating with other matrix vector products.
申请公布号 US9218389(B2) 申请公布日期 2015.12.22
申请号 US201314022402 申请日期 2013.09.10
申请人 International Business Machines Corporation 发明人 Woodruff David P.
分类号 G06F7/00;G06F17/30 主分类号 G06F7/00
代理机构 Cantor Colburn LLP 代理人 Cantor Colburn LLP
主权项 1. An apparatus for computing a frequency of packets in network devices, comprising: memory comprising computer-executable instructions; and a processor executing the computer-executable instructions, the computer-executable instructions, when executed by the processor, cause the processor to perform operations comprising: associating respective packets with a plurality of entities in a vector, each of the plurality of entities being mapped to corresponding ones of the respective packets and corresponding to a plurality of computers; upon at least one network device receiving the respective packets, individually increasing a count for the respective packets mapped to the plurality of entities in the vector; computing a matrix vector product of a matrix A and the vector, the matrix A being a product of at least a first matrix and a second matrix, the first matrix comprising rows and columns where each of the rows has a single random location with a one value and remaining locations with zero values; and transmitting the matrix vector product to a centralized computer for aggregation with other matrix vector products; wherein coordinates of the vector and an approximation vector are identical such that each has identical ones of the plurality of entities mapped at corresponding coordinates.
地址 Armonk NY US