发明名称 装置、方法、およびシステム
摘要 A vector reduction instruction is executed by a processor to provide efficient reduction operations on an array of data elements. The processor includes vector registers. Each vector register is divided into a plurality of lanes, and each lane stores the same number of data elements. The processor also includes execution circuitry that receives the vector reduction instruction to reduce the array of data elements stored in a source operand into a result in a destination operand using a reduction operator. Each of the source operand and the destination operand is one of the vector registers. Responsive to the vector reduction instruction, the execution circuitry applies the reduction operator to two of the data elements in each lane, and shifts one or more remaining data elements when there is at least one of the data elements remaining in each lane.
申请公布号 JP6025986(B2) 申请公布日期 2016.11.16
申请号 JP20150529799 申请日期 2013.06.10
申请人 インテル・コーポレーション 发明人 カプリオリ、ポール;カンヒア、アビ、エス.;クック、ジェフェリー、ジェイ.;アル−オトーム、ムアウヤ、エム.
分类号 G06F9/305;G06F9/315;G06F9/38 主分类号 G06F9/305
代理机构 代理人
主权项
地址