发明名称 Systems, apparatuses, and methods for performing a horizontal add or subtract in response to a single instruction
摘要 Embodiments of systems, apparatuses, and methods for performing in a computer processor vector packed horizontal add or subtract of packed data elements in response to a single vector packed horizontal add or subtract instruction that includes a destination vector register operand, a source vector register operand, and an opcode are describes.
申请公布号 US9619226(B2) 申请公布日期 2017.04.11
申请号 US201113992230 申请日期 2011.12.23
申请人 Intel Corporation 发明人 Hagog Mostafa;Ould-Ahmed-Vall Elmoustapha;Valentine Robert;Gradstein Amit;Rubanovich Simon;Sperber Zeev
分类号 G06F9/302;G06F9/305;G06F15/76;G06F9/30;G06F15/80 主分类号 G06F9/302
代理机构 Nicholson De Vos Webster & Elliott LLP 代理人 Nicholson De Vos Webster & Elliott LLP
主权项 1. A method of performing in a computer processor vector packed horizontal add or subtract of packed data elements in response to a single vector packed horizontal add or subtract instruction that includes a destination vector register operand, a source vector register operand, an immediate, and an opcode, wherein the source vector register comprises a plurality of packed data elements divided into a plurality of data lanes, each data lane corresponds to a destination data element in the destination vector register, the immediate comprises at least a same number of active bits as there are packed data elements in each data lane, and each active bit of the immediate corresponds to one of the plurality of packed data elements in each data lane, the method comprising: executing the single vector packed horizontal add or subtract instruction to, for each data lane of the source vector register, read a value of each active bit position of the immediate to determine whether to negate a value of corresponding data element position of the data lane, responsively negate the values determined to be negated, and sum all negated and unchanged packed data elements in each data lane to create a data lane result; and storing each data lane result in a corresponding destination data element position of the destination register.
地址 Santa Clara CA US