发明名称 DOUBLE ROUNDED COMBINED FLOATING-POINT MULTIPLY AND ADD
摘要 Methods, apparatus, instructions and logic are disclosed providing double rounded combined floating-point multiply and add functionality as scalar or vector SIMD instructions or as fused micro-operations. Embodiments include detecting floating-point (FP) multiplication operations and subsequent FP operations specifying as source operands results of the FP multiplications. The FP multiplications and the subsequent FP operations are encoded as combined FP operations including rounding of the results of FP multiplication followed by the subsequent FP operations. The encoding of said combined FP operations may be stored and executed as part of an executable thread portion using fused-multiply-add hardware that includes overflow detection for the product of FP multipliers, first and second FP adders to add third operand addend mantissas and the products of the FP multipliers with different rounding inputs based on overflow, or no overflow, in the products of the FP multiplier. Final results are selected respectively using overflow detection.
申请公布号 US2017039033(A1) 申请公布日期 2017.02.09
申请号 US201615332721 申请日期 2016.10.24
申请人 Intel Corporation 发明人 Samudrala Sridhar;Magklis Grigorios;Lupon Marc;Ditzel David R.
分类号 G06F7/487;G06F7/499;G06F7/544;G06F7/485 主分类号 G06F7/487
代理机构 代理人
主权项 1. An apparatus comprising: a floating-point (FP) multiplier circuit to multiply a first operand multiplicand mantissa by a second operand multiplier mantissa to generate a product; a FP alignment circuit to align a third operand mantissa according to the product of the FP multiplier circuit; an overflow detection circuit to detect an overflow condition in the product of the FP multiplier circuit; a first FP adder circuit to add together the aligned third operand mantissa and the product of the FP multiplier circuit using a first rounding input to generate a first sum or difference based on an assumption that the overflow condition in the product of the FP multiplier circuit was not detected; a second FP adder circuit to add together the aligned third operand mantissa and the product of the FP multiplier circuit using a second rounding input to generate a second sum or difference based on an assumption that the overflow condition in the product of the FP multiplier circuit was detected; and a multiplexer circuit to select between the second sum or difference and the first sum or difference based on the overflow detection circuit detecting, or not detecting the overflow condition, respectively, in the product of the FP multiplier circuit.
地址 Santa Clara CA US