发明名称 Fast normalization in a mixed precision floating-point unit
摘要 A hardware circuit for returning single precision denormal results to double precision. A hardware circuit component configured to count leading zeros of an unrounded single precision denormal result. A hardware circuit component configured to pre-compute a first exponent and a second exponent for the unrounded single precision denormal result. A hardware circuit component configured to perform a second normalization of the rounded single precision denormal result back to architected format.
申请公布号 US9280316(B2) 申请公布日期 2016.03.08
申请号 US201414151006 申请日期 2014.01.09
申请人 International Business Machines Corporation 发明人 Boersma Maarten J.;Fuchs Thomas;Kaltenbach Markus;Lang David
分类号 G06F7/48;G06F5/01;G06F7/499;G06F7/483 主分类号 G06F7/48
代理机构 代理人 McCarthy Maeve
主权项 1. A method for returning scalar single precision denormal results to double precision normal results in architected format with limited latency impact on performance utilizing an improved floating-point unit architecture, the method comprising: performing a first normalization on a first unrounded single precision denormal result of a floating-point operation, wherein the first normalization includes performing fraction leading zero removal and exponent adjustment up to a minimum exponent threshold on the first unrounded single precision denormal result to yield a second unrounded single precision denormal result, wherein performing the first normalization includes utilizing a first normalizer circuit of the improved floating-point unit architecture; counting leading zeros of the second unrounded single precision denormal result; performing fraction rounding of the second unrounded single precision denormal result to yield a rounded single precision denormal result, wherein performing fraction rounding includes utilizing a floating-point unit rounder circuit of the improved floating-point unit architecture; pre-computing a first exponent and a second exponent for the second unrounded single precision denormal result in parallel with a second normalization of the rounded single precision denormal result, wherein pre-computing includes utilizing a floating-point unit subtractor circuit of the improved floating-point unit architecture; and outputting a final fraction result by performing the second normalization of the rounded single precision denormal result back to architected format, wherein performing the second normalization includes utilizing a second normalizer circuit of the improved floating-point unit architecture; wherein the architected format includes the final fraction result and one of the first exponent and the second exponent.
地址 Armonk NY US