发明名称 効率的なベクトルロール演算のための装置及び方法
摘要 A machine readable storage medium containing program code is described that when processed by a processor causes a method to be performed. The method includes creating a resultant rolled version of an input vector by forming a first intermediate vector, forming a second intermediate vector and forming a resultant rolled version of an input vector. The first intermediate vector is formed by barrel rolling elements of the input vector along a first of two lanes defined by an upper half and a lower half of the input vector. The second intermediate vector is formed by barrel rolling elements of the input vector along a second of the two lanes. The resultant rolled version of the input vector is formed by incorporating upper portions of one of the intermediate vector's upper and lower halves as upper portions of the resultant's upper and lower halves and incorporating lower portions of the other intermediate vector's upper and lower halves as lower portions of the resultant's upper and lower halves.
申请公布号 JP5759527(B2) 申请公布日期 2015.08.05
申请号 JP20130238546 申请日期 2013.11.19
申请人 インテル・コーポレーション 发明人 ウリエル、タル;ボルシェム、ボリス;オウルド−アハメド−バル、エルモウスタファ
分类号 G06F17/16;G06F9/30;G06F9/38 主分类号 G06F17/16
代理机构 代理人
主权项
地址