SPEECH RECOGNITION WITH ACOUSTIC MODELS,申请号US201514983315-传众专利搜索

首页产品黄页商标征信

会员服务注册登录

法人/股东/高管

发明名称	SPEECH RECOGNITION WITH ACOUSTIC MODELS
摘要	Methods, systems, and apparatus, including computer programs encoded on computer storage media for learning pronunciations from acoustic sequences. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a sequence of multiple frames of acoustic data at each of a plurality of time steps; stacking one or more frames of acoustic data to generate a sequence of modified frames of acoustic data; processing the sequence of modified frames of acoustic data through an acoustic modeling neural network comprising one or more recurrent neural network (RNN) layers and a final CTC output layer to generate a neural network output, wherein processing the sequence of modified frames of acoustic data comprises: sub sampling the modified frames of acoustic data; and processing each subsampled modified frame of acoustic data through the acoustic modeling neural network.
申请公布号	US2016372119(A1)	申请公布日期	2016.12.22
申请号	US201514983315	申请日期	2015.12.29
申请人	Google Inc.	发明人	Sak Hasim;Senior Andrew W.
分类号	G10L17/18;G10L17/02;G10L17/04	主分类号	G10L17/18
代理机构		代理人
主权项	1. A method comprising: receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a sequence of multiple frames of acoustic data at each of a plurality of time steps; stacking one or more frames of acoustic data to generate a sequence of modified frames of acoustic data; processing the sequence of modified frames of acoustic data through an acoustic modeling neural network comprising one or more recurrent neural network (RNN) layers and a final CTC output layer to generate a neural network output, wherein processing the sequence of modified frames of acoustic data comprises: subsampling the modified frames of acoustic data; andprocessing each subsampled modified frame of acoustic data through the acoustic modeling neural network.
地址	Mountain View CA US

您可能感兴趣的专利

Multichannel Audio Coding

Method of operating an ozone generating apparatus

Security devices, their production and use

METHOD FOR CONTROLLING CLUBROOT

SHEET PILE CONNECTING ELEMENTS FOR USE IN PIPE PILE RETAINING WALLS

MEASURING APPLIANCE COMPRISING A DYNAMIC SIGHTING FUNCTIONALITY AND ASSOCIATED METHOD

ADJUSTMMENT SHOES FRAME FIELD OF THE INVENTION

PLASMA REACTOR FOR GAS TO LIQUID FUEL CONVERSION

FILTRO DE COMBUSTIBLE PARA LA REDUCCION DE EMISIONES CONTAMINANTES

HIGH VISCOSITY SPRAY EMULSION CONCRETE RELEASE AGENT

Pacifier comprising a shield, and shield

Power plant with C02 capture and method to operate such power plant

Expression of catalase in Trichoderma

Process and reactor for saccharification of cellulose

Determining a property of a formation material

Continuous steam generator with equalizing chamber

An automatic miniature injector and sample-taker device for medical use

Porous polymer monoliths, processes for preparation and use thereof

Wireless power transmission using phased array antennae

Beverage products and flavor systems having a non-sweetening amount of monatin