发明名称 System and method for addressing overfitting in a neural network
摘要 A system for training a neural network. A switch is linked to feature detectors in at least some of the layers of the neural network. For each training case, the switch randomly selectively disables each of the feature detectors in accordance with a preconfigured probability. The weights from each training case are then normalized for applying the neural network to test data.
申请公布号 US9406017(B2) 申请公布日期 2016.08.02
申请号 US201314015768 申请日期 2013.08.30
申请人 Google Inc. 发明人 Hinton Geoffrey E.;Krizhevsky Alexander;Sutskever Ilya;Srivastva Nitish
分类号 G06N3/08;G06N3/04 主分类号 G06N3/08
代理机构 Fish & Richardson P.C. 代理人 Fish & Richardson P.C.
主权项 1. A computer-implemented method comprising: obtaining a plurality of training cases; and training a neural network having a plurality of layers on the plurality of training cases, each of the layers including one or more feature detectors, each of the feature detectors having a corresponding set of weights, and a subset of the feature detectors being associated with respective probabilities of being disabled during processing of each of the training cases, wherein training the neural network on the plurality of training cases comprises, for each of the training cases respectively: determining one or more feature detectors to disable during processing of the training case, comprising determining whether to disable each of the feature detectors in the subset based on the respective probability associated with the feature detector,disabling the one or more feature detectors in accordance with the determining, andprocessing the training case using the neural network with the one or more feature detectors disabled to generate a predicted output for the training case.
地址 Mountain View CA US