发明名称 MODELING INTERESTINGNESS WITH DEEP NEURAL NETWORKS
摘要 An “Interestingness Modeler” uses deep neural networks to learn deep semantic models (DSM) of “interestingness.” The DSM, consisting of two branches of deep neural networks or their convolutional versions, identifies and predicts target documents that would interest users reading source documents. The learned model observes, identifies, and detects naturally occurring signals of interestingness in click transitions between source and target documents derived from web browser logs. Interestingness is modeled with deep neural networks that map source-target document pairs to feature vectors in a latent space, trained on document transitions in view of a “context” and optional “focus” of source and target documents. Network parameters are learned to minimize distances between source documents and their corresponding “interesting” targets in that space. The resulting interestingness model has applicable uses, including, but not limited to, contextual entity searches, automatic text highlighting, prefetching documents of likely interest, automated content recommendation, automated advertisement placement, etc.
申请公布号 US2015363688(A1) 申请公布日期 2015.12.17
申请号 US201414304863 申请日期 2014.06.13
申请人 Microsoft Corporation 发明人 Gao Jianfeng;Deng Li;Gamon Michael;He Xiaodong;Pantel Patrick
分类号 G06N3/04;G06N3/08 主分类号 G06N3/04
代理机构 代理人
主权项 1. A computer-implemented process, comprising: using a computer to perform process actions for: receiving a collection of source and target document pairs; identifying a separate context for each source document and each target document; mapping each context to a separate vector; mapping each of the vectors to a convolutional layer of a neural network; mapping the convolutional layer to a plurality of hidden layers of the neural network; and generating a learned interestingness model by learning weights for each of a plurality of transitions between the layers of the neural network, such that the learned weights minimize a distance between the vectors of interesting source and target documents.
地址 Redmond WA US