发明名称 Density estimation and/or manifold learning
摘要 Density estimation and/or manifold learning are described, for example, for computer vision, medical image analysis, text document clustering. In various embodiments a density forest is trained using unlabeled data to estimate the data distribution. In embodiments the density forest comprises a plurality of random decision trees each accumulating portions of the training data into clusters at their leaves. In embodiments probability distributions representing the clusters at each tree are aggregated to form a forest density which is an estimate of a probability density function from which the unlabeled data may be generated. A mapping engine may use the clusters at the leaves of the density forest to estimate a mapping function which maps the unlabeled data to a lower dimensional space whilst preserving relative distances or other relationships between the unlabeled data points. A sampling engine may use the density forest to randomly sample data from the forest density.
申请公布号 US8954365(B2) 申请公布日期 2015.02.10
申请号 US201213528866 申请日期 2012.06.21
申请人 Microsoft Corporation 发明人 Criminisi Antonio;Shotton Jamie Daniel Joseph;Konukoglu Ender
分类号 G06F17/00;G06K9/62 主分类号 G06F17/00
代理机构 Zete Law, P.L.L.C. 代理人 Sula Miia;Minhas Micky;Zete Law, P.L.L.C.
主权项 1. A method comprising: accessing, at a processor, observations which are unlabeled in that each observation belongs to one of a plurality of unknown classes; training a plurality of random decision trees to form a density forest using the unlabeled observations such that each random decision tree partitions the unlabeled observations into a plurality of clusters each represented by a probability distribution; aggregating the partitions from each of the trees to obtain a forest density which is a probability density function that describes the accessed observations.
地址 Redmond WA US