主权项 |
1. A method for creating a model capable of identifying one or more clusters in a healthcare dataset, the method comprising:
receiving, by one or more processors, an input pertaining to a range of numbers, wherein each number in the range of numbers is representative of a number of clusters in the healthcare dataset; for a cluster in the number of clusters: estimating, by the one or more processors, one or more first parameters of a distribution associated with the cluster; estimating, by the one or more processors, an inverse cumulative distribution of each of one or more n-dimensional variables in the healthcare dataset based on a threshold value and a cumulative distribution of each of the one or more n-dimensional variables; updating, by the one or more processors, the one or more first parameters to generate one or more second parameters based on the estimated inverse cumulative distribution, wherein the updating is performed using an expectation-maximization algorithm; and creating, by the one or more processors, the model for each number in the range of numbers based on the one or more second parameters associated with each cluster in the number of clusters. |