Skip to content
Prev 50930 / 398500 Next

A Few MCLUST Questions

I can answer for MCLUST specifically, but in general mixture modelling 
terms it is easier to think of a reasonable initial clustering of the 
data from which the M step will quickly produce initial parameter 
estimates, than to pick a large number of initial parameters values out 
of the air. (Perhaps you may use a random grouping to start things off 
if nothing else comes to mind.) Usually if you try to do this you will 
pick parameters that make some data values very improbable leading to 
numerical difficulties in the M-step.

On the other hand you may have a good set of parameter values from a 
previously-fitted data set and you have a new, but similar set of data, 
perhaps from a different time-period or location. Then it will make 
sense to start off from the parameter values that you have.

Don't worry about the software - it should be just as easy for it to 
begin at either the E- or the M- step - it is you own intentions and 
convenience that matter.

Murray Jorgensen
KKThird at Yahoo.Com wrote: