This booklet makes a speciality of exploratory facts research, studying of latent buildings in datasets, and unscrambling of data. insurance info a wide diversity of equipment from multivariate information, clustering and class, visualization and scaling in addition to from information and time sequence research. It offers new ways for info retrieval and knowledge mining and reviews a number of hard purposes in numerous fields.

N (see B¨ohning (1992)). , zn ]T . t. Z. This contrasts with the direct manipulation of Y, which, due to its discrete nature, brings a combinatorial nature to the problems. , α(K−1) ]T , where α(k) is a global mean for z(k) , and W is a matrix (with zeros in the diagonal) encoding the pair-wise preferences: Wi,j > 0 expresses a preference (with strength proportional to Wi,j ) for having points i and j in the same cluster; Wi,j = 0 expresses the absence of any preference concerning the pair (i, j).

Controlling the level of separation of component distributions is more challenging. The (true) parameter values are shown in Table 1. These ad hoc values try to cover different situations in empirical data sets. In particular, there is an attempt to include persistent patterns usually observed in empirical data sets with heavy retention probabilities (states almost absorbent). The distance between a1kk and a2kk , |a1kk − a2kk | = |P (Xit = k|Xi,t−1 = k, Zi = 1) − P (Xit = k|Xi,t−1 = k, Zi = 2)|, and between λs1 and λs2 , |λ1k − λ2k | = |P (Xi0 = k|Zi = 1) − P (Xi0 = k|Zi = 2)|, sets the level of separation.

2003)), for each group a few well-known representatives are enumerated: Indexes based on inertia (Sum of squares): • • • • • Cali´ nski and Harabasz (1974) index (pseudo F-statistics), Hartigan(1975) index, Ratkovski index (Ratkovski and Lance (1978)), Ball (1965) index, Krzanowski and Lai (1988) index. Indexes based on scatter matrices: • • • • Scott index (Scott and Symons (1971)), Marriot (1971) index, Friedman index (Friedman and Rubin (1967)), Rubin index (Friedman and Rubin (1967)). Indexes based on distance matrices: • Silhouette (Rousseeuw (1987), Kaufman and Rousseeuw (1990)), • Baker and Hubert (Hubert (1974), Baker and Hubert (1975)), • Hubert and Levine (1976).

