Page 20 - Data Science Algorithms in a Week
P. 20

Unsupervised Ensemble Learning                        5

                                         Table 1. Traditional and Modern algorithms









































                                  CLUSTERING ALGORITHMS BASED ON ENSEMBLE

                          Clustering algorithms based on ensemble called unsupervised ensemble learning or
                       consensus  clustering  can  be  considered  as  a  modern  clustering  algorithm.  Clustering
                       results are prone to being diverse across the algorithm, and each algorithm might work
                       better for a particular dataset. This diversity is hypothetically illustrated by a toy example
                       in Figure 4. In this figure, samples are in the same group represented by the same symbol.
                       As shown in figure, different clustering methods might give us different partitions of the
                       data, and they can even produce the different number of clusters because of the diverse
                       objectives and methodological foundations (Haghtalab, Xanthopoulos, & Madani, 2015).
                          As  it  will  be  discussed  later,  to  deal  with  the  potential  variation  of  clustering
                       methods, one can use consensus clustering. The core idea of consensus clustering is to
                       combine good characteristics of different partitions to create a better clustering model. As
                       the simple logic of process is shown in Figure 5, different partitions (   ,    , … ,    ) need
                                                                                         2
                                                                                      1
                                                                                                 
                                                                                     ∗
                       to be somehow produced and combined to create optimum partition (   ).
   15   16   17   18   19   20   21   22   23   24   25