Page 16 - Data Science Algorithms in a Week
P. 16

In: Artificial Intelligence                            ISBN: 978-1-53612-677-8
                       Editors: L. Rabelo, S. Bhide and E. Gutierrez   © 2018 Nova Science Publishers, Inc.






                       Chapter 1



                                  UNSUPERVISED ENSEMBLE LEARNING


                                                                       *
                                                     Ramazan Ünlü
                                       Industrial Engineering and Management Systems,
                                         University of Central Florida, Orlando, FL, US


                                                        ABSTRACT


                              Clustering is used in identifying groups of samples with similar properties, and it is
                          one  of  the  most  common  preliminary  exploratory  analysis  for  revealing  “hidden”
                          patterns,  in  particular  for  datasets  where  label  information  is  unknown.  Even  though
                          clustering techniques  have been  well used to analyze a  variety of datasets in different
                          domains for years, the limitation of them is that each clustering method works better only
                          in certain conditions. This made the selection of the most suitable algorithm for particular
                          dataset  much  more  important.  Restrained  implementation  of  clustering  methods  has
                          forced  clustering  practitioners  to  develop  more  robust  methods,  which  is  reasonably
                          practicable  in  any  condition.  The  unsupervised  ensemble  learning,  or  consensus
                          clustering,  is  developed  to  serve  this  purpose.  It  consists  of  finding  the  optimal
                          combination strategy of individual partitions that is robust in comparison to the selection
                          of an algorithmic clustering pool. The goal of this combination process is to improve the
                          average quality of individual clustering methods. Due to increasing development of new
                          methods, their promising results and the great number of applications, it is considered to
                          make a crucial and a brief review about it. Through this chapter, first the main concepts
                          of clustering methods are briefly introduced and then the basics of ensemble learning is
                          given.  Finally,  the  chapter  is  concluded  with  a  comprehensive  summary  of  novel
                          developments in the area.

                       Keywords: consensus clustering, unsupervised ensemble learning


                       *  Corresponding Author Email: ramazanunlu@gmail.com.
   11   12   13   14   15   16   17   18   19   20   21