Page 31 - Data Science Algorithms in a Week
P. 31

16                               Ramazan Ünlü

                       result. By internal quality measures, they refer to the real-valued quality metrics that are
                       computed  directly  from  a  clustering  and  do  not  include  calculations  that  involve  data
                       sample class information as opposed to external quality measures (Ünlü & Xanthopoulos,
                       2016b). In the next step, they have tried to make this study better in terms of a well-
                       known  evaluation  metric;  variance.  They  have  optimized  internal  quality  measures  by
                       applying  Markowitz  Portfolio  Theory  (MPT).  Using  the  core  idea  of  MPT  which  is
                       constructing portfolios to optimize expected return based on a given level of market risk
                       which is considered as variance, they have taken not only value of the validity measures
                       itself  but  variation  on  them  into  consideration.  By  doing  this,  they  aimed  to  reduce
                       variance of the accuracy of the final partition which is produced by weighted consensus
                       clustering (Ünlü & Xanthopoulos, 2016a).
                          Throughout the section, some featured studies have been summarized. Researches on
                       consensus clustering are not limited to those summarized above, other contributions can
                       be seen in (Berikov, 2014; Gupta & Verma, 2014; Kang, Liu, Zhou, & Li, 2016; Lock &
                       Dunson, 2013; Parvin, Minaei-Bidgoli, Alinejad-Rokny, & Punch, 2013; Su, Shang, &
                       Shen, 2015; Wang, Shan, & Banerjee, 2011; Wu, Liu, Xiong, & Cao, 2013).


                                                       REFERENCES


                       Abello, J., Pardalos, P. M., & Resende, M. G. (2013).  Handbook of massive data sets
                          (Vol. 4): Springer.
                       Ailon, N., Charikar, M., & Newman, A. (2008). Aggregating inconsistent information:
                          ranking and clustering. Journal of the ACM (JACM), 55(5), 23.
                       Al-Razgan, M., & Domeniconi, C. (2006). Weighted clustering ensembles Proceedings
                          of the 2006 SIAM International Conference on Data Mining (pp. 258-269): SIAM.
                       Alizadeh, H., Minaei-Bidgoli, B., & Parvin, H. (2014). Cluster ensemble selection based
                          on a new cluster stability measure. Intelligent Data Analysis, 18(3), 389-408.
                       Ana,  L.,  &  Jain,  A.  K.  (2003).  Robust  data  clustering  Computer  Vision  and  Pattern
                          Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on (Vol.
                          2, pp. II-II): IEEE.
                       Ayad, H. G., & Kamel, M. S. (2008). Cumulative voting consensus method for partitions
                          with  variable  number  of  clusters.  IEEE  Transactions  on  pattern  analysis  and
                          machine intelligence, 30(1), 160-173.
                       Ayad, H. G., & Kamel, M. S. (2010). On voting-based consensus of cluster ensembles.
                          Pattern Recognition, 43(5), 1943-1953.
                       Azimi, J., & Fern, X. (2009). Adaptive Cluster Ensemble Selection. Paper presented at the
                          IJCAI.
   26   27   28   29   30   31   32   33   34   35   36