Page 31 - Data Science Algorithms in a Week
P. 31
16 Ramazan Ünlü
result. By internal quality measures, they refer to the real-valued quality metrics that are
computed directly from a clustering and do not include calculations that involve data
sample class information as opposed to external quality measures (Ünlü & Xanthopoulos,
2016b). In the next step, they have tried to make this study better in terms of a well-
known evaluation metric; variance. They have optimized internal quality measures by
applying Markowitz Portfolio Theory (MPT). Using the core idea of MPT which is
constructing portfolios to optimize expected return based on a given level of market risk
which is considered as variance, they have taken not only value of the validity measures
itself but variation on them into consideration. By doing this, they aimed to reduce
variance of the accuracy of the final partition which is produced by weighted consensus
clustering (Ünlü & Xanthopoulos, 2016a).
Throughout the section, some featured studies have been summarized. Researches on
consensus clustering are not limited to those summarized above, other contributions can
be seen in (Berikov, 2014; Gupta & Verma, 2014; Kang, Liu, Zhou, & Li, 2016; Lock &
Dunson, 2013; Parvin, Minaei-Bidgoli, Alinejad-Rokny, & Punch, 2013; Su, Shang, &
Shen, 2015; Wang, Shan, & Banerjee, 2011; Wu, Liu, Xiong, & Cao, 2013).
REFERENCES
Abello, J., Pardalos, P. M., & Resende, M. G. (2013). Handbook of massive data sets
(Vol. 4): Springer.
Ailon, N., Charikar, M., & Newman, A. (2008). Aggregating inconsistent information:
ranking and clustering. Journal of the ACM (JACM), 55(5), 23.
Al-Razgan, M., & Domeniconi, C. (2006). Weighted clustering ensembles Proceedings
of the 2006 SIAM International Conference on Data Mining (pp. 258-269): SIAM.
Alizadeh, H., Minaei-Bidgoli, B., & Parvin, H. (2014). Cluster ensemble selection based
on a new cluster stability measure. Intelligent Data Analysis, 18(3), 389-408.
Ana, L., & Jain, A. K. (2003). Robust data clustering Computer Vision and Pattern
Recognition, 2003. Proceedings. 2003 IEEE Computer Society Conference on (Vol.
2, pp. II-II): IEEE.
Ayad, H. G., & Kamel, M. S. (2008). Cumulative voting consensus method for partitions
with variable number of clusters. IEEE Transactions on pattern analysis and
machine intelligence, 30(1), 160-173.
Ayad, H. G., & Kamel, M. S. (2010). On voting-based consensus of cluster ensembles.
Pattern Recognition, 43(5), 1943-1953.
Azimi, J., & Fern, X. (2009). Adaptive Cluster Ensemble Selection. Paper presented at the
IJCAI.