Article
Article name Application of the Homogeneity Analysis for the Visualization and Analysis of Heterogeneous Data
Authors Gordeev R.N. Candidate of Physical and Mathematical Sciences, Associate Professor, roman.gordeev@mail.ru
Bibliographic description
Section Scientific Research
UDK 004
DOI
Article type
Annotation Problems of classification and ranking arise in today’s information society. Whether it’s the need to rank users of information resources according to their interests and preferences, or analysis of consumer preferences of the visitors of internet shops, or may be an analysis and comparison of consumer properties of the some goods and more. And an empirical methods suit well for such problems solving, in particular, a random forest, which has proved its worth for making very accurate predictions for solving regression and classification. In this paper we consider the problem of ranking and classification. The adaptation of the analysis of homogeneity method had been proposed for effective visualization committees of decision trees, including visualization and new observations that were not included in the training set.
Key words classification, homogeneity analysis, visualization of graphs.
Article information
References 1. Goldberger J. Neighbourhood Components Analysis / Goldberger, J., Roweis, S., Hinton, G. and Salakhutdinov, R. Advances in Neural Information Processing System, 2005. Vol. 17. P. 513520. 2. Sugiyama M. Dimensionality Reduction of Multimodal Labeled Data by Local Fisher Discriminant Analysis // The Journal of Machine Learning Research. 2007. № 8. P. 1061. 3. Breiman L. Random Forests // Machine Learning. 1997. № 45. P. 532. 4. Breiman L. Bagging Predictors // Machine Learning. 1996. № 24. P. 123140. 5. Friedman J. Predictive Learning via Rule Ensembles / J. Friedman and B. Popescu // The Annals of Applied Statistics. 2008. № 2. P. 916954. 6 De Leeuw J. Homogeneity Analysis in R: The Package Homals / J. De Leeuw and P. Mair // Journal of Statistical Software. 2008. № 31. P. 121. 7. Michailidis G. The Gifi System of Descriptive Multivariate Analysis / Michailidis G. and De Leeuw J. // Statistical Science. 1998. Vol. 13. № 4. P. 307336. 8. Breiman L. Classification and Regression Trees / L. Breiman, J. Friedman, R. Olshen and C. Stone. 1998. Belmont: Wadsworth 9. Liaw A. Classication and Regression by Random Forest /A. Liaw and M. Wiener// R News. 2002. № 2. P. 1822. 10. Lin Y. Random Forests and Adaptive Nearest Neighbors /Y. Lin and Y. Jeon // Journal of the American Statistical Association. 2006. № 101. P. 578590. 11. Shi T. Unsupervised Learning With Random Forest Predictors /T. Shi and S. Horvath // Journal of Computational and Graphical Statistics. 2006. № 15. P. 118138. 12. Urbanek S. Visualizing Trees and Forests // in Handbook of Data Visualization. 2008. Berlin, Heidelberg: Spinger. P. 243264. 13. Borg I. and Groenen P. Modern Multidimensional Scaling: Theory and Applications 1997. New York. 14. Kruskal J. Nonmetric Multidimensional Scaling: A Numerical Method // Psychometrika. 1964. №29. P. 115–129. 15. Kruskal J., Wish M. Multidimensional Scaling. 1978. Beverly Hills: Sage Publications.
Full articleApplication of the Homogeneity Analysis for the Visualization and Analysis of Heterogeneous Data