Article |
---|
Article name |
Application of the Homogeneity Analysis for the Visualization and Analysis of Heterogeneous Data |
Authors |
Gordeev R.N. Candidate of Physical and Mathematical Sciences, Associate Professor, roman.gordeev@mail.ru |
Bibliographic description |
|
Section |
Scientific Research |
UDK |
004 |
DOI |
|
Article type |
|
Annotation |
Problems of classification and ranking arise in today’s information society. Whether it’s
the need to rank users of information resources according to their interests and preferences,
or analysis of consumer preferences of the visitors of internet shops, or may be an analysis
and comparison of consumer properties of the some goods and more.
And an empirical methods suit well for such problems solving, in particular, a random
forest, which has proved its worth for making very accurate predictions for solving regression
and classification.
In this paper we consider the problem of ranking and classification. The adaptation of
the analysis of homogeneity method had been proposed for effective visualization committees
of decision trees, including visualization and new observations that were not included in the
training set.
|
Key words |
classification, homogeneity analysis, visualization of graphs. |
Article information |
|
References |
1. Goldberger J. Neighbourhood Components Analysis / Goldberger, J., Roweis, S.,
Hinton, G. and Salakhutdinov, R. Advances in Neural Information Processing System, 2005.
Vol. 17. P. 513520.
2. Sugiyama M. Dimensionality Reduction of Multimodal Labeled Data by Local Fisher
Discriminant Analysis // The Journal of Machine Learning Research. 2007. № 8. P. 1061.
3. Breiman L. Random Forests // Machine Learning. 1997. № 45. P. 532.
4. Breiman L. Bagging Predictors // Machine Learning. 1996. № 24. P. 123140.
5. Friedman J. Predictive Learning via Rule Ensembles / J. Friedman and B. Popescu //
The Annals of Applied Statistics. 2008. № 2. P. 916954.
6 De Leeuw J. Homogeneity Analysis in R: The Package Homals / J. De Leeuw and
P. Mair // Journal of Statistical Software. 2008. № 31. P. 121.
7. Michailidis G. The Gifi System of Descriptive Multivariate Analysis / Michailidis G.
and De Leeuw J. // Statistical Science. 1998. Vol. 13. № 4. P. 307336.
8. Breiman L. Classification and Regression Trees / L. Breiman, J. Friedman, R. Olshen
and C. Stone. 1998. Belmont: Wadsworth
9. Liaw A. Classication and Regression by Random Forest /A. Liaw and M. Wiener// R
News. 2002. № 2. P. 1822.
10. Lin Y. Random Forests and Adaptive Nearest Neighbors /Y. Lin and Y. Jeon //
Journal of the American Statistical Association. 2006. № 101. P. 578590.
11. Shi T. Unsupervised Learning With Random Forest Predictors /T. Shi and S.
Horvath // Journal of Computational and Graphical Statistics. 2006. № 15. P. 118138.
12. Urbanek S. Visualizing Trees and Forests // in Handbook of Data Visualization. 2008.
Berlin, Heidelberg: Spinger. P. 243264.
13. Borg I. and Groenen P. Modern Multidimensional Scaling: Theory and Applications
1997. New York.
14. Kruskal J. Nonmetric Multidimensional Scaling: A Numerical Method //
Psychometrika. 1964. №29. P. 115–129.
15. Kruskal J., Wish M. Multidimensional Scaling. 1978. Beverly Hills: Sage Publications. |
Full article | Application of the Homogeneity Analysis for the Visualization and Analysis of Heterogeneous Data |