Serbian Journal of Management (May 2013)

HIGHLY ROBUST METHODS IN DATA MINING

  • Jan Kalina

DOI
https://doi.org/10.5937/sjm8-3226
Journal volume & issue
Vol. 8, no. 1
pp. 9 – 24

Abstract

Read online

This paper is devoted to highly robust methods for information extraction from data, with a special attention paid to methods suitable for management applications. The sensitivity of availabledata mining methods to the presence of outlying measurements in the observed data is discussed as a major drawback of available data mining methods. The paper proposes several newhighly robustmethods for data mining, which are based on the idea of implicit weighting of individual data values.Particularly it propose a novel robust method of hierarchical cluster analysis, which is a popular data mining method of unsupervised learning. Further, a robust method for estimating parameters in thelogistic regression was proposed. This idea is extended to a robust multinomial logistic classification analysis. Finally, the sensitivity of neural networks to the presence of noise and outlying measurements in the data was discussed. The method for robust training of neural networks for the task of function approximation, which has the form of a robust estimator in nonlinear regression, was proposed.

Keywords