Статистика України (Mar 2017)

Data Reduction in Socio-Economic Studies

  • V. V. Lypchuk,
  • О. M. Krupa

DOI
https://doi.org/10.31767/su.1(76).2017.01.02
Journal volume & issue
no. 1(76)
pp. 15 – 20

Abstract

Read online

The article is devoted to the problem of data reduction as an important step on the way of providing reliability and efficiency of socio-economic studies. Through the reduction the large amounts of raw data, generated from different sources, become more useful, convenient and clear for use. Meanwhile, the data reduction is not treated as a separate phase of studies in the national statistic practice. The aim of the article is to substantiate the importance of data reduction in economic studies and attempt to systematize and generalize the essence and components of the phase of data reduction as well as ways of their implementation. The study is based on methods of theoretical generalization, abstract and logic, analogy and others. The essence of data reduction is defined as the process of converting raw data into the pure form and reducing the number of units’ attributes (features), which are not significant to further analysis. In fact, this is part of the analysis involving selection of the data that are most important from the viewpoint of the study’s goals. The significance of data reduction in economic studies is outlined. It is found that it assures the validity of their results, reduces their time and costs, simplifies the representational complexity of the problem being addressed, eliminates the errors and redundant data from the investigated set, looses the requirements to calculation tools. The data resulting from reduction are much more informative. Many dependencies and relationships become more readable (visual). It is emphasized that reduction applies to the current data (on-line), as well as to historical data (off-line), contained in the already created databases. The phases of data reduction are described. They are: control of data collection, data editing, classification, data construction and grouping, coding and transmission (data transmission to the processing tools - computers). Data reduction techniques and methods most common in the global practice are shown. Future studies of data reduction problems are expected to focus on potential ways to implement its advanced methods in the domestic practice of statistical science. It will allow for enhancing significantly the speed and efficiency of economic analysis and the reliability of its results.

Keywords