Journal of Health Management & Informatics (Jan 2018)

Data Quality Assessment and Recommendations to Improve the Quality of Hemodialysis Database

  • Neda Firouraghi,
  • Shahrokh Ezzatzadegan Jahromi,
  • Ashkan Sami,
  • Mohamad Reza Morvaridi,
  • Roxana Sharifian

Journal volume & issue
Vol. 5, no. 1
pp. 21 – 26

Abstract

Read online

Introduction: Since clinical data contain abnormalities, quality assessment and reporting of data errors are necessary. Data quality analysis consists of developing strategies, making recommendations to avoid future errors and improving the quality of data entry by identifying error types and their causes. Therefore, this approach can be extremely useful to improve the quality of the databases. The aim of this study was to analyze hemodialysis (HD) patients’ data in order to improve the quality of data entry and avoid future errors. Method: The study was done on Shiraz University of Medical Sciences HD database in 2015. The database consists of 2367 patients who had at least 12 months follow up (22.34±11.52 months) in 2012-2014. Duplicated data were removed; outliers were detected based on statistical methods, expert opinion and the relationship between variables; then, the missing values were handled in 72 variables by using IBM SPSS Statistics 22 in order to improve the quality of the database. According to the results, some recommendations were given to improve the data entry process. Results: The variables had outliers in the range of 0-9.28 percent. Seven variables had missing values over 20 percent and in the others they were between 0 and 19.73 percent. The majority of missing values belong to serum alkaline phosphatase, uric acid, high and low density lipoprotein, total iron binding capacity, hepatitis B surface antibody titer, and parathyroid hormone. The variables with displacement (the values of two or more variables were recorded in the wrong attribute) were weight, serum creatinine, blood urea nitrogen, systolic and diastolic blood pressure. These variables may lead to decreased data quality. Conclusion: According to the results and expert opinion, applying some data entry principles, such as defining ranges of values, using the relationship between hemodialysis features, developing alert systems about empty or duplicated data and entering directly HD data or lab results into the database can improve the data quality drastically. Experts' opinion in detecting outliers as a complement to statistical methods can have an effective role in detection of real outliers. For the analysis of HD databases, the relationship between the variables because of their effect on the quality should be focused more to improve the quality of the database.

Keywords