Journal of the Serbian Chemical Society (Jan 2022)
Laboratory data clustering in defining population cohorts: Case study on metabolic indicators
Abstract
The knowledge on the general population health is important for creating public policies and organization of medical services. However, personal data are often limited, and mathematical models are employed to achieve a general overview. Cluster analysis was used in this study to assess general trends in population health based on laboratory data. Metabolic indicators were chosen to test the model and define population cohorts. Data on blood analysis of 33,049 persons, namely the concentrations of glucose, total cholesterol and triglycerides, were collected in a public health laboratory and used to define metabolic cohorts employing computational data clustering (CLARA method). The population was shown to be distributed in 3 clusters: persons with hypercholesterolemia with or without changes in the concentration of triglycerides or glucose, persons with reference or close to reference concentrations of all three analytes and persons with predominantly elevated all three parameters. Clustering of biochemical data, thus, is a useful statistical tool in defining population groups in respect to certain health aspect.
Keywords