World Allergy Organization Journal (Jan 2019)
The evaluation of pollen concentrations with statistical and computational methods on rooftop and on ground level in Vienna – How to include daily crowd-sourced symptom data
Abstract
Background: It is recommended to position pollen monitoring stations on rooftop level to assure a large catchment area and to gain data that are representative for a regional scale. Herein, an investigation of the representativeness of pollen concentrations was performed for 20 pollen types in the pollen seasons 2015–2016 in Vienna for rooftop and ground level and was compared with weather data and for the first time with symptom data. Methods: The complete data set was analyzed with various statistical methods including Spearmen correlation, ANOVA, Kolmogorov–Smirnov test and logistic regression calculation: Odds ratio and Yule's Q values. Computational intelligence methods, namely Self Organizing Maps (SOMs) were employed that are capable of describing similarities and interdependencies in an effective way taking into account the U-matrix as well. The Random Forest algorithm was selected for modeling symptom data. Results: The investigation of the representativeness of pollen concentrations on rooftop and ground level concerns the progress of the season, the peak occurrences and absolute quantities. Most taxa examined showed similar patterns (e.g. Betula), while others showed differences in pollen concentrations exposure on different heights (e.g. the Poaceae family). Maximum temperature, mean temperature and humidity showed the highest influence among the weather parameters and daily pollen concentrations for the majority of taxa in both traps. Conclusion: The rooftop trap was identified as the more adequate one when compared with the local symptom data. Results show that symptom data correlate more with pollen concentrations measured on rooftop than with those measured on ground level. Keywords: Symptom data, Sampling height, Pollen concentrations, Computational methods