JMIR Formative Research (Jan 2022)

Prediction of Age-Adjusted Mortality From Stroke in Japanese Prefectures: Ecological Study Using Search Engine Queries

  • Kazuya Taira,
  • Sumio Fujita

DOI
https://doi.org/10.2196/27805
Journal volume & issue
Vol. 6, no. 1
p. e27805

Abstract

Read online

BackgroundStroke is a major cause of death and the need for nursing care in Japan, with large regional disparities. ObjectiveThe purpose of this study was to clarify the association between stroke-related information retrieval behavior and age-adjusted mortality in each prefecture in Japan. MethodsAge-adjusted mortality from stroke and aging rates were obtained from publicly available Japanese government statistics. A total of 9476 abstracts of Japanese articles related to symptoms and signs of stroke were identified in Ichushi-Web, a Japanese web-based database of biomedical articles, and 100 highly frequent words (hereafter referred to as the Stroke 100) were extracted. Using data from 2014 to 2019, a random forest analysis was carried out using the age-adjusted mortality from stroke in 47 prefectures as the outcome variable and the standardized retrieval numbers of the Stroke 100 words in the log data of Yahoo! JAPAN Search as predictive variables. Regression analysis was performed using a generalized linear mixed model (GLMM) with the number of standardized searches for Stroke 100 words with high importance scores in the random forest model as the predictive variable. In the GLMM, the aging rate and data year were used as control variables, and the random slope of data year and random intercept were calculated by prefecture. ResultsThe mean age-adjusted mortality from stroke was 28.07 (SD 4.55) deaths per 100,000 for all prefectures in all data years. The accuracy score of the random forest analysis was 89.94%, the average error was 2.79 degrees, and the mean squared error was 13.57 degrees. The following 9 variables with high importance scores in the random forest analysis were selected as predictive variables for the regression analysis: male, age, hospitalization, enforcement, progress, stroke, abnormal, use, and change. As a result of the regression analysis with GLMM, the standardized partial regression coefficients (β) and 95% confidence intervals showed that the following internet search terms were significantly associated with age-adjusted mortality from stroke: male (β=−5.83, 95% CI −8.67 to −3.29), age (β=−5.83, 95% CI −8.67 to −3.29), hospitalization (β=−5.83, 95% CI −8.67 to −3.29), and abnormal (β=3.83, 95% CI 1.14 to 6.56). ConclusionsStroke-related search behavior was associated with age-adjusted mortality from stroke in each prefecture in Japan. Query terms that were strongly associated with age-adjusted mortality rates of stroke suggested the possibility that individual characteristics, such as sex and age, have an impact on stroke-associated mortality and that it is important to receive medical care early after stroke onset. Further studies on the criteria and timing of alerting are needed by monitoring information-seeking behavior to identify queries that are strongly associated with stroke mortality.