Machine Learning-based Models for Outpatient Prescription of Kampo Formulations: An Analysis of a Health Insurance Claims Database

Hayato Yamana; Akira Okada; Sachiko Ono; Nobuaki Michihata; Taisuke Jo; Hideo Yasunaga

doi:10.2188/jea.JE20220089

Journal of Epidemiology (Jan 2024)

Machine Learning-based Models for Outpatient Prescription of Kampo Formulations: An Analysis of a Health Insurance Claims Database

Hayato Yamana,
Akira Okada,
Sachiko Ono,
Nobuaki Michihata,
Taisuke Jo,
Hideo Yasunaga

Affiliations

Hayato Yamana: Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Akira Okada: Department of Prevention of Diabetes and Lifestyle-Related Diseases, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Sachiko Ono: Department of Eat-loss Medicine, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Nobuaki Michihata: Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Taisuke Jo: Department of Health Services Research, Graduate School of Medicine, The University of Tokyo, Tokyo, Japan
Hideo Yasunaga: Department of Clinical Epidemiology and Health Economics, School of Public Health, The University of Tokyo, Tokyo, Japan

DOI: https://doi.org/10.2188/jea.JE20220089
Journal volume & issue: Vol. 34, no. 1
pp. 8 – 15

Abstract

Read online

Background: Despite the widespread practice of Japanese traditional Kampo medicine, the characteristics of patients receiving various Kampo formulations have not been documented in detail. We applied a machine learning model to a health insurance claims database to identify the factors associated with the use of Kampo formulations. Methods: A 10% sample of enrollees of the JMDC Claims Database in 2018 and 2019 was used to create the training and testing sets, respectively. Logistic regression analyses with lasso regularization were performed in the training set to construct models with prescriptions of 10 commonly used Kampo formulations in 1 year as the dependent variable and data of the preceding year as independent variables. Models were applied to the testing set to calculate the C-statistics. Additionally, the performance of simplified scores using 10 or 5 variables were evaluated. Results: There were 338,924 and 399,174 enrollees in the training and testing sets, respectively. The commonly prescribed Kampo formulations included kakkonto, bakumondoto, and shoseityuto. Based on the lasso models, the C-statistics ranged from 0.643 (maoto) to 0.888 (tokishakuyakusan). The models identified both the common determinants of different Kampo formulations and the specific characteristics associated with particular Kampo formulations. The simplified scores were slightly inferior to full models. Conclusion: Lasso regression models showed good performance for explaining various Kampo prescriptions from claims data. The models identified the characteristics associated with Kampo formulation use.

Published in Journal of Epidemiology

ISSN: 0917-5040 (Print); 1349-9092 (Online)
Publisher: Japan Epidemiological Association
Country of publisher: Japan
LCC subjects: Medicine: Medicine (General)
Website: http://jeaweb.jp/english/journal/index.html

About the journal

Abstract

Keywords