Efficient generation of HPLC and FTIR data for quality assessment using time series generation model: a case study on Tibetan medicine Shilajit

Rong Ding; Shiqi He; Xuemei Wu; Liwen Zhong; Guopeng Chen; Rui Gu

doi:10.3389/fphar.2024.1503508

Frontiers in Pharmacology (Nov 2024)

Efficient generation of HPLC and FTIR data for quality assessment using time series generation model: a case study on Tibetan medicine Shilajit

Rong Ding,
Shiqi He,
Xuemei Wu,
Liwen Zhong,
Guopeng Chen,
Rui Gu

Affiliations

Rong Ding: State Key Laboratory of Southwestern Chinese Medicine Resources, School of Ethnic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China
Shiqi He: School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China
Xuemei Wu: State Key Laboratory of Southwestern Chinese Medicine Resources, School of Ethnic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China
Liwen Zhong: School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu, China
Guopeng Chen: State Key Laboratory of Southwestern Chinese Medicine Resources, School of Ethnic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China
Rui Gu: State Key Laboratory of Southwestern Chinese Medicine Resources, School of Ethnic Medicine, Chengdu University of Traditional Chinese Medicine, Chengdu, China

DOI: https://doi.org/10.3389/fphar.2024.1503508
Journal volume & issue: Vol. 15

Abstract

Read online

BackgroundThe scarcity and preciousness of plateau characteristic medicinal plants pose a significant challenge in obtaining sufficient quantities of experimental samples for quality evaluation. Insufficient sample sizes often lead to ambiguous and questionable quality assessments and suboptimal performance in pattern recognition. Shilajit, a popular Tibetan medicine, is harvested from high altitudes above 2000 m, making it difficult to obtain. Additionally, the complex geographical environment results in low uniformity of Shilajit quality.MethodsTo address these challenges, this study employed a deep learning model, time vector quantization variational auto- encoder (TimeVQVAE), to generate data matrices based on chromatographic and spectral for different grades of Shilajit, thereby increasing in the amount of data. Partial least squares discriminant analysis (PLS-DA) was used to identify three grades of Shilajit samples based on original, generated, and combined data.ResultsCompared with the originally generated high performance liquid chromatography (HPLC) and Fourier transform infrared spectroscopy (FTIR) data, the data generated by TimeVQVAE effectively preserved the chemical profile. In the test set, the average matrices for HPLC, FTIR, and combined data increased by 32.2%, 15.9%, and 23.0%, respectively. On the real test data, the PLS-DA model’s classification accuracy initially reached a maximum of 0.7905. However, after incorporating TimeVQVAE-generated data, the accuracy significantly improved, reaching 0.9442 in the test set. Additionally, the PLS-DA model trained with the fused data showed enhanced stability.ConclusionThis study offers a novel and effective approach for researching medicinal materials with small sample sizes, and addresses the limitations of improving model performance through data augmentation strategies.

Published in Frontiers in Pharmacology

ISSN: 1663-9812 (Online)
Publisher: Frontiers Media S.A.
Country of publisher: Switzerland
LCC subjects: Medicine: Therapeutics. Pharmacology
Website: http://journal.frontiersin.org/journals/pharmacology

About the journal

Abstract

Keywords