Scientific Reports (Jan 2024)

Broadening the capture of natural products mentioned in FAERS using fuzzy string-matching and a Siamese neural network

  • Israel O. Dilán-Pantojas,
  • Tanupat Boonchalermvichien,
  • Sanya B. Taneja,
  • Xiaotong Li,
  • Maryann R. Chapin,
  • Sandra Karcher,
  • Richard D. Boyce

DOI
https://doi.org/10.1038/s41598-023-51004-4
Journal volume & issue
Vol. 14, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Increased sales of natural products (NPs) in the US and growing safety concerns highlight the need for NP pharmacovigilance. A challenge for NP pharmacovigilance is ambiguity when referring to NPs in spontaneous reporting systems. We used a combination of fuzzy string-matching and a neural network to reduce this ambiguity. Our aim is to increase the capture of reports involving NPs in the US Food and Drug Administration Adverse Event Reporting System (FAERS). For this, we utilized Gestalt pattern-matching (GPM) and Siamese neural network (SM) to identify potential mentions of NPs of interest in 389,386 FAERS reports with unmapped drug names. A team of health professionals refined the candidates identified in the previous step through manual review and annotation. After candidate adjudication, GPM identified 595 unique NP names and SM 504. There was little overlap between candidates identified by each (Non-overlapping: GPM 347, SM 248). We identified a total of 686 novel NP names from FAERS reports. Including these names in the FAERS collection yielded 3,486 additional reports mentioning NPs.