npj Digital Medicine (Oct 2022)

Automated clinical coding: what, why, and where we are?

  • Hang Dong,
  • Matúš Falis,
  • William Whiteley,
  • Beatrice Alex,
  • Joshua Matterson,
  • Shaoxiong Ji,
  • Jiaoyan Chen,
  • Honghan Wu

DOI
https://doi.org/10.1038/s41746-022-00705-7
Journal volume & issue
Vol. 5, no. 1
pp. 1 – 8

Abstract

Read online

Abstract Clinical coding is the task of transforming medical information in a patient’s health records into structured codes so that they can be used for statistical analysis. This is a cognitive and time-consuming task that follows a standard process in order to achieve a high level of consistency. Clinical coding could potentially be supported by an automated system to improve the efficiency and accuracy of the process. We introduce the idea of automated clinical coding and summarise its challenges from the perspective of Artificial Intelligence (AI) and Natural Language Processing (NLP), based on the literature, our project experience over the past two and half years (late 2019–early 2022), and discussions with clinical coding experts in Scotland and the UK. Our research reveals the gaps between the current deep learning-based approach applied to clinical coding and the need for explainability and consistency in real-world practice. Knowledge-based methods that represent and reason the standard, explainable process of a task may need to be incorporated into deep learning-based methods for clinical coding. Automated clinical coding is a promising task for AI, despite the technical and organisational challenges. Coders are needed to be involved in the development process. There is much to achieve to develop and deploy an AI-based automated system to support coding in the next five years and beyond.