BMC Medical Informatics and Decision Making (Sep 2024)

Comparison of semi and fully automated artificial intelligence driven softwares and manual system for cephalometric analysis

  • Rumeesha Zaheer,
  • Hafiza Zobia Shafique,
  • Zahra Khalid,
  • Rooma Shahid,
  • Abdullah Jan,
  • Tooba Zahoor,
  • Ramsha Nawaz,
  • Mehak ul Hassan

DOI
https://doi.org/10.1186/s12911-024-02664-3
Journal volume & issue
Vol. 24, no. 1
pp. 1 – 11

Abstract

Read online

Abstract Background Cephalometric analysis has been used as one of the main tools for orthodontic diagnosis and treatment planning. The analysis can be performed manually on acetate tracing sheets, digitally by manual selection of landmarks or by recently introduced Artificial Intelligence (AI)-driven tools or softwares that automatically detect landmarks and analyze them. The use of AI-driven tools is expected to avoid errors and make it less time consuming with effective evaluation and high reproducibility. Objective To conduct intra- and inter-group comparisons of the accuracy and reliability of cephalometric tracing and evaluation done manually and with AI-driven tools that is WebCeph and CephX softwares. Methods Digital and manual tracing of lateral cephalometric radiographs of 54 patients was done. 18 cephalometric parameters were assessed on each radiograph by 3 methods, manual method and by using semi (WebCeph) and fully automatic softwares (Ceph X). Each parameter was assessed by two investigators using these three methods. SPSS was then used to assess the differences in values of cephalometric variables between investigators, between softwares, between human investigator means and software means. ICC and paired T test were used for intra-group comparisons while ANOVA and post-hoc were used for inter-group comparisons. Results Twelve out of eighteen variables had high intra-group correlation and significant ICC p-values, 5 variables had relatively lower values and only one variable (SNO) had significantly low ICC value. Fifteen out of eighteen variables had minimal detection error using fully-automatic method of cephalometric analysis. Only three variables had lowest detection error using semi-automatic method of cephalometric analysis. Inter-group comparison revealed significant difference between three methods for eight variables; Witts, NLA, SNGoGn, Y-Axis, Jaraback, SNO, MMA and McNamara to Point A. Conclusion There is a lack of significant difference among manual, semiautomatic and fully automatic methods of cephalometric tracing and analysis in terms of the variables measured by these methods. The mean detection errors were the highest for manual analysis and lowest for fully automatic method. Hence the fully automatic AI software has the most reproducible and accurate results.

Keywords