Learning more from the inter-rater reliability of interstitial fibrosis assessment beyond just a statistic

Peir-In Liang; Wei-Chou Lin; Mei-Chin Wen; Shun-Chen Huang; Pei-Wei Fang; Hao-Wen Chuang; Yi-Jia Lin; Hui-Ping Chien; Huan-Da Chen; Tai-Di Chen

doi:10.1038/s41598-023-40221-6

Scientific Reports (Aug 2023)

Learning more from the inter-rater reliability of interstitial fibrosis assessment beyond just a statistic

Peir-In Liang,
Wei-Chou Lin,
Mei-Chin Wen,
Shun-Chen Huang,
Pei-Wei Fang,
Hao-Wen Chuang,
Yi-Jia Lin,
Hui-Ping Chien,
Huan-Da Chen,
Tai-Di Chen

Affiliations

Peir-In Liang: Department of Pathology, Kaohsiung Medical University Hospital, Kaohsiung Medical University
Wei-Chou Lin: Department of Pathology, National Taiwan University Hospital
Mei-Chin Wen: Department of Pathology, China Medical University Hsinchu Hospital
Shun-Chen Huang: Department of Anatomic Pathology, Chang Gung Memorial Hospital Kaohsiung Branch
Pei-Wei Fang: Department of Pathology, Fu Jen Catholic University Hospital, Fu Jen Catholic University
Hao-Wen Chuang: Department of Pathology and Laboratory Medicine, Kaohsiung Veterans General Hospital
Yi-Jia Lin: Department of Pathology, Tri-service General Hospital, National Defense Medical Center
Hui-Ping Chien: Department of Pathology and Laboratory Medicine, Shin Kong Wu Ho-Su Memorial Hospital
Huan-Da Chen: Department of Pathology, Kaohsiung Medical University Hospital, Kaohsiung Medical University
Tai-Di Chen: Department of Anatomic Pathology, Chang Gung Memorial Hospital Linkou Main Branch

DOI: https://doi.org/10.1038/s41598-023-40221-6
Journal volume & issue: Vol. 13, no. 1
pp. 1 – 10

Abstract

Read online

Abstract Interstitial fibrosis assessment by renal pathologists lacks good agreement, and we aimed to investigate its hidden properties and infer possible clinical impact. Fifty kidney biopsies were assessed by 9 renal pathologists and evaluated by intraclass correlation coefficients (ICCs) and kappa statistics. Probabilities of pathologists’ assessments that would deviate far from true values were derived from quadratic regression and multilayer perceptron nonlinear regression. Likely causes of variation in interstitial fibrosis assessment were investigated. Possible misclassification rates were inferred on reported large cohorts. We found inter-rater reliabilities ranged from poor to good (ICCs 0.48 to 0.90), and pathologists’ assessments had the worst agreements when the extent of interstitial fibrosis was moderate. 33.5% of pathologists’ assessments were expected to deviate far from the true values. Variation in interstitial fibrosis assessment was found to be correlated with variation in interstitial inflammation assessment (r2 = 32.1%). Taking IgA nephropathy as an example, the Oxford T scores for interstitial fibrosis were expected to be misclassified in 21.9% of patients. This study demonstrated the complexity of the inter-rater reliability of interstitial fibrosis assessment, and our proposed approaches discovered previously unknown properties in pathologists’ practice and inferred a possible clinical impact on patients.

Published in Scientific Reports

ISSN: 2045-2322 (Online)
Publisher: Nature Portfolio
Country of publisher: United Kingdom
LCC subjects: Medicine; Science
Website: https://www.nature.com/srep/

About the journal