Heliyon (Oct 2023)
Relation of lumbar intervertebral disc height and severity of disc degeneration based on Pfirrmann scores
Abstract
Background: Disc height (DH) change is considered one of the most critical factors in assessing intervertebral disc degeneration (IVD). Pfirrmann et al. developed a scoring system for disc degeneration evaluation based on changes in DH in magnetic resonance imaging (MRI). While the relationship between DH measurements and Pfirrmann scores for disc degeneration has been explored, the validity of different DH measuring techniques or their connection with disc degeneration is yet uncertain. The present study investigates intra-rater and inter-rater agreement and reliability of different DH measurement methods on MRI and evaluates the relationship between different DH measurement methods and Pfirrmann scores of IVD degeneration, as well as between different Pfirrmann scores and clinical outcomes. Methods: Adult patients with MRI scans of the lumbar spine were recruited. Eight DH measuring techniques were tested for intra-rater and inter-rater agreement and reliability. Bland and Altman's Limits of Agreement (LOA) was used to evaluate intra-rater and inter-rater agreements. Intra-rater and inter-rater reliability were evaluated using intra-class correlations (ICC) with 95 % confidence intervals (95 % CI). The association between DH and Pfirrmann scores was examined using one-way ANOVA. Results: Excellent intra-rater reliability was reported for 332 participants on DH (ranging from 0.912 (0.901, 0.923) to 0.973 (0.964, 0.981) and from 0.902 (0.892, 0.915) to 0.975 (0.962, 0.985) by two independent raters). All measuring methods had high intra-rater agreement, except for methods 4 and 5. All methods had good-to-excellent of inter-rater reliability on DH (ICCs ranging from 0.812 (0.795, 0.828) to 0.995 (0.994, 0.995)) except for the posterior disc material length of method 5 (ICC 0.740 (0.718, 0.761)). Methods 1 to 6 for evaluating DH in patients with spondylolisthesis had poor inter-rater reliability. The IVD levels with grades IV and V in Pfirrmann scores had significantly lower DH than the IVD levels with grades I to III in Pfirrmann scores. IVD levels with grades IV and V in Pfirrmann scores had significantly higher VAS and ODI than IVD levels with grades I in Pfirrmann scores. Conclusion: A good-to-excellent intra-rater and inter-rater reliability was achieved on most DH measuring methods on MRI following a standardized and structured protocol. However, small anatomical structures and different tissue borders could influence measurements. Additionally, DH can differentiate between grade IV and V Pfirrmann scores, and severe IVD degeneration (IV and V Pfirrmann) is linked to clinical outcomes.