Ultrasonography (Apr 2014)

Intra- and interobserver reliability of gray scale/dynamic range evaluation of ultrasonography using a standardized phantom

  • Song Lee,
  • Joon-Il Choi,
  • Michael Yong Park,
  • Dong Myung Yeo,
  • Jae Young Byun,
  • Seung Eun Jung,
  • Sung Eun Rha,
  • Soon Nam Oh,
  • Young Joon Lee

DOI
https://doi.org/10.14366/usg.13021
Journal volume & issue
Vol. 33, no. 2
pp. 91 – 97

Abstract

Read online

Purpose: To evaluate intra- and interobserver reliability of the gray scale/dynamic range of the phantom image evaluation of ultrasonography using a standardized phantom, and to assess the effect of interactive education on the reliability. Methods: Three radiologists (a resident, and two board-certified radiologists with 2 and 7 years of experience in evaluating ultrasound phantom images) performed the gray scale/dynamic range test for an ultrasound machine using a standardized phantom. They scored the number of visible cylindrical structures of varying degrees of brightness and made a ‘pass or fail’ decision. First, they scored 49 phantom images twice from a 2010 survey with limited knowledge of phantom images. After this, the radiologists underwent two hours of interactive education for the phantom images and scored another 91 phantom images from a 2011 survey twice. Intra- and interobserver reliability before and after the interactive education session were analyzed using K analyses. Results: Before education, the K-value for intraobserver reliability for the radiologist with 7 years of experience, 2 years of experience, and the resident was 0.386, 0.469, and 0.465, respectively. After education, the K-values were improved (0.823, 0.611, and 0.711, respectively). For interobserver reliability, the K-value was also better after the education for the 3 participants (0.067, 0.002, and 0.547 before education; 0.635, 0.667, and 0.616 after education, respectively). Conclusion: The intra- and interobserver reliability of the gray scale/dynamic range was fair to substantial. Interactive education can improve reliability. For more reliable results, double- checking of phantom images by multiple reviewers is recommended.

Keywords