PLoS ONE (Jan 2019)

Comparison of shape quantification methods for genomic prediction, and genome-wide association study of sorghum seed morphology.

  • Lisa Sakamoto,
  • Hiromi Kajiya-Kanegae,
  • Koji Noshita,
  • Hideki Takanashi,
  • Masaaki Kobayashi,
  • Toru Kudo,
  • Kentaro Yano,
  • Tsuyoshi Tokunaga,
  • Nobuhiro Tsutsumi,
  • Hiroyoshi Iwata

DOI
https://doi.org/10.1371/journal.pone.0224695
Journal volume & issue
Vol. 14, no. 11
p. e0224695

Abstract

Read online

Seed shape is an important agronomic trait with continuous variation among genotypes. Therefore, the quantitative evaluation of this variation is highly important. Among geometric morphometrics methods, elliptic Fourier analysis and semi-landmark analysis are often used for the quantification of biological shape variations. Elliptic Fourier analysis is an approximation method to treat contours as a waveform. Semi-landmark analysis is a method of superimposed points in which the differences of multiple contour positions are minimized. However, no detailed comparison of these methods has been undertaken. Moreover, these shape descriptors vary when the scale and direction of the contour and the starting point of the contour trace change. Thus, these methods should be compared with respect to the standardization of the scale and direction of the contour and the starting point of the contour trace. In the present study, we evaluated seed shape variations in a sorghum (Sorghum bicolor Moench) germplasm collection to analyze the association between shape variations and genome-wide single-nucleotide polymorphisms by genomic prediction (GP) and genome-wide association studies (GWAS). In our analysis, we used all possible combinations of three shape description methods and eight standardization procedures for the scale and direction of the contour as well as the starting point of the contour trace; these combinations were compared in terms of GP accuracy and the GWAS results. We compared the shape description methods (elliptic Fourier descriptors and the coordinates of superposed pseudo-landmark points) and found that principal component analysis of their quantitative descriptors yielded similar results. Different scaling and direction standardization procedures caused differences in the principal component scores, average shape, and the results of GP and GWAS.