Methods in Ecology and Evolution (Dec 2023)

PENet: A phenotype encoding network for automatic extraction and representation of morphological discriminative features

  • Zhengyu Zhao,
  • Yuanyuan Lu,
  • Yijie Tong,
  • Xin Chen,
  • Ming Bai

DOI
https://doi.org/10.1111/2041-210X.14235
Journal volume & issue
Vol. 14, no. 12
pp. 3035 – 3046

Abstract

Read online

Abstract Digitalized natural history collections serve as vital ecological and evolutionary research resources. Specimen retrieval based on morphological features allows for the rapid acquisition of similar specimens from these collections, aiding in maximizing the utilization of their resources and catering to the requirements of related research. However, achieving this objective requires effective feature extraction and representation techniques. We developed a phenotype encoding network (PENet), a deep learning‐based model that combines hashing methods to automatically extract and encode discriminative features into hash codes. We evaluated the performance of PENet on six data sets, including a newly constructed beetle data set (6566 images), which covers over 60% of the genera within the six subfamilies of Scarabaeidae. Phenotype encoding network showed high performance in feature extraction and image retrieval, allowing users to input an image of a specimen and efficiently retrieve all specimens with similar morphology. Two visualization methods, t‐SNE and Grad‐CAM, were used to evaluate the representation ability of the hash codes. Additionally, by using the hash codes generated from PENet, a phenetic distance tree was constructed based on the beetle data set. The result indicated that the hash codes could reveal the phenetic distances and relationships among categories to a certain extent. PENet provides an automatic and efficient method to extract and represent morphological discriminative features. The generated hash code can be used as a low‐dimensional carrier of these features, enabling efficient specimen retrieval. Moreover, the distance information carried by these hash codes suggests their potential in systematics, deserving further exploration.

Keywords