ChatGPT and general-purpose AI count fruits in pictures surprisingly well without programming or training

Konlavach Mengsuwan; Juan C. Rivera-Palacio; Masahiro Ryo

Smart Agricultural Technology (Dec 2024)

ChatGPT and general-purpose AI count fruits in pictures surprisingly well without programming or training

Konlavach Mengsuwan,
Juan C. Rivera-Palacio,
Masahiro Ryo

Affiliations

Konlavach Mengsuwan: Research Platform Data Analysis & Simulation, Leibniz Centre for Agricultural Landscape Research (ZALF), Müncheberg, Germany; Environment and Natural Sciences, Brandenburg University of Technology Cottbus‐Senftenberg, Cottbus, Germany
Juan C. Rivera-Palacio: Research Platform Data Analysis & Simulation, Leibniz Centre for Agricultural Landscape Research (ZALF), Müncheberg, Germany; Environment and Natural Sciences, Brandenburg University of Technology Cottbus‐Senftenberg, Cottbus, Germany; Alliance of Bioversity International and CIAT, Rome, Italy
Masahiro Ryo: Research Platform Data Analysis & Simulation, Leibniz Centre for Agricultural Landscape Research (ZALF), Müncheberg, Germany; Environment and Natural Sciences, Brandenburg University of Technology Cottbus‐Senftenberg, Cottbus, Germany; Corresponding author at: Research Platform Data Analysis & Simulation, Leibniz Centre for Agricultural Landscape Research (ZALF), Müncheberg, Germany.

Journal volume & issue: Vol. 9
p. 100688

Abstract

Read online

General-purpose artificial intelligence (AI) can facilitate agricultural digitalization as many tools do not require coding. Yet, it remains unclear how well the emerging general-purpose AI technologies can perform object counting, which is a fundamental task in agricultural digitalization, in comparison to the current standard practice. We show that ChatGPT (GPT4 V) demonstrated moderate performance in counting coffee cherries from images, while the T-Rex, foundation model for object counting, performed with high accuracy. Testing with a hundred images, we examined that ChatGPT can count cherries, and the performance improves with human feedback (R2 = 0.36 and 0.46, respectively). The T-Rex foundation model required only a few samples for training but outperformed YOLOv8, the conventional best practice model (R2 = 0.92 and 0.90, respectively). Obtaining the results with these models was 100x shorter than the conventional best practice. These results bring two surprises for deep learning users in applied domains: a foundation model can drastically save effort and achieve higher accuracy than a conventional approach, and ChatGPT can reveal a relatively good performance especially with guidance by providing some examples and feedback. No requirement for coding skills can impact education, outreach, and real-world implementation of generative AI for supporting farmers.

Published in Smart Agricultural Technology

ISSN: 2772-3755 (Online)
Publisher: Elsevier
Country of publisher: Netherlands
LCC subjects: Agriculture: Agriculture (General); Social Sciences: Industries. Land use. Labor: Special industries and trades: Agricultural industries
Website: https://www.journals.elsevier.com/smart-agricultural-technology

About the journal

Abstract

Keywords