On-tree apple fruit size estimation using stereo vision with deep learning-based occlusion handling

Omeed Mirbod, Daeun Choi, Paul H. Heinemann, Richard P. Marini, Long He

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Apple (Malus domestica) fruit size plays an integral role in orchard management decision-making, particularly during chemical thinning, fruit quality assessment, and yield prediction. A machine vision system was developed using stereo cameras synchronised to a custom-built LED strobe to perform on-tree sizing of fruit in images with high measurement accuracy. Two deep neural network models (Faster R–CNN and Mask R–CNN) were trained to detect fruit candidates for sizing followed by extrapolation of occluded fruit regions to improve size estimation. The segmented fruit shapes were converted to metric surface areas and diameters using spatial resolutions and depth information from the stereo cameras. Monthly field trials from June to October using the camera system were conducted, measuring fruit diameters ranging from 22 to 82 mm, and compared against ground truth diameters. Diameter estimates had a mean absolute error ranging from 1.1 to 4.2 mm for the five-month trial period, an average error of 4.8% compared to ground truth diameter measurements. Standard deviation errors ranged from 0.7 to 1.9 mm. Using neural network models for intelligent sampling of fruit in images followed by extrapolation of missing regions can be an alternative method of handling fruit occlusion in agricultural imaging and improving sizing accuracy.

Original languageEnglish (US)
Pages (from-to)27-42
Number of pages16
JournalBiosystems Engineering
Volume226
DOIs
StatePublished - Feb 2023

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Food Science
  • Agronomy and Crop Science
  • Soil Science

Fingerprint

Dive into the research topics of 'On-tree apple fruit size estimation using stereo vision with deep learning-based occlusion handling'. Together they form a unique fingerprint.

Cite this