InsectAgent: Improving Insect Recognition through Dynamic Information Augmentation with Multimodal Large Language Models

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Insect recognition remains a critical challenge for biodiversity monitoring, conservation efforts, and agricultural sustainability. Current computer vision approaches struggle with accurate species identification due to subtle morphological differences. Our analysis reveals that while vision classifiers frequently fail to predict the correct species as their top choice, they consistently include the true species within top candidate predictions. This indicates that expert entomological knowledge is required to resolve ambiguities when vision classifiers fail. We present InsectAgent, a novel two-stage framework that enhances insect recognition through dynamic information augmentation using Multimodal Large Language Models (MLLMs). In the first stage, a vision classifier generates candidate species predictions with confidence scores. When confidence falls below a threshold, the second stage activates, retrieving relevant taxonomic knowledge from an expert knowledge base and invoking an MLLM for further analysis. This conditional MLLM invocation strategy significantly reduces computational costs by avoiding expensive model calls for high-confidence predictions while ensuring expertlevel reasoning for ambiguous cases. The information-augmented reasoning process combines visual cues with domain expertise, mirroring expert entomologists' workflow. Experimental results demonstrate that InsectAgent significantly outperforms standalone vision classifiers, achieving an average relative improvement of 14.24% in accuracy for insect identification tasks.

Original languageEnglish (US)
Title of host publicationIEEE Computer Society Annual Symposium on VLSI, ISVLSI 2025 - Conference Proceedings
PublisherIEEE Computer Society
ISBN (Electronic)9798331534776
DOIs
StatePublished - 2025
Event28th IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2025 - Kalamata, Greece
Duration: Jul 6 2025Jul 9 2025

Publication series

NameProceedings of IEEE Computer Society Annual Symposium on VLSI, ISVLSI
ISSN (Print)2159-3469
ISSN (Electronic)2159-3477

Conference

Conference28th IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2025
Country/TerritoryGreece
CityKalamata
Period7/6/257/9/25

All Science Journal Classification (ASJC) codes

  • Control and Systems Engineering
  • Hardware and Architecture
  • Electrical and Electronic Engineering

Fingerprint

Dive into the research topics of 'InsectAgent: Improving Insect Recognition through Dynamic Information Augmentation with Multimodal Large Language Models'. Together they form a unique fingerprint.

Cite this