A Survey of Methods for Addressing Imbalance Data Problems in Agriculture Applications

Tajul Miftahushudur, Halil Mertkan Sahin, Bruce Grieve, Hujun Yin*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

112 Downloads (Pure)

Abstract

This survey explores recent advances in addressing class imbalance issues for developing machine learning models in precision agriculture, with a focus on techniques used for plant disease detection, soil management, and crop classification. We examine the impact of class imbalance on agricultural data and evaluate various resampling methods, such as oversampling and undersampling, as well as algorithm-level approaches, to mitigate this challenge. The paper also highlights the importance of evaluation metrics, including F1-score, G-mean, and MCC, in assessing the performance of machine learning models under imbalanced conditions. Additionally, the review provides an in-depth analysis of emerging trends in the use of generative models, like GANs and VAEs, for data augmentation in agricultural applications. Despite the significant progress, challenges such as noisy data, incomplete datasets, and lack of publicly available datasets remain. This survey concludes with recommendations for future research directions, including the need for robust methods that can handle high-dimensional agricultural data effectively.
Original languageEnglish
Article number454
JournalRemote Sensing
Volume17
Issue number3
DOIs
Publication statusPublished - 29 Jan 2025

Keywords

  • data imbalance
  • agriculture
  • machine learning
  • sampling techniques
  • precision farming

Fingerprint

Dive into the research topics of 'A Survey of Methods for Addressing Imbalance Data Problems in Agriculture Applications'. Together they form a unique fingerprint.

Cite this