Benchmarking In-the-Wild Multimodal Plant Disease Recognition and A Versatile Baseline
Tianqi Wei | Zhi Chen | Zi Huang | Xin Yu |

Introduction
Existing deep-learning methods have achieved remarkable performance in recognizing in-laboratory plant disease images. However, their performance often significantly degrades in classifying in-the-wild images. Furthermore, we observed that in-the-wild plant images may exhibit similar appearances across various diseases (i.e., small inter-class discrepancy) while the same diseases may look quite different (i.e., large intra-class variance). Motivated by this observation, we propose an in-the-wild multimodal plant disease recognition dataset, PlantWild, which contains the largest number of disease classes but also text-based descriptions for each disease. PlantWild is currently the largest dataset containing wild plant disease images.

Method
Code of our baseline MVPDR is available through https://github.com/tqwei05/MVPDR.
The workflow of MVPDR is presented as the following figure.
It models text descriptions and visual data through multiple prototypes and can achieve outstanding performance on in-the-wild plant disease images.

Paper
![]() |
"Benchmarking In-the-Wild Multimodal Plant Disease Recognition and A Versatile Baseline", Tianqi Wei, Zhi Chen, Zi Huang, Xin Yu. ACM International Conference of Multimedia 2024 [PDF] |
Downloads
PlantWild
The PlantWild dataset is accessible through:
- Google Drive Download Link
- Hugging Face
- UQRDM (Password: plantwildv1)
PlantWild_v2
We invited experts in the field of agriculture to refine our dataset.
We also added images of new types of diseases according to experts' suggestions.
The obtained dataset, PlantWild_v2, has enhanced data reliability and the number of classes has been expanded to 115.
This version can be accessed through:
- Google Drive Download Link
- Hugging Face
- UQRDM (Password: plantwildv2)
PlantSeg with Segmentation annotations
On top of the plant disease recognition dataset PlantWild, we present the PlantSeg dataset that provides segmentation annotations focused on diseased parts for images.

Copyright
Our dataset follows the copyright Creative Commons BY-NC-ND 4.0 license.
BibTeX
@inproceedings{MVPDR,
title={Benchmarking In-the-Wild Multimodal Plant Disease Recognition and A Versatile Baseline},
author={Wei, Tianqi and Chen, Zhi and Huang, Zi and Yu, Xin},
booktitle={ACM International Conference of Multimedia},
year={2024}}