Document Type

Article

Journal/Book Title/Conference

Computers and Electronics in Agriculture

Volume

198

Publisher

Elsevier BV

Publication Date

7-1-2022

First Page

1

Last Page

12

Creative Commons License

Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License
This work is licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 4.0 License.

Abstract

Precision weed management offers a promising solution for sustainable cropping systems through the use of chemical-reduced/non-chemical robotic weeding techniques, which apply suitable control tactics to individual weeds or small clusters. Therefore, accurate identification of weed species plays a crucial role in such systems to enable precise, individualized weed treatment. Despite recent progress, the development of a robust weed identification and localization system in the presence of unstructured field environments remains a serious challenge, requiring supervised modeling using large volumes of annotated data. This paper makes a first comprehensive evaluation of deep transfer learning (DTL) for identifying common weed species specific to cotton (Gossypium hirsutum L.) production systems in southern United States (U.S.). A new dataset for weed identification was created, consisting of 5187 color images of 15 weed classes collected under natural light conditions and at varied weed growth stages, in cotton fields (primarily in Mississippi and North Carolina) during the 2020 and 2021 growth seasons. We evaluated 35 state-of-the-art deep learning models through transfer learning with repeated holdout validations and established an extensive benchmark for the considered weed identification task. DTL achieved high classification accuracy of F1 scores exceeding 95%, requiring reasonably short training time (less than 2.5 h) across models. ResNeXt101 achieved the best overall F1-score of 98.93 ± 0.34%, whereas 10 out of the 35 models achieved F1 scores near or above 98.0%. However, the performance on minority weed classes with few training samples was less satisfactory for models trained with a conventional, unweighted cross entropy loss function. To address this issue, a weighted cross entropy loss function was adopted, which achieved substantially improved accuracies for minority weed classes (e.g., the F1-scores for Xception and MnasNet on the Spurred Anoda weed increased from 48% to 90% and 50% to 82%, respectively). Furthermore, a deep learning-based cosine similarity metric was employed to analyze the similarity among weed classes, assisting in the interpretation of classifications. Both the codes (https://github.com/Derekabc/CottonWeeds) for model benchmarking and the weed dataset (https://www.kaggle.com/yuzhenlu/cottonweedid15) of this study are made publicly available, which expect to be a valuable resource for future research on weed identification and beyond.

Available for download on Monday, July 01, 2024

Share

COinS