Vegetable detection
deep learning
object detection
YOLO
precision agriculture
computer vision
Accurate detection of vegetables from images is an essential task in precision agriculture, automated food supply chains and smart retail systems. Even with a little labeled data, recent developments in deep learning, especially pretrained object detection models, have greatly enhanced performance on visual recognition tasks.. However, the relative effectiveness of modern pretrained object detection architectures on agricultural datasets remains underexplored. In this study, we present a comprehensive comparative evaluation of multiple state-of-the-art pretrained object detection models on a publicly available vegetable object detection dataset from Bangladesh. The evaluated models include RF-DETR (Large), YOLOv11 variants, Roboflow 3.0 Object Detection models and YOLOv12 variants with different capacity configurations (Fast, Accurate and Extra Large). Standard metrics like recall, precision and mAP@50 on a predetermined test set are used to evaluate performance. Experimental results demonstrate that YOLOv12 (Extra Large) significantly outperforms other models, achieving an mAP@50 of 79.0%, precision of 81.8% and recall of 80.7%. When implementing pretrained detectors in agricultural computer vision applications, the results emphasize the significance of model architecture and scale.