Video segmentation is a type of deep learning algorithm that enables autonomous vehicles to perceive and interpret real-world scenes in real-time . Since video footage comprises multiple static frames, a fast image segmentation algorithm can be utilized for video segmentation. Image segmentation is divided into two categories: instance segmentation and semantic segmentation. Instance segmentation is superior to semantic segmentation because it preserves the 3D spatial location of objects. Furthermore, an instance segmentation algorithm is an object detection algorithm that can generate a pixel-wise mask for each object instance . This paper discusses the fundamental components of two model families, R-CNN and YOLO, as well as the evaluation metrics and a comparison between a version of the R-CNN (Mask R-CNN) model and a version of the YOLO (YOLOv5) model.
Computer Science; Mathematics
Dao, Minh Duc, "An Exploration Into Image Object Detection and Image Instance Segmentation: Mask R-CNN and YOLOv5 Comparison in Image Object Detection Task" (2023). Senior Independent Study Theses. Paper 10459.
Bachelor of Arts
Senior Independent Study Thesis
© Copyright 2023 Minh Duc Dao