Submitted by somebodyenjoy t3_zc24rg in MachineLearning

I'm trying to figure out if the various versions of YOLO, such as YOLOv7 are better than the various versions of RCNN in terms of accuracy alone if speed is not much of an issue. Let's say I'm trying to detect various objects on a 2D floor plan, and I only care about accuracy.

How would a classifier that would go square by square to find the objects perform? This may not be as efficient as the standard object detection models, but would it be more accurate if I am willing to throw as much compute power as it wants for this brute force approach?

3

Comments

You must log in or register to comment.

SeucheAchat9115 t1_iyusz2k wrote

I guess on Coco the best accuracy is given by transformer networks like Swin, but I would assume your dataset is not as big as coco, therefore transformers might not generalize well.

4

somebodyenjoy OP t1_iyv5zlq wrote

Maybe in terms of speed, but what about accuracy? Wouldn’t it make sense that a classifier going around the image would be more accurate? Is there any research or articles comparing the modern algorithms to sliding windows

0

somebodyenjoy OP t1_iyv79r7 wrote

I understand, I was asking if we use something like an alexnet and train it on a specific object, like a dog or not detector. Then make this detector go around the entire image in a brute-force manner, would that be more accurate than the object detector models right now

0

bernhard-lehner t1_iywb1n5 wrote

if compute doesn't seem to be an issue, why not try what works best on your data?

2