Submitted by somebodyenjoy t3_zc24rg in MachineLearning

I'm trying to figure out if the various versions of YOLO, such as YOLOv7 are better than the various versions of RCNN in terms of accuracy alone if speed is not much of an issue. Let's say I'm trying to detect various objects on a 2D floor plan, and I only care about accuracy.

How would a classifier that would go square by square to find the objects perform? This may not be as efficient as the standard object detection models, but would it be more accurate if I am willing to throw as much compute power as it wants for this brute force approach?

3

Comments

You must log in or register to comment.

SeucheAchat9115 t1_iyusz2k wrote

I guess on Coco the best accuracy is given by transformer networks like Swin, but I would assume your dataset is not as big as coco, therefore transformers might not generalize well.

4

somebodyenjoy OP t1_iyuz8tx wrote

In this case, what would be the better option?

1

SeucheAchat9115 t1_iyuzb7c wrote

I guess yolov7 is a good choice, but depends on your institute be aware of the licenses of the code.

2

somebodyenjoy OP t1_iyv3jls wrote

What would be the accuracy of the brute-force approach, i.e. sliding window approach? Would the accuracy be better than all others?

0

SeucheAchat9115 t1_iyv5t91 wrote

Sliding window approches are „Conventional“ Image Processing techniques which are not comptitive anymore nowadays.

3

somebodyenjoy OP t1_iyv5zlq wrote

Maybe in terms of speed, but what about accuracy? Wouldn’t it make sense that a classifier going around the image would be more accurate? Is there any research or articles comparing the modern algorithms to sliding windows

0

SeucheAchat9115 t1_iyv631b wrote

Deep Learning Classifiers based on Convolutions also go around the whole image. And the sliding window approaches are not competitive anymore in terms of accuracy as well

2

somebodyenjoy OP t1_iyv79r7 wrote

I understand, I was asking if we use something like an alexnet and train it on a specific object, like a dog or not detector. Then make this detector go around the entire image in a brute-force manner, would that be more accurate than the object detector models right now

0

SeucheAchat9115 t1_iyv7fcl wrote

No, because the object detector can solve the problem in a single forward path. Todays deep learning based object detectors like Yolo or RCNN + Swin are very good choices for a detection task

1

somebodyenjoy OP t1_iyv8ila wrote

You mean to say they can do better in terms of accuracy even tho they detect in a single forward path?

1

SeucheAchat9115 t1_iyv8t0s wrote

Yes, because Deep Learning is way better than conventional Methods.

1

killver t1_iyv1zfo wrote

EfficientDet if you care about license.

1

bernhard-lehner t1_iywb1n5 wrote

if compute doesn't seem to be an issue, why not try what works best on your data?

2