Are there any pre-trained machine learning models out there that can detect any object in an image without me having to train them first?

I come across a few object detection libraries that require me to train it to a date set. Then it will only be able to detect the presence of those objects.

I want a machine learning model where I can feed is any image and it returns me name of objects present in that image. I need it for one of my projects.

Any help will be appreciated.

Thanks!

Comments

You must log in or register to comment.

smsorin t1_iwygv07 wrote on November 19, 2022 at 9:04 AM

#576,669

No, and if there was one you wouldn't want it.

The reason is that with a model, any model, you only get guarantees for inputs that match the training data distribution. If the model has never seen a cogwheel before there's no telling what it will do.

If you find one, there's no promise how well it will do on your task. It might work, might work but only half the time or it might just give you wrong answers. To know this, you will need a dataset with your images and the desired labels. If you have this, you might as well make it a bit larger and train the model on part of it.

You will get better results if you start from a model that already was trained on data similar to your, or start with a pretrained model and tune it on your data.

PegasusInvasion OP t1_iwyhe5f wrote on November 19, 2022 at 9:12 AM

#576,702

Replying to smsorin (#576,669)

Thanks for the explanation.

I actually wanted to use a machine learning model to sort my pictures folder based on the objects in the image.

For example, I have 10 images of cars and 5 images of buildings. I wanted to move images of cars and buildings into separate folders.

Do you have any idea, how I can achieve that?

sudhanv99 t1_iwyhvfj wrote on November 19, 2022 at 9:19 AM

#576,738

Replying to PegasusInvasion (#576,702)

it is doable but you have to know what images are in your folder. you can find models trained on coco dataset which can do this. but your images have to be in the coco dataset.

you can write a script which will move your files based on the predictions after that.

PegasusInvasion OP t1_iwyj8hb wrote on November 19, 2022 at 9:40 AM

#576,819

Replying to sudhanv99 (#576,738)

I was trying to avoid having to train on a coco dataset 🫤 Anyway, thanks for your help!

sudhanv99 t1_iwyjd7e wrote on November 19, 2022 at 9:42 AM

#576,823

Replying to PegasusInvasion (#576,819)

you dont have to train. there are pretrained models, you can just run inference.

PegasusInvasion OP t1_iwyjhlh wrote on November 19, 2022 at 9:44 AM

#576,829

Replying to sudhanv99 (#576,823)

Sorry I misunderstood what you meant by coco dataset. I get it now. Do you know any good models that I can try?

sudhanv99 t1_iwyjrr9 wrote on November 19, 2022 at 9:48 AM

#576,841

Replying to PegasusInvasion (#576,829)

try yolo, that ll be simple for your application.

who_ate_my_motorbike t1_iwyjwir wrote on November 19, 2022 at 9:50 AM

#576,850

How about this one? https://huggingface.co/facebook/detr-resnet-50-panoptic

You can drag an image into it in the browser to try it out. You get a Json output with recognised objects.

PegasusInvasion OP t1_iwyk2ch wrote on November 19, 2022 at 9:52 AM

#576,858

Replying to sudhanv99 (#576,841)

I will try that. Thanks a ton man!

PegasusInvasion OP t1_iwyk4vy wrote on November 19, 2022 at 9:53 AM

#576,860

Replying to who_ate_my_motorbike (#576,850)

Thanks, I will give it a try 🙌

WilliamWithering t1_iwylm4a wrote on November 19, 2022 at 10:16 AM

#576,933

You might want to take a look at DETIC : https://github.com/facebookresearch/Detic (Open Vocabulary Object Detection, trained on thousands of classes)

skaag t1_iwyntwj wrote on November 19, 2022 at 10:49 AM

#577,059

Check out YOLO

teenaxta t1_iwyr26s wrote on November 19, 2022 at 11:35 AM

#577,251

you can try YOLO

PegasusInvasion OP t1_iwyr5ma wrote on November 19, 2022 at 11:37 AM

#577,258

Replying to teenaxta (#577,251)

Thanks for sending the link!

IntelligenXia t1_iwyyhyn wrote on November 19, 2022 at 1:06 PM

#577,716

Detectron2 ( Facebook ) or for ease of use - Yolo v5 onwards

eg : For Yolo v5 based on how complex you want your inference to run, change the yaml file .

Here coco.yaml has around 80 objects

The model on Imagenet has around 1000 classes . That should be more than enough for anything you want to do !

Here are the 80 classes from coco.yaml file

0: person 1: bicycle 2: car 3: motorcycle 4: airplane 5: bus 6: train 7: truck 8: boat 9: traffic light 10: fire hydrant 11: stop sign 12: parking meter 13: bench 14: bird 15: cat 16: dog 17: horse 18: sheep 19: cow 20: elephant 21: bear 22: zebra 23: giraffe 24: backpack 25: umbrella 26: handbag 27: tie 28: suitcase 29: frisbee 30: skis 31: snowboard 32: sports ball 33: kite 34: baseball bat 35: baseball glove 36: skateboard 37: surfboard 38: tennis racket 39: bottle 40: wine glass 41: cup 42: fork 43: knife 44: spoon 45: bowl 46: banana 47: apple 48: sandwich 49: orange 50: broccoli 51: carrot 52: hot dog 53: pizza 54: donut 55: cake 56: chair 57: couch 58: potted plant 59: bed 60: dining table 61: toilet 62: tv 63: laptop 64: mouse 65: remote 66: keyboard 67: cell phone 68: microwave 69: oven 70: toaster 71: sink 72: refrigerator 73: book 74: clock 75: vase 76: scissors 77: teddy bear 78: hair drier 79: toothbrush

A slight change in the inference code is to needed to write into a csv file , in your case

Photo 1 | Car |

Photo 2 | Car | Person

...

etc

blimpyway t1_iwz31ut wrote on November 19, 2022 at 1:52 PM

#578,055

And how would an image without any object look like?

MaoamWins t1_iwz3eev wrote on November 19, 2022 at 1:55 PM

#578,083

YOLOv6 or YOLOv7 might do the job just fine.

https://github.com/meituan/YOLOv6

https://github.com/WongKinYiu/yolov7