Viewing a single comment thread. View all comments

Anaphylaxisofevil t1_ispq1tl wrote

In the most general sense, tracking is the same as detection, but with priors based on a prediction of what you're expecting to see from past history. So tracking obviously requires an image sequence, and detection only a single timestep. Tracking can potentially be faster and more accurate, because you have access to this extra information which limits your search space, but is also reliant on the quality of your prediction; bad prediction mean tracking failures.

I'm not completely sure if this is the level of answer you're looking for though.

1

Dense-Smf-6032 OP t1_ispqyw8 wrote

I see. How do I make the video track to do fast inference (if I don't want to run it per-frame level)?

1

Anaphylaxisofevil t1_isptitd wrote

It really depends on making (and training) an adaptation to your per-frame detector to incorporate prediction priors, then devising a method for making the prediction priors from previous frames' output. I'm not that familiar with the specifics of your particular problem to add much more, I'm afraid.

1