Submitted by ChaosAdm t3_yhmlen in MachineLearning
I am trying to follow the steps of database creation that was followed in this research paper.
In the paper, they have stated they use sequential data as input for the model they are developing, wherein 20 frames are taken before every point-of-impact in a tennis match.
How can I go about doing that? Anyone could explain in as easy words as possible or guide me to an appropriate medium?
Quoting the paper:
>For this dataset, we used a tennis match video (1080 × 720 pixels, 25 fps) of a professional tennis match uploaded to YouTube. The video is input, making the players’ rectangular images of the time of impact of each player to 20 frames before impact
michelin_chalupa t1_iuekl9p wrote
The simple way would be to just annotate those impact frames. A more sophisticated way might involve tracking the ball, and annotating those frames where it’s estimated velocity is low (which will of course be noisy, depending on the angle of the camera wrt it’s trajectory).
If it were me, I’d just hunker down for an afternoon and manually annotate those impact frames.