Submitted by ChaosAdm t3_yhmlen in MachineLearning
I am trying to follow the steps of database creation that was followed in this research paper.
In the paper, they have stated they use sequential data as input for the model they are developing, wherein 20 frames are taken before every point-of-impact in a tennis match.
How can I go about doing that? Anyone could explain in as easy words as possible or guide me to an appropriate medium?
Quoting the paper:
>For this dataset, we used a tennis match video (1080 × 720 pixels, 25 fps) of a professional tennis match uploaded to YouTube. The video is input, making the players’ rectangular images of the time of impact of each player to 20 frames before impact
dual_carriageway t1_iueqojt wrote
Another option could be tracking the audio(if there is any) - the ball hitting the racket should make a similar ish noise each time and you may be able to automate detecting impacts using that