mayiSLYTHERINyourbed t1_iu3dafc wrote on October 28, 2022 at 6:54 AM

On a regular basis. We care down to the ms how fast inference or training is. In my last organisation we had to process like 200k images while inferencing. At this point even a delay of 2ms would cost 6.7 minutes just for getting the feature vectors. Which really matters.

GPUaccelerated OP t1_iu4ve0v wrote on October 28, 2022 at 3:50 PM

OK right. That's also a project with immense scale.

I guess the bigger the project, the more inference speed is required. But I've never heard about caring deeply about the ms in training. Mind sharing why that was important in that use case?

mayiSLYTHERINyourbed t1_iu7im0x wrote on October 29, 2022 at 3:41 AM

Our use case was in biometrics, where the test sample would usually range in millions of images which needed to be matched simultaneously. Over here even accumulating 2-3ms over each batch or batch would lead to huge delay.

GPUaccelerated OP t1_iuilu92 wrote on October 31, 2022 at 4:40 PM

okay cool! Thanks for explaining