Viewing a single comment thread. View all comments

seek_it t1_iu1t7d4 wrote

Can I also use this project to improve inference time of projects like yolov5, etc.?

2

pommedeterresautee OP t1_iu2vg8y wrote

Right now the kernels cover linear layer, attention, and layer norm / rms norm. So the effect would be limited outside a transformer or assimilated. However we will increase the number of kernels, but convolution is not right now our priority

3