pommedeterresautee OP t1_iuh0y8u wrote
Reply to comment by fakesoicansayshit in [P] Up to 12X faster GPU inference on Bert, T5 and other transformers with OpenAI Triton kernels by pommedeterresautee
I think so but not tried. Requires to write search / replace patterns
Viewing a single comment thread. View all comments