Submitted by fxmarty t3_z1titt in MachineLearning
killver t1_ixiah49 wrote
Reply to comment by fxmarty in [P] BetterTransformer: PyTorch-native free-lunch speedups for Transformer-based models by fxmarty
Thanks a lot for all these replies. I have one more question if you do not mind: Sometimes I have huggingface models as a backbone in my model definitions, how would I go along to only apply the transformer based quantization on only the backbone? Usually these are called on the full model, but if my full model is already in onnx format it is complicated.
Viewing a single comment thread. View all comments