Viewing a single comment thread. View all comments

incrapnito t1_jdqamby wrote

I think you are looking for federated learning which is complete research field on its own. It digs into combining weights of two neural networks such that both tasks can still be accomplished. Existing approaches should apply to transformers too.

−2