Submitted by seraphaplaca2 t3_122fj05 in MachineLearning
In the last few days I had a new thought. I don't know if it is possible or already done somewhere? Is it possible to merge the weights of two transformer models like they do with merging stable diffusion models? Like can I merge for example BioBert and LegalBert and get a model that can do both?
[deleted] t1_jdq3nn5 wrote
[deleted]