Created by: guillaumekln
Patch Description This PR adds the CTranslate2 integration in the README. CTranslate2 is a fast inference engine for Transformer models, including OPT models.
Testing steps No tests are needed.
Created by: guillaumekln
Patch Description This PR adds the CTranslate2 integration in the README. CTranslate2 is a fast inference engine for Transformer models, including OPT models.
Testing steps No tests are needed.