Viewing a single comment thread. View all comments

phys_user t1_jbw7i59 wrote

Looks like text-embedding-ada-002 is already on the MTEB leaderboard! It comes in at #4 overall, and has the highest performance for clustering.

You might also want to look into SentEval, which can help you test the embedding performance on a variety of tasks: https://github.com/facebookresearch/SentEval

3

vintage2019 t1_jbzzadd wrote

Has anyone ranked models with that and published the results?

1