Viewing a single comment thread. View all comments

currentscurrents t1_j9nqcno wrote

What are you trying to do? Most of the cool features of language models only emerge at much larger scales.

5

Seankala OP t1_j9nqmf5 wrote

That's true for all of the models. I don't really need anything cool though, all I need is a solid model that can perform simple tasks like text classification or NER well.

4

Friktion t1_j9oxnz6 wrote

I have some experience with FastText for e-commerce product classification. Its super lightweight and performs well as a MVP.

5

cantfindaname2take t1_j9qov0f wrote

For simple NER tasks some simpler models might work too,like conditional random fields. The crfsuite package has a very easy to use implementation of it and it is using a C lib under the hood for the model training.

1