Desticheq
Desticheq t1_j9xiv9l wrote
Reply to comment by theLastNenUser in [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints? by johnhopiler
Well, in terms of "out-of-the-box," I'm not sure what else could be better. AWS, Azure or Google provide empty units basically, and you'd have to configure all the "Ops" stuff like network, security, load balancing, etc. That's not that difficult if you do it once in a while, but for a "test-it-and-forget-it" project it might be too difficult.
Desticheq t1_j9qo0mu wrote
Reply to [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints? by johnhopiler
Hugginface actually allows a fairly easy deployment process for models trained with their framework
Desticheq t1_j76ad08 wrote
Reply to comment by sponsored-by-potato in Information Retrieval book recommendations? [D] by Ggronne
RemindMe! 1 week
Desticheq t1_jd9c1fe wrote
Reply to [D] Do you have a free and unlimited chat that specializes only in teaching programming or computing in general? by Carrasco_Santo
I'm looking to apply the PEFT technique for some llm to use in my Regis AI extension that works on top of leetcode. While GPT's fine for the hints and general conversation, there are other applications like code improvement or complexity estimation where I might benefit from a customized model