Desticheq t1_jd9c1fe wrote on March 22, 2023 at 7:31 PM

Reply to [D] Do you have a free and unlimited chat that specializes only in teaching programming or computing in general? by Carrasco_Santo

I'm looking to apply the PEFT technique for some llm to use in my Regis AI extension that works on top of leetcode. While GPT's fine for the hints and general conversation, there are other applications like code improvement or complexity estimation where I might benefit from a customized model

Desticheq t1_j9xiv9l wrote on February 25, 2023 at 7:20 AM

Reply to comment by theLastNenUser in [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints? by johnhopiler

Well, in terms of "out-of-the-box," I'm not sure what else could be better. AWS, Azure or Google provide empty units basically, and you'd have to configure all the "Ops" stuff like network, security, load balancing, etc. That's not that difficult if you do it once in a while, but for a "test-it-and-forget-it" project it might be too difficult.

Desticheq t1_j9qo0mu wrote on February 23, 2023 at 9:45 PM

Reply to [P] What are the latest "out of the box solutions" for deploying the very large LLMs as API endpoints? by johnhopiler

Hugginface actually allows a fairly easy deployment process for models trained with their framework

Desticheq t1_j76ad08 wrote on February 4, 2023 at 11:57 AM

Reply to comment by sponsored-by-potato in Information Retrieval book recommendations? [D] by Ggronne

RemindMe! 1 week