kross00
kross00 t1_jdjg4gq wrote
Reply to comment by LeN3rd in [D] Simple Questions Thread by AutoModerator
Do you know which datasets they use?
kross00 t1_jdgutr0 wrote
Reply to [D] Simple Questions Thread by AutoModerator
Is it feasible to train Llama 65B (or smaller models) to engage in chit-chatting in a manner that would not readily reveal whether one is conversing with an AI or a human? The AI does not need to answer highly complex questions and could decline them similarly to how a human would.
kross00 t1_jdaw9n6 wrote
Reply to [P] One of the best ChatGPT-like models (possibly better than OpenAssistant, Stanford Alpaca, ChatGLM and others) by [deleted]
Hey, did you used a custom dataset or a public one?
kross00 t1_jczd3i2 wrote
Reply to comment by Civil_Collection7267 in [D] Best ChatBot that can be run locally? by rustymonster2000
I’m having a hard time understanding what LoRA is and why it makes the 7B model better? I thought it only improves hardware requirements, but it also improves model coherency? This is all new for me
kross00 t1_jcre2hi wrote
Reply to [P] The next generation of Stanford Alpaca by [deleted]
I'm a newbie... but maybe take a look at this model: https://github.com/BlinkDL/RWKV-LM
kross00 t1_jcnr2yo wrote
Reply to [P] ControlNetInpaint: No extra training and you can use 📝text +🌌image + 😷mask to generate new images. by mikonvergence
How is it different from the inpain already built in controlnet?
kross00 t1_jcdmgl4 wrote
Reply to [P] ControlNetInpaint: No extra training and you can use 📝text +🌌image + 😷mask to generate new images. by mikonvergence
Do you plan on releasing an Automatic1111 plugin for this?
kross00 t1_jdui6ot wrote
Reply to [D] Simple Questions Thread by AutoModerator
Can AlphaTensor be utilized to solve math problems beyond matrix multiplication algorithms?