rikiiyer
rikiiyer t1_izew1lv wrote
Reply to [D] We're the Meta AI research team behind CICERO, the first AI agent to achieve human-level performance in the game Diplomacy. We’ll be answering your questions on December 8th starting at 10am PT. Ask us anything! by MetaAI_Official
I listened to Noam’s conversation with Lex Friedman the other day and he made the point that the model had to learn human like tendencies in order to work with humans to win at Diplomacy. Do you think it would be possible to use these learned features to somehow teach other models how to act more human-like?
rikiiyer t1_is07xlk wrote
Reply to [D] Career advice: Can one make a career in building machine learning models and then selling the IP for them? by likeamanyfacedgod
What you’re describing sounds like an AutoML tool, of which dozens already exist. Various companies offer a subscription service to access an API for their tool, and charge by usage/time. Maybe I’m missing something but I don’t see why companies would lease IP rather than just use a service. Like AutoML software is typically used by companies who have very few ML experts on their team so the API makes it easy for engineers to integrate ML into their products, while companies with many ML experts usually just build their own AutoML tools
rikiiyer t1_jddanig wrote
Reply to comment by djmaxm in [D] Simple Questions Thread by AutoModerator
The 30B params of the model are going onto your GPUs VRAM (which should be 24GB), which is causing the issue. You can try loading the model in 8bit which could reduce size