genshiryoku t1_jee9v9j wrote on March 31, 2023 at 11:52 AM

Reply to TikTok is a "Digital Fentanyl" that is Damaging Mental Health at Scale by BackgroundResult

Tiktok should be banned, but not because it's "worse" than any other social media.

It should be banned because it's a direct tool for the Chinese Communist Party with proven direct links to the Chinese government as ByteDance has communist party members in their board of directors.

genshiryoku t1_ja7pugt wrote on February 27, 2023 at 1:49 PM

Reply to comment by jamesj in Weird feeling about AI, need find ig somebody has same feeling by polda604

I agree with this. I'm a middle aged engineer and believe it or not there used to be a time when assembly was considered "automation of programming".

Before before assembly you would have to hot-wire individual 1s and 0s into the hardware to program which was a labor intensive jobs. You had to memorize the instructions and data sequence as strings of 1s and 0s.

Then assembly came along and suddenly a lot of the work was simplified to only writing a command that was equivalent to those instructions.

Then there was another big paradigm shift with "high level languages" like C and C compilers.

Essentially ever since C and other compiled languages existed most people haven't truly programmed anymore. Because essentially you're just communicating to a computer program what the computer program should actually program for you.

The C/C++ or Python code you're writing today? That's not actually programming. It's just you telling the computer what it should program for you.

In a way ChatGPT and other systems like it are just a newer higher level programming language. Because you're still communicating to the computer what it needs to program. But it's just in a more intuitive human way.

I don't think the job of programmer is going to go away at all. Just like Assembly didn't crash the occupation or C didn't crash the occupation. It's just yet another layer of abstraction on top of it.

As an old-school kind of guy I have to admit that I liked writing assembly more than C and I like C more than Python. And yet again I like Python more than typing into ChatGPT. But this is how software development has always been. You adapt to the new developments, you specialize into a very specific niche, or you exit the labor market and become a hobbyist.

Young people have too much anxiety about these things because the last ~15 years have been relatively stagnant in terms of big paradigm shifts within programming.

Big shifts like this used to happen every 2-3 years.

genshiryoku t1_ja7a86b wrote on February 27, 2023 at 11:05 AM

Reply to comment by MysteryInc152 in AI technology level within 5 years by medicalheads

Humans do struggle with it. Japanese as a language is vague on purpose so that you can always have plausible deniability to save face. This is great for cultural purposes but it's a nightmare for AI (or autistic people).

genshiryoku t1_ja6uw8w wrote on February 27, 2023 at 7:28 AM

Reply to comment by MysteryInc152 in AI technology level within 5 years by medicalheads

Not for Japanese. Due to how Japanese works it's essentially impossible to translate into English without having full context. This context isn't embedded within the language itself but conveyed through circumstance. This is why it's basically impossible to properly translate as AI models tend to hallucinate the missing context information and get it wrong.

genshiryoku t1_ja31syb wrote on February 26, 2023 at 2:01 PM

Reply to AI technology level within 5 years by medicalheads

I agree as a Japanese person that speaks English it's funny how extremely bad even the best AI tools are right now into translating Japanese into English. English to Japanese is a bit better but still not very good.

I recognize that it needs AGI to properly translate Japanese into English. Because Japanese lacks so much context that current AIs basically just "hallucinate" the missing context like how ChatGPT bullshits code when it doesn't know what to do.

genshiryoku t1_ja2pugm wrote on February 26, 2023 at 11:57 AM

Reply to comment by 321gogo in AI is accelerating the loss of individuality in the same way that mass production and consumerism replaced craftsmanship and originality in the 20th century. But perhaps there’s a silver lining. by SpinCharm

I disagree with this especially due to the popularity of Youtube and Tiktok where everyone has completely different video feeds based on their own interests.

I think the recommendation engine just generating the media you want to watch is the clear next step and something that traditional media can't compete with.

I think you wanting to connect with others over shared media consumption is just a sign of our generation and not shared by Gen Z in the same way.

genshiryoku t1_ja2m3p6 wrote on February 26, 2023 at 11:08 AM

Reply to comment by Akimbo333 in Meta unveils a new large language model that can run on a single GPU by AylaDoesntLikeYou

Of course not, it's Meta.

genshiryoku t1_j9svy3v wrote on February 24, 2023 at 9:02 AM

Reply to comment by blueSGL in New agi poll says there is 50% chance of it happening by 2059. Thoughts? by possiblybaldman

No the reason why the median prediction barely got down is because we still have the exact same bottleneck and issues on the path to AGI. These haven't been solved over the past 6 years. So while we have made great strides with scaling up Transformer and specifically Large Language Models that display emergent properties. The actual issue still plays behind the scenes.

The main issue and bottleneck is training data, we're rapidly running out of usable data on the internet with the biggest models already being trained on 30% of all relevant data on the internet. If rates continue like this we might run out of usable data between 2025-2027.

We know we can't use synthetic or AI generated data to train models on because of the overfitting problem that introduces. We essentially need to either find some way to generate orders of magnitude more data (Extremely hard problem if not outright impossible). Or we need to have breakthroughs in AI architecture so that the models need to be trained on fewer data (Still a hard problem and linear in nature).

The massive progress we're seeing currently is simply just scaling up models bigger and bigger and training them on more data but once the data stops flowing these models will rapidly stagnate and we will enter a new AI winter.

This is why the median prediction barely changed. We'd need to solve these fundamental bottlenecks and issues before we'll be able to achieve AGI.

Of course the outlier possibility of AGI already emerging before running out of training data over the next 2-4 years is also a slight possibility of course.

So essentially while the current progress and models are very cool and surprising they are essentially within the realm of expected growth, because no one was doubting the AI boom to slow down before the training data ran out. We're dreading 2-4 years from now when all usable internet data has essentially been exploited already.

genshiryoku t1_j8t2thh wrote on February 16, 2023 at 7:28 PM

Reply to comment by JLockrin in Bingchat is a sign we are losing control early by Dawnof_thefaithful

I think he's suggesting using different completely separate models that target different "topics" or queries within the same application instead of having a general agent. It would do better at the specific jobs and to you still look like a self-contained tool instead of 100 applications/webapps

genshiryoku t1_j6mzuv0 wrote on January 31, 2023 at 2:14 PM

Reply to OpenAI once wanted to save the world. Now it’s chasing profit by informednews

It's literally impossible for large models costing hundreds of millions of USD to train to be done by a non-profit.

It was either become a for-profit or perish.

genshiryoku t1_j6ahc38 wrote on January 28, 2023 at 10:50 PM

Reply to comment by hopelesslysarcastic in Why did 2003 to 2013 feel like more progress than 2013 to 2023? by questionasker577

I think the next 5 years will be one of explosive AI progress but sudden and rapid stagnation and an AI winter will follow after that.

The reason I think this is because we're rapidly running out of training data as bigger and bigger models essentially get trained on all the available data on the internet. After that data is used up there will be nothing new for bigger models to train on.

Since hardware is already stagnating and data will be running out the only way to make progress would be to make breakthroughs on the AI architectural front, which is going to be linear in nature again.

I'm a Computer Scientist by trade and while I work with AI systems on a daily basis and keep up with AI papers I'm not an AI expert so I could be wrong on this front.

genshiryoku t1_j6a85jx wrote on January 28, 2023 at 9:44 PM

Reply to Why did 2003 to 2013 feel like more progress than 2013 to 2023? by questionasker577

Because Moore's Law largely stopped around ~2005 when Dennard Scaling stopped being a thing. Meaning clockspeeds have hovered around the 4-5Ghz rate for the last 20 years time.

We have started coping by engaging in parallelism through multi-core systems but due to Amdahls Law there is a diminishing return associated with adding more cores to your system.

On the "Instructions Per Cycle" front we're only making slow linear progression similar to other non-IT industries so there's not a lot of gain to be had from this either.

The reason why 2003-2013 feels like a bigger step is because it was a bigger step than 2013-2023. At least from a hardware perspective.

The big innovation we've made however is using largely parallelized GPU cores to accelerate machine learning on the extremely large data sets large social media sites have which has resulted in the current AI boom.

But yeah you are correct in your assessment that computer technology have largely stagnated since about ~2005.

genshiryoku t1_j64ence wrote on January 27, 2023 at 5:11 PM

Reply to comment by terminal_laziness in ⭕ What People Are Missing About Microsoft’s $10B Investment In OpenAI by LesleyFair

There's no reason to assume AGI translates to profitability. If AGI is commodified then essentially all value will be captured by the hardware guys, not the AGI developers.

genshiryoku t1_j643mw8 wrote on January 27, 2023 at 4:02 PM

Reply to ⭕ What People Are Missing About Microsoft’s $10B Investment In OpenAI by LesleyFair

I agree in AI models becoming commodities over time as has been seen with Stable Diffusion essentially disrupting the entire business model of paid image generation like Dall-E and Midjourney.

I completely agree with the investment case and burn rate of these AI companies not being worth it. And that just like historically with the industrial revolution. It won't be the AI companies benefiting from the creation of AI it will be the companies that can rapidly scale up their production with the use of AI.

It wasn't steam engine makers that benefited from the industrial revolution. It was factories that could quickly scale up with steam engine providing labor.

It won't be the AI companies benefiting from AI. It will be companies that have lots of intellectual workers that can quickly scale up with AI providing intellectual labor.

I actually expect law firms, medical field, schooling platforms and other almost purely intellectual firms to benefit the most from an economic windfall perspective.

genshiryoku t1_j63y8d5 wrote on January 27, 2023 at 3:27 PM

Reply to ⭕ What People Are Missing About Microsoft’s $10B Investment In OpenAI by LesleyFair

I really appreciate this high quality post. Especially the margin comparison of SaaS vs AI companies.

genshiryoku t1_j5y03ci wrote on January 26, 2023 at 9:57 AM

Reply to comment by Scarlet_pot2 in The inside story of ChatGPT: How OpenAI founder Sam Altman built the world’s hottest technology with billions from Microsoft by nick7566

Problem is that AI benefits from economy of scale and there's an exponential increase in performance with bigger models.

What this means is that it's a "winner-takes-all" situation and you can't compete as a smaller entity without huge capital injection to generate the compute necessary to train large models.

The only alternative I can think of is a distributed computer like Seti@home where people volunteer their GPUs to collectively train large open source AI models.

As cryptocurrency mining has shown us, most people won't do that on volunteer basis so there'd need to be some sort of financial incentive but I wouldn't want to mix a neutral AI open source model with perverse financial incentives like crypto.

So essentially that is not going to happen and even StabilityAI is eventually going to have to incorporate like OpenAI to continue on their path, sadly enough.

genshiryoku t1_j5j0z99 wrote on January 23, 2023 at 10:07 AM

Reply to NVIDIA just released a new Eye Contact feature that uses AI to make you look into the camera by strangesmagic

He doesn't blink once.

genshiryoku t1_j57j6s1 wrote on January 20, 2023 at 11:08 PM

Reply to comment by Gohoyo in AGI by 2024, the hard part is now done ? by flowday

It would be lower quality data but still usable if significantly altered. The question is. Why would you do this instead of just generating real data?

GPT is trained on human language it needs real interaction to learn from like the one we're having right now.

I'm also not saying that this isn't possible. We are AGI level intelligences and we absolutely consumed less data than GPT-3 did over our lifetimes so we know it's possible to reach AGI with relatively little data.

My original argument was merely that it's impossible with current transformer models like GPT and that we need another breakthrough in AI architecture to solve problems like this, not merely scale up current transformer models, because the training data is going to run out over the next couple of years as all of the internet will be used up.

genshiryoku t1_j57h1fb wrote on January 20, 2023 at 10:53 PM

Reply to comment by Gohoyo in AGI by 2024, the hard part is now done ? by flowday

The "created data" is merely the AI mixing the training data in such a way that it "creates" something new. If the dataset is big enough this looks amazing and like the AI is actually creative and creating new things but from a mathematics perspective it's still just statistically somewhere in between the data it already has trained on.

Therefor it would be the same as feeding it its own data. To us it seems like completely new, and actually useable data though which is why ChatGPT is so exciting. But for AI training purposes it's useless.

genshiryoku t1_j57dtsz wrote on January 20, 2023 at 10:31 PM

Reply to comment by Gohoyo in AGI by 2024, the hard part is now done ? by flowday

Without going to deep into it. This is a symptom of Transformer models. My argument was why transformer models like GPT can't scale up.

It has to do with the mathematics behind training AI. Essentially for every piece of data the AI refines itself but for copies of data it overcorrects itself which results in inefficiency or worse performance. With synthetic data it kinda acts the same as duplicate data in that it overcorrects and worsens its own performance.

If you are truly interested you can see for yourself here.

And yes AI researchers are looking for models to detect what data is synthetic on the internet because it's inevitable that new data will be machine generated which can't be used to train on. If we fail at that task we might even enter an "AI dark age" where models get worse and worse with time because the internet will be filled with AI generated garbage data that can't be trained on. Which is the worst case scenario.

genshiryoku t1_j57bbc9 wrote on January 20, 2023 at 10:14 PM

Reply to comment by Gohoyo in AGI by 2024, the hard part is now done ? by flowday

You can't use AI generated data to train AI as essentially they are already from their dataset. Training with synthetic data like that is called "overfitting" and reduces the performance and effectiveness of the AI.

genshiryoku t1_j56we7z wrote on January 20, 2023 at 8:38 PM

Reply to Google to relax AI safety rules to compete with OpenAI by Surur

The only reason Google doesn't have publicly usable models like ChatGPT is because Google rightfully realizes that it will suppress their business model of AD revenue based search which is still their core business model where most of their revenue comes from.

genshiryoku t1_j56btvq wrote on January 20, 2023 at 6:28 PM

Reply to comment by Baturinsky in AGI by 2024, the hard part is now done ? by flowday

The problem is the total amount of data and the quality of the data. Humans using an AI like GPT-3 doesn't generate nearly enough data to properly train a new model, not even with decades of interaction.

The demand for training data scales logarithmically with the parameter scale of the transformer model. This essentially means that mathematically Transformer models are a losing strategy and isn't going to lead to AGI unless you had unlimited amount of training data, which we don't.

We need a different architecture.

genshiryoku t1_j568vwe wrote on January 20, 2023 at 6:10 PM

Reply to comment by Surur in AGI by 2024, the hard part is now done ? by flowday

That's not a whole lot of data and doesn't compare to the gargantuan amount of data already on the decade generated over decades.

The current transformer model scaling will hit a wall soon due to lack of training data.

genshiryoku t1_j55te6y wrote on January 20, 2023 at 4:34 PM

Reply to comment by MrEloi in AGI by 2024, the hard part is now done ? by flowday

Because GPT-3 was trained on almost all publicly available data and GPT-4 will be trained by transcribing all video footage on the internet and feeding it to it.

You can't scale the model up without scaling the training data with it. The bottleneck is the training data and we're running out of it.

It's not like the internet is suddenly going to 10x in size over the next couple of years. Especially as the global population is shrinking and most people are already connected online so not a lot of new data is made.