marr75 t1_je14tki wrote on March 28, 2023 at 5:38 PM

Reply to comment by wazis in [N] OpenAI may have benchmarked GPT-4’s coding ability on it’s own training data by Balance-

me irl

marr75 t1_j7ksi6o wrote on February 7, 2023 at 2:56 PM

Reply to comment by st8ic in [N] Google: An Important Next Step On Our AI Journey by EducationalCicada

They should be. I think LLMs will totally upset how content is indexed and accessed. It's one of the easiest and lowest stakes use cases for them, really.

Unfortunately, Google has such a huge incumbent advantage that they could produce the 5th or 6th best search specialized LLM and still be the #1 search provider.

marr75 t1_j5joben wrote on January 23, 2023 at 2:17 PM

Reply to comment by Appropriate_Ant_4629 in [D] Couldn't devs of major GPTs have added an invisible but detectable watermark in the models? by scarynut

Watermarks are a great way to ensure I use GPT-NeoX and allies instead of Da Vinci and allies.

marr75 t1_j4da0ub wrote on January 14, 2023 at 10:18 PM

Reply to comment by Ronny_Jotten in [D] Is MusicGPT a viable possibility? by markhachman

Sure, there will probably be plenty of litigation in the next few years. I find it probable that these suits fail. Sorry for my imprecision on the origin and application of the four-part test. I think we'll just hold our same opinions on the matter coming out so I don't really care enough about this debate to formulate my sentences that carefully or continue.

marr75 t1_j4cuv4t wrote on January 14, 2023 at 8:35 PM

Reply to comment by Ronny_Jotten in [D] Is MusicGPT a viable possibility? by markhachman

I read the 30 word OP here and the jukebox blog post and have read multiple analyses of AGI vs Google. The best I can guess, you're referring to the jukebox post, which only references IP in the sentence:

> As generative modeling across various domains continues to advance, we are also conducting research into issues like bias and intellectual property rights

So, I question if you know what discussion you're replying in, if you yourself read the post, or if I'm just so confused I can't believe my own reading comprehension anymore (which could happen any day now).

The multi-part fair use test established in AGI vs Google is widely held to be applicable to AI and ML models. There are no guarantees when it comes to credible legal theories and the winds can shift after a Supreme Court decision or two, but that's the state of the art today.

marr75 t1_j4ctyyh wrote on January 14, 2023 at 8:29 PM

Reply to comment by Ronny_Jotten in [D] Is MusicGPT a viable possibility? by markhachman

Things aren't "true"/"false" in this context, unfortunately. It is commonly held by IP and copyright lawyers to be the most credible legal theory available today. The multi-part test for fair use it created has been generally upheld as usable in AI and machine learning scenarios.

marr75 t1_j4a5zip wrote on January 14, 2023 at 6:05 AM

Reply to [D] Is MusicGPT a viable possibility? by markhachman

Copyright issues aren't problematic. The case law is well settled until there is new legislation. Some parties don't like it and some journalists want to write about it. 🤷

marr75 t1_j4a5pv3 wrote on January 14, 2023 at 6:02 AM

Reply to comment by itsnotlupus in [D] Is MusicGPT a viable possibility? by markhachman

Copyright is not a big obstacle for generative AI. Research Author's Guild vs Google.

marr75 t1_j3c7zik wrote on January 7, 2023 at 3:01 PM

Reply to comment by C0R0NA_CHAN in [D] Which ML model should I use to analyse and detect dip in time series sequence? by C0R0NA_CHAN

I'm not following what you're saying but you can detect all local minima with a single function call, order them and know their summary statistics with a second function call, and come up with a threshold based comparison for the end of the video if that's what you want.

None of this requires a machine learning model. You lost me when you mixed in "only when an ad occurs". Do you have any data that would help you train such a model? Are you just trying to detect ads? You could:

identify all local minima attention drops
engineer features such as distance into video, length of drop (time spent below average before and after local minima), magnitude of drop
perform unsupervised learning, i.e. PCA/t-sne/k-means
hope the "structural" features identified by unsupervised learning help you organize ads vs non-ads (they might!)

Again, not a complicated system because you don't have complex features as you've described them.

Is this just a novelty project? The way you're asking about it makes me think there's a low chance of follow through and your questions are kind of "arguing" towards a more complicated model. Run whatever code you are capable of then, I guess. I will probably decline to give further advice if that trend of leading questions continues.

marr75 t1_j3b4xso wrote on January 7, 2023 at 7:37 AM

Reply to [D] Which ML model should I use to analyse and detect dip in time series sequence? by C0R0NA_CHAN

I don't know why you would need ML to do this. Detecting local minima is a Calc I homework question. Scipy (and many other libraries) do it in a single function call.

marr75 t1_izywb2h wrote on December 12, 2022 at 9:31 PM

Reply to comment by Mark8472 in [Discussion] Amazon's AutoML vs. open source statistical methods by fedegarzar

If they were using a custom python pipeline for the statistical models, yeah, I could see this argument. But, like many of the Nixtla tools:

!conda install -c conda-forge statsforecast
import sf
sf.fit(Xzero, yzero)
yone = sf.predict(Xone)

This is a pretty common "marketing" post format from Nixtla. I think they make good tools and good points, so I'm not at all mad about it. They're providing a ready to use tool (StatsForecast) and making a great point about it's performance and cost vs the AWS alternative. Asking for the total cost of developing and maintaining statsforecast means you'd have to also account for the total cost and complexity of developing and maintaining AmazonForecast...

marr75 t1_iymo8k3 wrote on December 2, 2022 at 3:17 PM

Reply to comment by TropicalAudio in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

Yeah

> Just guessing here, but

is a common US English idiom that typically means, "Obviously".

You're absolutely right, though. Just by comparing the training data to the training process and serialized weights, you can see how clearly this should overfit. Once your model is noticeably bigger than a dictionary of X, Y pairs of all of your training data, it's very hard to avoid overfitting.

I volunteer with a group that develops interest and skills in science and tech for kids from historically excluded groups. I was teaching a lab on CV last month and my best student was like, "What if I train for 20 epochs, tho? What about 30?" and the performance improved (but didn't generalize as well). He didn't understand generalization yet so instead, he looked at the improvement trend and had a lightbulb moment and was like, "What if I train for 10,000 epochs???" I should check to see if his name is on the list of collaborators for the paper 😂

marr75 t1_iykwulm wrote on December 2, 2022 at 3:42 AM

Reply to comment by Internal-Diet-514 in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

While I agree with your general statement, my gut says a well parameterized/regularized deep learning solution would perform as well as an ensemble of statistical approaches (without the expertise needed to select the statistical approaches) but would be harder to explain/interpret.

marr75 t1_iyjvtdc wrote on December 1, 2022 at 10:57 PM

Reply to comment by dataslacker in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

Just guessing here, but: overfitting.

marr75 t1_iyjvo41 wrote on December 1, 2022 at 10:55 PM

Reply to comment by butyrospermumparkii in [R] Statistical vs Deep Learning forecasting methods by fedegarzar

That answer is hard to predict.

marr75 t1_itbnukd wrote on October 22, 2022 at 12:22 PM

Reply to comment by phraisely in [P] Look up words by their description by phraisely

I edited down the flatly negative part of what I wrote above because you're engaging so sincerely to improve it. I can't imagine getting a feel for it without running a lot of queries (100 a month or 10 per hour or 1 per minute, something like this). On top of that, the job to be done here is a little suspect for me. Are there people who have a commercially viable need to get a phrase back for a description?

The 2 tests I wanted to try were 2 very specific words I can't remember. The first is one of those german multi-word combinations that means, "the problem is solved by the mere structure of the solution." I don't think that word is probably even in the dictionary based on the results I was getting and I also started to learn that it was giving me back short phrases instead of words, which was disappointing. The second word means "distribution preserving" and I didn't get a chance to test it but it's got latin roots and I'm skeptical phraisely has it in the dictionary, too.

Overall, I was hoping the technology on display would be more powerful. I guess I'd pay $1 for either of those words.

marr75 t1_itblwi0 wrote on October 22, 2022 at 12:02 PM

Reply to comment by phraisely in [P] Look up words by their description by phraisely

I was trying to get a feel for it and can't even remember how many queries I issued. 3-5 maybe? There was no indicator that I was using a quota (especially a quota that small) when suddenly I was told I needed to wait 224 hours for more queries.

marr75 t1_it9a9z5 wrote on October 21, 2022 at 9:38 PM

Reply to comment by bluboxsw in [P] Look up words by their description by phraisely

You also get a very slow rate of queries once you do sign up for free.

marr75 t1_isxougd wrote on October 19, 2022 at 1:45 PM

Reply to comment by Ularsing in [D] How frustrating are the ML interviews these days!!! TOP 3% interview joke by Mogady

I read a good blog post from a guy talking about how modern IDEs encourage you to learn really weird "motions" (using pycharm's refactor, codegen, and code completion mid-stream, for example). He wasn't saying it was bad per se, just that we should all remember the point isn't to be "good" at the IDE, it's to solve problems with the code.

I feel the same about pandas. If anything, the skill to focus on is vectorizing your operations. That's the biggest readability and performance improvement and it's portable to dplyr, polars, etc.

marr75 t1_isxnz3w wrote on October 19, 2022 at 1:38 PM

Reply to comment by yourmamaman in [D] How frustrating are the ML interviews these days!!! TOP 3% interview joke by Mogady

I think they're designed by very traditional engineering managers. The coding test trend gained popularity thanks to Jeff Atwood because he used it as an early screen for people applying for lucrative jobs they didn't actually know how to do (which is useful!). Managers were using it as a higher and higher floor for skills and we got the leetcode style (a fresh bootcamper might pass fizzbuzz but they're unlikely to have months to grind leetcode). We've also seen an explosion in roles who code but are more responsible for the wisdom and value of their creations (the spec and visual design aren't enough or even relevant for a model, a Lagrangian relaxation, a recommendation engine, etc).

Real conversation I had with another executive, "Hey, we've got that coding screener for engineers, can we whip up something similar for [name a role]?" You start combining these different forces - a desire for selectivity, a desire to lower hiring cost, more complex technology roles that have to chart some of their own spec, and just human laziness - and you get what OP described.

marr75 t1_iram7mu wrote on October 6, 2022 at 3:53 PM

Reply to comment by NotAnotherEmpire in Looking back at the pandemic, how did the different vaccines fare outside the lab? by [deleted]

Depends on what you mean by effective. This article summarizes and links a few quality studies.

For symptomatic Omicron infections: 2 dose ~50-60% effective, 3-dose ~70-80% effective, 4-dose ~90% effective. The death rate was not reliably calculable in these studies because it was so low for all groups. So, is 50% efficacy against symptomatic infection, [some very high efficacy]% against death/severe illness "effective"? It certainly slows the spread and keeps a lot of people alive. Plus there's strong evidence you could "choose your own efficacy", if you were higher risk or just didn't want the hassle of symptomatic Covid, you could choose first and second boosters.

For reference, the flu vaccine, which is becoming a better comparison as we have vaccines against this family of coronavirii and they have become endemic (there's a smaller and smaller population with no prior immunity) is typically 40-60% effective against the most common strains of flu each year.