utopiah t1_jeeokvj wrote on March 31, 2023 at 1:53 PM

Reply to comment by Disastrous_Elk_6375 in [D][N] LAION Launches Petition to Establish an International Publicly Funded Supercomputing Facility for Open Source Large-scale AI Research and its Safety by stringShuffle

> the french will build it

funny here knowing who is behind BLOOM

utopiah t1_je8vryb wrote on March 30, 2023 at 7:15 AM

Reply to [P] Imaginary programming: implementation-free TypeScript functions for GPT-powered web development by xander76

It's pretty cool and thanks for providing the playground. I wouldn't have bothered without it. I think it's very valuable but also is quite costly, both economically and computationally, while creating privacy risks (all your data going through OpenAI) so... again in some situation I can imagine it being quite powerful but in others and absolute no. That being said there are others models (I just posted on /r/selfhosted minutes ago about the HuggingFace/Docker announcement enabling us to run Spaces locally) e.g Alpaca or SantaCoder or BLOOM that might enable us to follow the same principle, arguably with different quality, without the privacy risks. Have you considering relying on another "runtime"?

utopiah t1_je0zqae wrote on March 28, 2023 at 5:06 PM

Reply to comment by fiftyfourseventeen in [D] FOMO on the rapid pace of LLMs by 00001746

Well I just did so please explain why not, genuinely trying to learn. I'd also be curious if you have a list of trained models compared by cost. I only saw some CO2eq order of magnitude equivalent but not rough price estimations so that would help me to get a better intuition as you seem to know more about this.

That being said the point was that you don't necessarily need to train anything from scratch or buy anything to have useful results, you cant rent per hour on cloud and refine existing work, no?

utopiah t1_jdzcevv wrote on March 28, 2023 at 8:37 AM

Reply to comment by fiftyfourseventeen in [D] FOMO on the rapid pace of LLMs by 00001746

$500 https://github.com/tatsu-lab/stanford_alpaca#data-generation-process

utopiah t1_jdh7hxy wrote on March 24, 2023 at 11:32 AM

Reply to comment by sEi_ in [N] ChatGPT plugins by Singularian2501

Thanks but that only clarifies from the UX side, we don't know know if OpenAI does save them and could decide to include past sessions in some form, as a context even with the current model, do we?

utopiah t1_jdgu9aa wrote on March 24, 2023 at 8:42 AM

Reply to comment by deepneuralnetwork in [N] ChatGPT plugins by Singularian2501

Does ChatGPT actually do that currently, namely keep track of your past prompts and makes a model of your tastes or values, so that "me" here is meaningful?

PS: not sure why the downvote. Is it an offensive or idiotic question?

utopiah t1_jcjoj65 wrote on March 17, 2023 at 9:40 AM

Reply to comment by iJeff in [D] What do people think about OpenAI not releasing its research but benefiting from others’ research? Should google meta enforce its patents against them? by [deleted]

in case you didn't follow https://www.reuters.com/technology/chinese-search-giant-baidu-introduces-ernie-bot-2023-03-16/ but nothing open source AFAICT.

utopiah t1_jbu0qpa wrote on March 11, 2023 at 6:31 PM

Reply to comment by Simusid in [Discussion] Compare OpenAI and SentenceTransformer Sentence Embeddings by Simusid

Still says absolutely nothing if you don't know what a cat is.

utopiah t1_jbtx8iv wrote on March 11, 2023 at 6:06 PM

Reply to comment by Simusid in [Discussion] Compare OpenAI and SentenceTransformer Sentence Embeddings by Simusid

> What we want is a model that can represent the "semantic content" or idea behind a sentence

We do but is it what embedding actually provide or rather some kind of distance between items, how they might relate or not between each other? I'm not sure that would be sufficient for most people to provide the "idea" behind a sentence, just relatedness. I'm not saying it's not useful but arguing against the semantic aspect here, at least from my understanding of that explanation.

utopiah t1_j4a9qq0 wrote on January 14, 2023 at 6:49 AM

Reply to [D] Is MusicGPT a viable possibility? by markhachman

It's not controversial as long as you don't share it and make money with it, you are pretty much free to do whatever you want.

If you plan to share the output, meaning here what's generated, not just the code and checkpoints, or a training set that's under copyright, publicly then it's another question entirely and if you are serious about that I recommend seeking legal advice.

utopiah t1_iw646oq wrote on November 13, 2022 at 6:16 AM

Reply to comment by cautioushedonist in [D] When was the last time you wrote a custom neural net? by cautioushedonist

Indeed, makes me wonder what's the share from Transformers alone or untouched Docker images.

utopiah t1_iw64365 wrote on November 13, 2022 at 6:15 AM

Reply to [D] When was the last time you wrote a custom neural net? by cautioushedonist

Not for a while but I started doing it again, did this little thing for Alameda Research and it went pretty great for a while but ... yeah, maybe I shouldn't. (sarcasm, just being facetious with the latest crypto scandal)

utopiah t1_iun6a16 wrote on November 1, 2022 at 4:34 PM

Reply to [D] Machine learning prototyping on Apple silicon? by laprika0

This is not my field but I find this question genuinely surprising.

Why would one even consider this unless prototyping from the actual jungle?

In any other situation where you even just have a 3G connection then delegating to the cloud (or your own on premise machines available online behind a VPN) seems much more efficient as soon as you have any inference and even more so training to run.

Why do I feel the question itself to be surprising? Well because ML is a data based field so the question can be answered with a spreadsheet. Namely your "model" would be optimizing for faster feedback in order to learn better about your problem with as costs your hardware but also your time. If you do spend X hours tinkering with an M1 (or M2 or even "just" a 4090) versus A100 in that or a random cloud e.g AWS or a local OVH by booting on a generalist distribution like Ubuntu versus dedicated setups like lambdalabs.com or coreweave.com or even higher level like HuggingFace on their own infrastructure then IMHO that does give you some insight.

Everything else seems anecdotal because others might not have your workflow.

TL;DR: no unless they are minuscule models and of course if you use it to ssh on remote machines but IMHO you have to figure it out yourselves as we all have different needs.

PS: to clarify and not to sound like an opinionated idiot, even though it's not my field I did run and trains dozens of models locally and remotely.