curiousshortguy

curiousshortguy t1_jaf3aab wrote

Yeah, about 2-3. You can easily shove layers of the networks on disk, and then load even larger models that don't fit in vram BUT disk i/o will make inference painfully slow.

10

curiousshortguy t1_j6ak1cj wrote

Why are you using euclidiean distance? Use cosine distances. The former cares about vector magnitue, the latter doesn't. As a general rule of thumb for comparing vector embeddings, you don't care about magnitude, at best, that typically captures document length.

Do you have more than product titles, such as product descriptions? Where do you get the user queries from? Do you use a default tokenizer for BERT?

3

curiousshortguy t1_j617zzd wrote

The keyword you want, similar to DevOps where Github plays a role as the code storage, is MLOps, and within that you want to look for data and model management and versioning. There are quite a number of companies offering various aspects of that, see for example this random infographic: https://adataanalyst.com/wp-content/uploads/2021/05/Infra-Tooling3.png

1

curiousshortguy t1_j4mt461 wrote

Think of chatGPT as a multi-task meta-learner where the prompt you give it specifies the task. It's essentially only trained on text generation (with some fine-tuning to make it more conversational). So you need to set-up a prompt to make it generate reasonable answers. It can't think or calculate, but by showing it how to generate a right answer in the prompt, it can leverage that information to give you better answers.

1

curiousshortguy t1_j1s504a wrote

> whilst some conferences are American yes, not all are

The ones OP mentions (ICML / ICLR / NeurIPS) are though.

> Blacklist is a universal term and is not offensive and trying to say so is stupid so changing the language to appease a small minority of people is stupid

There's enoguh science to back up that this is probably not stupid:

> https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6007773/
but of course, if you don't care about a minority because they're a minority, you can go ahead and do whatever you want to do. You just might be an inconsiderate asshat then.

−2

curiousshortguy t1_j1ppptp wrote

Most ML conferences are American, happen to have an extensive code of conduct and diversity, inclusion, and accessibility goals, and thus happen in a very American cultural space.

If you want to be part of a community, you can't unsubscribe from rules and be rude as you like by simply saying that you're not American. You're not to discriminate members of the community or use micro-aggressions against members because your culture doesn't acknowledge the societal issues in other countries.

−2

curiousshortguy t1_iuvswa7 wrote

>You just described the role of a discriminator in a gan

I disagree. The discriminator is just used in a binary fashion and doesn't add a lot of explanatory value.

​

Just to clarify, I am not trying to say that OP is unqualified. But I think just thinking about it in a binary way isn't enough for good peer review and a functioning system.

1

curiousshortguy t1_irc2htk wrote

> This might be a silly question, but do you think that asking to be engaged for no pay, emphasizing you're very determined to gain experience would generally be a good idea?

Doesn't hurt to ask. A good approach is to see if the lab has a list of research grants on their website. Look at those, and you might be able to infer where funding is available. Same goes for the recent papers, they give you good indications. It's even better if you're taking a class and can engage the team via that.

​

>I am in no position to try and aim for these places, it does make me worry that I would have to compete with an abundance of people with such qualifications at every step. I've recently burned myself trying hard to get chosen for summer research programs.

Don't be discouraged, you only need to be accepted once :D

To be fair, 75% your success in applications to Faang and Ivy League will be via recommendations and network. You have a full 2 years to build that.

1

curiousshortguy t1_irbnvzk wrote

It really depends on the country, the EU is not quite as homogeneous at it seems from the outside.

Typically, master students are expected to write a master thesis, and that often is research-focused. Often, they're supervised in a daily fashion but a PhD student with a professor being responsible. This happens in the labs, but because every student goes through it, these students are not always listed as lab members (because they aren't).

Sometimes, labs also have funding for student researchers. They do work that's not part of their thesis, and that's much rarer, and in most countries, the pay is shit compared to jobs in industry (I guess: welcome to academia).

If you want to get published and don't happen to find a lab that's hiring students: Choose a good thesis topic where the lab is 1) doing research 2) you contribute to an ongoing effort in the lab 3) you are supervised by someone who wants you to succeed (i.e. your project isn't a side-project, or very specific nieche follow-up), and 4) you make your ambitions clear from the beginning.
Don't expect the supervisors to give you an idea that end-to-end will lead to a publication. You need to use your own judgement, and your research on the field, to make a somewhat educated guess.

Unless you're trying to join Ivy League like places where currently a whole pipeline of FAANG engineers pushes their high-school students as interns through a pipeline to have their names top-tier conference publications before they even graduate high school, you'll end up being a decent candidate with:

  • good enough grades
  • a decent selection of courses that gives you solid background to understand enough math and theory
  • side-projects, interesting term papers, ba/ma thesis projects
  • network, get to know your lecturers, engage with them
  • recommendation letters are often a requirement. Just having good grades doesn't make you a good candidate for a letter. Wtf is the professor going to write? "He was good in my class" is just a weak recommendation letter, barely better than none at all.
  • try to particpiate in summer schools, there often is funding for it. typically, summer schools target phd candidates, but you can join them as a master student as well
  • if you're rich/well-funded enough, you can even attend academic conferences on your own/as a student
  • don't wait for research student jobs to be advertised. Go ask. That's how they're all gone before they make it to the job boards.

Probably more, but I can't think of more rn.

6