NamerNotLiteral
NamerNotLiteral t1_jcf8eo1 wrote
Reply to comment by WH7EVR in [N] A $250k contest to read ancient Roman papyrus scrolls with ML by nat_friedman
Former CEO of Github as well.
NamerNotLiteral t1_j41fy66 wrote
Reply to [P] Looking for someone with good NN/ deep learning experience for a paid project by CuriousCesarr
So as far as I understand the project, you want to estimate the price of real estate. There're a few ways to do this. Forget pictures for the moment, just go with listed/numeric information.
You have information like Area/Square Footage, Listed Amenities, Age, Location, etc.If you have existing data of this sort, where it lists all the above and then a price, then it is fairly straightforward to pull off – but no guarantees on the accuracy. This has been done by plenty of people, so if you just do this your investors will probably ask you about how you're going to compete with established Real Estate companies who have much bigger teams and much more data.
Now let's consider images: you have pictures of the house, and you want to use those pictures as a way to measure how broken-down/upscale the house is and use that as a parameter to base the price of. You are going to combine this with the above, of course, because it's ridiculous otherwise. I'll say this frankly – this hasn't really been done, and it's a research problem. Not a 'product problem'. You could do a whole PhD thesis on this alone. There are so many different ways to approach this.
- You can use ML to extract furniture from the picture individually, then assign a value to each item of furniture. Aggregate that value to get how well furnished the place.
- Massive Pitfall - How do you assign a value to a furniture? A minimalistic luxury sofa and an antique cabinet could be worth equally high. Designing this NN would be a huge challenge to start with.
- Second Pitfall - You need labelled data. You would need a whole team manually annotating the data by looking through hundreds/thousands of furniture images and assigning a value to them.
- You can use ML to determine the quality of the whole room. Forget individual objects, just rate the whole picture from "broken down" to "fancy af" on a scale from 1 to 10 or something.
- Pitfall - Again, you need labelled data. You'd need a whole team going through images of rooms and marking them. And since you're applying the model into such a very abstracted and broad problem, your results are not really going to be reliable.
- You can use ML at a more micro level. Maybe you could detect broken or damaged furniture.
- Massive Pitfall - There is very little data available for this, and moreover detecting such issues is still an issue for state of the art models. Some research has been done, such as detecting defects in wooden surfaces and stuff, but it's still at a fairly basic level. Making an algorithm that would detect, say, a crack on a chair, a stain on a cushion, scratches on glass, etc is possible... individually, by zooming in on that thing specifically. Doing this for a whole room on low-mid resolution images would be a nightmare.
Honestly I've given you the entire business plan you're looking for here lmao. Only reason I'm comfortable doing this is because what you're imagining is not really a feasible business plan except for at the very, very basic level.
Like, if you had a team that could pull any of these off, they would be working at AirBnB, Zillow or some other major real estate company already.
If those investors are feeling particularly generous and give you several years and an 7-figure budget, then this might be worth considering. Otherwise...
NamerNotLiteral t1_j41dryy wrote
Reply to comment by [deleted] in [P] Looking for someone with good NN/ deep learning experience for a paid project by CuriousCesarr
Circumventing bot blocking protocols is a trivial matter.
The potential lawsuit, on the other hand, is not.
NamerNotLiteral t1_j41cbv0 wrote
Reply to comment by CuriousCesarr in [P] Looking for someone with good NN/ deep learning experience for a paid project by CuriousCesarr
A few hundreds is way too little. I would be comfortable with a few thousand homes' data, and more comfortable yet if I could scrape Zillow or something on top of that.
(but that has its own issues, both legally and in terms of data drift, since Zillow data would be American while you're European).
NamerNotLiteral t1_j2pwcve wrote
Reply to comment by answersareallyouneed in [D] life advice to relatively late bloomer ML theory researcher. by notyourregularnerd
As someone who's 25 in a month and applying to MS programs (and not expecting to get in, not for Fall 23), I expect I'll be 27 or 28 by time I start a PhD.
This's been a huge source of insecurity for me, especially considering so many people I see and interact with in the field are younger than me and yet already 1-2 years ahead of me in the same trajectory.
Empirically, late 20s is still young, but it never feels like that when you're the one in your 20s.
NamerNotLiteral t1_isjwba5 wrote
Reply to comment by ZodiacShadow in Tifu by giving my 70 yr. old teacher a book by Sayuri_olg
Yeah, dear god! There's a difference between a book that's just a book and one where all the sauciest parts are annotated and highlighted.
NamerNotLiteral t1_je533mj wrote
Reply to comment by TheFriendlyArtificer in The guy behind the viral fake photo of the Pope in a puffy coat says using AI to make images of celebrities 'might be the line' — and calls for greater regulation by Lakerlion
I only see one way to regulate models whose weights are public already.
Licenses hard-built into the GPU itself, through driver code or whatever. Nvidia and AMD can definitely do this. When you load the model into the GPU, they could check the exact weights, and if it's a 'banned' model they could shut it down.
Most of these models are too large for individuals to train from scratch, so you'd only need to ban the weights floating around. Fine tuning isn't possible either, since you need to load the original model first before you fine-tune it.
Yes, there would be ways to circumvent this, speaking as a lifelong pirate. But it's something that could be done by Nvidia, and would immediately massively increase the barrier to entry.