FamousSuccess t1_jc27bzt wrote on March 13, 2023 at 2:40 PM

I'm not sure if the data will be sold, rather than just tools to gather it.

Even still, from what I've seen in the past not much stands in the way of "ownership" of tweets/FB posts/Social media. It tends to fall in the public IP territory

tyler1128 t1_jc27nqp wrote on March 13, 2023 at 2:42 PM

I'm personally thinking about writing a service to sell the data at something like 1/10,000th the cost twitter is charging or less. It'd cache most of the tweet data in LRU form up to a specific data limit in a central database, and dynamically grab new data in the case it isn't already there. There's also be a constantly running scraper for new data to throw it in the central DB cache. Only think stopping me is understanding the legal ramifications. On-demand access to historical data is too slow for large cohorts.

FamousSuccess t1_jc2ry05 wrote on March 13, 2023 at 4:56 PM

Well. Keep in mind that google effectively sells advertising based on user data, and their services/users depend entirely on content and data of non google entities.

So I’d say if google can build a business on other entities public data, so can you.

Not a perfect parallel but a parallel nonetheless

dubiousadvocate t1_jc2jom8 wrote on March 13, 2023 at 4:03 PM

I don’t think legality enters into it. At worst it’s a EULA violation. Like any public facing website. Grounds for banning the account but these would be throw away accounts to begin. Musk would whine about it but he’d probably also embrace the artificial user numbers at the same time.

One thing we’ve all learned about the man during this debacle is he’s self destructively impulsive and undisciplined.

Mr_ToDo t1_jc2lulp wrote on March 13, 2023 at 4:17 PM

Well it doesn't use the API, and assuming that it doesn't use a login then it's probably not bound by the EULA since it would all be public data with no agreement to see it.

Could be a bit of fun if it removes the login prompt, but it's pretty random normally and if there isn't an actual hard limit to what you can load then removing it is likely just a technicality at best(It seems more concerned about how long I stare at old tweets then how far down I scroll. I know sometimes I've gone years down if I don't stop scrolling)

dubiousadvocate t1_jc2qc1a wrote on March 13, 2023 at 4:46 PM

All good points.

haux_haux t1_jc3q8f1 wrote on March 13, 2023 at 8:36 PM

Didn't LinkedIn sue an organisation for scraping a while back. Did that fly?

bobartig t1_jc42cxs wrote on March 13, 2023 at 9:57 PM

There’s been a lot of misreporting regarding the recent HiQ v. LinkedIn case from the 9th Circuit. The best write up I've encountered is by an Internet and Web Scraping attorney, Kieran McCarthy

The key takeaway is that in the 9th Circuit (which has the most developed law in this area) web scraping a publicly available website doesn’t necessarily constitute a CFAA violation, but that doesn’t mean what you did was either legal, or that you won’t face legal liability.

dubiousadvocate t1_jc3uc8z wrote on March 13, 2023 at 9:03 PM

I haven't heard about that. I'm curious too.

Of course anyone can file a SLAP lawsuit and hope to intimidate legal behavior through financial burden.

Twitter’s $42,000-per-Month API Prices Out Nearly Everyone | Tiers will start at $500,000 a year for access to 0.3 percent of the company’s tweets. Researchers say that’s too much for too little data

tyler1128 t1_jc1vmhb wrote on March 13, 2023 at 1:11 PM