this post was submitted on 31 Jul 2024
577 points (97.2% liked)

Technology

59111 readers
3902 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Reddit says Microsoft’s Bing, Anthropic, and Perplexity have scraped its data without permission. “It has been a real pain in the ass to block these companies.”

top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 544 points 3 months ago* (last edited 3 months ago) (24 children)

"Without these agreements, we don’t have any say or knowledge of how our data is displayed and what it’s used for, which has put us in a position now of blocking folks who haven’t been willing to come to terms with how we’d like our data to be used or not used,” Huffman said in an interview this week

It's not your data.

Fuck off.

[–] [email protected] 78 points 3 months ago (4 children)

My only regret was not deleting all my comments before deleting my reddit account :P

[–] [email protected] 72 points 3 months ago (2 children)

Don't regret too much. I wouldn't be surprised if reddit's "delete" function was really just "move to the "suckers-wanted-to-delete-this" file.

[–] [email protected] 29 points 3 months ago (3 children)

I "deleted" all my posts, then randomly had someone reply to a 3 year old post that wasn't showing up in my profile but still showed on the page.

Don't delete your comments, edit them to be useless.

load more comments (3 replies)
[–] [email protected] 20 points 3 months ago (2 children)

If you delete your content do it in the form form of a GDPR takedown request

load more comments (2 replies)
[–] [email protected] 28 points 3 months ago (2 children)

Same. 13+ years of insanity and grouchy comments are gonna mess up the AIs

[–] [email protected] 27 points 3 months ago (6 children)

Same, plus or minus a year.

It took me a week, but I scrambled every comment and post with lorem ipsum and bee movie scripts, deleted the comments, then after verifying I could no longer find any of my original content on any search engine outside archive sites, I deleted the account.

It took so long because r*ddit started limiting API access when they realized people were automating their profile scrubbing.

As I've said before about certain countries, if you're doing everything you can to prevent people from leaving [THING/PLACE] then you might just be shit.

load more comments (6 replies)
load more comments (1 replies)
load more comments (2 replies)
[–] [email protected] 28 points 3 months ago (7 children)

Part of the ToS. Whatever you put on there is effectively theirs. Same with Facebook and your photos etc.

load more comments (7 replies)
load more comments (22 replies)
[–] [email protected] 203 points 3 months ago (6 children)
[–] [email protected] 104 points 3 months ago (1 children)

Spez gets a ton of online hate and it is still somehow not enough hate.

load more comments (1 replies)
load more comments (5 replies)
[–] [email protected] 119 points 3 months ago

Without these agreements, we don’t have any say or knowledge of how our data is displayed and what it’s used for, which has put us in a position now of blocking folks who haven’t been willing to come to terms with how we’d like our data to be used or not used

It’s the users’ data, not yours, you rent seeking fuck

[–] [email protected] 85 points 3 months ago (3 children)

“It has been a real pain in the ass to block these companies.” makes me regret ever using Reddit in my life. Get your profit whatever, but this is just beyond greed.

load more comments (3 replies)
[–] [email protected] 84 points 3 months ago (2 children)

"Let's see ... how do we get more people to visit our site? I know! We'll prevent search engines from sending people to it!"

...

[–] [email protected] 26 points 3 months ago* (last edited 3 months ago) (3 children)

It's phase three of the enshittification cycle. In phase one, you attract users by providing a good service. Once they're locked in, you squeeze them for all they're worth by switching focus to business customers. Once they're locked in, you squeeze them by threatening to deny them access to the users on whom they now depend.

load more comments (3 replies)
[–] [email protected] 22 points 3 months ago* (last edited 3 months ago) (4 children)

Big profit now is better than our long term image

~ Reddit shareholders

load more comments (4 replies)
[–] [email protected] 66 points 3 months ago (6 children)

Ok, now I'm miffed that Google caved to Reddit's demands and paid up.

Because this set a dangerous precedent.

Earlier, Google got a lot of demands from various publications to pay up for indexing the publicly available news sites. And they always responded with "Ok, guess you leave us no other choice than just exclude you from indexing altogether." Let the site simmer for a while until they went "oh shit, not being indexed by major search engines sucks. we didn't really mean it please come back"

It's especially jarring because Reddit doesn't even produce their own news content anyway. That search engine money isn't going to the content creators. News sites at least could say they need to pay for their content to be written by their employees.

load more comments (6 replies)
[–] [email protected] 64 points 3 months ago

Fuck U/spez

[–] [email protected] 62 points 3 months ago

"People who fecklessly farm other people's data upset at other companies are farming their data."

[–] [email protected] 56 points 3 months ago (2 children)
load more comments (2 replies)
[–] [email protected] 49 points 3 months ago (5 children)

Well Reddit should just sue these companies and see if these companies are actually breaking any laws. Holding sizeable chunk of the internet hostage also sounds like something the EU and US might want to look in to as it very much sounds like anti-competitive conduct or market manipulation.

Also if these companies want to have greater ownership over the content generated by their users they should also be much more liable for the content posted to their sites. I mean when something like the Section 230 was written they probably did not take this in to account. If these companies want to start selling user generated content then they should simply lose the immunity from liability.

load more comments (5 replies)
[–] [email protected] 48 points 3 months ago (8 children)

Remember when Reddit was pro net neutrality?

[–] [email protected] 21 points 3 months ago

Spez has a bunker to build!

load more comments (7 replies)
[–] [email protected] 45 points 3 months ago (3 children)
load more comments (3 replies)
[–] [email protected] 43 points 3 months ago (1 children)

Reddit should have to pay to work on windows....

[–] [email protected] 21 points 3 months ago* (last edited 3 months ago) (1 children)

Microsoft paying Reddit to pay Microsoft to pay Reddit to pay...

Stock prices absolutely skyrocketing with the news of this infinite revenue stream.

load more comments (1 replies)
[–] [email protected] 42 points 3 months ago (2 children)

fuck spez little piggy greedy soy boy

load more comments (2 replies)
[–] [email protected] 33 points 3 months ago (2 children)

the exodus must have worked if they're having to do this to shore up income.

[–] [email protected] 22 points 3 months ago (5 children)

We can be so glad we left. That placees quality has dropped so much.

Reddit is for porn and perhaps age old posts now.

load more comments (5 replies)
load more comments (1 replies)
[–] [email protected] 32 points 3 months ago

This is for you, Reddit: 🎻

[–] [email protected] 32 points 3 months ago

lol. That’s not how any of this works.

If you have content freely and publicly accessible, it will be read freely by humans and bots.

[–] [email protected] 28 points 3 months ago (3 children)

Scraping isn't illegal, they can't do anything

load more comments (3 replies)
[–] [email protected] 28 points 3 months ago (1 children)

and spez will pay you for creating content.

/s

load more comments (1 replies)
[–] [email protected] 25 points 3 months ago

Reddit only exists because of an open net and sharing content, noe they just suddenly determined that an open net is bad.

A common strategy, but it fucking sucks.

Fuck you reddit.

[–] [email protected] 22 points 3 months ago* (last edited 3 months ago) (1 children)

Pay for what now, it's full of bots. And disingenuous people. Add to that the self moderation people put themselves through to interact in that website

load more comments (1 replies)
[–] [email protected] 21 points 3 months ago

Reddit says web clients have visited their public, free website without permission.

Fixed.

[–] [email protected] 20 points 3 months ago (2 children)

I've said once and I'll say it again. Either the information on your site is free to all or to none. You can't have some people/entities pay and some not!

load more comments (2 replies)
load more comments
view more: next ›