this post was submitted on 26 Jun 2023
213 points (100.0% liked)

Reddit Migration

83 readers
2 users here now

### About Community Tracking and helping #redditmigration to Kbin and the Fediverse. Say hello to the decentralized and open future. To see latest reeddit blackout info, see here: https://reddark.untone.uk/

founded 1 year ago
 

This video shows that Reddit refused to delete all comments and posts of its users when they close their account via a CCPA / GDPR request.

top 50 comments
sorted by: hot top controversial new old
[–] [email protected] 38 points 1 year ago (5 children)

The creator of tildes.net is a former Reddit backend developer, and believes this behavior is likely due to how Reddit caching works (or doesn't work), rather than an intentional subversion of user intent:

Yes, this is almost certainly a technical issue. The way reddit caches things probably isn't the standard way you're thinking of, like a short-term cache that expires and refreshes itself. There are multiple layers of "cached" listings and items for almost everything, and a lot of these caches are actually data that's stored permanently and kept up to date individually.

For example, when you view your comments page, Reddit uses a cached (permanent) list of which comments are in that page. There is a separate list stored for each sorting method. For example, maybe you'd have something like this with some made-up comment IDs:

Deimos's comments by new: 948, 238, 153
Deimos's comments by hot: 238, 153, 948
Deimos's comments by controversial: 153, 238, 948
If I post a new comment, it will go through each list and add the new ID in the right spot (for example, in the "new" list it always just goes at the start). If I delete a comment, it goes through every list, and removes the ID if it can find it in there.

One of the problems with this system (which is probably what's causing @phedre's issues, and affecting many other people trying to delete their whole history) is that all of these listings are capped at 1000 items. If you already have more than 1000 comments and you post a new one, the 1000th comment currently in the new list gets "pushed off the end". The comment still exists, but you won't be able to see it by looking through your comments page, because it's no longer in that listing.

Deleting comments also doesn't cause previously "pushed off" ones to get re-added. If you have 5000 comments, your listing will only include 1000 of them. If you delete 50 of the ones in the listing, your listing now has 950 comments in it. If you delete all 1000 from the listing, your comments page will appear empty, but you actually still have 4000 comments that will be visible in the comments pages they were posted in.

And this is only one aspect of it. There are also multiple other places and ways that comments are cached—comment trees are cached (order and nesting of comments on a comments page, for all the different sorting methods), rendered HTML versions of comments are cached, API data is probably cached, and so on.

All of these issues are probably just some combination of all of your posts being difficult to find and access due to the listing limits or certain cached representations of posts not being cleared or updated properly.

[–] [email protected] 50 points 1 year ago (1 children)

Luckily GDPR deletion requests don't care about how they are implemented. And failures to comply en masse tends to get really expensive.

[–] [email protected] 10 points 1 year ago* (last edited 1 year ago) (2 children)

Yup. I'm waiting for Reddit to come back with my GDPR data request (which has a time limit of 30 days, after which they can tell their excuses to extend it by another 30 days I believe), and assuming they have not reversed the API decision I'm ordering them to delete it all afterwards. And they even now have a handy list, the one they just gave me, of everything they have to purge - if they didn't, it wouldn't be on that list in the first place :)

[–] [email protected] 8 points 1 year ago

Still waiting for the GDPR request i made at the start of this shitshow, will be funny to witness the mass GDPR deletion requests of accounts at the start of July

[–] [email protected] 4 points 1 year ago (1 children)

It's been 3-4 weeks since I submitted my CCPA request, and I still haven't gotten my data yet. CCPA has a time limit of 45 days.

[–] [email protected] 2 points 1 year ago (1 children)

That's what's so awful about this. Prices were announced May 31st, so for a CCPA request that was done that very instant, they can delay until mid July, when the API changes will make it much more difficult to delete your data, and there's no recourse.

Even for GDPR, maybe you'd get it the day before, for the shorter 30 day limit. But a day of a few hours could easily mean you've gone past and API is also a problem for you.

This is some messed up timing, mates.

[–] [email protected] 2 points 1 year ago (1 children)

I would hope that someone reaching out to press from ModCoord would pass these concerns on to journalists. A persistent journalist can uncover the extent of compliance to the GDPR and CCPA through proper questions. "Have you seen an increase in GDPR/CCPA requests wince the controversy started? What percent of those have you completed? What about reports that users are unable to delete their data?" etc. (only better because I'm not a journalist and probably oversimplifying).

[–] [email protected] 3 points 1 year ago

Reddit stopped answering requests for comment from objective journalists.

People just need to start filling complains with their Data Protection Authority. Then the mainstream media will be forced to cover the stories to get the clicks.

[–] [email protected] 21 points 1 year ago

Based on this, I'd say that Reddit fully deserves to be banned in Europe and California, and fined into potential bankruptcy. Having deeply flawed technology that prevents them from ever being in compliance of a very serious law is no excuse.

[–] [email protected] 17 points 1 year ago* (last edited 1 year ago)
[–] [email protected] 5 points 1 year ago (1 children)

This sounds like malicious incompetence...

load more comments (1 replies)
[–] [email protected] 2 points 1 year ago (1 children)

Oh God. Somewhat unrelated, but I felt like I knew the name "Deimos" from somewhere. Couldn't put my finger on it. Finally realized who he was.

[–] [email protected] 7 points 1 year ago (1 children)

Greek god of dread and terror. Also, the smaller and outermost of the two satellites of Mars, named after said god.

load more comments (1 replies)
[–] [email protected] 26 points 1 year ago (3 children)

This is one of the many legal issues Reddit now has.
Reddit is very clearly eying an ipo, but who really wants to invest in this dumpster fire.

I’m not an investor but I l personally wouldn’t invest in a website as shortsighted as Reddit.

In an industry as cutthroat as social media having a site as active as Reddit, for 18 years. Should be celebrated.

In this world, where platforms live and die in the span of single years, why would Reddit throw away a formula that has worked for nearly 20 years.

[–] [email protected] 12 points 1 year ago

As an investor, I can say with near certainty that the objective is extremely "short" sided.

[–] [email protected] 10 points 1 year ago

Easy question to answer: they aren't profitable and the free money of years of near 0 or 0% interest rates is over. The constant VC dried up and the website is insolvent. They have a massively bloated staff roster. They're going to die if they don't make a major change.

And at the same time, all "traditional" monetization strategies for websites like these just... don't work with the way Reddit works. Making the changes they need to make will kill the site.

They never cooked up a monetization strategy that would work for them. They procrastinated. They felt free money would continue forever and underestimated how reliant their site was on volunteer labor. They got distracted by stupid side projects instead of refining the core product.

Reddit will absolutely survive all this. I expect it to still exist, at the end of the day. But it'll be smaller, and what remains will be a soulless shit hole. And it'll still be borderline insolvent.

[–] [email protected] 2 points 1 year ago

If I could get a controlling interest for fifty bucks I'd chip in on that.

[–] [email protected] 25 points 1 year ago* (last edited 1 year ago) (1 children)

I wonder at what point people start taking them to court. It seems like the usual idiot tech bro excuse of thinking terms of service/use somehow override the law which is hilariously naive.

You cannot override the law in a TOS.

Like if they wrote down that they were allowed to murder you written into their TOS and proceeded to murder you they'd still go to jail for murder.

[–] [email protected] 6 points 1 year ago

Username's research checks out.

(Sorry, I know people are kind of sick of funny tropes that were common on Reddit, but I couldn't resist. I"ll see myself out now...)

[–] [email protected] 22 points 1 year ago* (last edited 1 year ago)
  • reddit in violation of privacy laws
  • spez a pedophile
  • subs closing
    reddit is doomed
[–] [email protected] 13 points 1 year ago (4 children)

Wow, their legal department shot themselves in the foot putting that in writing. Idiots.

I submitted a CCPA request weeks ago and have yet to hear from them. They also restored tons of content I deleted.

Time for a class action yet?

[–] [email protected] 5 points 1 year ago (12 children)

Their explanation for restored content will likely be something about the nature of how their CDN works.

Granted, this excuse won't hold up much, but it's probably true and will limit their liability in the sense that it isn't intentional.

I've deleted my comments multiple times with PowerDeleteSuite and had things come back, a couple times over. However now I'm going through with shreddit (github version) using my GDPR files. It's taking a long time because things panic every so many comments (I'm backing up everything, on file 75 so far but still 46,000 lines left from a 75,000 line file, however it's panicking less now that the comments are more recent) and I haven't had it restore any of the links I've checked from that process.


Reddit changed the way they display comments in the profile a few months back. Now, you only see a limited number of comments under New, Hot, Top & Controversial. These are the lists that most deletion services access. So, if you use PowerDeleteSuite or any other service it will likely miss things. In particular, I opened up links to my older Top comments, ran the script, then found it had completely ignored replies underneath my comment that had low but positive karma - these wouldn't have appeared on the lists. My new list only went back about 3 months (although I think it's about number of comments rather than time).

You really need to use the GDPR files to get everything. These contain CSV files with links to every single post and comment you have. However, it seems that reddit are delaying following through with most requests until after 1 July, when API requests (such as those that shreddit uses) will be blocked.


Also PSA don't use the shreddit website, they charge you $15. The github version is free and will take CSV files with the appropriate tag. But, again, in my experience it panics and hangs fairly often, so it will take a lot of work to use. I've had to run it, back up the terminal output, use the last link and delete everything in posts.csv and comments.csv before the one it stuck on, then resume with ammended files.

Reddit really isn't making it easy to follow through with your rights. Make records of this, then this can be used to convince local Data Protection Authorities to collectively throw down a bigger hammer than Huffman ever wielded, or even imagined.

Also another PSA, reddit's terms do not deny you ownership of your content. So even if they try to claim ownership themselves (as Steve Huffman has frequently publicly stated) they cannot deny you the right to edit your content and restrict what they do with it. It's your information, and reddit hasn't even paid for it.

You can't sell a microwave without paying for the nuts and bolts.

load more comments (12 replies)
[–] [email protected] 5 points 1 year ago (1 children)

Time for a massive fine from the EU. Something large enough to bankrupt them.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (1 children)

Sadly probably not. The GDPR fine can be "up to €20 million, or up to 4% of the annual worldwide turnover of the preceding financial year, whichever is greater" which would be around 26 million based on their 2022 revenue. The company has gathered over $1.3 billion in funding and was "valued" at around $10 billion quite recently.

And that's only around what a year of API calls would have cost for Apollo so clearly by discontinuing the API they are going to save that amount back in no time!

load more comments (1 replies)
[–] [email protected] 2 points 1 year ago

This is the comment I was looking for. A class action from European citizens, for example, under the European privacy law, would really be bad news for Reddit (and good news for the Internet)

load more comments (1 replies)
[–] [email protected] 12 points 1 year ago (2 children)

What I noticed is that when restoring your comments they prioritize the ones with the most upvotes. Some I even deleted manually before the blackout reappeared too.

I find this shit to be likely illegal. I understand that we gave Reddit permission to use our content by agreeing to their terms of service, but if my comment was "A" and I edit it so that it displays "B", it is wrong for Reddit to still display "A" below my username without my authorization. They can exploit the content "A" however they want, but to show it under my username as if it were what I consented to display under my name feels like a breach to me.

[–] [email protected] 6 points 1 year ago (2 children)

I have been overwriting and deleting manually and I haven't seen anything come back yet... But it's also a nightmare to delete old comments that they have archived and don't show up on your profile. I just gave up

[–] [email protected] 3 points 1 year ago

It is illegal under the EU law.

[–] [email protected] 11 points 1 year ago (1 children)

I love it, it makes their intentions so obvious. Milking our content for AI training. Nobody will read our old conversations, except for AI´s

[–] [email protected] 7 points 1 year ago (2 children)

Lovely thing is that there isn't even a option to delete data via a LGPD (Lei Geral de Proteção de Dados) request. Well, for what I know at least

[–] [email protected] 13 points 1 year ago (1 children)

The product owners over at Reddit are going to be surprised to learn that Brazil exists.

[–] [email protected] 8 points 1 year ago

"Brazil? Isn't there where the Amazon offices are at"

[–] [email protected] 3 points 1 year ago

Ditto for the Canadian PIPEDA.

[–] [email protected] 5 points 1 year ago (5 children)

I don't think they are actually restoring posts/comments. This whole thing is based on confusion about the blackout and many subreddits going private. Most people would think you can see all of your own posts and comments if you are logged in and go to your profile page, but if a subreddit goes private you cannot even see your own submissions in that sub.

So after the blackout ended and most subreddits went public again, people who nuked their account history are now discovering that there's still posts remaining. They think these posts were restored, but they weren't even deleted in the first place.

This is obviously a huge oversight on how Reddit handles your data and your profile page, but don't attribute to malice which is adequately explained by stupidity.

[–] [email protected] 16 points 1 year ago* (last edited 1 year ago) (1 children)

That still makes it impossible for a user to ever delete all their comments, which is the CCPA complaint

load more comments (1 replies)
[–] [email protected] 10 points 1 year ago (1 children)

If you watch the video, you would understand that this individual is deleting specific comments, then saw the exact same comments that he deleted return some time later.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (2 children)

Yes, but if you look closely all of those submissions were made on the javascript subreddit. It's entirely possible that this sub was still private on the 24th, and went public on 25th. I don't know for sure but that seems to be the most likely scenario.

Edit: Looking at the blackout tracker, javascript was still private on June 24th, which is the day that the OP of the video was manually deleting his submissions.

[–] [email protected] 9 points 1 year ago (1 children)

If the sub were private that time, wouldnt it have prevented him from being able to delete the comment in there in the first place (bc he wouldnt be able to see them when its privet?) In this case he was able to see them i guess because he was able to delete them specific. But am not sure

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago)

Yup, I couldn't see my own comments or posts on subs that were private. When I tried to delete them via API/script it got me an error too.

However, there's an exception. If you are a mod or approved user for a sub, then you can see and edit/delete as normal. I have never tried this scenario but maybe in this case when it go public again, any deletions are undone (because of the caching issue).

load more comments (1 replies)
[–] [email protected] 9 points 1 year ago (1 children)

The law doesn't care how they handle the data or if a subreddit is private. If someone requests their data to be deleted, everything must be deleted.

[–] [email protected] 4 points 1 year ago

Correct, which is why reddit must ultimately do it. Only they would have access.

load more comments (2 replies)
[–] [email protected] 4 points 1 year ago (1 children)

I'm honestly not surprised at all. The content you created for them is valuable and they're expecting individual users not to fight back or even notice. They have the power and thicker wallets on their side.

[–] [email protected] 3 points 1 year ago (1 children)

Good thing about gdpr is the data commissioner will fight it for you

load more comments (1 replies)
load more comments
view more: next ›