this post was submitted on 20 May 2024
400 points (98.1% liked)

Technology

59132 readers
5810 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
all 34 comments
sorted by: hot top controversial new old
[–] [email protected] 122 points 5 months ago (5 children)

remember kids, everything you post on the internet stays forever*

*unless it cannot be monetized anymore

[–] [email protected] 54 points 5 months ago (1 children)

Everything you post has potential to remain forever even if it's not monetized directly. Cautioning people about it makes sense now and has always made sense.

[–] [email protected] 10 points 5 months ago

I know a lot of people still have terrible fanfiction they wrote as teens on the internet somewhere, so the warning is very appropriate.

[–] [email protected] 25 points 5 months ago (1 children)
[–] [email protected] 35 points 5 months ago

Even the Wayback Machine has limits to what is available.

[–] [email protected] 17 points 5 months ago (1 children)

Oh, that stuff is out there somewhere... in a database

[–] [email protected] 6 points 5 months ago

Or on a server hanging out in a landfill.

[–] [email protected] 13 points 5 months ago (1 children)

Don't worry, it might still bubble up to the surface in the hallucinations of an AI.

[–] [email protected] 3 points 5 months ago (1 children)

You can't train an AI on data that's no longer in existence

[–] [email protected] 9 points 5 months ago

But a decade from now, there will be AI trained on data that will no longer exist. And many websites that GPT trained on probably don't exist anymore.

[–] [email protected] -3 points 5 months ago (2 children)

Really? Because I don't think my dick pic can be monetized

[–] [email protected] 12 points 5 months ago

Sure it can. People will pay to not see it.

[–] [email protected] 2 points 5 months ago

Maybe start a charity and raise money that way?

[–] [email protected] 61 points 5 months ago (3 children)

54% of Wikipedia pages contain at least one link in their “References” section that points to a page that no longer exists.

It would be interesting to know how many of these references don't exist anymore and how many have just moved. Web has come a very long way since 2013 and I bet that websites hosting the references have undergone several iterations altering the URLs in some way.

[–] [email protected] 27 points 5 months ago

That, in and of itself, is also a problem! First of all, because such pages often fail to return a HTTP 301 moved permanently response, and second (but perhaps even more importantly) the reason they move is because the site transitioned from using static, human-readable URLs to some kind of unstable CMS-managed non-descriptive gibberish that breaks caching and linking. It's an intentional siloing and hoarding of content.

[–] [email protected] 7 points 5 months ago

And how many are the site completely re-jigging their CMS with no forwarding set up.

[–] [email protected] 47 points 5 months ago (2 children)

The online era is going to be a thousand Library of Alexandria's worth of lost information, records, journals, news, ... everything. It will all just digital-rot into the memory hole.

[–] [email protected] 38 points 5 months ago

And when trying to find a backup, no search engines can find it due to the AI garbage fucking up the SEO

[–] [email protected] 2 points 5 months ago
[–] [email protected] 30 points 5 months ago (1 children)

I wonder how this compares the the number of businesses that existed in 2013 that no longer exist. I wonder for two reasons:

  • Is 38% similar to the typical rate of failure for businesses and other ventures?
  • How much of the 38% can be explained by closure of high-risk businesses like restaurants?

Something else that could explain a lot of it is webpages that were always intended to be ephemeral. Political campaign websites for instance.

[–] [email protected] 16 points 5 months ago* (last edited 5 months ago) (1 children)

Also.... I once created hellblade.com. We sold gaming computers with cases that changed colour with heat in the UK. Was a total disaster. Now it's some big game franchise. Wish I'd kept the domain.

[–] [email protected] 11 points 5 months ago (1 children)
[–] [email protected] 7 points 5 months ago

Yeah i often wonder.... Would they have changed the name of the game, paid me off, or done something like hellbladegame.com.

[–] [email protected] 22 points 5 months ago

disgusting. it's like early TV where people thought it was low-rent crap and not worth saving.

it always seems impractical to store this stuff but then it goes away and you realize how much you're missing.

[–] [email protected] 11 points 5 months ago

Perhaps something like IPFS would help mitigate this. Popular stuff would be pinned by someone.

[–] [email protected] 8 points 5 months ago

well a friend made a site called lookingupthebuttofadeadbear.com and it isn't accessible anymore but i don't think anyone misses it

[–] [email protected] 7 points 5 months ago (1 children)

Makes me wonder how many dead links and webpages there must be

[–] [email protected] 3 points 5 months ago

Going through top posts on some subreddits is pretty grim nowadays because of the Gfycat collapse. Turns out Gfycat was a huge chunk of all the links on the internet.

[–] [email protected] 7 points 5 months ago* (last edited 5 months ago) (1 children)

I was just listening to a YouTube playlist of mine that goes back at least 10 years and was disappointed how much of it was deleted. And not only that, but in many cases I couldn't even tell what the videos were.

Literally just today, I picked one music video that just seemed to be gone from youtube and the internet, but thankfully was able to find a Wayback machine link to the artists website in 2008 with a .mov download link.

[–] [email protected] 4 points 5 months ago

I was going through my YouTube subscriptions on an account that’s been active since 2010ish. I didn’t recognize several accounts at all. They had deleted all their older videos and changed their account names.

I found myself subscribed to things that I would never have subscribed to. Either I had done it accidentally or they changed their name and took their videos in a different direction.

It’s a bummer because there are some old videos that were pretty funny/creative and now they are just gone.

[–] [email protected] 6 points 5 months ago* (last edited 5 months ago)

Information has a half life. And for digital information it's really short. I always thought the digitization of documents and media is a bad idea for this very reason. Photo albums are not as common anymore, more people read through screens. All the information is getting stored in devices that expire, get thrown away, or that won't be able to be accessed in a couple decades.

Think about all of the information that we have stored right now digitally. If nothing is actively done to keep it safe, how much of it do you think will survive in 100 years? Instagram, Facebook, and Google will not be around forever. Your personal photo galleries videos and files WILL be lost unless someone deliberately curates them for preservation.

[–] [email protected] 3 points 5 months ago

I have said multiple times before that 2013 was the worst year ever. I'm still proud of that opinion, but maybe, just MAYBE, there was something good about that year after all, so it wasn't all darkness and rainstorms.

It had MOAR websites to access.

[–] [email protected] 2 points 5 months ago* (last edited 5 months ago)

Welcome to the future of the internet. Good luck trying to find a 2 days old news article on facebook from a big publisher, because you wanted to read the comments.