• @BreakDecks@lemmy.ml
    link
    fedilink
    English
    515 months ago

    Google never did make backups of the Internet, why are we pretending like they ever did? Cached webpages were a basic workaround for third-party website downtime; a guarantee that you could reliably see the information you searched for, even if the linked site was down. It was nothing more than a snapshot of the webpage their crawlers saw, where older copies are permanently deleted with every new crawl of the page.

    It was never an archival effort, it was a rotating cache. If you were under the impression for all these years that Google was preserving Internet history, I don’t know why, because Google never claimed to be doing that. Maybe it’s time to reevaluate any other altruistic things you’re assuming that mega corporations are up to…

  • BoisZoi
    link
    fedilink
    English
    295 months ago

    If possible, please use the internet archive extension and upload pages that haven’t been uploaded ever, or in the last year.

    Likewise, if you know or use another service, archive it there too!

  • BudgieMania
    link
    fedilink
    10
    edit-2
    5 months ago

    Well surely this means that archive.org will be allowed to exist in peace, since it would be ridiculous to make the information and culture produced in the year of our lord 20fucking24 the most ephemeral it has ever been in human history, right?

    Right?

  • Willie
    link
    fedilink
    55 months ago

    I feel like this is so they can deny that they fed all the webpages that they cached to their ‘AI’ training datasets later when someone accuses them of that. Now when asked about the copies of webpages that they have they can be like “What copies?” and end the conversation there.

  • @linearchaos@lemmy.world
    link
    fedilink
    English
    45 months ago

    I wonder if this is related to why their searches have been going to hell. Like They changed how the engine indexes or something.

  • @astanix@lemmy.world
    link
    fedilink
    English
    35 months ago

    I noticed this yesterday when I tried to load a cached version of a site. How disappointing.

  • @wizardbeard@lemmy.dbzer0.com
    link
    fedilink
    English
    -15 months ago

    Three guesses at if they even attempted to donate this data to Internet Archive/Wayback Machine, and the first two don’t count.

    • @BreakDecks@lemmy.ml
      link
      fedilink
      English
      75 months ago

      Google cached content is pruned down into a space-saving format and rotated/deleted after less than a year, so it would be pretty worthless to the IA.

    • Chozo
      link
      fedilink
      15 months ago

      Internet Archive likely wouldn’t be able to handle it. They’re already struggling currently, as it is, and dumping a few petabytes of caches of the entire internet onto them probably won’t help.

  • @TCB13@lemmy.world
    link
    fedilink
    English
    -25 months ago

    You can’t cache stuff, politicians and the media needs ways to be able to delete content whenever they please.