32 comments

  • duxup a day ago ago

    People who steal from Americans get pardons... archive.is gets the the Feds on them.

    The current administration would be a good joke if it wasn't real.

    • black_13 20 hours ago ago

      [dead]

    • walletdrainer a day ago ago

      FWIW circumventing various paywalls is probably the bad thing archive.is is being investigated for, not the archiving bit.

      • dogma1138 a day ago ago

        Do they actually do anything to circumvent paywalls or do websites just whitelist their crawlers?

        • walletdrainer 15 hours ago ago

          Websites don’t whitelist their crawlers, they maintain custom bypasses for a wide variety of websites.

          If the websites were inclined to whitelist these crawlers, they’d also whitelist archive.org which is actually easy to whitelist. Archive.is is not

        • undefined a day ago ago
          [deleted]
      • rekabis a day ago ago

        An AdGuard employee working their Reddit subreddit let slip that the legal order that forced them to block those domains (from their ad-blocking DNS) was a - claimed! - result of Archive.today having saved CP and refusing to delete it.

        Methinks someone accidentally archived the Epstein files, and the FBI is desperately trying to scrub the unredacted backups before the archive URL becomes well-known. That alone would align somewhat with the CP claim,

  • dogma1138 a day ago ago

    Regardless of this situation I actually think that websites like Archive and TWBM should be fully transparent.

    A very large partition of citations in Wikipedia for example relies on them. Most of the pages that cite archived copies do so because the live version is no longer available I would like to have some assurances that archive.is and the likes are not altering their content in any way over time.

    Unironically content sensitive hashing of archival pages might be one of the few use cases where something like a blockchain might actually be useful for.

    • dpark a day ago ago

      Why would the blockchain be useful here? You don’t need a blockchain to store a hash.

      • Galanwe a day ago ago

        You do need some kind of reliable, distributed storage though. The sequential nature of a blockchain also ensures that such stored content is held no matter what by any full node.

        • egberts1 a day ago ago

          No, just no.

          A simple four-hash like BSD or Gentoo Linux do with their repository is more than sufficient.

          No need to record who is requesting the recording, much leas fetchibg.

          • Galanwe 15 hours ago ago

            The hash to verify content is only half of the problem. You also need to store the _actual_ content of the page. What's the point of having Wikipedia reference a URL + hash if the page does not exist anymore?

            A blockchain is, at its core, a distributed database, it is exactly made for this use case.

            • dpark 7 hours ago ago

              A quick check indicates that storing something on the Bitcoin blockchain costs about a dollar. How many millions (billions?) would Wikipedia need to spend to stash everything they reference in the blockchain?

              > What's the point of having Wikipedia reference a URL + hash if the page does not exist anymore?

              It would be way cheaper for Wikipedia to run a durable archive service themselves than to use the blockchain as an archive.

              • Galanwe 5 hours ago ago

                > A quick check indicates that storing something on the Bitcoin blockchain costs about a dollar

                That's nonsensical, the price of using a service on a blockchain is essentially a floating value. That is the whole point of having a token in the first place: people willing to store and people storing are participating in the price of the service.

                Last I checked, filecoin was a few cents per GB per month.

                You can create a blockchain of kind hearted people to store Wikipedia as well, it's really up to you. But comparing apples and oranges makes no sense.

                • dpark 5 hours ago ago

                  > That's nonsensical, the price of using a service on a blockchain is essentially a floating value.

                  This is kind of a ridiculous response. The price of oil is also a floating value and yet it is not nonsensical to discuss the price of a barrel of oil.

                  Yes, the cost to store something on the bitcoin blockchain floats. Several sources indicate that roughly a dollar is a reasonable approximation currently. If you disagree I’d be interested in seeing your data.

                  > Last I checked, filecoin was a few cents per GB per month.

                  I don’t know a ton about filecoin but it seems like retrieving data is pretty cumbersome. It’s not clear that this would actually be useable for a Wikipedia reference archive.

                  > You can create a blockchain of kind hearted people to store Wikipedia as well, it's really up to you. But comparing apples and oranges makes no sense.

                  Blockchain for its own sake. Sure, you could create a custom blockchain. You could also just pay AWS for georeplicated blob storage and it would be way less complex.

            • egberts1 10 hours ago ago

              You should experiment with embedded hash within HTML pages and discover their limitations.

              These same limitations amplifies when going to outer scopes like URL itself, blockchain isn't immune to this.

              Blockchain also has the same problem when attempting to track/verify each single vote.

              W3C Subresource Integrity Recommendation

              Source: https://www.w3.org/TR/sri-2/

        • AlienRobot a day ago ago

          It's a hash.

          • dogma1138 a day ago ago

            You both need to generate the hash at the point of archival correctly and store it in a way that cannot be modified later on.

            Doing that with a blockchain like tech is one of the few use cases where the tech itself actually adds value.

            Heck you might be able to store the entire pages on a blockchain or a blockchain linked storage.

            The problem with these sites is that we implicitly trust them and unlike a book or other handprint media where editing or destroying all unedited existing copies is effectively impossible if a shady actor can easily start editing archived news articles and other sites that are no longer publicly available.

            • dpark a day ago ago

              This is getting to blockchain for the sake of blockchain.

              If Wikipedia recorded the hash of every referenced page you could verify that the archive.is page is unchanged.

              You could certainly argue that archive.is isn’t the right place to store archives (I have no idea) but attempting to move all this to the blockchain would be very expensive.

            • AlienRobot a day ago ago

              You only need the hash of the original content. No blockchain is necessary. The problem is that there is no source for that hash except for the scraper that archives it since people don't put the hash in a hyperlink.

              If you download an ISO for a Linux OS for example, they give you the hash of the file so you can check it. They don't build an entire blockchain whatever to validate the hash.

    • JumpCrisscross a day ago ago

      > very large partition of citations in Wikipedia for example relies on them

      Is the Internet Archive related to archive.is?

  • toast0 a day ago ago

    The subpoena is for archive.today.

    The .is TLD is run by ISNIC and they process registrations directly, and operating out of Iceland, it would be very strange if they took orders from the FBI.

  • lschueller a day ago ago

    I would bet, even when the fbi is able to track down archive.today, it will be a matter of hours until the archive is shifted to another network and reinstated.. Even though if a certain amount of archived data might be lost, the core service will be rehosted quite fast somewhere else, i would think.

  • walletdrainer a day ago ago

    > The subpoena is supposed to be secret

    Ars inventing their own colour here. This is simply not true.

    • tw04 a day ago ago

      >Ars inventing their own colour here. This is simply not true.

      What are you talking about? Right at the top of the subpoena it literally says in bold and all caps, and I quote:

      >YOU ARE REQUESTED NOT TO DISCLOSE THE EXISTENCE OF THIS SUBPOENA INDEFNITELY AS ANY SUCH DISCLOSURE COULD INTERFERE WITH AN ONGOING INVESTIGATION AND ENFORCEMENT OF THE LAW.

      • t0mas88 a day ago ago

        The "requested" part sounds like it's not mandatory, otherwise they would have used something like "you're not allowed to..." or similar language.

        • namibj a day ago ago

          Sounds like a notice that disclosure could result in outcomes that make the discloser liable of (the crime? of) interfering-with-an-investigation.

          And thus to avoid that risk, to think twice before disclosing. Disclosure would only be punished if the specific circumstances actually result in legally-considered-(unlawful-)interference.

        • AlienRobot a day ago ago

          The "supposed" part as well.

      • walletdrainer 17 hours ago ago

        You are absolutely allowed to do whatever you want with the subpoena unless you’re ordered by some authority not to do so.

  • ChrisArchitect a day ago ago
  • undefined a day ago ago
    [deleted]
  • undefined a day ago ago
    [deleted]