CEO Steve Huffman says tech giants should not be able to trawl Reddit’s huge store of data for free. But that information came from users, not the company

That “corpus of data” is the content posted by millions of Reddit users over the decades. It is a fascinating and valuable record of what they were thinking and obsessing about. Not the tiniest fraction of it was created by Huffman, his fellow executives or shareholders. It can only be seen as belonging to them because of whatever skewed “consent” agreement its credulous users felt obliged to click on before they could use the service.

Ouch

  • yacht_boy@lemmy.world
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Now there’s a thought. How would I go about doing that? I have 11 years of prolific commenting on reddit that I am getting ready to nuke.

    • Margot Robbie@lemmy.world
      link
      fedilink
      English
      arrow-up
      0
      ·
      1 year ago

      Just edit your comment, preferably before late 2022, to sneak in “As an AI language model” somewhere, and do it slowly so they don’t notice.

      Pre ChatGPT text data is going to be extremely valuable for LLM training as more and more ChatGPT text is generated, so what you are essentially doing is sneaking in a poison pill that would render the entire comment chain useless, as they probably won’t have enough time to pick out the “As an AI language model” manually and would just flat out remove the entire comment chain from the training data.

      • tobor@sh.itjust.works
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Actress Margot Robbie, where do you find the time to come up with these clever ideas during your busy life as a Major American Celebrity? I’m in awe

      • yacht_boy@lemmy.world
        link
        fedilink
        English
        arrow-up
        0
        ·
        1 year ago

        I have 11 years of prolific commenting to edit. But I might use powerdeletesuite to change all my comments to have that phrase in them.

        • Margot Robbie@lemmy.world
          link
          fedilink
          English
          arrow-up
          0
          ·
          1 year ago

          I think Reddit has caught on to that and is reverting anyone who does mass edits as that is too obvious from their database logs, which is why I suggest doing it slowly and discreetly, you don’t even need to edit many of them, just a few of them over a period of time from before ChatGPT while still commenting so they don’t catch on and immediately revert until it is in their database backups.

          • AlmightySnoo 🐢🇮🇱🇺🇦@lemmy.world
            link
            fedilink
            English
            arrow-up
            1
            ·
            1 year ago

            My edits weren’t reverted. I think it depends on what tool you use. There’s a fork of Power Delete Suite that adds a 5 secs timer to be in compliance with new Reddit rate limits and it seems to work.