iocaine - The deadliest poison known to AI

moonleay@feddit.org · 2 days ago

iocaine - The deadliest poison known to AI

Admiral Patrick@dubvee.org · 2 days ago

Oh hell yeah.

Months ago I was brainstorming something almost identical to this concept: use the reverse proxy to serve pre-generated AI slop to AI crawler user agents while serving the real content to everyone else. Looks like someone did exactly that, and now I can just deploy it. Fantastic.

AItoothbrush@lemmy.zip · 2 days ago

Ai slop is actually better than random data because it gets in a feedback loop which is more destructive.

Saledovil@sh.itjust.works · 2 days ago

If you use natural text to train model A, and then use model A’s output, a, to train model B, then model B’s output will be less good than model A’s output. The quality degenerates with each generation, but the it happens over generations of models. So, random data is worse than AI slop, because random data is already of the lowest possible quality for AI training.

2 days ago

Yes, but random data might be easier to detect in the first place, and could then be filtered.