Alphane Moon@lemmy.world to Technology@lemmy.worldEnglish · 3 days agoNepenthes: a dangerous tarpit to trap LLM crawlers – OSnewswww.osnews.comexternal-linkmessage-square3fedilinkarrow-up158arrow-down12
arrow-up156arrow-down1external-linkNepenthes: a dangerous tarpit to trap LLM crawlers – OSnewswww.osnews.comAlphane Moon@lemmy.world to Technology@lemmy.worldEnglish · 3 days agomessage-square3fedilink
minus-squarecatloaf@lemm.eelinkfedilinkEnglisharrow-up9arrow-down1·2 days agoGood luck getting any of them to actually crawl it though. Most models are trained on datasets like reddit comments, not by crawling sites like search indexers.
Good luck getting any of them to actually crawl it though. Most models are trained on datasets like reddit comments, not by crawling sites like search indexers.