med-mastodon.com is one of the many independent Mastodon servers you can use to participate in the fediverse.
Medical community on Mastodon

Administered by:

Server stats:

355
active users

#scraping

0 posts0 participants0 posts today
Ramin HonaryBookmarking this: <a href="https://billauer.co.il/blog/2025/05/phpbb-attack-bots-ip-addresses/" rel="nofollow noopener" target="_blank">https://billauer.co.il/blog/2025/05/phpbb-attack-bots-ip-addresses/</a><br><br><a class="hashtag" href="https://fe.disroot.org/tag/tech" rel="nofollow noopener" target="_blank">#tech</a> <a class="hashtag" href="https://fe.disroot.org/tag/webadmin" rel="nofollow noopener" target="_blank">#WebAdmin</a> <a class="hashtag" href="https://fe.disroot.org/tag/bots" rel="nofollow noopener" target="_blank">#Bots</a> <a class="hashtag" href="https://fe.disroot.org/tag/scraping" rel="nofollow noopener" target="_blank">#Scraping</a> <a class="hashtag" href="https://fe.disroot.org/tag/scraperbots" rel="nofollow noopener" target="_blank">#ScraperBots</a> <a class="hashtag" href="https://fe.disroot.org/tag/devops" rel="nofollow noopener" target="_blank">#DevOps</a> <a class="hashtag" href="https://fe.disroot.org/tag/security" rel="nofollow noopener" target="_blank">#security</a>
PrivacyDigest<p><a href="https://mas.to/tags/Browser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Browser</span></a> <a href="https://mas.to/tags/Extensions" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Extensions</span></a> Turn Nearly 1 Million <a href="https://mas.to/tags/Browsers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Browsers</span></a> Into Website-Scraping <a href="https://mas.to/tags/Bots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Bots</span></a> - Slashdot <br><a href="https://mas.to/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mas.to/tags/hijack" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>hijack</span></a></p><p><a href="https://tech.slashdot.org/story/25/07/09/2257245/browser-extensions-turn-nearly-1-million-browsers-into-website-scraping-bots?utm_source=rss1.0mainlinkanon&amp;utm_medium=feed" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">tech.slashdot.org/story/25/07/</span><span class="invisible">09/2257245/browser-extensions-turn-nearly-1-million-browsers-into-website-scraping-bots?utm_source=rss1.0mainlinkanon&amp;utm_medium=feed</span></a></p>
Jonathan Bailey<p>An architecture firm has filed a lawsuit against Pinterest over alleged scraping. However, the case is a real blast from the past.</p><p><a href="https://www.plagiarismtoday.com/2025/07/09/architect-sues-pinterest-over-scraping/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">plagiarismtoday.com/2025/07/09</span><span class="invisible">/architect-sues-pinterest-over-scraping/</span></a></p><p><a href="https://mastodon.world/tags/Copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Copyright</span></a> <a href="https://mastodon.world/tags/Pinterest" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pinterest</span></a> <a href="https://mastodon.world/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a></p>
Kevin Russell<p><span class="h-card" translate="no"><a href="https://front-end.social/@zeldman" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>zeldman</span></a></span> </p><p>Watt is being Dunn about AI scraping images and descriptions?</p><p>Make RED sure you fill your gravy description meat with AI hostile get em on the beaches words.</p><p>Images uploaded to mastodon should have AI poison added to them.</p><p><a href="https://mstdn.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mstdn.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mstdn.social/tags/ZuckSucks" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ZuckSucks</span></a></p>
Rod2ik 🇪🇺 🇨🇵 🇪🇸 🇺🇦 🇨🇦 🇩🇰 🇬🇱<p>Le <a href="https://mastodon.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://mastodon.social/tags/payant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>payant</span></a> : vers un changement radical du modèle économique de l’ <a href="https://mastodon.social/tags/IA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IA</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/g%C3%A9n%C3%A9rative" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>générative</span></a> ?</p><p><a href="https://www.journaldugeek.com/2025/07/04/le-scraping-payant-vers-un-changement-radical-du-modele-economique-de-lia-generative/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">journaldugeek.com/2025/07/04/l</span><span class="invisible">e-scraping-payant-vers-un-changement-radical-du-modele-economique-de-lia-generative/</span></a></p>
Marcel SIneM(S)US<p><a href="https://social.tchncs.de/tags/Cloudflare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Cloudflare</span></a> lässt KI-Crawler auflaufen, wenn nicht für <a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> bezahlt wird | heise online <a href="https://www.heise.de/news/Cloudflare-laesst-KI-Crawler-auflaufen-wenn-nicht-fuer-Scraping-bezahlt-wird-10467015.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">heise.de/news/Cloudflare-laess</span><span class="invisible">t-KI-Crawler-auflaufen-wenn-nicht-fuer-Scraping-bezahlt-wird-10467015.html</span></a> <a href="https://social.tchncs.de/tags/PayPerCrawl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PayPerCrawl</span></a> <a href="https://social.tchncs.de/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialIntelligence</span></a> <a href="https://social.tchncs.de/tags/copyright" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>copyright</span></a> <a href="https://social.tchncs.de/tags/Urheberrecht" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Urheberrecht</span></a></p>
Petra van Cronenburg<p><span class="h-card" translate="no"><a href="https://indieweb.social/@akamran" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>akamran</span></a></span> <span class="h-card" translate="no"><a href="https://me.dm/@davidtoddmccarty" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>davidtoddmccarty</span></a></span> If you search Google for <a href="https://mastodon.online/tags/Mastodon" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mastodon</span></a> hashtag scraping, you find software and programs that help AI for doing that. It exists.</p><p>Fact is that from today, the main instances mastodon.social and mastodon.online prohibit <a href="https://mastodon.online/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> officially: <a href="https://techcrunch.com/2025/06/17/mastodon-updates-its-terms-to-prohibit-ai-model-training/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">techcrunch.com/2025/06/17/mast</span><span class="invisible">odon-updates-its-terms-to-prohibit-ai-model-training/</span></a></p><p>Problem of decentralisation: admins/users of other instances must get aware of the problem and change their terms, too.</p><p>It may be funny but it's no joke.</p><p><a href="https://mastodon.online/tags/gravy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>gravy</span></a></p>
Sozialwelten<p><a href="https://ifwo.eu/tags/Hinweis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Hinweis</span></a> auf <a href="https://ifwo.eu/tags/Nutzbarkeit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Nutzbarkeit</span></a> von <a href="https://ifwo.eu/tags/Data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Data</span></a> <a href="https://ifwo.eu/tags/Analytics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Analytics</span></a> / <a href="https://ifwo.eu/tags/Data" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Data</span></a> <a href="https://ifwo.eu/tags/Science" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Science</span></a> <a href="https://ifwo.eu/tags/Methode" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methode</span></a>​n <a href="https://ifwo.eu/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a>, <a href="https://ifwo.eu/tags/Pattern" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Pattern</span></a> <a href="https://ifwo.eu/tags/Recognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Recognition</span></a>, <a href="https://ifwo.eu/tags/Machine" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Machine</span></a> <a href="https://ifwo.eu/tags/Learning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Learning</span></a> oder <a href="https://ifwo.eu/tags/Text" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Text</span></a> <a href="https://ifwo.eu/tags/Mining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mining</span></a> für <a href="https://ifwo.eu/tags/soziologisch" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>soziologisch</span></a>​e <a href="https://ifwo.eu/tags/Forschung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Forschung</span></a>. </p><p><a href="https://ifwo.eu/tags/Sutter" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sutter</span></a> / <a href="https://ifwo.eu/tags/Maasen" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Maasen</span></a> - <a href="https://ifwo.eu/tags/Neuerfindung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Neuerfindung</span></a> <a href="https://ifwo.eu/tags/Soziologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Soziologie</span></a> S.76 f. 2020 DOI: 10.5771/9783845295008-73</p><p><a href="https://ifwo.eu/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://ifwo.eu/tags/ML" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ML</span></a> <a href="https://ifwo.eu/tags/TextMining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextMining</span></a> <a href="https://ifwo.eu/tags/Soziologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Soziologie</span></a> <a href="https://ifwo.eu/tags/BigData" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BigData</span></a> <a href="https://ifwo.eu/tags/Methodologie" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methodologie</span></a> <a href="https://ifwo.eu/tags/Methodik" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Methodik</span></a> <a href="https://ifwo.eu/tags/Sozialforschung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sozialforschung</span></a> <a href="https://ifwo.eu/tags/Sozialwissenschaft" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sozialwissenschaft</span></a></p>
Kevin Karhan :verified:<p><span class="h-card" translate="no"><a href="https://mastodon.social/@anirvan" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>anirvan</span></a></span> <span class="h-card" translate="no"><a href="https://mastodon.social/@404mediaco" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>404mediaco</span></a></span> the only way to deal with this is the same as with any other <a href="https://infosec.space/tags/malware" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>malware</span></a> and <a href="https://infosec.space/tags/DDoS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DDoS</span></a>:</p><ul><li>Block the entire <a href="https://infosec.space/tags/ASN" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASN</span></a>|s of <em>every single hoster that allows "<a href="https://infosec.space/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a>"</em>!</li></ul><p>I do maintain a <a href="https://infosec.space/tags/blocklist" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blocklist</span></a> of those and will happily accept suggestions and pull requests...</p><p><a href="https://github.com/greyhat-academy/lists.d/blob/main/scrapers.ipv4.block.list.tsv" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/greyhat-academy/lis</span><span class="invisible">ts.d/blob/main/scrapers.ipv4.block.list.tsv</span></a></p>
stgreenie<p>My anti society<br>collision course<br>I charted to address<br>my many anxieties<br>is rapidly<br>approaching the end of the line</p><p>The impending<br>gravity induced crash<br>should be quite the event<br>given the acceleration<br>of my descent</p><p>The "I told you so"<br>chorus of the status quo<br>will be very pleased<br><a href="https://universeodon.com/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> up my bloody carcass<br>to mount on their Warning Wall</p><p>I sure would hate<br>to give these smug bastards<br>the confirmation validation <br>they so desperately need</p><p>Deceased me<br>playing the lead<br>in their future forewarning<br>history stories<br>to unborn rebellious <br>non conformist generations<br>about the folly<br>of living life brazenly<br>outside the rigid boundaries<br>constructed with bricks of bullshit</p><p>I guess it's time to confess<br>my internal trepidation</p><p>Those pointless<br>could and should ofs<br>as if somehow<br>we are the captains<br>of our destinies</p><p>For you see my soon to be<br>grieving comrades<br>the die was cast<br>for ill fated us<br>the day we were born</p><p>Playing the roles<br>fate precisely defines<br>scribed in the unwavering stars<br><a href="https://universeodon.com/tags/vss365" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>vss365</span></a></p>
René Voorburg<p>Vici.org is experiencing large-scale distributed scraping, with bots fraudulently posing as regular users. Presumably harvested data is being used for AI training.<br><a href="https://archaeo.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://archaeo.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://archaeo.social/tags/ethics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ethics</span></a></p>
@reiver ⊼ (Charles) :batman:<p>3/</p><p>For more on scraping (as in web-scraping) see here:<br><a href="https://mastodon.social/@reiver/114353728684249608" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">mastodon.social/@reiver/114353</span><span class="invisible">728684249608</span></a></p><p>CC: <span class="h-card" translate="no"><a href="https://mastodon.social/@404mediaco" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>404mediaco</span></a></span> </p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>2/</p><p>Scraping (as in Web Scraping) is the act of extracting data from HTML web-pages where the data is NOT machine-legible.</p><p>If the data, even in an HTML web-page, is in a machine-legible format, then it is NOT scraping.</p><p>...</p><p>And, getting data in JSON (key-value pairs) is definitely NOT scraping — as JSON's purpose is to communicate data in a machine-legible manner.</p><p>CC: <span class="h-card" translate="no"><a href="https://mastodon.social/@404mediaco" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>404mediaco</span></a></span> </p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraping</span></a></p>
@reiver ⊼ (Charles) :batman:<p>1/</p><p>If these researchers used a typical HTTP-based API that returns JSON, then —</p><p>What these researchers did is NOT scraping.</p><p>CC: <span class="h-card" translate="no"><a href="https://mastodon.social/@404mediaco" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>404mediaco</span></a></span></p><p>RE: <a href="https://www.404media.co/researchers-scrape-2-billion-discord-messages-and-publish-them-online/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">404media.co/researchers-scrape</span><span class="invisible">-2-billion-discord-messages-and-publish-them-online/</span></a></p><p><a href="https://mastodon.social/tags/Scraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraper</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/WebScraper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraper</span></a> <a href="https://mastodon.social/tags/WebScraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WebScraping</span></a></p>
Marcel SIneM(S)US<p><a href="https://social.tchncs.de/tags/Facebook" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Facebook</span></a>-<a href="https://social.tchncs.de/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a>: Nutzer können sich ab heute <a href="https://social.tchncs.de/tags/Datenschutz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Datenschutz</span></a>-Klage anschließen | heise online <a href="https://www.heise.de/news/Facebook-Scraping-Nutzer-koennen-sich-ab-heute-Datenschutz-Klage-anschliessen-10372426.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">heise.de/news/Facebook-Scrapin</span><span class="invisible">g-Nutzer-koennen-sich-ab-heute-Datenschutz-Klage-anschliessen-10372426.html</span></a> <a href="https://social.tchncs.de/tags/DSGVO" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DSGVO</span></a> <a href="https://social.tchncs.de/tags/GDPR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GDPR</span></a> <a href="https://social.tchncs.de/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a></p>
Kevin Karhan :verified:<p><span class="h-card" translate="no"><a href="https://chaos.social/@fx" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>fx</span></a></span> <span class="h-card" translate="no"><a href="https://chaos.social/@julialuna" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>julialuna</span></a></span> I think that this makes <a href="https://infosec.space/tags/Anubis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Anubis</span></a> really <a href="https://infosec.space/tags/ableist" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ableist</span></a> and bad for <a href="https://infosec.space/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> people cuz <a href="https://infosec.space/tags/JavaScript" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>JavaScript</span></a> won't work on <a href="https://infosec.space/tags/LynxBrowser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>LynxBrowser</span></a>.</p><ul><li>The better option would be to literally <a href="https://infosec.space/tags/block" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>block</span></a> all the <a href="https://infosec.space/tags/GAFAMs" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GAFAMs</span></a> and their <a href="https://infosec.space/tags/ASN" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASN</span></a>|s as well as any hoster allowing <a href="https://infosec.space/tags/bots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>bots</span></a> and <a href="https://infosec.space/tags/scrapers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scrapers</span></a>.</li></ul><p>Given how <a href="https://infosec.space/tags/IRC" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IRC</span></a>, <a href="https://infosec.space/tags/Tor" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Tor</span></a> and <a href="https://infosec.space/tags/Mining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Mining</span></a> is a big no-no on most hosters, it stands to reason that it's trivial to force them to ban <em>"<a href="https://infosec.space/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a>"</em> and related <a href="https://infosec.space/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> workloads as well!</p><ul><li>There are better alternatives, espechally on <a href="https://infosec.space/tags/OnionServices" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OnionServices</span></a>, to prevent and stall <a href="https://infosec.space/tags/DDoS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DDoS</span></a>|ing like several <a href="https://infosec.space/tags/OnionServices" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OnionServices</span></a> deployed presently...</li></ul>
Winbuzzer<p>German Court Orders Meta to Pay a Tiny 200€ Fine Over GDPR Non-Compliance</p><p><a href="https://mastodon.social/tags/Meta" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Meta</span></a> <a href="https://mastodon.social/tags/Facebook" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Facebook</span></a> <a href="https://mastodon.social/tags/GDPR" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>GDPR</span></a> <a href="https://mastodon.social/tags/DataPrivacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataPrivacy</span></a> <a href="https://mastodon.social/tags/DataBreach" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataBreach</span></a> <a href="https://mastodon.social/tags/Scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Scraping</span></a> <a href="https://mastodon.social/tags/Privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Privacy</span></a> <a href="https://mastodon.social/tags/Regulation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Regulation</span></a> <a href="https://mastodon.social/tags/EU" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EU</span></a> <a href="https://mastodon.social/tags/Germany" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Germany</span></a> <a href="https://mastodon.social/tags/OLGFrankfurt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OLGFrankfurt</span></a> <a href="https://mastodon.social/tags/Damages" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Damages</span></a> <a href="https://mastodon.social/tags/BigTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>BigTech</span></a> <a href="https://mastodon.social/tags/UserRights" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>UserRights</span></a> <a href="https://mastodon.social/tags/DMA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DMA</span></a> <a href="https://mastodon.social/tags/DataProtection" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DataProtection</span></a> </p><p><a href="https://winbuzzer.com/2025/04/27/german-court-orders-meta-to-pay-a-tiny-200e-fine-over-gdpr-non-compliance-xcxwbn/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">winbuzzer.com/2025/04/27/germa</span><span class="invisible">n-court-orders-meta-to-pay-a-tiny-200e-fine-over-gdpr-non-compliance-xcxwbn/</span></a></p>
Pyrzout :vm:<p>Wikipedia lanza un conjunto de datos para entrenar la inteligencia artificial <a href="https://blog.elhacker.net/2025/04/wikipedia-dataset-ia-stop-scraping.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.elhacker.net/2025/04/wiki</span><span class="invisible">pedia-dataset-ia-stop-scraping.html</span></a> <a href="https://social.skynetcloud.site/tags/inteligenciaartificial" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>inteligenciaartificial</span></a> <a href="https://social.skynetcloud.site/tags/wikipedia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>wikipedia</span></a> <a href="https://social.skynetcloud.site/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> <a href="https://social.skynetcloud.site/tags/robots" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>robots</span></a></p>
Bagolina<p><a href="https://sciences.social/tags/Wikip%C3%A9dia" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Wikipédia</span></a> ouvre un accès structuré à ses données pour entraîner des modèles d’<a href="https://sciences.social/tags/IA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>IA</span></a><br><a href="https://www.blogdumoderateur.com/wikipeda-ouvre-acces-structure-donnees-entrainer-modeles-ia" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">blogdumoderateur.com/wikipeda-</span><span class="invisible">ouvre-acces-structure-donnees-entrainer-modeles-ia</span></a><br>l’usage massif de Wikipédia par les robots de <a href="https://sciences.social/tags/scraping" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>scraping</span></a> génère un trafic considérable<br>La collecte est souvent effectuée sans forcément respecter les bonnes pratiques techniques ou éthiques<br>65 % de ce trafic gourmand en ressources sur notre site provient de bots<br>depuis janvier 2024 augmentation de 50 % de la bande passante utilisée pour le téléchargement de contenu depuis ses serveurs.</p>
@reiver ⊼ (Charles) :batman:web scraper