8th World
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 9 months ago

How to block AI Crawler Bots using robots.txt file

www.cyberciti.biz

external-link
message-square
63
fedilink
80
external-link

How to block AI Crawler Bots using robots.txt file

www.cyberciti.biz

Cynicus Rex@lemmy.ml to Privacy@lemmy.mlEnglish · 9 months ago
message-square
63
fedilink
Just a moment...
www.cyberciti.biz
external-link
  • asudox@lemmy.world
    link
    fedilink
    arrow-up
    6
    arrow-down
    1
    ·
    9 months ago

    Not sure if that is effective at all. Why would a crawler check the robots.txt if it’s programmed to ignore it anyways?

    • ɐɥO@lemmy.ohaa.xyz
      link
      fedilink
      arrow-up
      16
      ·
      9 months ago

      cause many crawlers seem to explicitly crawl “forbidden” sites

    • Crashumbc@lemmy.world
      link
      fedilink
      English
      arrow-up
      3
      ·
      9 months ago

      Google and script kiddies copying code…

    • MangoPenguin@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      1
      ·
      9 months ago

      You could also place the same page as a hidden link on your home page.

Privacy@lemmy.ml

privacy@lemmy.ml

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !privacy@lemmy.ml

A place to discuss privacy and freedom in the digital world.

Privacy has become a very important issue in modern society, with companies and governments constantly abusing their power, more and more people are waking up to the importance of digital privacy.

In this community everyone is welcome to post links and discuss topics related to privacy.

Some Rules

  • Posting a link to a website containing tracking isn’t great, if contents of the website are behind a paywall maybe copy them into the post
  • Don’t promote proprietary software
  • Try to keep things on topic
  • If you have a question, please try searching for previous discussions, maybe it has already been answered
  • Reposts are fine, but should have at least a couple of weeks in between so that the post can reach a new audience
  • Be nice :)

Related communities

  • Lemmy.ml libre_culture
  • Lemmy.ml privatelife
  • Lemmy.ml DeGoogle
  • Lemmy.ca privacy

much thanks to @gary_host_laptop for the logo design :)

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 517 users / day
  • 3.37K users / week
  • 6.82K users / month
  • 16.4K users / 6 months
  • 1 local subscriber
  • 37.7K subscribers
  • 3.44K Posts
  • 93.6K Comments
  • Modlog
  • mods:
  • k_o_t@lemmy.ml
  • tmpod@lemmy.pt
  • ranok@sopuli.xyz
  • Yayannick@lemmy.ml
  • BE: 0.19.10
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org