Should publishers block or monetise AI crawlers?

Block or monetise is the wrong way to frame the choice, because it is not one decision but many, and for most publishers the answer is some of each. Blocking every AI crawler protects your content but forfeits the revenue and erases your brand from the AI answers that increasingly shape demand. Allowing everything for free is the default that costs publishers billions. The useful approach sits in between: decide bot by bot and page by page, based on what each crawler does with your content and whether you can earn from it. Blocking and monetising are not opposites. They are two settings on the same control.

The three things an AI crawler can be doing

To decide, first know what the crawler is for. There are three broad purposes, and they call for different responses.

Training crawlers gather your content in bulk to train a model. The value to you is indirect and deferred, and the case for charging or licensing is strongest here, because the content becomes part of a commercial product with no attribution and no traffic back.

Search and retrieval crawlers build an index that an assistant queries to find and cite sources. Allowing these can keep your content eligible to be cited in AI answers, which preserves brand presence even when clicks do not follow.

Live Search Agents fetch a specific page in real time to answer a live user question. This is the category closest to purchase intent and the one most worth monetising, because the read is happening at the moment a user is making a decision.

When blocking makes sense

Block when the crawler takes value you cannot recover and offers nothing in return. Low-value scrapers and bots that ignore your rules are the clearest case. Blocking is also right for content you never want ingested at all, for legal, contractual, or editorial reasons. And selectively blocking training crawlers while allowing retrieval crawlers is a legitimate strategy if you want to be cited in AI search without contributing to model training. The major AI companies increasingly separate these, so you can allow one and disallow the other.

When monetising makes sense

Monetise when the traffic represents a real, high-intent audience you can earn from. Live Search Agent retrievals are the prime case: the visit is happening anyway, the user is researching a decision, and content-layer advertising can turn the read into same-day revenue without affecting human readers. Blocking that traffic instead of monetising it forfeits both the income and a presence in the answer the user receives. For most publishers, the largest single category of AI traffic is exactly this kind, which is why monetising rather than blocking is increasingly the default for it.

The cost of blocking everything

Blanket blocking feels safe but has two hidden costs. It removes your content from the sources AI assistants read, so your brand stops appearing in AI answers while competitors who allow access keep showing up. And it converts a growing, monetisable audience into nothing. As AI assistants become a primary way people find information, a publisher invisible to them is invisible to a rising share of demand. The defensive instinct to block is understandable, but applied to everything it trades a manageable problem for a structural one.

A practical decision framework

Begin with visibility, because you cannot make good decisions about traffic you cannot see. Classify your AI traffic at the CDN edge by type and owner. Then set policy by category. Block scrapers and rule-breakers. Decide on training crawlers based on whether you want a licensing conversation or simply to keep content out of models. Allow retrieval crawlers to preserve citation eligibility. Monetise Live Search retrievals on your commercial, high-intent pages, while keeping sensitive editorial content in an observe-only state. With edge-level control, these are per-category and per-page settings, not an all-or-nothing switch. blankspace provides this as observe-and-monetise control, so you can watch traffic first and turn on monetisation page by page when you are ready.

Frequently asked questions

Is it bad for SEO or AI visibility to block AI crawlers?

Blocking retrieval and search crawlers can remove your content from AI answers, reducing your visibility in the tools people increasingly use to find information. Blocking training crawlers has less effect on citation, which is why many publishers block training but allow retrieval.

Can I block some AI crawlers and allow others?

Yes. The major AI companies increasingly use separate identifiers for training and for retrieval, so you can allow a search or live agent while disallowing the training crawler. This is the basis of a selective strategy.

Does monetising AI crawlers mean letting them take my content?

It means earning from the read that is already happening. Content-layer advertising inserts a paid brand mention into the content an agent retrieves, so you are compensated for the access rather than giving it away for free.

What is the risk of doing nothing?

Doing nothing is the same as allowing everything for free, which is why publishers collectively lose billions to uncompensated AI traffic. The traffic grows whether or not you act, so inaction is a choice to forgo the revenue.

How do I start without committing to a full policy?

Begin in observe mode. Classify your AI traffic at the edge without changing anything, see the volume and mix, and then set block or monetise policy per category and per page once you understand what you have.