Technical SEO Cheat Sheet

Blocked by
robots.txt

One line of text is hiding your client's site from Google. Search spiders see it, but are ordered to walk away.

Where to find it: Indexing > Pages > 'Blocked by robots.txt'

What It Is

This status indicates your robots.txt file contains a dynamic Disallow rule explicitly instructing Googlebot not to crawl these URLs. Google recognizes that the page exists but complies with your instruction not to access or parse it, meaning blocked pages cannot be indexed or ranked.

Why It Matters

This is one of the most catastrophic issues to encounter. A single faulty line can silently and systematically drop entire structural sections of a site from search engines within weeks. It most often happens when testing or staging environments carry restriction rules straight into production.

Root Diagnostics

5 Common Root Causes

Mismatched wildcards or staging remnants typically lie behind unintended indexing bans.

01

Staging Remnants

The test environment's Disallow: / rule is accidentally carried over to production at launch.

02

Overly Broad Wildcards

Developers blocking custom dashboard/admin folders utilize wildcards that inadvertently sweep up public URLs.

03

Outdated Legacy Rules

Older, static directives aimed at previous architectural layouts remain active without subsequent system cleanups.

04

Aggressive Plugin Rules

Security software, CDNs, or localized firewalls generate highly restrictive dynamic rules without developer consent.

05

Asset-Level Blocking

Underlying CSS, JavaScript, or dynamic asset files are disallowed, preventing Google from rendering and analyzing pages.

Interactive Standard Operating Procedure

The Fix Blueprint (Interactive SOP)

Check off each diagnostic step to monitor your implementation progress live!

Implementation Progress: 0% Completed (0/6)

Tools

  • Search Console Tester
    Free | Settings > robots.txt Tester
  • Google Documentation
    Free | Official guidance for crawling & index parameters
  • Screaming Frog SEO Spider
    Free tier | Simulate a full site crawl with current rules active

Time to Fix

15 min
To Diagnose Root Obstacles
10 min
To Resolve Problematic Rules

Pro Tip

Always test changes inside Google's robots.txt tester BEFORE updating production environments.

A single incorrect wildcard slash pattern or character spacing can block index access to your entire site from search results in a matter of seconds. Keep parameters tight, clean, and isolated!