Add

Lorem ipsum

Lorem ipsum

A
A
A
SEO / GEO

0 views

5 min

How to Optimize Your robots.txt File to Block Crawling of Unnecessary Pages in Webflow

Learn how to set up an advanced robots.txt file in Webflow to block crawlers from useless pages and focus Google’s crawl budget on your strategic pages.

The robots.txt file tells crawlers which pages on your site they can and can’t visit. Set it up properly, and it focuses Google’s crawl budget on your strategic pages while speeding up the indexing of important content. Sandro, cofounder of Gemeos Agency, a Webflow agency, walks you through the essentials you need to know.

Prerequisites

  • A Webflow site published on a custom domain
  • Access to Project Settings > SEO in Webflow
  • Google Search Console to check the impact

1. Open robots.txt in Webflow

In Webflow, go to Project Settings > SEO > Robots.txt. Webflow generates a default robots.txt file that allows everything. You can edit it directly in this interface. Changes take effect the next time you publish the site.

2. Understand the basic syntax

# Rules for all robots
User-agent: *
Disallow: /admin/
Disallow: /thank-you
Disallow: /404

# Explicitly allow a blocked section
Allow: /admin/public/

# Point to your XML sitemap
Sitemap: https://votresite.com/sitemap.xml

3. Identify the pages to block

On a typical Webflow site, these pages are worth blocking to save crawl budget:

  • Confirmation and thank-you pages (/thank-you, /thank-you, /confirmation)
  • Utility pages with no SEO value (/404, /search)
  • Preview or staging pages if they’re publicly accessible
  • Privacy policy and terms pages if they don’t have SEO value

good to know

Blocking a page in robots.txt doesn’t stop it from being indexed if Google finds it through an external link. To make sure a page isn’t indexed, combine robots.txt Disallow with a noindex meta tag. robots.txt alone isn’t enough to de-index a page Google already knows about.

4. Optimized robots.txt template for Webflow

User-agent: *
# Utility pages with no SEO value
Disallow: /thank-you
Disallow: /thank-you
Disallow: /confirmation
Disallow: /404

# Block AI crawlers if needed
User-agent: GPTBot
Disallow: /

User-agent: ClaudeBot
Disallow: /

User-agent: PerplexityBot
Disallow: /

Sitemap: https://votresite.com/sitemap.xml

5. Check in Search Console

In Search Console, the "URL Inspection" tool shows whether a page is blocked by robots.txt. The "Coverage" section also lists pages excluded by robots.txt. Make sure no strategic page is blocked by mistake.

Page typerobots.txtMeta robotsReason
Thank-you pageDisallownoindexDouble protection
404 pageDisallownoindexNo SEO value
Service pagesAllowindexStrategic pages
Blog articlesAllowindexMain SEO content
Policy pagesDisallow or AllownoindexDepends on strategy

Conclusion

A well-configured robots.txt is a sign of technical quality sent to Google. It doesn’t replace other SEO optimizations, but it does support a controlled indexing strategy.

  • Use case 1: an e-commerce site with hundreds of filter pages that don’t need to be crawled
  • Use case 2: an agency that wants to block AI crawlers from its premium content
  • Use case 3: a site with limited crawl budget that wants to prioritize its conversion pages

Good to know

Heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse varius enim in eros elementum tristique. Duis cursus, mi quis viverra ornare, eros dolor interdum nulla, ut commodo diam libero vitae erat. Aenean faucibus nibh et justo cursus id rutrum lorem imperdiet. Nunc ut sem vitae risus tristique posuere.

Lorem ipsum dolor sit amet consectetur.

Lorem ipsum dolor sit amet consectetur. Aliquam orci sagittis dignissim sapien praesent donec.

Lorem ipsum

Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla pariatur.

Published on

You might be interested in these tutorials

Similar tutorials

SEO / GEO

5 min read

5 views

How to Set Up a Redirect in Webflow? (2026)

Updated on 19.12.2025 by Sandro DA SILVA

SEO / GEO

5 min read

5 views

Add structured data to your Webflow site?

Updated on 21.08.2025 by Sandro DA SILVA

No-code

5 min read

5 views

How to Obfuscate a Link in Webflow

Updated on 23.04.2025 by Sandro DA SILVA

Let’s f*****G GO !!

Ready to launch
Your business?

Alexandre

Max

Enora

Bryan

Cannelle

Tiphaine

You'll :heart: our collaboration...