Question 1

What is robots.txt and why do I need it?

Accepted Answer

Robots.txt is a text file placed in your website root (yoursite.com/robots.txt) that tells search engine crawlers which pages they can access. It's essential for SEO - it prevents wasting crawl budget on unimportant pages, protects sensitive areas, blocks duplicate content from indexing, and guides crawlers to your sitemap.

Question 2

Where should I put my robots.txt file?

Accepted Answer

Always place robots.txt in your website's root directory. It must be accessible at http://yoursite.com/robots.txt (not in subfolders). Search engine crawlers always check this exact URL first before crawling your site. If it's in the wrong location, it won't work.

Question 3

What does "User-agent: *" mean?

Accepted Answer

User-agent: * means the rules apply to all search engine crawlers (Google, Bing, Yahoo, etc.). You can target specific crawlers like "User-agent: Googlebot" or "User-agent: Bingbot" to apply different rules to different search engines.

Question 4

What's the difference between Allow and Disallow?

Accepted Answer

Disallow tells crawlers NOT to access specific paths (e.g., Disallow: /admin/ blocks the admin folder). Allow explicitly permits access, useful for allowing specific files in a blocked directory. If nothing is specified, crawlers can access everything by default.

Question 5

Should I block pages I don't want in search results?

Accepted Answer

Be careful! Disallow in robots.txt only prevents crawling, not indexing. Pages may still appear in search results without descriptions. For pages you want completely removed from search, use robots.txt AND noindex meta tags, or use password protection.

Question 6

What is crawl delay and should I use it?

Accepted Answer

Crawl-delay tells crawlers to wait X seconds between requests, preventing server overload. Only use it if you have limited bandwidth or your server struggles with crawler traffic. Google ignores crawl-delay (use Search Console instead), but Bing and other crawlers respect it.

Question 7

How do I add my sitemap to robots.txt?

Accepted Answer

Add "Sitemap: https://yoursite.com/sitemap.xml" anywhere in your robots.txt file (conventionally at the end). You can list multiple sitemaps. This helps crawlers discover all your pages faster, improving indexing efficiency.

Question 8

Can robots.txt block malicious bots?

Accepted Answer

No. Robots.txt is a gentleman's agreement - only well-behaved crawlers follow it. Malicious bots, scrapers, and hackers ignore robots.txt. To block bad bots, use .htaccess rules, server-level blocking, or WAF (Web Application Firewall) solutions.

Robots.txt Generator

About this tool

Usage examples

Block Admin Directory

Allow Specific Bot

Set Crawl Delay

Add Sitemap

Complete robots.txt

How to use

Benefits

FAQs

What is robots.txt and why do I need it?

Where should I put my robots.txt file?

What does "User-agent: *" mean?

What's the difference between Allow and Disallow?

Should I block pages I don't want in search results?

What is crawl delay and should I use it?

How do I add my sitemap to robots.txt?

Can robots.txt block malicious bots?

Related tools

Base64 Encoder / Decoder

Binary Translation Tool

CSS Minifier & Formatter