noindex

Q: What does noindex mean?

Noindex is a directive that tells search engines not to include a page or resource in search results. It can be applied through a meta robots tag or an X-Robots-Tag HTTP header.

Q: Does noindex stop Google from crawling a page?

No. Noindex controls indexing, not crawling. Google still needs to crawl the page to see the noindex directive. If the page is blocked by robots.txt , Google may not detect the noindex instruction.

Q: Where should a noindex tag be placed?

For HTML pages, the noindex tag should be placed inside the section of the page. For non-HTML files, noindex can be applied through the HTTP response header using X-Robots-Tag .

Q: Can noindex be used on PDFs?

Yes. For PDFs and other non-HTML resources, noindex is usually applied through the X-Robots-Tag HTTP header instead of a meta tag.

Q: What is the difference between noindex and robots.txt?

Noindex tells search engines not to show a page in search results. robots.txt tells crawlers which URLs or paths they should not crawl. They should not be treated as the same control. Google also states that specifying noindex inside robots.txt is not supported.

Q: Should noindex and canonical tags be used together?

Usually, they should not be combined unless there is a specific reason. A canonical tag suggests the preferred version of similar content, while noindex tells search engines not to index the page. For duplicate consolidation, use canonical. For removing a page from search results, use noindex.

Q: How long does it take for noindex to remove a page from Google?

It depends on when Google crawls the page again. If the page is still appearing in results after noindex is added, Google may not have recrawled it yet, or the directive may be blocked or incorrectly implemented. Google recommends using URL Inspection and the Page Indexing report in Search Console to debug noindex issues.

Q: Is noindex a security measure?

No. Noindex should not be used to protect private or sensitive information. The page can still be accessed directly if someone has the URL. Sensitive content should be protected through authentication, permissions, or proper access control.

Controlling What Search Engines Should Not Index

SEOWebsiteTechnical

Author: Steven Hsu
Published: 23/05/2026
Updated: 23/05/2026

Noindex is a search engine directive that tells crawlers not to include a page or resource in search results.

It is one of the most important tools in technical SEO because not every accessible URL deserves to be indexed. Some pages need to exist for users, systems, campaigns, testing, filtering, or account flows, but they should not appear as search results.

noindex does not hide a page from the internet. It tells search engines that the page should not be stored and shown in their index.

What Is `noindex`?

noindex is a directive used to prevent a page from appearing in search engine results.

It can be added as a meta robots tag in the HTML of a page or as an HTTP X-Robots-Tag header. Google supports both methods. The meta tag is usually used for HTML pages, while the HTTP header is useful for non-HTML files such as PDFs or other resources. Google also states that a page must be crawlable for the noindex directive to be seen; if the URL is blocked by robots.txt, Google may not be able to detect the noindex instruction.

A common meta robots implementation looks like this:

<meta name="robots" content="noindex">

This tells search engines that support the directive not to index the page.

What `noindex` Does

noindex controls indexation. It does not control access, security, crawling permission, or user visibility.

When a search engine crawls a page and sees a valid noindex directive, it should remove that page from the index or avoid adding it in the first place. The page may still be accessible to users, linked internally, visited directly, tracked in analytics, or used as part of a website flow.

For example, a thank-you page after a form submission may need to exist so analytics can record a conversion. But it usually should not rank in search results because users should only reach it after completing the form.

`noindex` vs `robots.txt`

noindex and robots.txt are often confused, but they solve different problems.

Control	Main Purpose	What It Does	What It Does Not Do
`noindex`	Index control	Tells search engines not to show a page in search results	Does not prevent crawling by itself
`robots.txt`	Crawl control	Tells crawlers which URLs or paths they should not crawl	Does not reliably remove URLs from the index
Password protection	Access control	Prevents unauthorized users and crawlers from accessing content	Requires authentication setup
Canonical tag	Duplication signal	Suggests the preferred version of similar content	Does not guarantee non-indexation

This distinction matters. If a page is blocked in robots.txt, search engines may not be able to crawl the page and see the noindex tag. For noindex to work reliably, the crawler needs access to the page or resource where the directive is present.

When to Use `noindex`

noindex should be used when a page has a valid purpose on the site but should not appear as an organic search result.

Common examples include internal search result pages, filtered or faceted pages with weak search value, staging or temporary pages, thank-you pages, login-related pages, account pages, duplicate utility pages, campaign pages that are not meant for organic discovery, and thin system-generated pages.

The goal is not to hide poor work. The goal is to keep the search index focused on pages that are useful, intentional, and appropriate for search users.

When Not to Use `noindex`

noindex should not be used as a shortcut for every SEO problem.

Do not use noindex on important pages just because they are not ranking well. If a page is strategically important, improve the page instead. Strengthen the content, search intent alignment, internal links, structured data, page experience, and technical foundation.

Do not use noindex when you actually need to protect private information. Noindex is not security. A noindexed URL can still be visited directly if someone has the link.

Do not use noindex when a canonical tag is the better solution. If two pages are similar and one should consolidate signals into another, a canonical tag may be more appropriate. If a page should not appear in search results at all, noindex is the clearer instruction.

How to Implement `noindex`

The most common method is the meta robots tag:

<meta name="robots" content="noindex">

This should be placed in the <head> section of the HTML page.

For non-HTML resources or server-level control, noindex can also be sent through the HTTP response header:

X-Robots-Tag: noindex

This is useful when you need to prevent indexing of files such as PDFs or when the directive is easier to manage at the server or application layer. Google documents both the meta robots tag and the X-Robots-Tag header as valid ways to apply noindex.

You can also target Google specifically:

<meta name="googlebot" content="noindex">

However, in most normal SEO cases, using robots is cleaner because it applies broadly to search engines that support the directive.

`noindex` and `nofollow`

noindex and nofollow can be combined, but they should not be treated as the same thing.

<meta name="robots" content="noindex, nofollow">

noindex tells search engines not to index the page. nofollow tells crawlers not to follow the links on the page. Google’s documentation shows that noindex can be combined with other robots directives, including nofollow.

In many cases, noindex, follow or simply noindex is more useful because the page can still pass crawl paths through its links. For example, a filtered listing page may not need to be indexed, but its links may still help crawlers discover useful products, articles, or category pages.

Use nofollow only when there is a clear reason to discourage link following.

`noindex` and Canonical Tags

noindex and canonical tags should be used carefully together.

A canonical tag tells search engines which version of a similar page should be treated as the preferred URL. Noindex tells search engines not to index the page. If both are used on the same page, the signals can become less clean because one instruction says “do not index this page,” while the other says “treat another page as the preferred version.”

For most cases:

Situation	Better Choice
Duplicate or near-duplicate page should consolidate into another URL	Canonical tag
Page should not appear in search results at all	Noindex
Page is private or sensitive	Authentication, not noindex
Page should not be crawled	Robots.txt or access control, depending on the case

The decision should be based on the page’s role, not just whether the URL feels “unimportant.”

How `noindex` Affects SEO

noindex can improve SEO hygiene when it is used correctly.

It helps keep low-value, duplicate, temporary, or utility pages out of search results. This can make the indexed version of a site cleaner and easier to understand. It also reduces the risk of search users landing on pages that were never designed as entry points.

However, noindex can damage SEO if it is applied to important pages by mistake. A misplaced noindex tag on a homepage, category page, article, service page, or product page can remove valuable content from search results.

That is why noindex should be treated as a technical control, not a casual content setting.

How to Check If a Page Has Noindex

A page can be checked in several ways.

You can inspect the page source and look for:

<meta name="robots" content="noindex">

You can also check HTTP headers for:

X-Robots-Tag: noindex

In Google Search Console, the URL Inspection tool can show whether Google detected a noindex directive. Google also recommends checking the Page Indexing report to monitor pages where Google extracted a noindex rule.

For technical audits, it is also common to crawl the site with SEO crawling tools and filter URLs by indexability status.

Common noindex Mistakes

Blocking a URL in robots.txt and expecting Google to see the noindex tag.
Adding noindex to important pages by mistake.
Using noindex as a security method.
Confusing noindex with canonical tags.
Applying noindex globally through a CMS, theme, plugin, or staging setting.
Forgetting to remove noindex after a website launch.
Noindexing pages that still need to build organic visibility.
Assuming noindex will remove a page from search results immediately.

Best Practices for `noindex`

Noindex works best when it is managed intentionally, documented clearly, and checked before major site changes.

Use `noindex` Only for Clear Page Types

Noindex should be applied to page types with a clear reason for being excluded from search.

Examples include internal search results, thank-you pages, test pages, staging pages, account pages, or thin system-generated pages. Avoid deciding page by page without a consistent rule, because this increases the risk of mistakes.

Keep Important Pages Indexable

Any page that is meant to attract organic search traffic should remain indexable unless there is a strong technical or strategic reason to exclude it.

This includes core service pages, articles, category pages, product pages, location pages, and other search-focused content.

Do Not Block Noindexed Pages in `robots.txt`

If search engines cannot crawl the page, they may not see the noindex directive. For noindex to work properly, the crawler needs to access the page and read the instruction.

Audit `noindex` Before and After Launches

Website launches, migrations, redesigns, CMS changes, and staging-to-production moves are common moments where noindex problems happen.

Before launch, check that staging pages are noindexed or protected. After launch, check that production pages are indexable.

Document Why `noindex` Was Used

noindex decisions should not be invisible.

For important templates or page types, document why they are noindexed, where the directive is applied, and who owns the decision. This helps prevent future teams from removing or adding noindex without understanding the impact.

Conclusion

noindex is a precise SEO control for keeping specific pages or resources out of search results.

Used correctly, it helps maintain a cleaner search index and prevents utility, duplicate, temporary, or low-value pages from appearing where they do not belong. Used carelessly, it can remove important pages from organic visibility and create serious indexing problems.

Simple Rule

The simplest rule is this: use noindex when a page needs to exist, but should not be found through search.

Frequently Asked Questions

noindex

What does noindex mean?

Does noindex stop Google from crawling a page?

Where should a noindex tag be placed?

Can noindex be used on PDFs?

What is the difference between noindex and robots.txt?

Should noindex and canonical tags be used together?

How long does it take for noindex to remove a page from Google?

Is noindex a security measure?

noindex

What Is noindex?

What noindex Does

noindex vs robots.txt

When to Use noindex

When Not to Use noindex

How to Implement noindex

noindex and nofollow

noindex and Canonical Tags

How noindex Affects SEO

How to Check If a Page Has Noindex

Common noindex Mistakes

Best Practices for noindex

Use noindex Only for Clear Page Types

Keep Important Pages Indexable

Do Not Block Noindexed Pages in robots.txt

Audit noindex Before and After Launches

Document Why noindex Was Used

Conclusion

Frequently Asked Questions

What Is `noindex`?

What `noindex` Does

`noindex` vs `robots.txt`

When to Use `noindex`

When Not to Use `noindex`

How to Implement `noindex`

`noindex` and `nofollow`

`noindex` and Canonical Tags

How `noindex` Affects SEO

Best Practices for `noindex`

Use `noindex` Only for Clear Page Types

Do Not Block Noindexed Pages in `robots.txt`

Audit `noindex` Before and After Launches

Document Why `noindex` Was Used