What is Indexing in SEO?

We onboarded a dental practice last year that had invested $8,000 in a new website. Custom design. Great content. Proper schema markup. They launched it, waited two months, and saw zero organic traffic. Not low traffic — zero.

When we looked at Google Search Console, the reason was immediately obvious: none of their pages were indexed. Not one. The site existed on the internet, but as far as Google was concerned, it didn't.

The culprit was a single checkbox in their WordPress settings that said "Discourage search engines from indexing this site." The developer had turned it on during the staging phase and forgotten to uncheck it when the site went live. Two months of invisible existence because of one checkbox.

That's an extreme example, but indexing problems are far more common than most people realize. And if your pages aren't indexed, nothing else you do in SEO matters. You can have the best content on the internet — if Google hasn't added it to the index, no one will ever find it through search.

The Simple Explanation

Indexing is the process by which Google adds a web page to its searchable database. Think of Google's index like a massive library catalog. When Google "indexes" your page, it's filing a card for that page in the catalog. When someone searches for something, Google checks the catalog and decides which cards — which pages — are the most relevant results.

If your page isn't in the catalog, it can't show up in search results. Period. It doesn't matter how good the page is.

Indexing is actually the second step in a three-step process. Here's how the whole thing works:

🕷️

Crawling

Google's bots discover your page by following links or reading your sitemap. They download the page content.

→

📋

Indexing

Google analyzes the content, understands what it's about, and adds it to the index — or decides not to.

→

🏆

Ranking

When someone searches, Google pulls relevant indexed pages and ranks them by quality, relevance, and authority.

The key thing most people miss: crawling and indexing are not the same thing. Google can crawl your page — meaning its bots visited and read it — without actually indexing it. If Google decides the page is low quality, duplicate, or not useful enough to include in the index, it will crawl it and then walk away.

How to Check If Your Pages Are Indexed

There are two quick ways to check. The first takes about three seconds:

Quick Index Check — Try it yourself

Type a URL above and we'll open a site: search in Google for that specific page. If Google shows a result, the page is indexed. If it shows nothing, it isn't.

The more thorough method is Google Search Console. The "Pages" report in Search Console tells you exactly how many pages are indexed, how many are not, and — critically — why they're not indexed. Every site we manage is connected to Search Console, and it's the first place we look when diagnosing traffic problems.

✅ The Search Console method

Go to Search Console → Pages (under Indexing in the sidebar). You'll see two numbers: indexed pages and not-indexed pages. Click "Not indexed" to see a breakdown of reasons. Common statuses include "Discovered – currently not indexed," "Crawled – currently not indexed," and "Excluded by noindex tag." Each one tells you something different about why Google isn't including those pages.

7 Reasons Your Pages Aren't Getting Indexed

After auditing hundreds of sites, these are the indexing problems we encounter most frequently, ranked roughly by how often we see them:

🚫

Noindex Tag (Accidental or Intentional)

Very Common

A noindex meta tag tells Google explicitly not to index a page. Sometimes this is intentional — you don't want your internal search results pages or staging pages in Google's index. But more often than we'd like to admit, it's accidental.

We've seen WordPress plugins add noindex to entire post types. We've seen developers leave noindex tags from the staging environment on production pages. We've seen Yoast settings that noindex all category pages by default — including category pages that were meant to rank.

Fix: Check the <head> section of your page's HTML for <meta name="robots" content="noindex">. In WordPress, check your SEO plugin settings (Yoast → Search Appearance, or Rank Math → Titles & Meta). Also check Settings → Reading for the "Discourage search engines" checkbox.

🤖

Blocked by Robots.txt

Common

Your robots.txt file tells search engines which parts of your site they're allowed to crawl. If a page (or an entire directory) is blocked by robots.txt, Google can't crawl it — and if Google can't crawl it, it can't index it.

The tricky part: sometimes developers block directories that contain important resources. We've seen robots.txt files that block the /wp-content/uploads/ directory, preventing Google from reading images. We've seen rules that block entire subdirectories of the site without anyone realizing it.

Fix: Check your robots.txt at yoursite.com/robots.txt. Use Google's robots.txt tester in Search Console to verify that your important pages aren't being blocked. Make sure your XML sitemap URL is included in the robots.txt file.

📄

Thin or Duplicate Content

Common

Google doesn't have to index every page it crawls. If it determines that a page doesn't provide enough unique value to justify inclusion in the index — because the content is too thin, too similar to another page, or simply not useful — it'll skip it.

This shows up in Search Console as "Crawled – currently not indexed." It means Google saw the page, evaluated it, and decided it wasn't worth including. This is Google's way of saying "we don't think this page is good enough."

Fix: Beef up the content. If the page has 100 words of generic text, make it comprehensive and genuinely useful. If it's a near-duplicate of another page, consolidate them using a 301 redirect or canonical tag. Every page on your site should have a unique, substantive reason to exist.

🔗

Orphan Pages (No Internal Links)

Moderate

If a page isn't linked to from anywhere else on your site, Google may never discover it — even if it's in your sitemap. These are called "orphan pages." They exist on your server but have no connection to the rest of your site's link structure.

We see this a lot with blog posts that were published but never linked to from category pages, related posts sections, or the main navigation. Google's crawler follows links. If there are no links to follow, the page is effectively invisible.

Fix: Every page on your site should be reachable through at least 2–3 internal links. Add links from related blog posts, service pages, or your site's navigation. Run a crawl with a tool like Screaming Frog to identify orphan pages.

🐌

Crawl Budget Exhaustion

Less Common

Google allocates a "crawl budget" to each site — a limit on how many pages it will crawl within a given timeframe. For most small-to-medium sites, this isn't a problem. But for larger sites with thousands of pages, low-quality pages, or slow server response times, Google might run out of budget before reaching your important pages.

This is especially relevant for sites that generate hundreds of thin pages automatically — tag archive pages, parameter-based filter pages, paginated results. Those pages consume crawl budget that could be spent on your high-value content.

Fix: Noindex or remove low-value pages that consume crawl budget. Improve server speed to allow faster crawling. Use robots.txt to block parameters and faceted navigation. Prioritize high-value pages in your XML sitemap.

🗺️

Missing or Broken XML Sitemap

Common

An XML sitemap is a file that lists all the pages you want Google to index. It's not strictly required — Google can discover pages through links — but it significantly speeds up the discovery and indexing process, especially for new pages.

We regularly find sites with no sitemap at all, sitemaps that return 404 errors, sitemaps that haven't been updated in years, or sitemaps that list URLs returning errors. Each of these scenarios slows down indexing.

Fix: Generate a sitemap with your SEO plugin (Rank Math and Yoast both do this automatically). Submit it in Google Search Console under Sitemaps. Check that it's returning a 200 status and includes all your important pages. Set it to update automatically when you publish new content.

⏱️

Brand New Site or Page

Normal

Sometimes the answer is just patience. New pages can take anywhere from a few hours to several weeks to be indexed, depending on your site's crawl frequency and authority. A page on the New York Times gets indexed in minutes. A page on a brand-new website with no authority might take 2–4 weeks.

If a page has been live for over a month and still isn't indexed, that's a problem worth investigating. If it's been three days, give it time.

Fix: Submit the URL directly in Google Search Console using the URL Inspection tool and click "Request Indexing." This doesn't guarantee immediate indexing, but it tells Google to prioritize crawling that specific page. Also make sure the page is linked from your sitemap and from at least one other page on your site.

When You Should Intentionally Block Indexing

Not every page on your site should be indexed. Some pages actively dilute your site's quality signal if they're in Google's index. Here's our general rule for what to index and what to keep out:

Should Be IndexedShould NOT Be Indexed (Noindex)

Homepage, service pages, location pagesInternal search results pages

Blog posts with substantial contentTag archive pages (in most cases)

Location/market-specific landing pagesAuthor archive pages (if single author)

Case studies, testimonials pageStaging or development pages

Resource pages, tools, calculatorsThank you / confirmation pages

Category pages (if well-built)Login, admin, or members-only pages

FAQ and glossary pagesDuplicate pages or parameter variations

The general principle: if a page provides unique value to a searcher, index it. If it exists only for internal site functionality or would add thin content to Google's impression of your site, noindex it.

Why This Matters Extra for Regulated Industries

For our clients in financial services, law, and healthcare, indexing issues carry additional weight because the pages themselves are more valuable. A single well-indexed service page for a personal injury attorney can generate $50,000–$200,000 in annual case revenue. A dermatologist's procedure page ranking for "Botox near me" can drive $300,000+ in treatment bookings per year.

When one of those pages isn't indexed, the cost isn't theoretical. It's real revenue that's going to competitors whose pages are indexed and ranking.

The audit that pays for itself: We've found unindexed high-value pages on nearly every new client site we audit. Last quarter, we discovered that a law firm's highest-potential service page — targeting a keyword with $180 CPCs — had been accidentally noindexed for eight months. Fixing that single issue was worth more than a year of SEO retainer fees.

The Bottom Line

Indexing is the most fundamental requirement of SEO. It's not glamorous. Nobody writes case studies about getting a page indexed. But if your pages aren't in Google's index, everything else — content, links, schema, speed optimization — is completely irrelevant.

The good news is that indexing problems are usually straightforward to identify and fix. Connect your site to Google Search Console. Check the Pages report. Look for pages that are "not indexed" and read the reasons Google gives you. Most issues come down to accidental noindex tags, robots.txt blocks, thin content, or orphan pages with no internal links.

If you've never checked your indexing status, do it today. You might be surprised by how many of your pages Google is quietly ignoring.

Our free SEO audit includes a complete indexing analysis — how many pages Google has indexed, which pages are missing, why they're excluded, and exactly what to fix. It's one of the first things we check, because nothing else matters until this is right.

What is Indexing in SEO?
(And Why Your Pages Might Be Invisible to Google)

The Simple Explanation

How to Check If Your Pages Are Indexed

✅ The Search Console method

7 Reasons Your Pages Aren't Getting Indexed

When You Should Intentionally Block Indexing

Why This Matters Extra for Regulated Industries

The Bottom Line

Latest Articles

How Many of Your Pages Are Actually Indexed?

What is Indexing in SEO?(And Why Your Pages Might Be Invisible to Google)

The Simple Explanation

How to Check If Your Pages Are Indexed

✅ The Search Console method

7 Reasons Your Pages Aren't Getting Indexed

When You Should Intentionally Block Indexing

Why This Matters Extra for Regulated Industries

The Bottom Line

Latest Articles

How Many of Your Pages Are Actually Indexed?

What is Indexing in SEO?
(And Why Your Pages Might Be Invisible to Google)