August 5, 2021

What are orphan pages and how to fix them

Orphan pages are bad for SEO and for your website's rankings. Unfortunately, they are easy to create unintentionally and can be difficult to identify. Fixing orphan pages can improve your site's SEO and avoid Google penalties.

What are orphan pages?

Orphan pages are web pages that exist on your website but have no internal links to them from any other pages on your site. However, there may be links to them from external sources.

Occasionally orphan pages may be created intentionally. But, in the vast majority of cases, they are unintended mistakes, that webmasters may be unaware of. They are bad for SEO, and too many may cause Google to lower your site's ranking. 

Why are orphan pages bad for SEO?

Here are some of the more important negative impacts of orphan pages:

  • It was used previously as a Black-Hat SEO technique. This included hiding some pages from users but ensuring search engines would find them. Therefore, search engines may presume the webmaster is attempting to trick them.
  • Search engines view pages with no internal links as unimportant.
  • Google penalizes the entire site when it finds orphan pages.
  • Orphan pages waste crawl budget and the craw rate slows with few internal links.
  • Search engines cannot understand how orphan pages fit within the overall site structure. This means they will struggle to calculate their relevance and pass no authority to them.
  • Orphan page content can disrupt contextual keyword targeting and impact SERP rankings.

10 common causes of orphan pages

  1. A website migration that wasn't successfully managed.
  2. Pages that were created early in the site build process, submitted to Google, but then abandoned. Meaning they were excluded from the site's navigation architecture.
  3. Pages that were added to the XML sitemap when they were created, but no longer form part of the site's flow.
  4. Pages used in A/B testing that were never deleted.
  5. Landing pages that are no longer used. Typically landing pages have no internal links leading to them.
  6. A blogger wanted to remove pages from public view, but not delete them, such as with an old blog category. The old blog category is deleted but now all the pages are orphaned.
  7. Pages that have been forgotten over time. The site was restricted, or its navigation changed, leaving those pages behind.
  8. Product pages that still exist for items that are out of stock or discontinued. Or expired classified ads.
  9. Old videos, articles, or content that are no longer relevant. So, they have internal links removed.
  10. Bad use of CMS to create pages, meaning orphan pages are left undetected.

How to fix orphan pages

  • Step 1: Identify orphan pages through URL mapping

    Obviously, you won't find orphan pages by crawling your website. You must look at search engines, especially Google and Bing, to extract all links from the website.

    In Google Analytics you can extract a list of all URLs that have been indexed and sort them by “least visited”. Do this by navigating to Behavior > Site Content > All Pages. In Bing, the corresponding tool is Indexed Pages Checker. Then export the URLs into a spreadsheet.

    Then you need to crawl your website to build a corresponding list of “official” valid URLs. You can easily find suitable tools by searching for “website crawler tool.

    By comparing both lists, you highlight orphan pages.

    Note: the process may be a little more detailed than this summary describes. But this is the basic essence of how to find a list of orphan pages.

    You can also use Labrika’s own sitemap validator tool, this gives you access to any pages that may be on your site, but aren’t indexable. Making it a quick and easy way to access a list of orphan pages quickly!

  • Step 2: Assess the orphan pages and decide on an action for each one

    Start by asking yourself the following questions. This will then affect the action you take.

    • Q1. How important is the page? If it has importance, then integrate it back into the site, otherwise delete it.
    • Q2. Does the page rank for your keywords? If so integrate it back into the site, otherwise, delete it.
    • Q3. Is the page a duplicate or almost a duplicate? Perhaps it can be merged with another non-orphan page.
    • Q4. Are there backlinks to the page from other websites?

    For pages that you re-integrate back into the website, take the opportunity to assess the page’s quality:

    • Does it need to be optimized?
    • Where should it be linked from internally?

How to manage expired pages and old listings to avoid creating orphan pages

Think of eBay for a moment. Every day, millions of auctions end, and their listings expire. eBay does not delete those expired listings. Many will have been picked up by search engines and will appear on SERPS for years to come in some cases. The last thing eBay wants is for a prospective customer to be directed to a “404 Page not found” error on the eBay site.

Instead, eBay treats expired listings as valuable lead generators. Visitors who click on an expired listing in the SERP will be shown alternative product suggestions. As well as the original expired listing.

This strategy applies just as well to e-commerce sites where products are permanently out of stock or discontinued. Those product pages are still indexed in search engines and can be treated as potential landing pages.

However, you may not wish to retain expired pages on your website for valid reasons. In that case, it is best to ensure they return a 404 or 410 (expired content) code that you can control.  To do this you can use a custom 404 page.

In summary - website management best practices prevent orphan pages

Any SEO professional or website builder is well aware of the dangers to SEO if orphan pages are found. Normally, they build checks and detection mechanisms into their processes to stop this.

A thorough site audit using the above steps should uncover any orphan pages. If you have a larger site you may want to bring in professional SEO services to stop you wasting time and money.

Don’t forget that Labrika offers a sitemap validator tool which can give you a list of Orphan pages quickly and easily. 


Start your free trial now