Broken Link Statistics 2026: Link Rot, 404 Errors, SEO Risk & Fixes

Broken Link Statistics 2026: Link Rot, 404 Errors, SEO Risk & Fixes

A broken link looks small until it blocks a sale, wastes a crawl, or sends a good backlink into a dead end.

That matters because the web is rotting faster than most business owners realize. Pew Research Center found that 25% of webpages that existed at some point between 2013 and 2023 were no longer accessible by October 2023. For pages from 2013, the missing share was 38%.

Ahrefs found the backlink side is even messier. In a study of more than 2 million domains, 66.5% of links pointing to sampled websites since January 2013 had rotted, and 74.5% of links were considered lost for SEO purposes.

If you run a business website, this is not just a webmaster chore. Broken links affect trust, leads, rankings, analytics, and the return you get from content you already paid to create.

Here are the broken link statistics worth knowing in 2026, plus a practical repair framework you can use without turning your site into a science project.

A broken link is a link that points to a page, file, image, or resource that no longer works. MDN defines a 404 Not Found response as a client error status code meaning the server cannot find the requested resource. MDN also says links that lead to 404 pages are often called broken links or dead links.

A 404 is not always bad. If a page was removed and has no useful replacement, a real 404 or 410 is honest. MDN says a 404 means the missing resource may be temporary or permanent, while a permanently removed resource should use 410 Gone.

A soft 404 is worse because it confuses the system. Google defines a soft 404 as a URL that shows users a page saying the page does not exist while returning a 200 success status code. In plain English, the page looks broken to a person but looks successful to a bot.

Link rot is the long-term decay of links across the internet. Ahrefs defines link rot as what happens when links stop pointing to their intended file, page, or server. Pew describes the broader problem as digital decay, where online content that once existed becomes inaccessible over time.

Use these numbers when you’re making the case for a website cleanup, content audit, migration plan, or monthly maintenance budget.

  1. 25% of webpages from 2013 to 2023 are no longer accessible. Pew Research Center analyzed pages from Common Crawl and found that a quarter of webpages that existed at some point between 2013 and 2023 were inaccessible by October 2023.

  2. 38% of webpages from 2013 disappeared by 2023. Pew found that 38% of pages collected from the 2013 snapshot were no longer accessible, compared with 8% of pages collected from 2023.

  3. 23% of news webpages contain at least one broken link. Pew’s analysis found that 23% of news webpages had at least one broken link.

  4. 21% of government webpages contain at least one broken link. The same Pew study found that 21% of government webpages had at least one broken link, with local government pages especially likely to have link decay.

  5. 54% of Wikipedia pages contain at least one broken reference link. Pew found that 54% of Wikipedia pages had at least one broken link in their References section.

  6. 66.5% of links to sampled websites rotted over nine years. Ahrefs studied links pointing to 2,062,173 websites and found that 66.5% had rotted since January 2013.

  7. 74.5% of studied links were lost for SEO purposes. Ahrefs counted rotted links plus other SEO-blocking issues and found that 74.5% of links in the study were considered lost.

  8. 47.7% of lost links came from dropped pages. Ahrefs reported that 47.7% of lost links were from dropped pages, meaning the linking pages were removed from Ahrefs’ index for reasons such as crawlability, indexability, or domain problems.

  9. 34.2% of lost links were removed from pages that still existed. Ahrefs found that 34.2% of lost links disappeared because the linking page still existed but no longer linked to the target.

  10. 6.45% of lost links were tied to crawl errors. Ahrefs placed 6.45% of lost links in the crawl error bucket, which means the crawler hit an error when checking the page.

  11. 5.99% of lost links came from redirected pages. Ahrefs found that 5.99% of lost links were lost because the page containing the link redirected somewhere else.

  12. 4.11% of lost links were marked not found. Ahrefs reported that 4.11% of lost links were classified as not found, the bucket most people think of when they hear “broken backlink.”

  13. A 404 only says the resource is missing, not whether it’s gone forever. MDN says a 404 status code only indicates that the resource is missing without saying whether the absence is temporary or permanent.

  14. A 410 is the better signal for permanently removed resources. MDN says that if a resource is permanently removed, servers should send 410 Gone instead.

  15. 404 pages create user frustration when broken links are not minimized. MDN says 404 errors can lead to poor user experience and that internal and external broken links should be minimized.

  16. Common 404 causes include mistyped URLs and deleted or moved pages without redirects. MDN lists mistyped URLs and pages moved or deleted without redirection as common causes of 404 responses.

  17. Google treats site availability problems as crawl blockers. Google says availability issues prevent Google from crawling a site as much as it might want to.

  18. Google recommends using the Crawl Stats report to find availability issues. Google tells site owners to use the Crawl Stats report to see Googlebot’s crawling history and identify availability errors or warnings.

  19. Google says most sites should not expect same-day crawling. Google says most sites should not expect same-day crawling for URLs, except for some time-sensitive sites such as news publishers.

  20. Google recommends updated sitemaps when adding pages. Google says one way to help new pages get crawled is to update sitemaps to reflect new URLs.

  21. Google says server trouble can cause Googlebot to scale back crawling. Google says Googlebot will scale back crawling if it detects that a server is having trouble responding to crawl requests.

  22. 32% of customers would leave a brand they loved after one bad experience. Baymard cites PwC research showing that 32% of customers would leave a brand they loved after a single bad experience.

  23. 88% of online consumers are less likely to return after a bad site experience. Baymard cites research that 88% of online consumers report being less likely to return after a bad experience.

  24. 91% of unsatisfied customers leave without complaining. Baymard cites research that 91% of unsatisfied customers do not complain about a bad experience, they simply leave.

  25. Semrush classifies 4xx errors as pages that cannot be accessed. Semrush says a 4xx error means a webpage cannot be accessed and is usually the result of broken links.

What the numbers mean for business websites

The headline is simple: broken links are normal, but unmanaged broken links are expensive.

A one-page brochure site can usually survive a few old 404s. A manufacturer, contractor, medical practice, ecommerce shop, SaaS company, or local service business has more at stake. These sites depend on old service pages, PDFs, blog posts, forms, product URLs, location pages, and referral links.

The Pew data shows that web content naturally disappears over time. The Ahrefs data shows that backlinks disappear too. That means a site owner who never audits links is making a bet that old URLs, old vendors, old documents, old product pages, and old third-party resources will keep working on their own.

They won’t.

The damage usually falls into four buckets.

1. Lost trust

People don’t separate your website plumbing from your business. When a quote request link fails, a PDF returns a 404, or a product link goes nowhere, the visitor reads it as sloppiness. That reaction is harsh, but it’s real.

This is why the UX numbers matter. Baymard cites research that 32% of customers would leave a brand they loved after one bad experience, and 88% of online consumers are less likely to return after a bad site experience. A broken link is not the only bad experience, but it is one of the easiest to prevent.

2. Wasted content investment

Most businesses have useful pages sitting in the back of the shop: old case studies, quote guides, warranty PDFs, installation instructions, financing pages, spec sheets, blog posts, and service pages. Those assets may still earn visits or backlinks.

When the URL changes without a redirect, the asset doesn’t just disappear from navigation. It can disconnect from search, email campaigns, old social posts, partner websites, customer bookmarks, and AI tools that learned the old address.

Ahrefs’ finding that 66.5% of links to sampled websites had rotted should make every content-heavy business pay attention. If you spent money creating a page, protect the address or redirect it properly.

3. Messy SEO signals

Google does not need every dead URL redirected. A deleted page with no replacement can return a real 404 or 410. The problem is when valuable pages, internally linked pages, or externally linked pages break by accident.

Google says availability issues prevent Google from crawling your site as much as it might want, and Google recommends using the Crawl Stats report to diagnose crawl problems. If your site is small, crawl budget probably isn’t your biggest issue. If your site has hundreds or thousands of URLs, broken links, redirect chains, and soft 404s can make technical maintenance harder than it needs to be.

4. Bad measurement

Broken links make reporting dirtier. A campaign can look weak because the landing page URL changed. A lead source can look dead because a button points to an old form. A product page can look like it lost demand because internal links stopped sending visitors there.

This is where business owners get misled. They think they have a marketing problem when they actually have a maintenance problem.

Not every 404 deserves a meeting. Fix the ones with business value first.

Use this priority order:

  1. Revenue paths: quote forms, checkout pages, booking pages, phone links, financing pages, product pages, service pages, and location pages.
  2. Backlinked URLs: pages with external links from vendors, directories, news mentions, partners, associations, customers, or industry sites.
  3. High-traffic URLs: pages still receiving organic search, paid traffic, email traffic, referral traffic, or direct visits.
  4. Internal navigation: header links, footer links, sidebar links, related-post links, CTA buttons, breadcrumbs, and sitemap links.
  5. Customer support assets: PDFs, manuals, warranties, spec sheets, onboarding pages, and policy pages.
  6. Old campaign URLs: email links, QR codes, printed material, trade show links, social posts, and ads.

The key is intent. If the old URL has a clear replacement, redirect it. If it was removed for a reason and has no useful replacement, return a proper 404 or 410. If the page still exists but moved, update the internal links so visitors and crawlers don’t have to travel through a redirect forever.

This is the maintenance process we use because it keeps the work grounded in business impact.

1. Crawl the site

Start with a crawler like Screaming Frog, Semrush Site Audit, Ahrefs Site Audit, Sitebulb, or another technical SEO tool. Screaming Frog says its SEO Spider audits websites for over 300 SEO issues, and Semrush lists 4xx errors among the issues Site Audit can identify.

Export internal 404s, external broken links, redirect chains, and broken images. Don’t fix from memory. Crawl first.

2. Check Google Search Console

Google recommends using the Crawl Stats report for crawl history and availability issues. Use Search Console to find URLs Google has tried to crawl, pages excluded as not found, and server issues that may not show up during a quick site crawl.

Search Console is especially useful after a redesign or migration because Google may still be discovering old URLs from backlinks, sitemaps, and previous crawls.

3. Match each broken URL to a business decision

There are only four useful outcomes:

  • Restore the page because it was removed by mistake.
  • Redirect the old URL to the closest relevant replacement.
  • Update the source link because the target URL changed.
  • Leave it as a real 404 or 410 because the page is intentionally gone.

Avoid the lazy fix: redirecting every old URL to the homepage. That may hide the error in a report, but it creates a bad visitor experience and can look like a soft 404 pattern when the replacement has nothing to do with the old page.

Broken links are not a one-time cleanup. Pew’s research shows that webpages disappear over time, and Ahrefs’ research shows that backlinks decay over time. Your maintenance process has to assume decay.

For most small business websites, a monthly crawl is enough. For ecommerce, publishing-heavy, multi-location, or frequently edited sites, check weekly or after every major content batch. After a redesign, check immediately, then again after Google has had time to recrawl the site.

A perfectly clean website is nice, but perfection is not the benchmark. The useful benchmark is whether broken links are blocking business outcomes.

A small service business website should aim for zero broken links in navigation, CTAs, service pages, location pages, forms, and footer links. A larger site may always have a few external links that break because third-party websites change. That’s normal. What isn’t normal is leaving broken internal paths in the parts of the site that produce leads.

Use these targets:

Site areaTargetWhy it matters
Main navigation0 broken linksVisitors rely on it to understand your business
CTA buttons0 broken linksThese directly affect leads and sales
Forms and checkout0 broken linksA broken conversion path is lost revenue
XML sitemap URLs0 intentional 404sSitemaps should reflect live, indexable pages
Backlinked URLsRedirect when a relevant replacement existsProtects referral traffic and link equity
External resource linksReview monthly or quarterlyThird-party pages decay over time

The standard is not “no 404s ever.” The standard is “no accidental dead ends where a customer, crawler, partner, or sales campaign expected something useful.”

Broken links are not glamorous work. They’re maintenance. But good maintenance is what keeps a website earning after launch day.

If your website has been redesigned, migrated, expanded, neglected, or patched by three different vendors, assume there are dead ends hiding somewhere. Find them before your customers do.

Need help cleaning up broken links, redirects, technical SEO issues, and conversion paths? Start here and we’ll help you turn your website back into a working sales asset.

  • broken links
  • 404 errors
  • technical SEO
  • website maintenance
  • link rot
Share:
Richard Kastl

Richard Kastl

Founder & Lead Engineer

Richard Kastl has spent 14 years engineering websites that generate revenue. He combines expertise in web development, SEO, digital marketing, and conversion optimization to build sites that make the phone ring. His work has helped generate over $30M in pipeline for clients ranging from industrial manufacturers to SaaS companies.

Related Articles

← Back to Blog