HomeTrainingSEO Mastery › Technical SEO Foundations
SEO Mastery
RGM° · Training

Technical SEO Foundations

The floor under which no amount of content or links can save you. Crawl, index, render, Core Web Vitals, structured data, mobile-first — the technical SEO discipline in depth.

What you will learn

  1. Why technical SEO is the floor, not the ceiling
  2. Crawlability: robots.txt, sitemaps, internal linking
  3. Indexability: canonicals, noindex, hreflang, parameters
  4. Rendering: server-side, client-side, hybrid, and the JS problem
  5. Core Web Vitals and page experience
  6. HTTPS, mixed content, and security headers
  7. Mobile-first indexing and responsive vs adaptive
  8. Structured data and the rich result economy
  9. Site architecture and URL design
  10. Advanced playbook
  11. Common mistakes
  12. Operating checklist

Why technical SEO is the floor, not the ceiling

Technical SEO doesn't rank pages by itself. Content, intent matching, authority, and user satisfaction do that. But broken technical SEO blocks every other lever: a page that can't be crawled can't be indexed, a page that can't be indexed can't rank, a page that loads in 9 seconds can't compete in 2024 SERPs against pages that load in 1.5. Technical work is the floor under which no amount of content or links can save you.

The good news is that most technical issues are binary — either fixed or not. Unlike content quality or link earning, which are gradient and competitive, technical SEO has a clear endpoint per issue. The bad news: there are many more technical issues than most teams recognize, and they compound. A site with five medium technical problems can lose 30–60% of its potential organic traffic.

Crawlability

Crawlability is whether search engine bots can find and access your pages. Three foundational artifacts:

robots.txt

The first file every crawler requests. Located at the domain root (example.com/robots.txt). It controls which crawlers can access which paths.

User-agent: *
Disallow: /admin/
Disallow: /cart/
Disallow: /*?sort=
Allow: /

Sitemap: https://example.com/sitemap.xml

Important nuances: robots.txt blocks crawling, not indexing. If a page is linked externally and Google can't crawl it, Google may still index it without content. Use noindex meta tag (which requires crawl access) to prevent indexing, not robots.txt.

Common errors: blocking /wp-content/ (kills CSS and JS for Googlebot which then ranks the site poorly), blocking the root of a section accidentally, allowing test/staging environments to be crawled in production.

XML sitemaps

Sitemaps tell crawlers which URLs to prioritize. Best practices:

Internal linking and crawl paths

The most important crawlability factor on large sites isn't robots.txt or sitemap — it's internal linking depth. A page 5+ clicks from the homepage may get crawled infrequently or not at all on sites with limited crawl budget.

Indexability

Once crawled, can the page be indexed? The signals:

Canonical tags

Canonical tags (<link rel="canonical" href="...">) tell Google which URL is the preferred version when content is similar across multiple URLs. Critical for:

Common errors: canonicalizing every page to the homepage (deindexes everything but the homepage), canonicalizing across language versions (use hreflang instead), canonical pointing to a 404 or noindex page.

Meta robots and X-Robots-Tag

Page-level indexing signals. Use noindex for thin or duplicate content you don't want indexed. Use nofollow sparingly — Google now treats it as a hint, not a directive.

X-Robots-Tag is HTTP-header equivalent; use for non-HTML resources (PDFs, images) you want noindexed.

hreflang

For multilingual or multi-regional sites. Tells Google which language/region version to serve which user. Implementation options: HTML link tags, HTTP headers, or sitemap annotations. Most common implementation error: missing return links between language versions, or pointing to incorrect language code.

URL parameters and faceted navigation

The hardest indexability problem for ecommerce sites. Product listing pages with filter parameters (color, size, price range, sort order) generate near-infinite URL combinations. Approach:

Rendering: the JS problem

Modern web increasingly relies on JavaScript to render content. Google CAN render JavaScript, but it does so in a second pass after initial crawl, sometimes days or weeks later. JS-rendered content that depends on user interaction or that errors in the rendering pipeline may never make it into the index.

Rendering options

Debugging rendering issues

Core Web Vitals and page experience

Google's Core Web Vitals are three metrics measuring real-user page experience:

MetricWhat it measuresGoodNeeds work
LCP (Largest Contentful Paint)How fast the main content loads< 2.5s2.5–4s
INP (Interaction to Next Paint)How fast the page responds to interaction (replaced FID in March 2024)< 200ms200–500ms
CLS (Cumulative Layout Shift)How much the layout jumps as it loads< 0.10.1–0.25

Page Experience signals also include HTTPS, no intrusive interstitials, and mobile-friendly. CWV is a real but modest ranking factor — it tiebreaks more often than it determines top-3 placement. But it affects user experience materially, which feeds back to rankings through engagement signals.

Improving LCP

Improving INP

Improving CLS

HTTPS, mixed content, and security headers

Mobile-first indexing

Since 2021, Google indexes the mobile version of your site by default. Implications:

Structured data and the rich result economy

Schema.org structured data lets you mark up your content semantically, enabling rich results in SERPs — star ratings, FAQs, product prices, recipe images, events, job postings, breadcrumbs.

Site architecture and URL design

URL structure

Information architecture

Advanced playbook

Common mistakes

Operating checklist

Sources and further reading


Part of the SEO Mastery series. Continue to the next module or take the series exam.