elementskit logo

The technical SEO checklist for search engines and AI search

technical seo checklist.png

Technical web optimization focuses on serving to search engines like google and yahoo discover, perceive, and index your web site. Technical web optimization foundations now additionally decide whether or not AI methods can entry and use your content material in responses.

You should get these foundations proper earlier than your different ways can drive outcomes as a result of, with out them, your content material might not be crawled, listed, or surfaced in any respect. That is true whether or not you’re optimizing for conventional search rankings, AI Overviews, or LLMs that pull content material from the net in actual time.

We have put collectively this technical web optimization guidelines that will help you work by the basics systematically. 

Technical SEO checklist covering indexing, user experience, site structure, code, and AI readiness

Let’s begin with what the rise of AI search means for technical web optimization, and why the identical foundations now have two jobs to do.

Technical web optimization now serves two search methods

For many of web optimization’s historical past, there was one platform to optimize for: Google. Its bots crawled your web site, saved what they discovered, and used that information to rank your pages in search outcomes.

That is nonetheless taking place. However the identical content material Google crawls and indexes is now additionally being utilized by AI methods to generate solutions to consumer queries. 

This is not a brand new self-discipline or a separate technical web optimization guidelines you might want to observe. 

The basics that assist search engines like google and yahoo discover and perceive your web site are the identical ones that decide whether or not AI methods can entry and use your content material, like crawl entry, clear HTML, correct schema, recent content material, and logical construction. 

Get these proper, and your web site is well-positioned for each search methods.

What has modified is the consequence of getting it unsuitable. A technical challenge that when harm your rankings now has the potential to make you invisible throughout search and AI surfaces on the identical time.

This guidelines covers the core technical necessities for each. Most of it will likely be acquainted territory when you’ve been doing web optimization for a while. And we’ve known as out anyplace that AI readiness modifications or provides to the usual checks.

1. Search for crawling and indexing points

Ensure search engines like google and yahoo can uncover (crawl) and save (index) your web site correctly so your pages can rank in search outcomes. 

These checks nonetheless help the normal search index. Some additionally have an effect on whether or not AI retrieval methods can entry and use your content material in responses.

Examine whether or not your web site is listed

Examine whether or not your web site is listed, as your web site received’t present in search outcomes if Google hasn’t listed it.

Examine your indexing standing utilizing Google Search Console (GSC) and Bing Webmaster Instruments (BWT). 

In GSC, head to the “Pages” report. It will present you which of them pages are listed in Google and that are excluded.

Google Search Console showing 90 indexed and 46 not indexed pages over time

The pages that aren’t listed shall be grouped by the particular motive.

Google Search Console table listing reasons pages aren't indexed and affected page counts

Listed here are just a few causes you may see:

  • Crawled – presently not listed: Google checked out these pages however determined they weren’t value indexing. This often means the content material is low high quality or the pages are too much like current pages. If these are vital pages for what you are promoting, prioritize updating them.
  • Blocked by robots.txt: Your robots.txt file is telling Google to not crawl these pages. Double-check your robots.txt file to be sure you’re not by accident blocking vital content material.
  • Excluded by ‘noindex’ tag: You might need by accident added noindex tags to your pages. This explicitly tells Google, “Do not put these pages in search outcomes.” Take away this tag from vital pages you wish to be listed.

You can even verify which of your pages Bing has listed utilizing the “Website Explorer” report in Bing Webmaster Tools

Bing Webmaster Tools Site Explorer with the Indexed URLs filter selected

Don’t assume this can match Google’s index. Examine each for complete protection.

You can even use IndexNow to submit URLs on to Bing and the opposite search engines like google and yahoo that help the protocol to encourage them to index your content material sooner.

Microsoft Bing Webmaster Tools IndexNow page promoting real-time indexing

Examine for any duplicates of your web site

Having duplicate variations of your web site can hurt your web optimization efforts as a result of search engines like google and yahoo view these as separate web sites, though they show the identical content material. As a result of they’re all technically completely different variations.

For instance, your web site may be accessible at:

  • https://yourdomain.com
  • https://www.yourdomain.com
  • http://yourdomain.com
  • http://www.yourdomain.com

Examine in case your web site is accessible by a number of URLs by coming into every variation in your browser and checking the tackle bar.

If a number of variations load — for instance, each the http and https variations work with out redirecting to 1 model — you might want to choose one most well-liked model and redirect all others to it.

Side-by-side browser views comparing a secure HTTPS site with a non-secure HTTP site showing the same content

Use the HTTPS model as your main URL (both with or with out www — that is your choice). Then implement a 301 permanent redirect, so customers and search engines like google and yahoo are forwarded to your most well-liked model.

Ensure your robots.txt file is precisely arrange

Configure your robots.txt file to keep away from blocking vital pages from being crawled.

A robots.txt file is a textual content file that tells search engine crawlers which components of your web site they’re allowed to entry. 

The file could include traces that seem like this:

Person-agent: *
Disallow: /admin/
Disallow: /login/
Enable: /

Your robots.txt file shall be positioned at “yourdomain.com/robots.txt.” Examine the “Disallow” directives particularly to be sure you’re not blocking vital folders or pages.

Robots.txt now applies to greater than conventional search crawlers. AI retrieval bots (those that fetch your content material to floor real-time solutions in AI search) are distinct from coaching scrapers, and every might be managed individually. We cowl tips on how to audit and configure this correctly in Part 6.

Repair redirect chains & loops

Repair redirect chains and loops to keep away from slowing your web site down for customers, losing crawl budget (search engine sources used to crawl your web site), and affecting how a lot authority is handed between your pages.

A redirect chain happens when one URL redirects to a different URL, which then redirects to a different URL, as an alternative of linking on to the ultimate vacation spot.

Diagram showing a redirect chain where URL A redirects to URL B, which redirects to URL C

A redirect loop occurs when a URL redirects to a URL that redirects again to the unique URL, creating an infinite loop.

Diagram illustrating a redirect loop where Page A redirects to Page B and back again

Each of those points can happen whenever you restructure URLs and don’t fastidiously handle redirects in the course of the course of.

Use Semrush’s Site Audit software to search out redirect-related points in your web site.

Set up a project within the software to run a full audit. Then, go to the “Points” tab and seek for “redirect” to establish any redirect chains or loops. Click on the “# redirect chains and loops” textual content to see the URLs which have points. 

Semrush Site Audit issues filtered for redirect problems, including chains and loops

To repair redirect chains, replace any hyperlinks or redirects to level on to the ultimate vacation spot URL. To repair redirect loops, ensure URLs don’t level to URLs that redirect again to the unique URL.

Repair damaged hyperlinks

Repair damaged hyperlinks that direct customers to webpages that now not exist to keep away from a poor consumer expertise. Damaged hyperlinks additionally don’t move authority, which may affect your pages’ visibility in search and AI methods.

Semrush 404 page example showing 'We go lost' message and button to go to homepage

Damaged hyperlinks might be both inner hyperlinks to your individual content material or exterior hyperlinks pointing to different web sites.

Use Site Audit to search out damaged hyperlinks. Go to the “Points” tab and seek for “damaged.” Click on the detected objects to see the affected pages.

Semrush Site Audit issues filtered for broken links, images, pages, and files

To repair damaged inner hyperlinks, restore the deleted web page if doable, or arrange a 301 redirect to ship customers to an analogous related web page.

To repair damaged exterior hyperlinks, exchange the hyperlink with an up to date model of the web page if it exists elsewhere, take away the hyperlink fully if no appropriate alternative exists, or discover an alternate useful resource that gives related data and hyperlink to that as an alternative.

Repair server errors

Repair server errors (5xx errors) to make sure search engines like google and yahoo can crawl and index your content material.

A server error signifies that there’s something unsuitable with the server that hosts your web site. Examine for server errors by trying to find “5xx” in Site Audit. Click on on the hyperlink to see the problematic pages and the particular error codes they’re returning. Go the main points on to your developer to repair the problems.

Semrush Site Audit showing 66 pages returning a 5XX status code

2. Optimize for a superb consumer expertise

Search engines like google and yahoo are inclined to reward web sites that present a superb consumer expertise (UX). Plus, a superb expertise retains guests engaged and encourages them to discover extra of your content material. 

The identical UX fundamentals that assist customers, corresponding to quick load occasions, steady layouts, and accessible interactions, additionally assist AI brokers interpret and navigate your web site.

Listed here are the principle elements to handle:

Ensure your web site is mobile-friendly

Ensure your web site shows and capabilities correctly on smartphones since search engines like google and yahoo primarily use the cell model of your web site for rating and indexing.

To verify your web site’s mobile-friendliness, open it in your cellphone and search for these frequent points:

  • Textual content that’s too small to learn with out zooming
  • Buttons or hyperlinks which are positioned too shut collectively to faucet precisely
  • Content material wider than the display, inflicting horizontal scrolling
  • Pop-ups that block the principle content material and are tough to dismiss

Flag any points in your design and growth group to repair.

Enhance your Core Net Vitals

Poor Core Web Vitals scores point out your web site could have points with loading velocity, interactivity, and format shifts. 

The three Core Net Vitals metrics are:

  • Largest Contentful Paint (LCP): Measures how rapidly the principle content material of your web page masses. Purpose for an LCP inside 2.5 seconds.
  • Interplay to Subsequent Paint (INP): Measures how rapidly your web page responds visually after a consumer interacts with it (like clicking a button or tapping a hyperlink). This could occur in lower than 200 milliseconds.
  • Cumulative Format Shift (CLS): Measures visible stability (i.e., how a lot parts bounce round because the web page masses). Purpose for a CLS rating below 0.1.

Use Google Search Console to see your Core Net Vitals efficiency.

Navigate to “Core Net Vitals” from the sidebar and click on “Open Report” to see the information.

Google Search Console mobile Core Web Vitals report with poor, needs improvement, and good URLs

Then, search for pages marked as “Poor” or “Want enchancment.” These pages have failed the Core Net Vitals evaluation and want optimization.

Google Search Console table showing mobile INP and LCP issues by severity and URL count

Take these URLs and run them by Google’s PageSpeed Insights software to get particular suggestions on tips on how to repair the problems.

PageSpeed Insights diagnostics listing JavaScript, CSS, and network payload performance issues

Work together with your developer to implement the advised fixes.

Keep away from intrusive interstitials

Keep away from intrusive interstitials as a result of they create a poor consumer expertise, particularly on cell units with restricted display area.

Intrusive interstitials are pop-ups or overlays that cowl a good portion of your content material, making it tough for customers to entry the data they got here for.

Intrusive interstitials embody full-screen pop-ups that block content material on arrival, overlays that customers should dismiss to correctly view the web page, and layouts through which advertisements push the principle content material beneath the fold. 

Real pop-ups like cookie notices, age verification, and paywall logins don’t rely as intrusive interstitials.

3. Work in your web site navigation

Ensure your web site has a reasonably easy navigation system that can permit customers to search out vital content material simply and assist search engines like google and yahoo and AI methods perceive your web site.

Enhance your web site construction

A transparent, logical web site construction helps customers, search engines like google and yahoo, and AI brokers perceive how pages relate to one another.

The perfect web site construction resembles a logical hierarchy. Your homepage is on the prime, adopted by essential class pages, then subcategories, and eventually particular person pages. 

This construction creates clear paths for customers, search engines like google and yahoo, and AI brokers to observe. Every web page ought to be accessible inside three or 4 clicks out of your homepage. 

Ideal website structure diagram with home page, categories, subcategories, and product pages

Interlink your pages

Use internal linking to create pathways between completely different pages in your web site, permitting search engine crawlers to find your content material whereas serving to customers discover associated data.

Diagram showing an internal link connecting one website page to another page on the same site

Search for alternatives so as to add contextual hyperlinks inside your content material. 

When including hyperlinks:

  • Use descriptive anchor text reasonably than generic “click on right here” or “learn extra” phrases
  • Create hub pages (essential matter pages) that convey collectively and hyperlink to all of your associated content material
  • Add “associated posts” sections on the finish of articles to hyperlink to related content material 

Use breadcrumbs

Use breadcrumbs to assist each customers and search engines like google and yahoo higher perceive your web site’s construction.

Breadcrumbs seem on the prime of a web page and present the trail to that web page inside your web site. Customers can click on on them to simply return to earlier sections.

Breadcrumbs annotated on Anthony Edwards 1 Low Shoes product page

Repair orphan pages

Orphan pages are tough for customers and search engines like google and yahoo to find since they haven’t any incoming inner hyperlinks. Repair them to offer a greater consumer expertise and doubtlessly enhance the visibility of that web page in search and AI methods.

Site structure diagram highlighting a group of orphan pages disconnected from the main website hierarchy

You possibly can verify in case your web site has any orphan pages utilizing Semrush’s Site Audit software. Go to the “Points” tab and seek for “orphan.” Click on into the difficulty to see which of your pages are orphaned.

Semrush Site Audit issues filtered for orphaned pages in Google Analytics and sitemaps

Repair the difficulty by including hyperlinks to the orphan web page from different related pages.

4. Clear up your web site’s code and configuration

Code and configuration points are a few of the most typical causes of crawling and indexing issues. Fixing these points can due to this fact enhance your search visibility, and presumably additionally your AI visibility. 

Use HTTPS

Utilizing hypertext transfer protocol secure (HTTPS) supplies a safe connection between your web site and your customers. Google has handled it as a (light-weight) rating sign since 2014.

It’s used to encrypt the connection between the consumer’s browser and your web site to guard delicate data like login credentials, cost particulars, and different private information.

Fashionable browsers additionally mark non-HTTPS websites as “Not Safe,” which may erode consumer belief and enhance bounce rates.

Browser security warning showing an invalid SSL certificate and non-private connection error

Implement HTTPS in your web site by buying a Safe Sockets Layer (SSL) certificates. Many website hosting companies provide this whenever you join, typically without cost. 

Implement hreflang for worldwide pages

Utilizing hreflang tags tells search engines like google and yahoo which language or regional model of your web page to serve to which viewers. Use it in case your web site targets customers in multiple nation or language.

For instance, when you seek for the official Disney web site within the U.S., you see the American English model:

Disney site SERP result showing American English version in the US

Should you do the identical in Germany, you see German model of the web page:

Disney site SERP result showing German version in Germany

To implement hreflang, add the suitable tags to the <head> part of every language/country-specific model of your web page. You’ll solely want to do that in case your web site operates internationally.

For instance, in case your web site targets audiences in the US, Germany, and Japan, the hreflang tags may seem like this:

<hyperlink rel="alternate" hreflang="x-default" href="https://yourwebsite.com" />
<hyperlink rel="alternate" hreflang="en-us" href="https://yourwebsite.com" />
<hyperlink rel="alternate" hreflang="de-de" href="https://yourwebsite.com/de/" />
<hyperlink rel="alternate" hreflang="ja-jp" href="https://yourwebsite.com/jp/" />

The primary tag signifies the default or fallback web page that ought to be proven to customers when no different variant is suitable.

Different tags specify the completely different language or nation variations obtainable in your web site, serving to Google serve the precise one based mostly on a consumer’s location and language settings.

For extra steerage on what tags it’s best to implement, learn our beginner-friendly guide to hreflang attributes.

Add schema markup

Including schema markup helps search methods perceive precisely what your content material is about, making it simpler for them to floor it precisely in each search and AI-generated outcomes.

It’s a sort of code that additionally helps search methods establish entities, authorship, dates, web page sort, discrete details, and extra.

Whereas it is not a direct rating issue, schema markup allows rich results (particular listings on search outcomes pages), which may enhance click-through charges.

There are various varieties of schema markup, however it’s best to deal with probably the most related ones in your particular content material varieties. These could embody:

  • Group 
  • Product 
  • Article 
  • Occasion 
  • Recipe 
  • Evaluate

A simple solution to generate schema is to make use of a Schema Markup Generator

Schema Markup Generator showing article fields and generated JSON-LD code

As soon as the code is generated, add it to the <head> part of your web page’s HTML. Then, use Google’s Rich Results Test software to confirm that your schema is carried out accurately.

Google Rich Results Test showing two valid structured data items detected

Schema might also assist AI methods pick key particulars out of your web page, like costs, creator names, and publication dates. This may occasionally assist them higher perceive when your model or content material is related to consumer prompts.

5. Audit your web site for AI grounding and agent readiness

Loads of what determines your visibility in AI search comes again to the identical technical web optimization fundamentals we’ve coated above.

Nevertheless, there are just a few further checks particular to how AI methods work that might straight affect your visibility in AI responses.

Examine robots.txt for AI retrieval entry

Examine your robots.txt file to make sure AI crawlers can entry your content material. By default, most bots will observe your current directives, so any unintentional blocks might be limiting AI entry you really need.

This is what blocking particular AI crawlers appears like in a robots.txt file: 

Robots.txt directives blocking AI bots including GPTBot, ChatGPT-User, and Claude-Web

For many websites, the objective is to make sure AI retrieval bots have entry to the content material you need surfaced in AI responses. Should you see “disallow” directives for AI crawlers, verify together with your growth group that they had been added deliberately, as they could restrict your AI visibility.

Use Semrush’s Site Audit to see whether or not you’re blocking any AI crawlers from accessing your content material.

Semrush Site Audit AI Search Health report showing blocked crawlers and issue count

Audit semantic HTML and web page accessibility

Checking your HTML and web page accessibility ensures AI methods can learn your content material correctly.

When an agent interacts with a webpage, it really works from:

  • Uncooked HTML: The underlying code construction the agent parses to establish content material and parts
  • Screenshots: A visible rendering of the web page used to interpret format and design context
  • Web page accessibility: The underlying construction that display readers depend on, which brokers could use to establish and work together with interactive parts on the web page

In case your HTML is poorly structured, brokers could battle to interpret your web page accurately.

The picture beneath exhibits the distinction between generic HTML and well-structured, semantic HTML.

Non-semantic HTML code vs semantic HTML code side-by-side comparison

The model on the left makes use of normal tags like <div> and <span> all through. Whereas the code nonetheless capabilities advantageous, they lack any that means or nuance that machines can use to know how a web page is structured.

Evaluate that to the semantic HTML instance on the precise, which replaces the overall tags with particular ones that clearly point out what a part of the web page they relate to. Semantic HTML tags embody:

  • Header: Marks the highest part of the web page, sometimes containing the emblem and site-wide navigation
  • H1: Identifies the principle heading of the web page
  • Nav: Alerts that this group of hyperlinks is the positioning’s navigation menu
  • Most important: Wraps the first content material of the web page, distinct from headers, footers, and sidebars
  • H2: Marks a subheading inside the principle content material, sitting one stage beneath the H1
  • Footer: Marks the underside part of the web page, sometimes containing copyright information, hyperlinks, and call particulars

Should you’re not sure in case your web site is utilizing semantic HTML, communicate to your developer.

Agentic commerce readiness: Further checks for ecommerce websites

The technical web optimization checks above apply to each web site. For ecommerce, there are just a few further issues value verifying as AI brokers transfer from retrieving data to finishing purchases on behalf of customers.

First, ensure your product schema is correct and displays real-time stock, as it might affect how your merchandise seem in search and AI outcomes. 

Product structured data mapped to Google Shopping product images, price, and availability

The only solution to check whether or not an agent can navigate your checkout is to attempt it your self with ChatGPT’s shopping assistant (or an analogous software) and full a purchase order in your web site. If it stalls, fails to search out key fields, or cannot progress by a step, that’s a difficulty that different brokers may face too.

ChatGPT browsing messages showing navigation issues with Best Buy store locator pages

Past these two checks, here is what else to confirm:

  • Maintain key coverage pages like returns, transport, and FAQs in plain HTML so brokers can learn them with out hitting a technical barrier
  • Ensure kind fields, buttons, and checkout steps are constructed with normal HTML parts so brokers can work together with them reliably
  • Examine that your web site works with out JavaScript for vital pages (some brokers cannot execute scripts and can solely see a clean web page)
  • Guarantee cookie consent banners and login modals might be dismissed with a clearly labeled button in-built normal HTML
  • Keep away from checkout flows that rely closely on dynamic content material updates. If the web page state modifications after an motion, ensure the up to date data is mirrored within the underlying HTML, not simply visually.

Additional studying: Agentic commerce is here: What it means for the ecommerce industry

Put this technical web optimization audit guidelines into motion

Technical web optimization is the way you ensure nothing stands between your model and each search platform that may floor it to your audience, together with AI platforms. 

This technical web optimization audit guidelines helps you confirm that nothing is damaged throughout the invention, crawl, indexing, or visibility layers of recent search methods.

Semrush’s Site Audit software helps you establish and repair technical points rapidly and effectively to make sure your model stays seen. Attempt it immediately.

Related Post

Leave a Reply

Your email address will not be published. Required fields are marked *