Accessibility / Findability Issues

Index in GoogleSearch google with site:sitename.com
Crawl vs. Index StatusNeed google webmaster
Server UptimeCheck status on https://www.uptrends.com/tools/uptime 
Robots.txtCheck status on http://sitename.com/robots.txt 
Use of Meta Robots TagGo to screaming frog tool and run a scan. In the ‘internal all’ report, scroll to the right to get the column named ‘Meta Robots 1’. Scroll down and check if any pages are marked as ‘no index’. Check if the marked pages are good and if so, why they have been ‘no-indexed’. Mark has fail if there’s a high proportion of pages no-indexed.
Meta RefreshUse Screaming frog internal_all report to check the meta refresh column. It should be blank for all pages.
Sitemap htmlhttp://www.gatherby.org/sitemap.html & without html
Blocked By Robots.txt, Blocked ResourcesScreaming frog > Response Code

  • Less than 7% = Pass
  • 7-15% = OK
  • More than 15% = Fail
No responseShould ideally be 0.
Success 2xx
  • 60% To 70 % Ok
  • Less than 60% = fail
  • 70% above  = Pass
Redirections 3xx , 4xx , 5xx
  • Less than 7% = Pass
  • 7-15% = OK
  • More than 15% = Fail

 

Page XML Sitemaps

Found Page XML Sitemaps?Check if www.website.com/sitemap.xml exists. Yes – pass, no – fail

Then check https://www.xml-sitemaps.com/validate-xml-site and http://freetools.webmasterworld.com/tools/site-validator/map.html  by entering  website.com. If valid, pass, else fail.

Page XML Sitemap(s) Listed in Robots.txtOpen the robots.txt file and check if the sitemap has been listed in it. 
Page XML Sitemap with Broken LinksCheck your XML Sitemap for errors with Screaming Frog

Download and save your xml sitemap
Choose Mode->List in Screamingfrog
Select URL list file
Choose the XML file and click “Open”
Click Start to start crawling.

Sort on Status code.

 If there are 4xx, 5xx, it’s a fail, else pass.

 

Mobile XML Sitemaps

Found Mobile Pages?Check if the site has a separate mobile version like http://m.website.com. (Most  websites now are responsive and don’t have a separate mobile site, in which case, mark as NA and the below checks can be skipped)
Found Mobile XML SitemapsIf there is a mobile version of the site, there should be a separate xml site map of the form http://m.website.com/sitemap.xml. Check if it exists manually and validate using https://www.xml-sitemaps.com/validate-xml-site and http://freetools.webmasterworld.com/tools/site-validator/map.html
Mobile XML Sitemap(s) Listed in Robots.txt Open the robots.txt file and check if the mobile sitemap has been listed in it
Mobile XML Sitemap(s) with Broken LinksCheck the mobile Sitemap for errors with Screaming Frog

Download and save your xml sitemap
Choose Mode->List in Screaming frog
Select URL list file
Choose the XML file and click “Open”
Click Start to start crawling.

Sort on Status code.

 If there are 4xx, 5xx, it’s a fail, else pass.

 

Video XML Sitemaps

Found Videos?If the site has videos, mark as pass, else NA and skip the following rows in this section 
Found Video XML SitemapsA video sitemap helps search engines to quickly identify video content and index it for video searches.
Video XML Sitemap(s) Listed in Robots.txtVisit yoursite.com/sitemap_video.xml – check if one exists.
Video XML Sitemap(s) with Broken LinksCheck the video sitemap using the screaming frog tool as mentioned in the page XML sitemap section.

 

Image XML Sitemaps

Found Images?Check if the site has images. (Most websites do)
Found Images XML SitemapsCheck if https://gatherby.org/sitemap_image.xml is found. If yes, pass, else fail.
Image XML Sitemap(s) Listed in Robots.txtIf the image sitemap exists, check if it is listed in the robots.txt.
Image XML Sitemap(s) with Broken LinksRun screaming frog check for the image sitemap and see if there are broken links.

Pagination

Pagination Present?Check if the site has prev and next buttons on each page. Many blog sites have this, when a large article is broken into multiple pages.This also applies to eCommerce sites with multiple product pages and long guides that is broken down into multiple pages.
View-All page AvailableMake sure view-all page is quick to load. User can easily navigate and find relevant information. Whenever a view all page is present for paginated content google makes effort to index the view-all page and google will also consolidate indexing properties to the view-all page. 
Use of session IDs or other unique identifiers/parametersCheck what happens when you more across paginated content. <link rel=”next” href=”page2&sid=9876” />. The sid should be the same. Keeping session id consistent you can tell google what paginated content belongs together
Use of splash or gateway pagesCheck if there is a splash page which loads before the home page is displayed.

 

Subdomains

Found subdomains?:Check if the website has subdomains like http://subdomain.website.com. Go to https://pentest-tools.com/information-gathering/find-subdomains-of-domain and enter the website URL into the box. If there are subdomains, they will be listed. Ignore the email.website.com, smtp.website.com etc which are used for emails.
Sub-domains have robots.txtIf there is a separate sub-domain, it should have its own robots.txt – http://subdomain.website.com/robots.txt  
Sub-domains have sitemapsIf there is a separate sub-domain, it should have its own robots.txt – http://subdomain.website.com/sitemap.xml    
Sub-domain indexingGoogle for site:subdomain.website.com and see if there are any pages returned. If the subdomain is intentionally created, like a blog which you want to be indexed and you can’t see any results, mark as fail, else pass. However, if it is an old version of the website which you don’t want Google to index, then mark pass if it doesn’t return any results. 

 

Breadcrumbs

Breadcrumbs Found and Available in All The PagesA trail, or secondary navigation, clearly visible to website users to help them navigate your website. Check for items in the submenu. Is it showing something like Home> Blog> How to check for Breadcrumbs for sub topics?
Breadcrumb Visual Representation (last trail path NOT clickable)If the breadcrumb path is visible, check if the individual items are clickable and if you are able to go back to the previous step in the path. 

 

Architecture Issues

Site accessible without Javascript (disable JS)Disable javascript on the browser and check if the site is loading normally. Navigate through the site. If everything functions as usual, mark as pass, else fail. 
Breadcrumbs NOT in JSThe breadcrumb path should work normally even if javascript is disabled. Check a few links after disabling javascript on browser. 
Links coded in HTML (NOT JavaScript)Turn off javascript and try clicking links on the site. They should work as usual.
TLM Reflects Order in Terms of Relevance of ContentWhether every page has a link from the home page via Top level menu. Check some random URLs obtained after doing a Screaming Frog crawl. 
Footer optimizationCheck if the page footer has items like FAQ, locations page, privacy policy, careers, HTML sitemap and other pages important pages that search engines look for to determine the quality of your website. 
Site Directory StructureDo a screaming frog scan, choose a few random URLs and see how far they are from the home page. Ideally, they should be within 4 clicks from the home page. 

 

URL Issues

Hyphens used as default delimiter in URLsRun screaming frog crawl and check the URLs in the list for delimiters other than -.
Overall URL friendliness (short and easy to share)Check the same URL list on screaming frog and see if the URLs are short and easy to share. 
Overall URL optimization (usage of target keywords)Check the screaming frog URL list to see if all URLs have meaningful slugs and if you are able to make out what the page is about, just by looking at the URL. Do a quick scan of the list. 
Absolute vs. Relative URL References in Source CodeOn any given page on the website, right click and select “view source code”.
Find an internal link.
If it’s coded as <a href = “http://yoursite.com/link/” > anchor</a>, it’s an absolute link.. Absolute URLs are better from SEO point of view but relative are better from page load time point of view.

 

On-Page Tagging/Markup Issues

Page Source META tags (ie.: <title>, meta description in head)Check the overview box on the right side of screaming frog. Under page title, the ‘missing’ should be 0% and under meta description, the missing should be 0%. 

 

<title> tag issues

Missing <title>Check Screaming frog results box on the right side. Missing percentage should be 0 for it to Pass. 1-10% – Ok. Above 10% – fail.
Duplicate <title>Duplicate under title should be 0 for pass.1-10% – Ok. Above 10% – fail.
Over 65 characters0 for pass.1-10% – Ok. Above 10% – fail.
Below 30 characters0 for pass.1-10% – Ok. Above 10% – fail.
Over 571 pixels0 for pass.1-10% – Ok. Above 10% – fail.
Below 200 pixels0 for pass.1-10% – Ok. Above 10% – fail.
Same as H1The H1 and Title should ideally be the same but it cannot be strictly enforced. One can make use of 2 variations of a keyword for better SEO (for eg. ‘Digital Marketing’ and ‘Online Marketing’. However, you need to make sure these are not completely different as it can lead to poor user experience and SEO penalty. 

Screaming frog tool will only check exact matches. If this score is above 70% it can be marked as a pass. Between 50-70% is ok and below 50% needs to be investigated by checking the title and H1 of each non-matching page individually and if the variation is only a little (keyword synonyms used for SEO benefit) they can be manually marked and the percentage of matches can be calculated manually. 

MultiplePages which have more than one <title> tag. This has to be 0% for pass; >0% but < 15% – OK; > 15% – Fail

<meta description> issues

 

Missing Meta Description0% – pass; >0% but < 15% – OK; > 15% – Fail
Duplicate Meta Descriptions0% – pass; >0% but < 15% – OK; > 15% – Fail
Over 156 characters0% – pass; >0% but < 15% – OK; > 15% – Fail
Below 70 characters0% – pass; >0% but < 15% – OK; > 15% – Fail
Over 928 pixels0% – pass; >0% but < 15% – OK; > 15% – Fail
Below 400 pixels0% – pass; >0% but < 15% – OK; > 15% – Fail

 

The meta description has to be between  70-156 characters in length for best result. Below 70 characters is bad because it is an opportunity to showcase your page to search engine visitors and entice them to click the link to your page. Above 156 is bad because search engines will truncate the characters. 

 

The pixel length is another way of looking at it because some characters are wider than the others, eg ‘l’ takes up lesser number of pixels than ‘w’.


Page Headings (H1-H6))

Check screaming frog results in the right side box.

 

Missing H1 Heading0% – pass; >0% but < 15% – OK; > 15% – Fail
Duplicate H1 Headings0% – pass; >0% but < 15% – OK; > 15% – Fail
Over 70 characters0% – pass; >0% but < 15% – OK; > 15% – Fail
Multiple H10% – pass; >0% but < 15% – OK; > 15% – Fail
Missing H20-20%- pass; 20-50% – OK, >50% fail  (It’s ok to have missing H2 tags but not for a large proportion of pages. H2 tags are used when a page is logically split into sections. If most of the pages have a missing H2 tag, it could mean that the content is not split into sections and is written as one big paragraph, which makes it less readable.)
Over 70 characters0% – pass; >0% but < 15% – OK; > 15% – Fail

 

Structured Data

 

Use of Structured DatamarkupGo to
https://search.google.com/structured-data/testing-tool and enter the URL of the site. Check if there is any data on the right side box. If yes, pass, else fail.

 

Main Body Content

Key Content LocationChoose a few random pages, right click and select click view source. Check the location of H1, H2 tags. They should be near the top of the pages. Mark as pass if the header tags and main content of page are near the top.
Rich Semantic Content (NOT thin)Open Screaming Frog and enter the domain.
Scroll over to the right to find “Word Count” and look for pages with less than 300 words (outside of trivial pages ie. contact, about).

(Consider downloading the internal_all file and checking thru excel)

It should ideally be less than 20%. 

Expired ContentManually check the pages – blog, news etc to see if there is any time-related content and if there are very old articles – dated 3-4 years back. 

SEO Video

Found Videos?Check if the site has any videos posted
Video XML SitemapIf there are videos, there has to be a video XML sitemap

 

YouTube Optimization

Applies to sites which have official YouTube channel.

 

Average Optimization Score
Channel Optimization:

Domain hosted Videos

Applies when the videos are hosted on the same domain.

 

Video files in one directory
Keyword in the filename and URL


SEO Images

Run screaming frog scan. Check the summary section on the right. Check the ‘Images’ section.

 

Image File Size (>100 kb)Larger image sizes slow load time on a page.

Slow load time leads to bad user experience having to wait for images to render. Mark as fail if >20% are over 100kb, OK if less than 20% is over 100kb. Pass for 0%

Missing Alt TextCheck screaming frog result under images for the Missing Alt Text value

0 – pass; <20% is ok. > 20% – fail

Alt Text over 100 CharactersCheck screaming frog result under images

0 – pass; <20% is ok. > 20% – fail

 

International SEO

 

Applies to international websites 

 

Client’s business targets: multiple countries or languages?
Use of Multiple ccTLDS
Use of subfolders or subdomains (within one ccTLD)
Language Markup
Language set
Alternate Tags (Rel=”Alternate” Hreflang=”x”)
Alternate language HREFLANG directive set in head of Source Code
Alternate language directive set in page XML sitemap.

 

Potential Triggers of Duplicate Content

 

Potential Trigger: www vs. non-wwwOpen the www version and non-www version of the site and a few of the pages. Either all should redirect to www or non-www. There shouldn’t be a situation where some pages load as www and some load as non-www and there are multiple versions of the same page – one on www and one on non-www. Screaming frog scan can reveal this, if the problem exists. Mark as pass if there are no two version and fail if there are two versions 
Potential Trigger: htttp vs. httpsCheck if the site has both http and non http pages. If yes, there is a potential duplicate content issue and mark as fail. If there is only one version, mark as pass. 
Potential Trigger: Redirected Index File NamesCheck if the home page is being redirected to some other page. (Screaming frog will show any 301/302 redirects if they exist; ignore the redirections to www or non www versions). If there are redirections, mark as fail. Otherwise, pass. 
Potential Trigger: Redirected Slash and Non Slash URLS

(ie.: www.example.com/sample/ vs. www.example.com/sample)

A quick scan of the internal_all file downloaded from screaming frog will reveal if there are multiple versions of the same page with ‘/’ and without ‘/’. Download the file and sort on the first column. The same URLs with / and without / will appear adjacent to each other. If there are 2 versions, mark as fail, else mark as pass.
Potential Trigger: Uppercase/Lower Characters Automatically FixedSelect a few random URLs on the site from screaming frog internal_all spreadsheet. Change some random characters in the URL to upper case and check what happens. If the pages load fine, pass. Else fail. 
Other Duplicate Content Found Within the Same DomainCheck if there is any chance of duplicate content by submitting the URL to http://siteliner.com/. If there are any duplicates listed, examine them and see if they are just snippets and headers/footers or any main content has been duplicated.
Other Subdomains or Client’s DomainsCheck if the page has subdomains where the content can be replicated, for example www.website.com/subpage and subpage.website.com Mark as pass if there are no subdomains and fail if there are subdomains and the possibility exists.
Duplicate Content Handling Use of the Rel=Canonical Tag:If there is duplicate content as shown in siteliner, check if there is a redirect set up from one page to the other. Pass if it is set, fail if not set and NA if there is no duplicate content.
Duplicate Content Handling : Within Pages in DomainIf there is any other method of handling duplicate content, mention pass.

Social Issues

 

Google + account:Pass if the page has a G+ page associated.Type in the name, phone and google plus in the Google search bar.
Implemented rel=”author”Shows a rich snippet of the author in the SERP.

Attributes content on a page or post to a particular author (i.e. digital signature). Currently Google has suspended authorship markup; however may bring it back in future.

Since this feature isn’t in use now, mark as NA and move on.

Social Sharing:Check if the articles have FB/Twitter/G+ sharing links which allow visitors to post the page link onto their own social media pages. Mark pass if yes and fail if no.
Open Graph Snippet: Facebook (custom search in SF for meta property=”og:”)Check the source code.
Type in the meta tags “og” manually. If tags like these exist, mark pass, else fail.

meta property=”og:title” content=”Gather By MGO 1,069 Bioactive Honey”/>
  <meta property=”og:url” content=”http://www.gatherby.org/gather_by_active_honey”>
    <meta property=”og:type” content=”article”>

Does have OG directive?
Does have Facebook sharing buttonCheck if articles have the Facebook sharing button. If yes, mark pass, else fail.
Not multiple
Editorial optimized
Twitter Card Snippet (searched “<custom search in SF for meta name=”twitter:”):Manually check for the meta name “twitter” in the source code. Mark pass if it’s present, and fail if not present.
Does have Twitter directive? 
Does have Twitter sharing buttonsCheck if there are twitter sharing buttons, mark pass if they are present and fail if they are not.
Not multiple
Optimized for social sharing

 

Page Speed

Go to https://developers.google.com/speed/pagespeed/insights/ and enter URL. Pass if the test is in green, fail if Red and Ok if Amber

Google Page Speed Insights (desktop) – home page, landing pagePass if the test is in green, fail if Red and Ok if Amber
Google Page Speed Insights (mobile) – home page, landing pagePass if the test is in green, fail if Red and Ok if Amber
Speed test using GTMetrix – home page, landing page (PageSpeed)
Speed test using GTMetrix – home page, landing page (YSlow Score)

 

Protocols

Check Screaming frog results. It will give the percentage of pages which are http and https.

 

HTTP
HTTPS

 

Crawler Tools

Xenu Crawler
  1. Enter main website url and start scan
  2. Save report file and upload in project folder in google drive
Screaming Frog Crawler
  1. Enter main website url and start scan
  2. Save individual results in  the technical audit sheet

 

Leave a Reply

Your email address will not be published. Required fields are marked *