Skip navigation.

ODIN Blog

Opera Developer Network

Opera MAMA - a sneak peek at headings, images and summary

, , , , ,

MAMA has been busy these last few weeks and we have some early results coming in. Based on feedback and requests here is a snapshot of what the Web's made of and some quick and dirty  analysis. The results are based on 4,225,113 URLs tested from over 250 countries; more about the methodology can be found in Dev Opera. It's worth noting that the numbers do not all add up as URLs may fulfill a number of the criteria. Therefore, percentages are guidelines only.

Headings

  • No headings used: 2473197, 58.5%
  • H1 used multiple times in a doc: 336523, 7.9%
  • Headings don't start with H1: 681316, 16.1%
  • Heading levels were skipped: 479294, 11.3%
  • Heading levels were out of order: 298695, 7.1%

Structure on the web is fundamental for readable web pages by both search engines and assistive technologies such as screen readers and refreshable braille displays. With a heading structure missing on over half the pages tested, and implemented incorrectly on the majority of others these results are pretty disappointing - if not very surprising.

WebAim's screen reader survey findings for headings found that 52% of respondents relied on headings to navigate around pages. I think this is a woefully low number but is perhaps an understandable given less than half of the websites tested had any headings at all.

Constantly debated is the use - or rather misuse - of multiple H1s in a page, so much so there is even a website dedicated to the H1 debate. Should a page have just have one heading describing the unique page content or can it have two and include the website name as a second H1?

I'm not a fan of multiple H1s as it interrupts the logical page structure for screen reader users who rely on headings to get a mental overview of the page content and navigate it. Coding the website name as H1 also duplicates unnecessary information - a user can easily find out what site they are on by reading the page title, logo or other branded elements as well as copy.

The 11.3% of pages that skip heading levels may be evidence of headings are being used for presentation rather than structure. Often this is found in sites that fail to use good CSS and style text using H1-H6. This is especially confusing for a screen reader or refreshable braille display users as they get a partial and confused structure - a bit like a text book with half the index missing.

What MAMA shows is that we're far off from getting structure right yet. In the meantime we're snubbing search engines and assistive technologies, which get left out in the cold as they can't make sense of content. To get an idea of what I mean check out the Importance of HTML Headings for Accessibility video from Aaron Cannon.

Images

  • title used in an img element: 647651, 15.3%
  • alt used in an img element: 3176199, 75.1%
  • title and alt used together in same img element: 575544, 13.6%

The thorny issue of images doesn't really contain many surprises with still a quarter of web pages lacking any alt attribute whatsoever. This of course doesn't take into account pages that have a mix of images both with and without alt attributes. What is interesting is that just over 13% use both title and alt around the same image.

There's rarely a good use case for this and has more of a negative impact of bloating your code and creating usability woes for screen reader and refreshable braille display users. In my experience, title text is very misunderstood and generally used incorrectly. Many page authors duplicate alt - or link - text leading to a classic case of too much accessibility as some screen reader users are forced to hear text twice.

Once we have had more time to analyse the data I hope to be able to let you know how many of the title and alt text combinations replicate text exactly in both.

Summary

  • Table summary: 102821 times, 2.4%
  • 1829 out of 102821 URLs using summary use "layout table", 1.7%

A summary on a data table is a useful way for a screen reader user to get information about the data in the table and how it is organised. Its intended use is not, however, to describe layout tables which should be avoided (and CSS used) and also should not be made evident to the screen reader user.

In the results we can see that 1.7% of URLS are using layout tables due to the misguided use of "Layout table" as the summary. This is really a conservative estimate as we can't predict every value to be tested however MAMA did unveil numerous instances of other misguided summary text such as: "layout", "layout table", "header", "navigation", "content", "banner", "main", "main table", "breadcrumb", "category"...the list goes on. Terms like this are of little use to the user when read out and bloat the amount of information they are forced to wade through.

This is a prime example of how there is a disconnect between the HTML 4.01 specification and WCAG 2.0 with actual usage. As HTML 4.01 states "This attribute offers advisory information about the element for which it is set. It should never contain primary information and should not duplicate information. WCAG 2.0 makes explicit reference warning against usage of the summary attribute on layout tables:

...if a layout table is used, then the summary attribute is not used or is null. The purpose of a layout table is simply to control the placement of content; the table itself is “transparent" to the user. A summary would “break" this transparency by calling attention to the table. A null summary (summary="") on layout tables is acceptable.
Partly because of such bmisuse of the summary attribute on the web today the HTML 5 Working Group are advocating the removal of summary from HTML5. This is something that is not a popular with many accessibility advocates who argue that if used correctly on data tables to provide information about the table contents summary is beneficial. Fellow Opera Evangelist Bruce Lawson has some compelling ideas around summary in HTML5 as well as other accessibility implications for the new specification.

Coming next

We're only just starting to look at the data so if you have any further requests of what you want us to look at leave a comment. Full analysis and data will be published in due course together with breakdown by country. To get an idea of what we'll cover check out last year's MAMA results

.

Unleashed in the East: Ukraine University tour April 26-30Installing multiple Opera instances with separate profiles on Mac

Comments

Anonymous 19. May 2009, 12:26

Aaron Gustafson writes:

I think the majority of your analysis here is spot-on (the prevalence of missing headings is shocking), but I do disagree with you on your interpretation of why heading levels may be skipped.

While I do often focus on the overall outline of a document when considering use of headings, I also feel strongly that the site's name and supplementary information (such as sidebar content) warrant use of additional heading levels (for semantic reasons) even though they don't necessarily fit into the overall document outline. In my work, I usually use an H1 for the site name only on the home page, replacing it with a lower-level heading type (usually H4) on interior pages because it is important, but not as important as the content title. Similarly, I routinely employ mid-level headings (H3 & H4) in sidebars because they do have headings even if they don't necessarily relate to the overall document outline.

Semantically, my employment of headings makes sense, but if viewed solely through the lens of the document outline, it doesn't. I'm aware of this trade-off and have come to terms with it; I believe it's the best I can hope to do right now. Perhaps in future iterations of HTML, we'll have the ability to indicate whether or not certain heading levels should be considered part of the document outline at all. Maybe HTML5's SECTIONs will help solve that issue or perhaps ARIA roles are really the answer (after all the "sectionhead" base role allows for the declaration of a heading without specifying a level). In other words, it may not be an issue for much longer, but in

Bruce Lawson 19. May 2009, 14:32

The absence of table summaries could either support the html5 position that summaries are completely unused by authors and therefore dispensible.

Or, it could indicate that authors are doing the right thing and dropping the (WCAG 1-mandated) summary attribute for layout tables. (I'm assuming there are many more layout tables on the web than data tables).

Perhaps we should adjust Aaron Leventhals' heuristic for identifying layout tables http://markmail.org/message/t56dz7nuk2r7azh3 and see how many time summary is used where it should be on data tables?

Anonymous 19. May 2009, 15:04

Jared Smith writes:

One minor clarification - the WebAIM screen reader user survey found that 76% of users always or often use headings for navigation. Only 2% of respondents never use them. I'd consider this quite significant.

Henny Swan 19. May 2009, 15:39

@Aaron, thanks for the comments, all great points. Actually I don't think we disagree on why headings are skipped at all, you've simpley added more possiblies as to why it happens - and well put.

The issue of headings in the template and the content comes up quite a lot and as you suggest does not have to signify the end of the world even if it is not technically significant. The link to "multiple H1's" above also goes into how HTM5 may offer a solution for this too.

@Bruce - I think we should certainly see if we can get that added, great suggestions.

@Jared - I was looking at the "Whenever they're available" numbers but agree that the combined stats of "always" or "often" is significant. I just wish people could get a better experience than is obviously the case now.

John Faulds 21. May 2009, 12:31

With regards images that use both title and alt attributes: I doubt this is a conscious choice by most people. The high percentage is more likely due to the default that current versions of Wordpress force on an author when uploading and inserting an image. A title is mandatory for an image to be inserted, and if you don't manually change it, WP will take the file name of the image and make that the value for both title and alt attributes.

Henny Swan 21. May 2009, 16:39

I think you have hit on a good point here Tyssen in that many blogging software's such as Wordpress and other, larger, content management systems force the duplication.

I use Wordpress on my personal blog and am religious about removing TITLE. On larger sites where 3rd party CMS's are used it may be that the site owner needs to look at having it customised so that TITLE test is a choice not a requirement.

I guess this takes us to another point which is to choose your CMS with care and referencing the Authoring Tools Accessibility Guidelines.

How to use Quote function:

  1. Select some text
  2. Click on the Quote link

Write a comment

Comment
(BBcode and HTML is turned off for anonymous user comments.)

If you can't read the words, press the small reload icon.


Smilies