Opera MAMA - a sneak peek at headings, images and summary
By Henny Swan. Tuesday, 19. May 2009, 09:07:36
MAMA has been busy these last few weeks and we have some early results coming in. Based on feedback and requests here is a snapshot of what the Web's made of and some quick and dirty analysis. The results are based on 4,225,113 URLs tested from over 250 countries; more about the methodology can be found in Dev Opera. It's worth noting that the numbers do not all add up as URLs may fulfill a number of the criteria. Therefore, percentages are guidelines only.
Headings
- No headings used: 2473197, 58.5%
-
H1used multiple times in a doc: 336523, 7.9% - Headings don't start with
H1: 681316, 16.1% - Heading levels were skipped: 479294, 11.3%
- Heading levels were out of order: 298695, 7.1%
Structure on the web is fundamental for readable web pages by both search engines and assistive technologies such as screen readers and refreshable braille displays. With a heading structure missing on over half the pages tested, and implemented incorrectly on the majority of others these results are pretty disappointing - if not very surprising.
WebAim's screen reader survey findings for headings found that 52% of respondents relied on headings to navigate around pages. I think this is a woefully low number but is perhaps an understandable given less than half of the websites tested had any headings at all.
Constantly debated is the use - or rather misuse - of multiple H1s in a page, so much so there is even a website dedicated to the H1 debate. Should a page have just have one heading describing the unique page content or can it have two and include the website name as a second H1?
I'm not a fan of multiple H1s as it interrupts the logical page structure for screen reader users who rely on headings to get a mental overview of the page content and navigate it. Coding the website name as H1 also duplicates unnecessary information - a user can easily find out what site they are on by reading the page title, logo or other branded elements as well as copy.
The 11.3% of pages that skip heading levels may be evidence of headings are being used for presentation rather than structure. Often this is found in sites that fail to use good CSS and style text using H1-H6. This is especially confusing for a screen reader or refreshable braille display users as they get a partial and confused structure - a bit like a text book with half the index missing.
What MAMA shows is that we're far off from getting structure right yet. In the meantime we're snubbing search engines and assistive technologies, which get left out in the cold as they can't make sense of content. To get an idea of what I mean check out the Importance of HTML Headings for Accessibility video from Aaron Cannon.
Images
-
titleused in animgelement: 647651, 15.3% -
altused in animgelement: 3176199, 75.1% -
titleandaltused together in sameimgelement: 575544, 13.6%
The thorny issue of images doesn't really contain many surprises with still a quarter of web pages lacking any alt attribute whatsoever. This of course doesn't take into account pages that have a mix of images both with and without alt attributes. What is interesting is that just over 13% use both title and alt around the same image.
There's rarely a good use case for this and has more of a negative impact of bloating your code and creating usability woes for screen reader and refreshable braille display users. In my experience, title text is very misunderstood and generally used incorrectly. Many page authors duplicate alt - or link - text leading to a classic case of too much accessibility as some screen reader users are forced to hear text twice.
Once we have had more time to analyse the data I hope to be able to let you know how many of the title and alt text combinations replicate text exactly in both.
Summary
- Table
summary: 102821 times, 2.4% - 1829 out of 102821 URLs using
summaryuse "layout table", 1.7%
A summary on a data table is a useful way for a screen reader user to get information about the data in the table and how it is organised. Its intended use is not, however, to describe layout tables which should be avoided (and CSS used) and also should not be made evident to the screen reader user.
In the results we can see that 1.7% of URLS are using layout tables due to the misguided use of "Layout table" as the summary. This is really a conservative estimate as we can't predict every value to be tested however MAMA did unveil numerous instances of other misguided summary text such as: "layout", "layout table", "header", "navigation", "content", "banner", "main", "main table", "breadcrumb", "category"...the list goes on. Terms like this are of little use to the user when read out and bloat the amount of information they are forced to wade through.
This is a prime example of how there is a disconnect between the HTML 4.01 specification and WCAG 2.0 with actual usage. As HTML 4.01 states "This attribute offers advisory information about the element for which it is set. It should never contain primary information and should not duplicate information.
WCAG 2.0 makes explicit reference warning against usage of the summary attribute on layout tables:
...if a layout table is used, then thePartly because of such bmisuse of thesummaryattribute is not used or is null. The purpose of a layout table is simply to control the placement of content; the table itself is “transparent" to the user. Asummarywould “break" this transparency by calling attention to the table. A nullsummary(summary="") on layout tables is acceptable.
summary attribute on the web today the HTML 5 Working Group are advocating the removal of summary from HTML5. This is something that is not a popular with many accessibility advocates who argue that if used correctly on data tables to provide information about the table contents summary is beneficial. Fellow Opera Evangelist Bruce Lawson has some compelling ideas around summary in HTML5 as well as other accessibility implications for the new specification.
Coming next
We're only just starting to look at the data so if you have any further requests of what you want us to look at leave a comment. Full analysis and data will be published in due course together with breakdown by country. To get an idea of what we'll cover check out last year's MAMA results
.






Anonymous # 19. May 2009, 12:26
I think the majority of your analysis here is spot-on (the prevalence of missing headings is shocking), but I do disagree with you on your interpretation of why heading levels may be skipped.
While I do often focus on the overall outline of a document when considering use of headings, I also feel strongly that the site's name and supplementary information (such as sidebar content) warrant use of additional heading levels (for semantic reasons) even though they don't necessarily fit into the overall document outline. In my work, I usually use an H1 for the site name only on the home page, replacing it with a lower-level heading type (usually H4) on interior pages because it is important, but not as important as the content title. Similarly, I routinely employ mid-level headings (H3 & H4) in sidebars because they do have headings even if they don't necessarily relate to the overall document outline.
Semantically, my employment of headings makes sense, but if viewed solely through the lens of the document outline, it doesn't. I'm aware of this trade-off and have come to terms with it; I believe it's the best I can hope to do right now. Perhaps in future iterations of HTML, we'll have the ability to indicate whether or not certain heading levels should be considered part of the document outline at all. Maybe HTML5's SECTIONs will help solve that issue or perhaps ARIA roles are really the answer (after all the "sectionhead" base role allows for the declaration of a heading without specifying a level). In other words, it may not be an issue for much longer, but in
Bruce Lawson # 19. May 2009, 14:32
Or, it could indicate that authors are doing the right thing and dropping the (WCAG 1-mandated) summary attribute for layout tables. (I'm assuming there are many more layout tables on the web than data tables).
Perhaps we should adjust Aaron Leventhals' heuristic for identifying layout tables http://markmail.org/message/t56dz7nuk2r7azh3 and see how many time summary is used where it should be on data tables?
Anonymous # 19. May 2009, 15:04
One minor clarification - the WebAIM screen reader user survey found that 76% of users always or often use headings for navigation. Only 2% of respondents never use them. I'd consider this quite significant.
Henny Swan # 19. May 2009, 15:39
The issue of headings in the template and the content comes up quite a lot and as you suggest does not have to signify the end of the world even if it is not technically significant. The link to "multiple H1's" above also goes into how HTM5 may offer a solution for this too.
@Bruce - I think we should certainly see if we can get that added, great suggestions.
@Jared - I was looking at the "Whenever they're available" numbers but agree that the combined stats of "always" or "often" is significant. I just wish people could get a better experience than is obviously the case now.
John Faulds # 21. May 2009, 12:31
Henny Swan # 21. May 2009, 16:39
I use Wordpress on my personal blog and am religious about removing TITLE. On larger sites where 3rd party CMS's are used it may be that the site owner needs to look at having it customised so that TITLE test is a choice not a requirement.
I guess this takes us to another point which is to choose your CMS with care and referencing the Authoring Tools Accessibility Guidelines.