Skip navigation.

exploreopera

| Help

Sign up | Help

Tom Heath's Displacement Activities

Posts tagged with "workshop"

VoCamp, Day Zero

, , , ...

Tomorrow is the first day of the first ever VoCamp. It may also be the last ever VoCamp, but I hope and believe that will not be the case. Around 20 of us have gathered in Oxford for two days for an event from which none of us quite knows what to expect. The goal is to help drive forward the creation, publication and utilisation of vocabularies for describing data on the Web.

In the last couple of years we, as in the Semantic Web community, have learned a great deal about how to publish data on the Web. As we've become more familiar with this process we've got better at knowing where to look to find existing data that could be published online according to Semantic Web and Linked Data principles. What hasn't kept pace with this process is the availability of vocabularies/ontologies for describing this data. I may now be able to get hold of data about changes in polar ice caps and polar bear migration patterns, but would bet money that there's no vocabulary with which to describe this data. Choose almost any domain and the situation will be the same.

If we're serious about building a Web of Data, then this issue has to change. I see this from my own work, but Peter Mika's experiences at Yahoo!, and the strength of his conviction (conveyed very nicely in this blog post), provide some great confirmation that I'm not alone in this perception. The vocabulary bottleneck has to be eased.

So, tomorrow is a chance for us to start changing that. The solution won't come overnight, but I hope that we can start the ball rolling. VoCampOxford2008, and any VoCamp in fact, is about creating some dedicated time and space to create and publish vocabularies in domains that interest us. We all have grand ideas while waiting at the bus stop/traffic lights, doing the washing up, wherever, about cool domains we could model and in which we could publish data, but without some ring-fenced time in which to do so these plans can easily come to nothing. VoCamp aims to solve that.

The primary success criteria for the next two days will be the publication of new vocabularies on the Web that increase the availability of Linked Data. That's the main goal, but there are many others. I am confident that this first VoCamp will be an opportunity to share issues, expertise, modeling techniques and design patterns. In doing so we will all become smarter. There is an opportunity to scope requirements in the wider Semantic Web field that impact upon the availability and reuse of vocabularies. Collectively we can identify missing pieces of the technical infrastructure required by the Web of Data, and begin to build a social infrastructure that helps us collectively ease the vocabulary bottleneck.

These are grand goals. Even if none of them were to be achieved, there is one other goal which I'm sure will be met; that is determining whether the VoCamp format works, and if so how. If the format fails, we'll need to look elsewhere for a solution. If it succeeds, fully or partially, we'll be closer to knowing how to do it even better next time.

Making Links at the BBC

, , , ...

Ian and I spent last Friday at BBC Television Centre in London. For anyone of my generation who grew up in the UK this place probably has an almost mythical status, as the place to send your competition entries or milk bottle tops for the latest Blue Peter appeal. We were there for a workshop on the theme of the Semantic Web, organised by Nicholas Humfrey and Patrick Sinclair from BBC Audio and Music Interactive.

Not only was it a privilege to get a look inside this great institution, it was great to see so many BBC people turn up to hear about the Semantic Web. Nick and Patrick had put together a very nicely structured programme, introducing people to the Semantic Web from the conceptual level of Linked Data (that was my bit), through a talk on DBpedia by Georgi Kobilarov, to the highs and lows of enterprise scale RDF storage as revealed by Steve Harris of Garlik, and finally to interfaces for structured data as presented by Daniel Smith from the University of Southampton. Hope all the slides will be linked to from the BBC Radio Labs blog in due course. In the meantime you can find mine here.

Aside from the inherent pleasure associated with talking to people about the Semantic Web, the highlight of the day for me was getting a sneak preview of the Linked Data work that's going on within the BBC, and will hopefully soon see the light of day on the public Web. The /programmes area of the BBC site will be home to large amounts of RDF data about programmes going out across all channels, and each will be identified by a dereferenceable URI.

This is a huge deal, and testament to the hard work put in by people like Nick, Patrick and Michael Smethhurst from the BBC, with input from people like Yves. There is already a public commitment to linked data principles at http://www.bbc.co.uk/programmes/developers , but what impressed me most was the extent to which linking to external data sets seems to be baked into the thinking from day one. Expect to see strong links to Musicbrainz in the first instance, and no doubt to many more data sets over time.

The BBC are well ahead of the game here. They don't have an angry mob of license-fee payers at the gates demanding access to BBC data in RDF, with chants of "give us our data, we've paid for it already" (or hopefully something more poetic). This mob will never materialise. They've seen the willingness of the BBC in this area with previous initiatives such as the Catalogue, and are down the pub dreaming up ways to use this data. With the advent of the current work on Linked Data and /programmes the non-mob have even more to dream about.

Perhaps as a publicly-funded organisation the BBC is obliged (morally or otherwise) to be a good citizen of the Web of Data. However, I don't get the impression that that's what this is about, in the first instance at least. I'm left with the feeling that this is a result of a bunch of guys really getting the Web of Data, and seeing the value that links can bring to their organisation.