OK, so the Data.gov.uk stuff hit a raw nerve with me yesterday. In itself, it was pretty disappointing and the reporting also touched on one of my pet hates: Why do a lot of journalists not ask questions any more? They just seem to repost what they’re told. But, I’m past all that now, so here are a few thoughts on how a government data website could be better implemented.
The W3 put a bit of effort into a data browser extension for Firefox called Tabulator. It’s nice and deserves to be better. If I look at a page on Wikipedia with Firefox, say for example
http://en.wikipedia.org/wiki/Ystrad_Meurig
Then if there is data I am interested in as a starting point, I can go to Dbpedia and at http://dbpedia.org/resource/Ystrad_Meurig my data browser extension will find a graph of data that it can understand.
This is really nice.
The Data.gov.uk site has a query page for looking at its semantic data, but no clues as to what I can ask for. Dbpedia has a query page and from my Ystrad Meurig example I already know a lot about how Dbpedia might be storing its data. I know that I can ask for latitudes and longitudes using the W3 positioning predicates (using Tabulator I can browse from the data page to those predicates and find out more about them – it’s all linked). I know Dbpedia has a thing called ‘distance to Cardiff‘ so I could query Dbpedia for all things with a distance to Cardiff that have latitudes and longitudes and then I could plot them on a map.
This is properly linked data. This is what a government data site should be like.
I mentioned the Ordinance Survey ontology in yesterday’s rant. I like it, but it could be better. It has a solid structure for an administrative geography of the UK (not including Northern Ireland). However, the current version 2, is already out of date. A number of the unitary authorities were merged last year. This information is already reflected on the corresponding pages in Dbpedia, along with nice tagging to link the new authorities to the data on the authorities they have replaced.
The OS version 2 ontology replaced version 1 in a fairly unhelpful way, but that was OK because they said they were still playing around with how to work with ontologies. Will the next version play well with the current one?
The Dbpedia way of doing things means that not only do we end up with an up to date administrative structure but we also maintain a history of that structure. That history can be useful if we have to consider people and not administrations. A person might get around to acknowledging a change in the administrative make up of his area – eventually – but it wont happen immediately. The online structures need to be able to link them from old knowledge to new concepts. Here is the advantage in all these new ideas: Nothing leads to a dead end, everything is given more meaning by its connections.
The other Tim









Most Recent Comments