Background Image for Why Open Data leads to Higher Quality Data

Open Data

Why Open Data leads to Higher Quality Data

We are often told Geolytix are foolish to ‘give away’ data we have created, especially when we know we could license that data for good money.

20th February 2013

Well, more foolishness… today we have released the full data for the top ~350 town and city centre Retail Places, to join our fantastic postal sector boundaries and a beautifully processed set of Census Data in the Open Data world. The reasons that a for-profit small business may choose to release their work as open are well rehearsed; reputation building; making money from services; licensing fuller and/or more recent versions are all valid. But one driver is a little more subtle.

QUALITY

In today’s agile world the mantra is release early, release often. But I worry that the ‘minimally viable product’ concept can sometimes be used as an excuse for releasing half-baked work. I do not want Geolytix to ever be thought of as doing shoddy work. Releasing some of our data as open forces us to get it as right as we possibly can straight out the gate. It may seem perverse, but when you have a handful of paying clients it is easier to release poor work; you knock up a patch, send out an update, re-do the release notes, apologise, make your excuses and move on. That is hard to do with Open Data.

Over 400 people have already downloaded our postal boundaries. In just four days and based of little more than a tweet and a handful of e-mails, nearly 100 people have downloaded the processed Geolytix UK Census pack. We very deliberately have made downloading the data as easy as possible, and we harvest no contact details (another thing people tell us we’re mad not to doing). This means we have no control, cannot recall or apologise. We have no way of knowing how many actual ‘users’ of the our data are out there – maybe only a handful, probably several hundred, maybe 1000′s.

As we go through our data production, document writing and quality checking, about once every 10 minutes I think about those end-users. Releasing Open Data is about building a reputation, and we know several hundred of our peers, competitors and potential partners and customers will be using our data. That puts a huge tension in the quality bow. Our data isn’t perfect (data never can be), but we spent long evenings and weekends working to get it as good as we possibly could. I’m sure producers of ‘closed data’ (heck that includes us!) work just as hard on quality, but when you *know* that 1000′s of people will be using your data from the the get-go…. you really do push that little bit harder to get it right first time. So that’s one more reason we ‘do’ Open Data.

Releasing Open Data forces us to keep our standards high

Download our open Retail Places here.

Related Posts

  • Geolytix Supermarket Retail Points V19

    Geolytix Supermarket Retail Points V19

    18th February 2021

    Our comprehensive data set of supermarket and convenience store locations across the UK, Retail Points, has now been updated and released the 19th version.

  • Supermarket Retail Points

    Supermarket Retail Points

    18th January 2021

    Retail Points is a comprehensive data set of supermarket and convenience store locations across the UK. Geolytix release this as open data allowing for unrestricted use, no licensing requirements and without cost.

  • Geolytix Retail Points - August 2020 - Including UK coverage of Spar

    Geolytix Retail Points - August 2020 - Including UK coverage of Spar

    25th August 2020

    As a result of the COVID-19 outbreak, many supermarket store openings and closures for this quarter were delayed.