It seems somewhat absurd that publicly funded institutions in South Africa should be allowed to copyright data produced using public funds. Of course, it is reasonable to expect that physical assets such as buildings, vehicles or machinery should appear on their balance sheets and be reserved for their exclusive use. But knowledge? I’m not so sure.
Assuming that the information “freely” available on the websites of publically funded institutions cannot reasonably be considered a state secret, the revelation of which would harm national interests, I would expect that it cannot be owned or its use restricted to taxpayers.
Here’s a recent example:
The Municipal Demarcation Board is an independent organisation tasked by government with determining the borders between municipalities in South Africa. It is publicly funded, having received R38.5m ($4.3m) last year from the national treasury. Boundaries are published in the Government Gazette and are considered to be public knowledge. I am not a lawyer, but intuitively, this data cannot be considered to enjoy protection of copyright laws.
In order to clarify the situation, I contacted the board and asked whether, as a citizen of South Africa, I was allowed to download the data from its site and use it for commercial purposes. The response that I received was: “Unfortunately there is a lot of commitment to our data and it is copyright to us and we cannot allow you to use it commercially at all.”
As an exercise, I surveyed copyrights on the websites of four national agencies, including the national statistics agency, and the electoral commission, as well as the websites of all nine provinces and the eight metropolitan municipalities. In some cases, licensing was unclear or missing. In the remaining cases, copyright was claimed and commercial use excluded, even for derived data products. This is reminiscent of the GPL (general public licence) but in reverse (all derived products should be restricted under the same rules). Most licences allowed for personal use and for reference purposes while some granted re-distribution rights (to third-parties for personal use or reference purposes).
It is of course possible that these copyrights have been applied as boilerplate by over-zealous lawyers, but our one data point – the response of the Municipal Demarcation Board – suggests that this is not the case.
As an exercise, I emailed each of these agencies requesting special permission for commercial rights (commercial is important here – in lieu of an actual open data policy, commercial rights give me the broadest possible scope for data use that I can hope for). I am eagerly awaiting their responses. You can follow the action on this spreadsheet as I update their policies as the replies come in.
Taking a step back, we should understand why this is important and not just a bunch of navel-gazing. After all, we have a few really fantastic datasets “freely” available (free as in beer). The elections datasets (e.g. here and here) are unprecedented in their depth and detail. The City of Cape Town manages a very rich dataset on municipal valuations, and the National Assembly hosts an intriguing database on gifts received by parliamentarians and other elected officials. However, aside from the fact that most of our data is not in machine-readable format (easily solved by web-scraping although this technology is itself encumbered by sticky legal questions), we are explicitly excluded from using this data for anything but personal use.
Open data has not yet percolated into the collective national consciousness. Some momentum has started to build through a number of niche groups of individuals seduced by the goings-on up north and even on the African continent. For this small sign of life to grow into a fully-fledged ecosystem of data consumers and producers, we need greater access to data under the stewardship of public entities. We need to be able to access that data for personal and for commercial use. We need to be able to download it and distribute derived products. And we should not have to ask for permission every time we do so.
Our post-Apartheid Constitution guarantees the right to access of information. It’s time that citizens of South Africa asserted that right. DM
Adi Eyal (@soapsudtycoon) is a data geek working towards creating an open data movement in South Africa. He came over from the data dark-side, collecting personal data for nefarious purposes. He now believes the data can be used to encourage public discourse in a country with an historically disengaged citizenry. He is a co-founder of the Open Data and Democracy Initiative, an organiser of HacksHackers Cape Town and a member of the Open Knowledge Foundation Cape Town chapter.
Bladerunner (1980s version) is a visual feast due in large part to the Hollywood Actors Strike. This allowed the designers an extra three months to refine the sets and props.