Company Update
• 22 Nov, 2019
Why the world still struggles with open geospatial data publishing
Realising the potential of open geospatial data will require us to think of data publishing in a fundamentally different way.
Recently I gave a presentation at FOSS4G Oceania 2019. The conference brought together folks from various ‘open’ communities around the region, including open source developers, open street map editors and open data users.
This wasn’t my first open data talk. In fact, we’ve been talking about the potential — but also the limitations — of open data publishing for a few years now. It’s a topic the team at Koordinates is passionate about. And from the very positive response afterwards, it seems most of the industry is as well.
What is open data?
First and foremost, what is open data? The strict definition is that open data is free of technical, price, and most legal restrictions on reuse. The basic principle of open data is that anyone on the planet should be able to access and reuse publicly-funded data — even for commercial purposes — without any requirement except to credit the copyright holders.
The decision to make data ‘open’ is made by the owner of the data, such as the government agency, the researcher, or the NGO that produced the data in the first place. For government agencies and research organisations, this decision is structured by various institutional and governmental policies and mandates.
In New Zealand, we have NZGOAL (a great acronym for the New Zealand Government Open Access and Licensing framework) and the Declaration on Open and Transparent Government. Australia has the originally named AUSGOAL (NZ copied Australia, to be fair), and other federal, state, and local governments have their own variations. Lots of universities and research institutions have open data policies, too.
What’s the problem?
This sounds great, doesn’t it? So what’s the problem? It’s fair to say that, as of 2019, progress with open data publishing has stalled. Some agencies, like Land Information New Zealand, have done an excellent job of publishing its data to tens of thousands of users. Others — and it’s fair to say these are in the majority — dump their data on a server, or FTP, or require you to formally email them with a data request.
Where portals are used, they generally haven’t changed or improved in the last six years despite rapid developments elsewhere in the geospatial industry. Metadata publication in portals is inconsistent. Raster imagery often isn’t supported. Service levels are unreliable. Licensing is absent, contradictory, or ‘pretty legal’. Data publishing is clearly an afterthought for most agencies and standards are persistently, frustratingly, low. Why is this still the case, well over a decade into the open data movement?
Lack of progress
There are a few reasons. From an outsider’s perspective, it’s easy to argue organisations simply aren’t treating data publishing as core business. This is true, but I think it risks letting technology providers off the hook. Data platforms also haven’t made it easy to publish data, especially large and complex geospatial data — or at least not as easy as publishers would like. This blog from the Civil Analytics Network at Harvard University, “An Open Letter to the Open Data Community,"outlines some of the issues government agencies have that are limiting their publishing activities.
I believe solving these issues will require data platforms to think of data publishing — and data distribution — in fundamentally different ways. Before I outline how our thinking at Koordinates has changed, I’d like to discuss an example of how data distribution works in the real world.
Real world data distribution
You may be aware that the New Zealand Government has an ambitious target of making New Zealand pest free by 2050. There is an enormous network of volunteer groups, local and central government agencies, NGOs, businesses, and individuals all working together towards this goal (noting, of course, this network existed prior to the Government’s target).
One of the ways these people can connect is by sharing data — after all, this is the only way the government is going to be able to track progress. Across the country, there are already hundreds of disparate, frequently updated datasets with no single point of truth or scalable way to collaborate.
Data complexity in the real world
This complexity is hard to incorporate into our usual way of thinking about open data, which is very much a “publisher publishes, user uses”, A-to-B relationship. In the real world, data distribution is simply much more complex. In our example of conservation data, we’re talking about a network, with an enormous number of potential transactions across many different nodes.
At present, this complex distribution of geospatial data — and real data collaboration — isn’t easy to achieve. It is usually managed by a messy system of cloud file-sharing systems, email and USBs. It would obviously be a great help for these volunteers to be able to immediately access up-to-date versions of all the geospatial data relevant to their project from one place.
A directly connected network
A more accurate view of data distribution is the model of a directly connected network, where organisations have multiple sources lying ‘upstream’ from them for a particular dataset, and other sources lying ‘downstream.’ An organisation might be the upstream source for dataset A, but a midstream node for dataset B, receiving the source, making changes, and distributing the augmented data to other stakeholders.
This view of data distribution helps explain some of the issues we face with the 'A-B' model of open data publishing. At each node we’re seeing a bunch of operations going on to make the data they have sourced fit for their specific purpose. It might be edited in a GIS. It might be clipped. It might have its attributes renamed. It might be combined with one or more other datasets to make a new product. There are actually dozens of potential operations used to make data fit for purpose or to add value.
This process actually similar to how products are made in other industries — it’s a supply chain.
Geospatial data supply chains
Our working hypothesis is that every organisation on the planet producing or using geospatial data is in a geospatial data supply chain. Part of that chain may involve accessing open data — authoritative data from government agencies are fundamental to the operation of the supply chain — but that ‘public’ side of the chain is by no means the entire picture.
At present, data supply chains take a lot of work to maintain. Geospatial data is big, complex, and often requires expensive tools and a lot of labour to maintain. That means people currently at the bottom of the supply chain are often using data which is quite out of date.
It’s all well and good for government agencies to aim to release more frequently updated data — but this in itself can become a problem as there’s a cost to bringing those updates into the internal systems of organisations across the network. While more updates and more data sounds exciting, it can be difficult to realise the potential of that geospatial data. Our current systems used to manage this supply chain —across government, industry, and civil society — can’t always cope.
Working towards a new model for open data
To solve the problems with open geospatial data publishing, we must solve the problems with geospatial data distribution writ large. We need to build an end-to-end solution for the geospatial data flows across these supply chains, while also significantly lowering the barriers for government agencies to publish their geospatial data openly.
At Koordinates, we’re working extremely hard to solve these problems — and we’ll be releasing some exciting features in the months ahead. The first of these, Data Forking, was released to existing customers last week (we’ll be making a bigger announcement on this blog shortly). Our next step will be new plans for our new-and-vastly-improved Koordinates Data Management, including free open data publishing for individuals and small teams. Watch this space.
Other News & Updates
Product
• 16 Feb, 2022
New License and Site Category Management
We have fully rebuilt and are about to launch our license and site category management functions. This brings improved performance and usability to the platform, and helps you to curate and refine your data in more ways.Read MoreProduct
• 23 Feb, 2022
Latest Desktop App release - v0.9.2
The latest version of our Desktop App has been released (v0.9.2) This represents extensive performance increases and other improvements to the App.Read MoreProduct
• 23 Feb, 2022
Introducing Multi-factor authentication and Okta support for Koordinates ID
Built with OAuth 2.0, the dominant web authentication standard in 2022Read MoreProduct
• 23 Feb, 2022
Koordinates QGIS Kart Plugin is now available
Allowing for direct Kart integration, get ready to unlock the full potential of QGISRead MoreProduct
• 17 Nov, 2021
Koordinates now supports GDA2020
The latest Australian national datum for mappingRead MoreProduct
• 29 Nov, 2021
Latest Desktop App release (v0.9)
Performance upgrades to enhance and improve speed and usabilityRead MoreProduct
• 24 Nov, 2021
Spatial filtering for Kart and QGIS Kart Plugin
The latest Kart release completes a major new feature for spatial filtering repositories and adds an upcoming plugin for QGISRead MoreProduct
• 30 Nov, 2021
New changes to our Data Management functionality
This feature lets a publisher/owner of data collect layers into a group, name it, and host it on their web portal.Read MoreProduct
• 04 Mar, 2020
Sno, our new open source tool for distributed data versioning
Sno brings the distributed, version-controlled workflows of software development to geospatial and tabular data.Read MoreProduct
• 19 Mar, 2020
Announcing Koordinates’ New Pro User Plan
We’re excited to announce our new entry-level Pro User plan for Koordinates Data Management.Read MoreProduct
• 10 Dec, 2019
Announcing new-and-improved Data Management for Teams
We've upgraded our Data Management Team Plan — and reduced the price.Read MoreProduct
• 18 Sep, 2019
Why Koordinates supports many (many) different file formats
Koordinates supports a wide range of formats beyond GIS, including DWG, CSV, KML, and geospatial PDF.Read MoreProduct
• 10 Sep, 2019
How Koordinates Solves the Problem of XML Metadata
Koordinates supports connected metadata sources, making it easier to publish and update XML metadata.Read MoreProduct
• 20 Aug, 2019
How Koordinates Handles Data Uploads
Koordinates supports a classic 'drag-and-drop' workflow, with a few special features under the hood.Read MoreProduct
• 15 Oct, 2017
Data is the key to unlocking the spatial revolution
Geospatial industry CMO Linda Stevens writes about the importance of unlocking Earth's data.Read MoreProduct
• 22 Jun, 2017
A data service, not a data portal
A data service is designed for all users, in all industries, and promises radically higher levels of reuse.Read MoreProduct
• 23 May, 2017
From open data to active data
In 2017, it’s no longer good enough to just ‘get open data out there.’Read MoreProduct
• 11 May, 2017
Solving the problem of dormant data
We now have the tools — policy, legal, cultural and technical — to solve the problem of data that is open but underused.Read MoreCompany Update
• 03 May, 2017
Talking Geospatial Data and Design
Next week our Head of Design, Dan Newman is talking at Techweek as part of the SIBA ‘Talk Geospatial’ breakfast series.Read MoreProduct
• 23 Mar, 2017
Realising the vision of open data
The connected data lifecycle shows how agencies can raise the bar and publish their open data better.Read MoreCompany Update
• 15 Mar, 2017
Open data and the Stats NZ Datarama
We’re excited to be a partner for the Stats NZ Datarama next Friday, 24 March.Read MoreProduct
• 10 Nov, 2016
The birds and the bees of software
A bit of magic happens when people from our various teams get together and cross-pollinate.Read MoreProduct
• 06 Nov, 2016
The future of open data publishing
The internet is awash in authoritative government data, much of it freely available. But are we seeing a radical increase in reuse?Read MoreCompany Update
• 21 Oct, 2016
Design Research
The design team at Koordinates has been busy researching the workflows and requirements of our publishers and users.Read MoreCompany Update
• 17 Dec, 2015
Welcome Statistics NZ!
Statistics NZ recently launched the prototype of their data service on the Koordinates platform, with a view to making their statistical data more easily consumable for users.Read MoreCompany Update
• 17 Apr, 2013
Koordinates & the LDS win
The LINZ Data Service, built using Koordinates and supported by our engineers, won both the Spatial Enablement category and the JK Barrie Award for Overall Excellence at the Asia Pacific Spatial Excellence Awards.Read MoreCompany Update
• 23 Apr, 2013
Our CEO chats with Kim Hill
Koordinates CEO Ed Corkery was invited by the Royal Society of New Zealand and the National Library to talk about open data with Radio New Zealand’s Kim Hill.Read MoreCompany Update
• 05 Jul, 2013
Minister Williamson praises the LINZ Data Service
By making it quick and easy to find, check and view datasets, our portals have enabled public agencies to reuse public datasets that already exist, saving a heap of time and money.Read MoreCompany Update
• 13 Jul, 2013
New Zealand local government data
Local government datasets have huge social and economic significance, and are used by everyone from architects and data professionals to home-owners and ratepayers.Read MoreProduct
• 13 Aug, 2013
Useful (and nice looking) open data
Students at Auckland University, spent their summer investigating how open data could help two groups, home buyers and professionals, make better decisions.Read MoreProduct
• 13 Nov, 2013
LINZ Data Service and the Global Open Data Barometer
New Zealand has placed fourth on the Global Open Data Barometer.Read MoreProduct
• 24 Jul, 2015
New Zealand National Broadband Map
The National Broadband Map allows you to enter your address and get an instant view of the different internet connection types available (or planned) in your area.Read MoreProduct
• 29 Oct, 2015
Getting what you want - even when it's not in the box
With cloud technology changing the playing field of publishing and sharing data, more organisations are making the transition to new software.Read MoreCompany Update
• 28 Oct, 2015
The changing economics of data publishing and sharing
The business models and subsequent economics associated with making data available are also evolving.Read MoreProduct
• 04 May, 2016
Mapping an ocean reserve at Cloud Break
Koordinates Customer Experience Manager Anne Harper helped map out the new Kurukuru Ocean Reserve in FijiRead MoreProduct
• 25 Sep, 2023
Desktop App Public Beta
The Desktop App is now in public beta, meaning anyone can download and try it today!Read MoreProduct
• 24 Sep, 2023
Try the 3D Map Viewer
Add another dimension to your data.Read MoreCompany Update
• 23 Sep, 2023
ISO27001 Certification
Ready to discover the secure, enterprise-grade geospatial data management capabilities of Koordinates?Read MoreProduct
• 22 Sep, 2023
Web Plugin added to Giraffe
Our web plugin allows you to embed a Data Browser into any map-based web applicationRead More• 20 Sep, 2023
Explore v10 UI
Take a in-depth tour of our new user interface, v10.Read MoreProduct
• 26 Sep, 2023