Open data and big data privacy
Read about types of big data (open, shared and closed) and access a free white paper on open data privacy and security.
If your business is trying to stay current with what’s happening in technology and data science, one term that you should be familiar with is “open data.” Read this post for the answer to “What is open data?” and download a free white paper about open data and big data privacy and security.
What Is Open Data? How Can Businesses Release It Responsibly?
Open data is data that is available for anyone to “access, use or share” — without restriction or limitation. While open data is free from restriction on its use, audience or redistribution, it introduces the highest risk of exposure to external threats, in addition to providing the maximum potential value to the general public as a utility.
For quite some time, public institutions, local governments, and health agencies have been in the practice of publishing and providing their data for public use. Privately-run businesses are also starting to release open datasets as well, and this paper introduces the methodology that businesses should use for responsible data release.
Some examples of open datasets from the public sector:
- Voter registries
- Census information
- Health and safety statistics
- Environmental monitoring data
- Research information
The Data Spectrum: Open, Shared and Closed Data
Along with open data, there is also shared data and closed data.
Shared data is generally available to specific people or groups, but can also be made available publicly. In the case of shared data, the data provider is responsible setting controls and conditions on sharing and use, in the form of various agreements, licenses, and terms.
Closed data is the most tightly controlled category of data due to its sensitivity and it “can only be accessed by its subject, owner, or holder” as defined by the Open Data Institute. Because of its limited exposure, closed data has the lowest inherent privacy and security risks when handled properly.
All data (small and big) resides on a continuous spectrum going from closed on one side to open on the other. The Open Data Institute (ODI) illustrates this spectrum on their graph of big data release categories (shown below). Where a dataset falls on this spectrum is determined by a number of factors including access, audience, controls, and conditions of use.
Privacy and Security of Open Data
It is critical for any businesses considering open access and data sharing to take measures to properly secure and responsibly manage that data.
Geotab has published a white paper that provides an overview of shared and open data and proposes a framework for responsible data release (click below to read Open Data Privacy and Security by Emilie Corcoran and Eugene Kang).
View Report Now
More articles on big data:
If you liked this post, let us know!
Emilie Corcoran is a Python Developer for Geotab.
Geotab's blog posts are intended to provide information and encourage discussion on topics of interest to the telematics community at large. Geotab is not providing technical, professional or legal advice through these blog posts. While every effort has been made to ensure the information in this blog post is timely and accurate, errors and omissions may occur, and the information presented here may become out-of-date with the passage of time.
Geotab | Blog
Sign up for monthly news and tips from our award-winning fleet management blog. You can unsubscribe at any time.
Other posts you might like
Geotab Data Bootcamp November 2020 recap
Geotab Data Bootcamp 2020 was the first online version of this learning event.
November 23, 2020
6 steps for data cleaning and why it matters
Data cleaning is the process of ensuring that your data is correct, consistent and usable.
November 20, 2020
Transport Canada ELD mandate: What fleets should know
Learn the differences between U.S. and Canada ELD verification.
November 17, 2020