As a catch-all term, “big data” can be pretty nebulous, in the same way that the term “cloud” covers diverse technologies. Input data to big data systems could be chatter from social networks, web server logs, traffic flow sensors, satellite imagery, broadcast audio streams, banking transactions, MP3s of rock music, the content of web pages, scans of government documents, GPS trails, telemetry from automobiles, financial market data, the list goes on. Are these all really the same thing?
There is a gap between initiatives that are based on governments giving out things that they want to give out, and governments creating rights that mean that they give things out all the time that they maybe don’t want to give out. The Freedom of Information Act is a right, it gives people a right, whereas these data initiatives in America and Britain tend to be not rights, but more like gifts. And that doesn’t make these data websites a bad thing, not at all – it just makes them not as good as rights. Because there are two problems: firstly, they’re not necessarily as strong; and secondly, they assume to some degree that the government knows what the public wants.