some theses to be nailed onto the Church of Big Data

(1) Cristian S. Calude, Giuseppe Longo. "The Deluge of Spurious Correlations in Big Data.” Foundations of Science (March 2016): 1-18.

Abstract:
Very large databases are a major opportunity for science and data analytics is a re- markable new eld of investigation in computer science. The effectiveness of these tools is used to support a philosophy against the scientific method as developed throughout history. According to this view, computer-discovered correlations should replace under- standing and guide prediction and action. Consequently, there will be no need to give scientific meaning to phenomena, by proposing, say, causal relations, since regularities in very large databases are enough: with enough data, the numbers speak for themselves . The end of science is proclaimed. Using classical results from ergodic theory, Ramsey theory and algorithmic information theory, we show that this philosophy is wrong. For example, we prove that very large databases have to contain arbitrary correlations. These correlations appear only due to the size, not the nature, of data. They can be found in randomly generated, large enough databases, which as we will prove implies that most correlations are spurious. Too much information tends to behave like very little information. The scientific method can be enriched by computer mining in immense databases, but not replaced by it.


(2)  Sha Xin Wei, Gabriele Carotti-Sha.  "Big Data." AI & Society, Springer Online(April 2016): 1-4.
http://link.springer.com/article/10.1007/s00146-016-0662-7/fulltext.html

Abstract: 
Big Data is a term popular among administrators and business circles motivating quite a lot of investment today. Part of it is rebranding. But rather than old wine in new bottle, it may be more likely watered wine in new bottles branded to take old wine’s market. David Donoho, one of the foremost statisticians in the world and a visionary of data science, observes that much of what passes for Big Data is a bit of software engineering plus a bit of statistics. Knowledge is power. But a smidgen of knowledge plus poor judgment can do a lot of damage, especially when billions of dollars implicate billions of lives.

(3) Complessità, scienza e democrazia / interview with Giuseppe Longo

_______________________________________________________
_______________________________________________________