Although true in some aspects, the suggested characterization of today’s science as a dichotomy between traditional science and data-driven science misses some of the nuance, complexity, and possibility that exists between the two positions. Part of the problem is the claim that Data Science works without theories. There are many theories behind the data that are used in science. However, for data science, the only theories that matter are those in mathematics, statistics, and computer science. In this conceptual paper, we add two other philosophy of science tenets, experiments and data, to the discussion to create a more nuanced view of how data science uses theories. Following Ihde’s concept of technoscience and the incessant quest for more precision, magnification, and resolution, we argue that technology-driven science created a need for more technology-driven science, culminating in data science. Further, we adapt Hacking and Galison’s views on physics to argue that data science is also an experimental science, which uses data objects in experiments. Drawing from Heelan (The Journal of Philosophy 85:515–524, 1988), we called these objects “data-objects-for-knowing”. Finally, we conclude that data science is a science to study artificially created phenomena—a science to study the data manipulated by the equations and operations of AI. It disregards the connections between data and the real world that were carefully built by the theories from other sciences. In the experiments of data science, data are the world itself. The knowledge created by data science is purposely disconnected from any theory from other sciences; it is a knowledge for the sake of itself.
All Science Journal Classification (ASJC) codes
- Human-Computer Interaction
- Artificial Intelligence