Kimwomi, George and Ondimu, Kennedy (2025) Epistemic Risks of Big Data Analytics in Scientific Discovery: Analysis of the Reliability and Biases of Inductive Reasoning in Large-Scale Datasets. International Journal of Innovative Science and Research Technology, 10 (3): 25mar404. pp. 3288-3294. ISSN 2456-2165
![IJISRT25MAR404.pdf [thumbnail of IJISRT25MAR404.pdf]](https://eprint.ijisrt.org/style/images/fileicons/text.png)
IJISRT25MAR404.pdf - Published Version
Download (220kB)
Abstract
The advent of Big Data Analytics has transformed scientific research by enabling pattern recognition, hypothesis generation, and predictive analysis across disciplines. However, reliance on large datasets introduces epistemic risks, including data biases, algorithmic opacity, and challenges in inductive reasoning. This paper explores these risks, focusing on the interplay between data- and theory-driven methods, biases in inference, and methodological challenges in Big Data epistemology. Key concerns include data representativeness, spurious correlations, overfitting, and model interpretability. Case studies in biomedical research, climate science, social sciences, and AI-assisted discovery highlight these vulnerabilities. To mitigate these issues, this paper advocates for Bayesian reasoning, transparency initiatives, fairness-aware algorithms, and interdisciplinary collaboration. Additionally, policy recommendations such as stronger regulatory oversight and open science initiatives are proposed to ensure epistemic integrity in Big Data research, contributing to discussions in philosophy of science, data ethics, and statistical inference.
Item Type: | Article |
---|---|
Subjects: | T Technology > T Technology (General) |
Divisions: | Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science |
Depositing User: | Editor IJISRT Publication |
Date Deposited: | 06 May 2025 10:22 |
Last Modified: | 06 May 2025 10:22 |
URI: | https://eprint.ijisrt.org/id/eprint/725 |