Kimwomi, George and Ondimu, Kennedy (2025) Epistemic Risks of Big Data Analytics in Scientific Discovery: Analysis of the Reliability and Biases of Inductive Reasoning in Large-Scale Datasets. International Journal of Innovative Science and Research Technology, 10 (3): 25mar404. pp. 3288-3294. ISSN 2456-2165

[thumbnail of IJISRT25MAR404.pdf] Text
IJISRT25MAR404.pdf - Published Version

Download (220kB)

Abstract

The advent of Big Data Analytics has transformed scientific research by enabling pattern recognition, hypothesis generation, and predictive analysis across disciplines. However, reliance on large datasets introduces epistemic risks, including data biases, algorithmic opacity, and challenges in inductive reasoning. This paper explores these risks, focusing on the interplay between data- and theory-driven methods, biases in inference, and methodological challenges in Big Data epistemology. Key concerns include data representativeness, spurious correlations, overfitting, and model interpretability. Case studies in biomedical research, climate science, social sciences, and AI-assisted discovery highlight these vulnerabilities. To mitigate these issues, this paper advocates for Bayesian reasoning, transparency initiatives, fairness-aware algorithms, and interdisciplinary collaboration. Additionally, policy recommendations such as stronger regulatory oversight and open science initiatives are proposed to ensure epistemic integrity in Big Data research, contributing to discussions in philosophy of science, data ethics, and statistical inference.

Item Type: Article
Subjects: T Technology > T Technology (General)
Divisions: Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science
Depositing User: Editor IJISRT Publication
Date Deposited: 06 May 2025 10:22
Last Modified: 06 May 2025 10:22
URI: https://eprint.ijisrt.org/id/eprint/725

Actions (login required)

View Item
View Item