Goyal, Kapil Kumar (2025) Rethinking Model Evaluation: Weighted Scenarios for AI Use-Case Grading. International Journal of Innovative Science and Research Technology, 10 (5): 25may1773. pp. 2875-2879. ISSN 2456-2165
![IJISRT25MAY1773.pdf [thumbnail of IJISRT25MAY1773.pdf]](https://eprint.ijisrt.org/style/images/fileicons/text.png)
IJISRT25MAY1773.pdf - Published Version
Download (368kB)
Abstract
Performance metrics of AI models like accuracy, precision, and recall are often reported in a vacuum, detached from the real-world contexts in which the models are deployed. Yet increasingly, the criticality and sensitivity of model applications demand a more nuanced approach to their performance evaluation. This paper introduces a new framework— Contextual AI Evaluations—that allows teams to assess models with greater relevance to the conditions under which the models will be deployed. Contextual AI Evaluations assign weights to different deployment scenarios to reflect the operational risk, business impact, and user sensitivity associated with each scenario. The framework is applied to several models currently in use.
Item Type: | Article |
---|---|
Subjects: | T Technology > T Technology (General) |
Divisions: | Faculty of Engineering, Science and Mathematics > School of Electronics and Computer Science |
Depositing User: | Editor IJISRT Publication |
Date Deposited: | 19 Jun 2025 09:10 |
Last Modified: | 19 Jun 2025 09:10 |
URI: | https://eprint.ijisrt.org/id/eprint/1260 |