Visible to the public Model-Agnostic Scoring Methods for Artificial Intelligence Assurance

TitleModel-Agnostic Scoring Methods for Artificial Intelligence Assurance
Publication TypeConference Paper
Year of Publication2022
AuthorsSikder, Md Nazmul Kabir, Batarseh, Feras A., Wang, Pei, Gorentala, Nitish
Conference Name2022 IEEE 29th Annual Software Technology Conference (STC)
KeywordsAI Assurance, Deep Learning, equilibrium, Optimization, Pipelines, pubcrawl, resilience, Resiliency, Safety, Scalability, Scoring Methods, security, Software, Software algorithms, software assurance, Telecommunications, Training
AbstractState of the art Artificial Intelligence Assurance (AIA) methods validate AI systems based on predefined goals and standards, are applied within a given domain, and are designed for a specific AI algorithm. Existing works do not provide information on assuring subjective AI goals such as fairness and trustworthiness. Other assurance goals are frequently required in an intelligent deployment, including explainability, safety, and security. Accordingly, issues such as value loading, generalization, context, and scalability arise; however, achieving multiple assurance goals without major trade-offs is generally deemed an unattainable task. In this manuscript, we present two AIA pipelines that are model-agnostic, independent of the domain (such as: healthcare, energy, banking), and provide scores for AIA goals including explainability, safety, and security. The two pipelines: Adversarial Logging Scoring Pipeline (ALSP) and Requirements Feedback Scoring Pipeline (RFSP) are scalable and tested with multiple use cases, such as a water distribution network and a telecommunications network, to illustrate their benefits. ALSP optimizes models using a game theory approach and it also logs and scores the actions of an AI model to detect adversarial inputs, and assures the datasets used for training. RFSP identifies the best hyper-parameters using a Bayesian approach and provides assurance scores for subjective goals such as ethical AI using user inputs and statistical assurance measures. Each pipeline has three algorithms that enforce the final assurance scores and other outcomes. Unlike ALSP (which is a parallel process), RFSP is user-driven and its actions are sequential. Data are collected for experimentation; the results of both pipelines are presented and contrasted.
DOI10.1109/STC55697.2022.00011
Citation Keysikder_model-agnostic_2022