Assessing Species Susceptibility in Adverse Outcome Pathways: A Framework for Predictive Toxicology & Safer Drug Development

Naomi Price Jan 09, 2026 409

This article provides a comprehensive guide for researchers and drug development professionals on evaluating species-specific susceptibility within the Adverse Outcome Pathway (AOP) framework.

Assessing Species Susceptibility in Adverse Outcome Pathways: A Framework for Predictive Toxicology & Safer Drug Development

Abstract

This article provides a comprehensive guide for researchers and drug development professionals on evaluating species-specific susceptibility within the Adverse Outcome Pathway (AOP) framework. It covers foundational principles defining AOP components and the biological basis for interspecies differences. Methodological sections detail practical approaches for cross-species extrapolation, integrating 'omics' data, and utilizing curated databases like the AOP-DB. The guide also addresses common challenges in AOP application and explores advanced computational and artificial intelligence methods for optimization. Finally, it outlines validation strategies through weight-of-evidence assessment and quantitative AOP development, concluding with future directions for implementing these approaches in biomedical research and regulatory science to enhance chemical safety and drug development [citation:1][citation:3][citation:9].

Defining the Landscape: Core AOP Concepts and the Biological Basis of Species Susceptibility

The Adverse Outcome Pathway (AOP) framework is a structured toxicological knowledge assembly tool designed to describe a sequential chain of causally linked events at different levels of biological organization, beginning from a Molecular Initiating Event (MIE) and culminating in an Adverse Outcome (AO) relevant to risk assessment [1] [2]. As a conceptual model, an AOP outlines the progression from the initial interaction of a stressor (e.g., a chemical) with a biomolecule within an organism, through a series of intermediate, measurable Key Events (KEs), to an in vivo adverse effect at the organism or population level [3] [4].

A foundational principle of the AOP framework is that it is chemically agnostic. It describes biological pathway perturbations that can be initiated by any stressor capable of triggering the specific MIE [4]. This modularity allows AOPs to serve as a scaffold for organizing mechanistic information, facilitating the use of non-traditional data streams—such as high-throughput in vitro assays and in silico models—to predict adverse effects and support regulatory decision-making [5] [4]. The framework is central to the development of New Approach Methodologies (NAMs), which aim to increase the efficiency of chemical safety assessments while reducing reliance on traditional animal testing [2] [5].

A critical challenge in applying AOPs for predictive toxicology is understanding and accounting for interspecies susceptibility. Biological pathways are often conserved, but quantitative responses and compensatory mechanisms can vary significantly between species, leading to differences in sensitivity and final outcomes [1]. Assessing this susceptibility is therefore not merely an add-on but a core component of AOP development and application, ensuring that predictions derived from model systems or one species are relevant and accurately extrapolated to humans or other species of concern [6].

Core Components of an AOP

An AOP is composed of three core, causally linked elements: the Molecular Initiating Event, intermediate Key Events, and the Adverse Outcome, connected by defined Key Event Relationships.

Molecular Initiating Event (MIE): The MIE is the initial, specific biological interaction between a stressor and a molecular target within an organism that triggers the perturbation leading to the AOP [2] [3]. It is the most upstream KE in the pathway. Examples include a chemical binding to a specific receptor (e.g., aryl hydrocarbon receptor), inhibiting a key enzyme (e.g., acetylcholinesterase), or covalently binding to DNA [4] [7].

Key Events (KEs): KEs are measurable, essential changes in biological state that form the steps between the MIE and the AO. They represent critical checkpoints in the progression of toxicity and can occur at molecular, cellular, tissue, or organ levels [3]. To be useful in an AOP, a KE must be essential to the pathway's progression; if it is blocked, the AO should not occur.

Adverse Outcome (AO): The AO is an adverse effect of regulatory significance at the level of the whole organism (e.g., organ toxicity, cancer, impaired reproduction) or population (e.g., reduced population growth, extinction) [2] [8]. The AO is the downstream endpoint that the AOP aims to predict.

Key Event Relationships (KERs): KERs are the scientific descriptions of the causal linkages between an upstream KE and a downstream KE [3]. They provide the rationale for why one event is expected to lead to another, based on biological plausibility and empirical evidence. KERs describe the evidence supporting the relationship and, ideally, its quantitative nature, which is crucial for building predictive models.

Table 1: Core Definitions of the AOP Framework [2] [3]

Term	Abbreviation	Definition
Molecular Initiating Event	MIE	The initial interaction between a stressor and a biomolecule that starts the pathway.
Key Event	KE	A measurable, essential change in biological state at any level of organization.
Key Event Relationship	KER	A scientifically-based, causal link between two Key Events.
Adverse Outcome	AO	An adverse effect at the organism or population level relevant for risk assessment.

The AOP Development Process and Weight of Evidence

Developing a scientifically credible AOP follows a systematic workflow involving evidence assembly, description, and rigorous evaluation of causal certainty [3].

The initial phase involves assembling existing knowledge from the scientific literature to define a plausible sequence from a candidate MIE to an AO. Each KE and KER must be clearly described. Following this assembly, developers conduct a formal Weight of Evidence (WoE) assessment to evaluate the confidence in the overall AOP and its individual KERs [3].

This WoE assessment is typically based on modified Bradford-Hill considerations, focusing on three core areas:

Biological Plausibility: Is the relationship between KEs consistent with established biological knowledge?
Empirical Support: Does experimental evidence from studies that measure both the upstream and downstream KEs demonstrate a consistent, dose- and time-responsive relationship?
Essentiality: Does modulation (inhibition or augmentation) of the upstream KE alter the downstream KE and/or the AO? Evidence from knock-out, knock-down, or chemical inhibition studies is particularly valuable here [3] [7].

The outcome of this process is a transparent, peer-reviewed AOP description, which is often submitted to the AOP Knowledge Base (AOP-KB), a central repository managed by the Organisation for Economic Co-operation and Development (OECD) [1] [3].

Table 2: Status of AOPs in the OECD Knowledge Base (Representative Data) [1] [7]

Description	Figure	Notes
Total AOPs in AOP-KB (2018)	233	Illustrates the scale of ongoing collaborative development.
AOPs with OECD "Endorsed" Status	~20+	Represents pathways that have undergone formal review.
Example: AOP 281 (AChE Inhibition)	Under Development	Case study for quantitative AOP (qAOP) development [7].

Assessing Species Susceptibility Within the AOP Framework

A key strength of the AOP framework is its utility for structuring the assessment of interspecies differences, which is vital for human health and ecological risk assessments [6] [8].

A Conceptual Framework for Human Relevance Assessment

A pragmatic workflow for assessing the human relevance of toxicological pathways involves evaluating evidence for three main questions related to the AOP's components [6]:

Conservation of Pathway Steps: Are the MIE, KEs, and KERs (including the target molecules and their functions) qualitatively similar between the test species and humans?
Pathology Alignment: Does the adverse outcome in the test species correspond to a relevant human disease or syndrome with similar pathophysiology?
Quantitative Differences: What are the quantitative differences in kinetics (exposure, absorption, metabolism) and dynamics (receptor affinity, cellular response) between species?

The combined evidence from these questions is scored to provide a transparent judgment on the strength of support for human relevance, which directly informs the suitability of NAMs based on that AOP [6].

Case Study: Species Susceptibility in Lung Overload Toxicity

Research on chronic inhalation of poorly soluble particles (PSPs) provides a definitive example of stark interspecies differences within a shared AOP framework [1].

Shared Initiating Event: In rats, mice, hamsters, and primates, PSPs deposit in the deep lung, overwhelming clearance mechanisms and leading to impairment of pulmonary macrophage clearance.
Divergent Outcomes: Despite this shared MIE and subsequent persistent inflammation, the final AO differs dramatically. Rats develop epithelial hyperplasia, fibrosis, and ultimately lung tumors. In contrast, mice and hamsters show a muted inflammatory and proliferative response and do not develop tumors, while non-human primates and humans show normal physiological clearance responses with no neoplastic outcomes from occupational exposures [1].

This case underscores that the presence of early KEs (like inflammation) does not reliably predict the final AO across species. The differential response is attributed to species-specific factors such as gene expression profiles (e.g., pro-inflammatory vs. anti-inflammatory mediators) and particle clearance rates [1].

Quantitative AOPs (qAOPs) and Predictive Modeling

Qualitative AOPs establish causal linkages, but Quantitative AOPs (qAOPs) are required for predictive risk assessment. A qAOP incorporates mathematical relationships that describe the dose- and time-dependent transitions between KEs [7].

Development Methods: qAOPs can be built using several modeling approaches:

Response-Response Models: Statistical fits (e.g., regression) of empirical data linking the magnitude of one KE to another.
Biologically-Based Mathematical Models: Systems of ordinary differential equations representing underlying biological processes (e.g., receptor binding, feedback loops).
Probabilistic Networks: Bayesian Networks that can handle uncertainty and integrate data from multiple KEs, useful for complex pathways [7].

Case Study: AChE Inhibition Leading to Neurodegeneration (AOP 281) The development of a qAOP for acetylcholinesterase (AChE) inhibition illustrates the process and challenges [7].

MIE: Inhibition of AChE in the synapse.
KE Progression: Excess acetylcholine → overactivation of muscarinic receptors → focal seizures → glutamate release → excitotoxicity → status epilepticus → neuronal cell death → neurodegeneration.
qAOP Challenge: A major hurdle was the lack of studies measuring multiple KEs simultaneously. Developing quantitative, predictive models required extracting and integrating fragmented data from over 200 studies to parameterize the relationships between, for example, brain AChE inhibition levels and the onset of seizures [7].

Table 3: Experimental Protocols for Key AOP Case Studies

AOP / Endpoint	Core Experimental Methodology	Measurement Endpoints (KEs)
Skin Sensitization [4]	Direct Peptide Reactivity Assay (DPRA): Incubate test chemical with synthetic peptides.	Peptide depletion (MIE: protein binding).
	Keratinocyte Assays (e.g., LuSens): Expose immortalized keratinocytes to test chemical.	Induction of antioxidant/cytokine genes (KE: cellular response).
	Dendritic Cell Assays: Expose cell lines like U937 or THP-1.	Surface marker expression (e.g., CD86) (KE: activation).
Endocrine Disruption (Estrogen Receptor) [4]	In vitro Transcriptional Activation Assays: Use yeast or human cells engineered with estrogen receptor and a reporter gene (e.g., ERα CALUX).	Reporter gene activity (MIE: receptor binding/activation).
	High-Throughput Screening (Tox21/ToxCast): Quantitative screening of thousands of chemicals.	Bioactivity profiles for prioritization.
Lung Overload & Fibrosis [1]	Sub-chronic/Chronic Inhalation Studies: Rodents exposed to PSPs (e.g., TiO₂, carbon black) for up to 24 months.	Lung burden, inflammatory cytokines, histopathology (hyperplasia, fibrosis), tumor incidence.
	Comparative Morphometry: Analysis of particle distribution in lung compartments across species (rat vs. primate).	Particle retention patterns (alveolar vs. interstitial).

The Scientist's Toolkit: Essential Reagents and Methods for AOP Research

Building and testing AOPs requires a multidisciplinary toolkit spanning molecular biology, in vitro toxicology, and analytical chemistry.

Table 4: Research Reagent Solutions for AOP Development

Reagent / Material	Function in AOP Research	Example Application
Recombinant Receptors & Enzymes	Provide purified molecular targets for high-throughput screening of MIEs (e.g., binding, inhibition).	Screening chemicals for ER/AR binding or AChE inhibition [4] [7].
Reporter Gene Assay Kits	Measure the transcriptional activation of specific pathways following receptor-ligand interaction.	Quantifying estrogenic or aryl hydrocarbon receptor activity in cell lines [4].
Differentiated Cell Lines (e.g., HepaRG, neuronal progenitors)	Provide metabolically competent, human-relevant in vitro models for assessing KEs at the cellular level.	Studying hepatotoxicity, neurodevelopmental toxicity KEs.
Cytokine & Phosphoprotein Multiplex Assays	Quantify multiple cellular signaling molecules or phosphorylation events simultaneously from small sample volumes.	Profiling inflammatory responses (KE) in lung or immune cell models [1].
siRNA/shRNA Libraries	Enable gene knockdown to test the essentiality of a specific protein to a KE or pathway progression.	Validating the role of a specific kinase or receptor in a hypothesized KER [3].
Targeted Mass Spectrometry Kits	Precisely quantify metabolites, proteins, or post-translational modifications associated with KEs.	Measuring changes in key pathway metabolites (e.g., acetylcholine) or oxidative stress markers [7].

Visualizing AOPs and Species Comparisons

Diagram 1: Generalized Structure of an Adverse Outcome Pathway.

Diagram 2: Divergent AOPs for Lung Overload from PSPs in Rats vs. Primates/Humans.

Diagram 3: Workflow for Assessing Human Relevance of an AOP [6].

Why Species Susceptibility Matters in Toxicology and Drug Development

Understanding species-specific differences in biological response is a cornerstone of predictive toxicology and safe drug development. The fundamental challenge is that a chemical stressor—a pharmaceutical candidate, environmental contaminant, or industrial compound—does not affect all species equally. These differences in susceptibility arise from variations in genomic sequences, protein structures, metabolic pathways, and physiological systems [9]. Historically, this has been addressed by testing in animal models, but a critical translational gap persists; approximately 90% of drugs that appear safe and effective in traditional animal models fail to gain regulatory approval for human use, primarily due to safety or efficacy issues [10].

The contemporary shift in regulatory science underscores the urgency of this issue. In April 2025, the U.S. Food and Drug Administration (FDA) announced a plan to phase out conventional animal testing requirements, starting with monoclonal antibodies, within 3-5 years [11]. This landmark decision accelerates the adoption of New Approach Methodologies (NAMs), which include in vitro human-based systems, in silico models, and other non-animal approaches [11] [12]. In this new paradigm, accurately assessing species susceptibility transitions from an academic exercise to a critical, applied science. It is essential for extrapolating hazard data from new models to humans, for protecting ecological species in environmental risk assessment, and for building confidence in the NAMs that will underpin next-generation risk assessment (NGRA) [13] [14].

This whitepaper frames the assessment of species susceptibility within the context of Adverse Outcome Pathway (AOP) research. An AOP is a structured, mechanistic description of the sequence of events from a Molecular Initiating Event (MIE), through intermediate Key Events (KEs), to an Adverse Outcome (AO) relevant to risk assessment [2]. By providing a common framework to organize mechanistic knowledge, AOPs enable researchers to identify where and why susceptibility may differ between species—be it at the level of initial protein-chemical binding, a downstream cellular response, or an organ-level effect [13] [9]. This document provides a technical guide to the methodologies and tools for integrating species susceptibility into AOP-driven research, which is fundamental to developing safer drugs and chemicals with less reliance on animal testing.

Foundational Concepts: AOPs and the Mechanistic Basis for Susceptibility

The Adverse Outcome Pathway (AOP) Framework

An AOP is a conceptual construct that models the causal linkages between a measurable MIE and an AO. As defined by the U.S. EPA, it is akin to "a series of dominos," where a chemical exposure triggers a molecular interaction (the MIE), leading to a cascade of measurable KEs at cellular, tissue, and organ levels, ultimately resulting in an adverse effect for the organism or population [2]. This framework decouples toxicity from a specific chemical, focusing instead on modular biological pathways that can be perturbed by various stressors.

The core components are:

Molecular Initiating Event (MIE): The initial interaction between a stressor and a biological target (e.g., binding to a receptor, inhibition of an enzyme).
Key Events (KEs): Measurable biological changes at different levels of organization.
Key Event Relationships (KERs): The causal or mechanistic linkages describing how one KE leads to another.
Adverse Outcome (AO): A change of regulatory relevance, such as impaired survival, growth, reproduction, or organ function [2].

Integrating Species Susceptibility into the AOP Framework

Species susceptibility manifests at every level of an AOP. The most fundamental differences often occur at the MIE. For example, a chemical may bind with high affinity to a human protein target but poorly to its ortholog in a rat due to amino acid sequence variations in the binding pocket [9]. This differential binding can be predicted by comparing protein structures and modeling chemical interactions. Susceptibility can also vary at downstream KEs due to differences in cellular defense systems, tissue repair capacities, or compensatory physiological pathways.

The AOP framework is inherently conducive to this analysis because it demands biological plausibility and essentiality of each KE. When constructing or evaluating an AOP, a critical question is: "Is this key event conserved across the species of interest?" Answering this requires a suite of computational and experimental tools designed for cross-species comparison. The ultimate goal is to develop quantitative AOPs (qAOPs) that can predict the probability or severity of an AO based on the intensity of an earlier KE, with models parameterized for different species.

Figure: The AOP Framework and Points of Species Susceptibility Divergence

Figure: A generalized AOP showing primary points where species susceptibility can diverge. Differences in protein structure at the MIE are often the most critical, but variations in cellular and physiological responses at downstream KEs also determine overall vulnerability.

Methodologies for Assessing Susceptibility in AOP Research

Assessing species susceptibility requires a weight-of-evidence approach that combines computational predictions with targeted experimental data [9]. The following methodologies are central to modern, AOP-informed research.

Computational & In Silico Approaches

These methods use existing biological and chemical data to predict susceptibility, prioritizing resources for experimental validation.

Sequence- and Structure-Based Protein Conservation Analysis: Tools like the U.S. EPA's SeqAPASS (Sequence Alignment to Predict Across Species Susceptibility) estimate the conservation of a protein target across species. Starting with a reference protein sequence (e.g., human), it performs multi-level evaluations: primary sequence alignment (Level 1), functional domain conservation (Level 2), and conservation of specific amino acid residues known to be critical for function (Level 3). The latest versions incorporate protein structure prediction (Level 4) using algorithms like I-TASSER or AlphaFold to generate and compare 3D structural models, providing a more functional assessment of conservation [9].
Cross-Species Molecular Docking: This technique builds on structural predictions by simulating how a chemical interacts with protein orthologs from different species. As demonstrated in a 2024 case study with the Androgen Receptor (AR), a single chemical (e.g., a hormone or toxicant) is docked against a library of predicted protein structures for the same target across hundreds of species [9]. The resulting binding poses and affinity scores are compared to a known reference (e.g., human protein with experimentally determined structure). A combination of metrics—docking score, ligand root-mean-square deviation (RMSD), binding pocket similarity, and interaction fingerprint similarity—are used to predict relative susceptibility [9].
AOP Network Development with Text-Mining: For emerging stressors like microplastics, identifying relevant MIEs and KEs across species is challenging. Automated text-mining tools like AOP-helpFinder can systematically screen scientific literature to identify and score co-occurrences of stressor-event and event-event relationships [15]. This helps propose putative AOP networks anchored in specific tissues (gill, gut, liver) and provides a structured, evidence-based starting point for investigating species-specific effects [15].

Experimental & In Vitro Approaches

Computational predictions require validation through human-relevant biological systems.

Human-Based In Vitro Tissue Models: Advanced 3D reconstructed tissue models mimic human organ biology more accurately than traditional cell lines or animal tissues. For inhalation toxicology, models like MucilAir-HF (upper airway) and EpiAlveolar (lower lung) contain differentiated, functional human cells. They are used to measure KE-related bioactivity, such as cytokine release (for inflammation KE), cilia beating frequency (for mucociliary clearance KE), and transcriptomic changes [14]. The concentration at which a significant bioactivity change occurs is termed the in vitro Point of Departure (PoD), a key metric for risk assessment.
Alternative In Vivo Model Organisms: Non-mammalian models with high genetic conservation and ethical advantages are valuable for testing AOP components. They offer whole-organism complexity, including metabolism and multi-organ interactions, at a scale suitable for higher-throughput screening [10].
- Zebrafish (Danio rerio): Excellent for developmental toxicity and real-time observation of morphological and functional endpoints.
- Nematode (C. elegans): Used for neurotoxicity, reproductive toxicity, and high-throughput genetic screening.
- Fruit Fly (Drosophila melanogaster): A powerful model for neurotoxicity and developmental genetics [10].

Table 1: Comparison of Key Methodologies for Assessing Species Susceptibility

Methodology	Core Principle	Key Output(s)	Utility in AOP Context	Key Limitations
SeqAPASS [9]	Compares protein sequence & predicted structure across species.	Prediction of protein conservation & potential functional equivalence.	Identifying if the MIE target (protein) is conserved in a species of concern.	Relies on available sequence data; predicted structures may have inaccuracies.
Cross-Species Molecular Docking [9]	Simulates chemical binding to protein orthologs from different species.	Relative binding scores, poses, and interaction fingerprints.	Predicting differential potency at the MIE based on structural variations.	Docking scores alone poorly correlate with absolute affinity; requires a reference crystal structure.
Human 3D Tissue Models [14]	Measures bioactivity in engineered human tissues under chemical exposure.	In vitro Point of Departure (PoD), gene expression, functional biomarkers.	Quantifying KE responses in a human-relevant system; deriving Bioactivity Exposure Ratios (BER).	May not capture systemic, multi-organ, or chronic adaptive responses.
Alternative Model Organisms [10]	Tests toxicity in a whole, living non-mammalian organism.	Survival, development, reproduction, behavior, and tissue-level endpoints.	Investigating the progression of KEs in vivo in a tractable, conserved system.	Physiological differences from humans (e.g., absorption, distribution, metabolism).

Detailed Experimental Protocols

Protocol: Cross-Species Molecular Docking for MIE Assessment

This protocol, adapted from [9], details how to predict species susceptibility based on differential binding at a protein target.

1. Define the AOP and Target: Select an AOP where the MIE is well-defined as chemical binding to a specific protein (e.g., androgen receptor binding leading to reproductive toxicity).

2. Generate and Prepare Protein Structures: * Input: Obtain the amino acid sequence of the reference protein (e.g., human AR). * SeqAPASS Analysis: Use the SeqAPASS tool to identify orthologs in species of interest and generate predicted 3D structures for their ligand-binding domains using integrated I-TASSER [9]. * Structural Alignment: Use a tool like PyMOL to structurally align all predicted models to a reference crystal structure (e.g., PDB ID: 2AMA). Trim structures to the aligned region and add polar hydrogens and charges.

3. Prepare the Ligand: Obtain the 3D structure of the chemical of interest. Optimize its geometry and assign appropriate charges.

4. Perform Molecular Docking: * Software: Use a docking program like AutoDock Vina. * Grid Definition: Define the search space around the binding pocket of the reference structure. * Docking Run: Dock the ligand against each prepared protein ortholog. To account for minor structural uncertainties, implement limited flexible docking by allowing side-chains of key binding pocket residues to move [9].

5. Analyze and Interpret Results: * Metrics: For each species, calculate: (1) Docking Score (kcal/mol), (2) Ligand RMSD compared to its pose in the reference structure, (3) Binding pocket shape similarity, (4) Protein-Ligand Interaction Fingerprint (PLIF) similarity. * Classification: Use a machine learning classifier (e.g., k-Nearest Neighbors) trained on these metrics from known binders/non-binders to assign a "susceptible" or "not susceptible" call to each species [9].

Figure: Cross-Species Molecular Docking Workflow

Figure: Computational workflow for predicting species susceptibility based on differential chemical binding to a protein target, a key MIE in many AOPs.

Protocol: AOP-Informed In Vitro Testing for Key Event Evaluation

This protocol, based on [14], outlines the use of human tissue models to assess KEs and derive points of departure for risk assessment.

1. AOP-Informed Assay Selection: For a given toxicity endpoint (e.g., lung fibrosis), review relevant AOPs to identify measurable KEs (e.g., sustained inflammation, fibroblast proliferation). Select in vitro assays that correspond to these KEs.

2. Experimental Setup: * Model System: Culture 3D human tissue models (e.g., MucilAir-HF for airway). * Exposure Scheme: Use a 12-day repeated exposure regimen at the air-liquid interface to mimic realistic human exposure. * Test Chemicals: Include benchmark chemicals with known in vivo outcomes (both low and high risk).

3. Bioactivity Measurement: * Tissue Integrity: Measure transepithelial electrical resistance (TEER) and lactate dehydrogenase (LDH) release. * Functional KE Endpoints: Measure biomarkers mapped to KEs (e.g., pro-inflammatory cytokine secretion for inflammation KE; cilia beating frequency for mucociliary clearance KE). * Transcriptomics: Perform RNA sequencing to identify pathway-level changes supporting KE activation.

4. Data Analysis and Risk Contextualization: * Determine Point of Departure (PoD): Use a nonlinear state-space model to identify the Bioactivity PoD—the lowest concentration where a significant, sustained change in a KE-related biomarker occurs. * Calculate Bioactivity Exposure Ratio (BER): Combine the in vitro PoD with a human exposure estimate (e.g., predicted lung deposition from a spray product). BER = PoD / Human Exposure Estimate. A BER > 1 suggests a margin of safety, while BER ≤ 1 indicates potential risk [14].

The Scientist's Toolkit: Essential Reagents & Materials

Table 2: Research Reagent Solutions for Species Susceptibility & AOP Research

Item	Function/Application in Research	Example/Notes
SeqAPASS Tool [9]	Web-based tool for predicting protein conservation and susceptibility across species via sequence and structural analysis.	Used for Level 1-4 analysis. Critical for hypothesis generation before wet-lab experiments.
I-TASSER or AlphaFold	Protein structure prediction servers. Generate 3D models of protein targets for species where no crystal structure exists.	Integrated into SeqAPASS v7.0 [9]. Essential for enabling cross-species molecular docking.
AutoDock Vina [9]	Molecular docking software. Performs the virtual screening of a chemical against multiple protein structures.	Standard tool for simulating the MIE (chemical-protein binding).
3D Reconstructed Human Tissue Models	In vitro systems for measuring KE-related bioactivity in a human-relevant context.	MucilAir-HF (airway), EpiAlveolar (lung) [14]. Provide a functional human system for PoD derivation.
AOP-Wiki [13]	Central, collaborative repository for AOP development and dissemination.	The primary platform for sharing, reviewing, and accessing qualitative AOP knowledge.
AOP-helpFinder [15]	Text-mining software. Automates literature screening to identify stressor-event and event-event co-occurrences.	Supports systematic, evidence-based construction of AOP networks for novel stressors.
Multiple Path Particle Dosimetry (MPPD) Model	Computational dosimetry model. Estimates regional deposition of inhaled particles/aerosols in human and rodent airways.	Used to translate in vitro PoDs to human-relevant exposure contexts by estimating lung deposited dose [14].

The field is rapidly evolving toward greater integration and formalization. A major initiative is the FAIRification of AOP data—making AOPs Findable, Accessible, Interoperable, and Reusable through standardized metadata and computational tools [13]. This will transform AOPs from qualitative narratives into computable knowledge graphs that can be seamlessly integrated with species-specific omics data and in silico models, vastly improving susceptibility predictions.

Simultaneously, the regulatory landscape is being reshaped by the validation and acceptance of NAMs [16]. A unified framework for demonstrating the scientific validity of NAMs is needed to accelerate their use in regulatory decision-making. Case studies that successfully integrate AOP-based susceptibility assessments into NGRA, like the inhalation toolbox study [14], are critical for building this confidence.

In conclusion, understanding species susceptibility is not merely an academic detail but a fundamental requirement for accurate human health and ecological risk assessment. The AOP framework provides the essential mechanistic scaffolding for investigating susceptibility. By combining computational predictions of protein-chemical interactions, AOP-informed in vitro testing in human systems, and strategic use of alternative models, researchers can build a robust, evidence-based understanding of how and why effects differ across the tree of life. This integrated approach is the cornerstone of a more ethical, predictive, and human-relevant future for toxicology and drug development.

Assessing species susceptibility is a fundamental challenge in chemical safety assessment and translational toxicology. The Adverse Outcome Pathway (AOP) framework provides a structured model linking a Molecular Initiating Event (MIE) to an Adverse Outcome (AO) across biological levels of organization [17]. However, a critical limitation in its application for prediction is the inherent variability in susceptibility among different species, life stages, and genotypes [18]. This variability determines whether and to what severity a molecular perturbation progresses along an AOP.

Understanding these sources of variability is not merely an academic exercise but a practical necessity for refining risk assessment, designing safer chemicals, and developing targeted therapeutics. It bridges the gap between mechanistic toxicology and real-world outcomes, allowing researchers to extrapolate data from model systems to human populations and diverse ecological species. This guide provides a technical exploration of how comparative biology, life stage analysis, and genetic dissection methodologies can be systematically integrated to quantify and predict susceptibility within the AOP paradigm.

Foundational Concepts: The Comparative Method and the AOP Framework

The comparative method is a cornerstone of evolutionary biology, involving the systematic comparison of traits across different species to infer evolutionary processes and functional adaptations [19] [20]. In the context of AOPs, this method shifts from comparing benign traits to comparing susceptibility traits—the differential responses of biological pathways to perturbation.

AOPs are conceived as stressor-agnostic, but the progression and potency of key events are highly dependent on biological context. Comparative biology informs AOP development by:

Identifying Conserved Pathways: Pinpointing MIKEs and key events that are evolutionarily conserved across species, suggesting broader relevance [21].
Explaining Divergent Outcomes: Highlighting where and why pathways diverge due to differences in physiology, metabolism, or genetic makeup, explaining species-specific susceptibility [19].
Validating Alternative Methods: Providing the phylogenetic rationale for using data from one species (e.g., in vitro models, zebrafish) to predict outcomes in another (e.g., human) [17] [21].

Integrating this comparative perspective requires moving beyond a single model organism and embracing a phylogenetically informed approach to AOP construction and testing [20].

Quantitative Data on Susceptibility Factors

The following tables summarize key quantitative relationships and parameters relevant to assessing susceptibility from genetic, comparative, and risk assessment perspectives.

Table 1: Genetic Variants and Associated Susceptibility Parameters

Variant Type (Gene/SNP)	Phenotypic Association	Reported P-value	Effect Size (Odds Ratio/Beta)	Key Susceptibility Context	Source/Reference
CLN7 (MFSD8) SVA Insertion	Neuronal Ceroid Lipofuscinosis 7 (Batten Disease)	N/A (Causal mutation)	N/A	Leads to mis-splicing; complete loss of function is causative for severe childhood neurodegenerative disease.	Case study [22]
Hypothetical GWAS SNP (rsExample)	Increased serum biomarker 'X' post-exposure	5.0 x 10⁻⁹	Beta = 0.5 units per allele	Modulates inflammatory response to chemical stressor, altering progression along AOP for liver fibrosis.	GWAS standard [23]
CYP2C92/3 variants	Warfarin sensitivity	< 0.001	OR for bleeding ~3.5	Alters drug metabolism kinetics, dramatically shifting dose-response curve and risk of hemorrhage (AO).	Pharmacogenomics [24]
RYR1/IP3R activation SNPs	Altered calcium signaling flux	Varies	Modest effect per SNP	May potentiate MIE of "sustained intracellular Ca²⁺ increase" from neurotoxicants, influencing downstream key events.	AOP analysis [18]

Table 2: Comparative Susceptibility Metrics Across Species and Life Stages

Stressor	Species/Life Stage	Susceptibility Endpoint (AO)	Key Comparative Metric (e.g., LC₅₀, BMD₁₀)	Identified Source of Variability	Implication for AOP
Chlorpyrifos (Organophosphate)	Human (Prenatal)	IQ Reduction	No threshold; ~1-5 IQ point loss per exposure unit [18]	Immature blood-brain barrier; low paraoxonase-1 (detox enzyme) activity in fetus.	Life stage drastically increases potency for neurodevelopmental AOPs.
Chlorpyrifos	Adult Rat vs. Adult Bird	Cholinergic Toxicity (Death)	Rat LD₅₀: ~10-30 mg/kg; Bird LD₅₀: ~1-10 mg/kg [18]	Differences in acetylcholinesterase (AChE) sensitivity and metabolic detoxification rates.	MIE (AChE inhibition) conserved, but tissue sensitivity and recovery rates (key events) differ.
Polycyclic Aromatic Hydrocarbons	Zebrafish Embryo vs. Adult	Developmental Cardiotoxicity vs. Hepatic Tumors	EC₅₀ for pericardial edema (ng/L) vs. Tumor incidence after chronic exposure (mg/kg)	Differential expression of aryl hydrocarbon receptor (Ahr) isoforms and DNA repair capacity.	Life stage determines entirely different AOPs (developmental vs. cancer) from same MIE.
Reference Drug (e.g., Aspirin)	Human Adult vs. Pediatric	Reye's Syndrome (Rare AO)	Risk associated with viral infection and age < 19 years [25]	Immature mitochondrial fatty acid metabolism in children.	AOP is life stage-restricted; safe for one population, hazardous for another.

Table 3: AOP-Specific Susceptibility Modifiers and Assessment Windows

AOP Title (AOP-Wiki ID)	Critical Life Stage for Susceptibility	Key Genetic Modifier Pathways	Recommended Testing System for Variability	Susceptibility Readout
AChE Inhibition → Acute Mortality	Developing nervous system (all species) [18]	Paraoxonase (PON1) polymorphism status; AChE gene variants.	Comparative in vitro neurospheres from human iPSCs vs. rodent primary cells.	Functional AChE activity recovery rate post-inhibition.
Sustained RyR Activation → Neurodevelopmental Deficits	Early brain development (neuronal migration, synaptogenesis) [18]	Ryanodine receptor (RYR1/3) and IP3 receptor (ITPR1) polymorphisms.	High-content imaging of calcium flux in isogenic neural progenitor cell lines with edited SNPs.	Amplitude and frequency of aberrant calcium transients.
PPARα Activation → Rodent Hepatocarcinogenesis	Adult (not relevant to non-proliferating human hepatocytes)	Strength of PPARα-mediated proliferation signal; innate immune response.	Species-comparative transcriptomics in human vs. rat hepatocyte models.	Gene expression signature of proliferation vs. alternative pathways.
Binding to PDE4B → Lung Fibrosis	Pre-existing inflammatory lung conditions (all ages)	Variants in TGF-β signaling, inflammatory cytokine genes (IL-1β, TNF-α).	3D human lung microtissues with donor genetic variability, exposed to TGF-β.	Collagen deposition and pro-fibrotic marker secretion.

Phylogenetic Comparative Methods (PCMs)

PCMs are statistical techniques that account for the evolutionary relationships (phylogeny) among species when comparing their traits. This prevents false conclusions arising from shared ancestry rather than independent adaptation [19] [20].

Protocol: Using Phylogenetic Generalized Least Squares (PGLS) to Test for Correlated Evolution Between a Key Event and Susceptibility

Phylogeny Construction: Obtain or infer a time-calibrated phylogenetic tree for the study species (e.g., 10 mammalian species). Use sequence data (e.g., mitochondrial cytochrome b) and software like BEAST or MrBayes.
Trait Data Collection: Quantify two continuous traits for each species:
- Trait Y (Putative Key Event): e.g., In vitro potency (IC₅₀) for inhibiting a specific enzyme (the MIE).
- Trait X (Susceptibility Metric): e.g., In vivo toxicity threshold (LD₅₀) for the relevant stressor.
PGLS Model Fitting: Use the caper package in R. The model is: Trait_Y ~ Trait_X + Phylogeny. The phylogenetic correlation matrix is derived from the tree branch lengths.
Analysis & Interpretation: A significant positive correlation between X and Y after accounting for phylogeny suggests the key event's sensitivity is an evolutionary driver of whole-organism susceptibility across the clade. This supports the AOP's universality for those species.

Life Stage-Specific Profiling

Susceptibility is dynamic across an organism's lifespan due to ontogenetic changes in gene expression, metabolism, and organ function.

Protocol: Cross-Life Stage Transcriptomics for AOP Anchoring

Model Selection & Exposure: Use a small vertebrate model (e.g., zebrafish). Expose synchronized populations at defined life stages (embryo, larva, juvenile, adult) to a sub-toxic concentration of a stressor linked to a known AOP.
Sample Collection & Sequencing: Collect target tissue (e.g., liver) at a fixed time post-exposure. Perform RNA extraction, library prep, and bulk RNA-Seq (n=5 per life stage per treatment).
Bioinformatic Analysis:
- Map reads to the reference genome and quantify gene expression.
- Identify Differentially Expressed Genes (DEGs) for each exposed life stage vs. its own control.
- Perform pathway enrichment analysis (e.g., using GO, KEGG) on the DEG lists.
Life Stage Susceptibility Mapping: Overlap enriched pathways with the key event networks of the relevant AOP. The life stage showing the strongest and most coherent dysregulation of the AOP-defined network is identified as the peak susceptibility window. This can be validated with targeted functional assays.

Genome-Wide Association Studies (GWAS) for Susceptibility Loci

GWAS identifies genetic variants statistically associated with a trait or disease risk in a population [23] [26]. In toxicology, the "trait" can be a quantitative measure of susceptibility.

Protocol: Conducting a GWAS on a Toxicity-Related Phenotype

Cohort and Phenotyping: Assay a large, genetically diverse population (e.g., 300+ strains of mice from a Diversity Outbred panel, or human cell lines from many donors). Measure a quantitative susceptibility phenotype (e.g., IC₅₀ for cytotoxicity, AUC of a biomarker response, or binary outcome like significant hepatocyte apoptosis).
Genotyping & Quality Control: Perform high-density SNP genotyping or whole-genome sequencing. Apply strict QC filters: call rate >95%, minor allele frequency (MAF) >5%, Hardy-Weinberg equilibrium p > 1x10⁻⁶.
Association Analysis: Use a linear or logistic mixed model to test each SNP for association with the phenotype, correcting for population structure (kinship matrix). Tools: PLINK, GCTA [23] [26].
Significance & Validation: Set a genome-wide significance threshold (typically p < 5x10⁻⁸). Associated SNPs define quantitative trait loci (QTLs). Candidate genes within QTLs are prioritized for functional validation (e.g., CRISPR editing in cell models) to confirm their role in modulating the AOP.

Visualizing Pathways and Workflows

Diagram 1: Integrating Variability Sources into an AOP Framework. This diagram illustrates how genetic, life stage, and comparative biology factors modulate the progression and outcome of a molecular initiating event (MIE) along an adverse outcome pathway (AOP).

Diagram 2: Phylogenetic Comparative Method (PCM) Workflow for Trait Analysis. This workflow details the steps from data collection to statistical analysis, highlighting how evolutionary history is integrated to distinguish true correlation from phylogenetic relatedness.

Diagram 3: GWAS Workflow for Identifying Genetic Loci of Susceptibility. This chart outlines the process from cohort establishment and data generation through quality control, statistical analysis, result visualization, and final functional validation of candidate genes.

The Researcher's Toolkit: Key Reagents and Materials

Table 4: Essential Research Reagents and Platforms for Susceptibility Studies

Category	Item / Technology	Primary Function in Susceptibility Research	Example Application
Genomic Analysis	Whole Genome Sequencing (WGS) Platform	Identify causal mutations and structural variants underlying extreme susceptibility phenotypes.	Diagnosing rare genetic disorders like Batten disease via CLN7 mutation detection [22].
	High-Density SNP Array or Genotyping-by-Sequencing (GBS)	Genotype large, diverse populations for GWAS to find common variants associated with quantitative susceptibility traits [23].	Mapping loci associated with variable inflammatory response to a toxicant in mouse populations.
Transcriptomic & Metabolomic Profiling	RNA-Sequencing (Bulk or Single-Cell)	Profile gene expression changes across life stages, species, or genotypes to anchor Key Events and identify susceptibility networks [17] [18].	Comparing liver response to PPARα agonist in rat vs. human hepatocytes to explain species-specific AOPs.
	High-Resolution Mass Spectrometry (Metabolomics)	Detect global biochemical shifts, revealing altered metabolic pathways that predispose or indicate progression along an AOP [17].	Identifying early metabolic biomarkers of neurotoxicity that differ between susceptible and resistant genotypes.
Functional Validation	CRISPR-Cas9 Gene Editing System	Create isogenic cell lines with specific SNPs or knockouts to validate candidate susceptibility genes in a controlled background.	Testing if a GWAS-identified SNP in a receptor gene alters calcium signaling (a Key Event) upon exposure.
	Differentiated Induced Pluripotent Stem Cells (iPSCs)	Generate human cell types (neurons, hepatocytes) from donors of varying genetics to model population variability in vitro.	Assessing inter-individual differences in neurodevelopmental toxicity using iPSC-derived neurons [18].
Comparative & Phylogenetic Tools	Multi-Species Tissue Biobank or Cell Catalog	Source biological materials from phylogenetically diverse species for direct comparative experiments.	Measuring conservation of a protein target's binding affinity across 10 mammalian species.
	Phylogenetic Analysis Software (e.g., BEAST, PHYLIP)	Reconstruct evolutionary relationships and apply comparative statistical methods (PGLS) [20].	Analyzing whether susceptibility to a compound is correlated with a trait across a primate phylogeny.
Data Integration & Modeling	AOP-Knowledgebase (AOP-Wiki) & AOP-helpFinder	Access structured AOP information and use text-mining tools to discover novel chemical-event and event-event associations [18].	Building or expanding an AOP for neurodevelopmental toxicity by mining literature for molecular events.
	Systems Biology Modeling Software	Integrate multi-omics data to construct quantitative, predictive models of pathway perturbation that incorporate genetic parameters.	Simulating how a 50% reduction in enzyme activity (due to a polymorphism) affects flux through a toxicokinetic pathway.

Understanding variability in species susceptibility is the key to transforming the AOP framework from a descriptive schematic into a predictive, quantitative model. As demonstrated, this requires the integration of three disciplinary pillars:

Comparative Biology provides the evolutionary context, distinguishing conserved from divergent pathways and justifying interspecies extrapolation [19] [20].
Life Stage Analysis identifies critical windows of heightened sensitivity, ensuring toxicity evaluations are relevant for the entire lifespan, particularly vulnerable periods like development [18].
Genetic Dissection unravels the inherited components of variability, from rare monogenic drivers of extreme sensitivity to common polymorphisms that shift population-level dose-response curves [23] [22] [24].

The future of accurate chemical risk assessment and precision toxicology lies in mechanism-based stratification. By systematically applying the methodologies outlined—PCMs, life-stage omics, and GWAS—within the AOP structure, researchers can move towards predicting not just if a stressor will cause an adverse outcome, but who (which species, which population subgroup, at which life stage) will be most at risk and why at a molecular level. This refined understanding is essential for developing safer products, targeted therapies, and protective regulations that account for real-world biological diversity.

Assessing species susceptibility represents a pivotal challenge in modern toxicology and chemical safety assessment. The Adverse Outcome Pathway (AOP) framework provides a structured, modular approach to describe the sequence of biological events leading from a molecular perturbation to an adverse outcome relevant for regulatory decision-making [27]. A critical component of this framework is the taxonomic domain of applicability (tDOA), which defines the species and life stages for which an AOP is considered biologically plausible [3]. This technical guide examines the methodologies and tools required to establish and expand tDOAs, facilitating the translation of in vitro mechanistic data to in vivo predictions across species. Focusing on a cross-species AOP network for silver nanoparticle reproductive toxicity [28], we detail integrative approaches combining in vitro human data, in vivo ecotoxicology data, and in silico extrapolation tools to bridge human and ecological risk assessment under a One Health perspective.

Adverse Outcome Pathways are conceptual frameworks that organize mechanistic knowledge into a causal sequence of measurable Key Events (KEs), beginning with a Molecular Initiating Event (MIE) and culminating in an Adverse Outcome (AO) [3] [27]. AOPs are not stressor-specific; they depict generalizable biological sequences that can be initiated by any stressor triggering a specific MIE [27]. This modularity allows KEs and Key Event Relationships (KERs) to be shared across multiple AOPs, forming complex AOP networks (AOPNs) that better reflect biological reality [27].

A persistent uncertainty in applying AOPs for regulatory hazard assessment is cross-species extrapolation. An AOP developed in a model organism (e.g., Caenorhabditis elegans) or in human cell lines requires validation to determine its relevance to other untested species [28]. The tDOA is therefore not an inherent property but a hypothesis based on the conservation of biological pathways and processes across taxa [3]. Defining the tDOA is essential for employing AOPs within New Approach Methodologies (NAMs) aimed at reducing animal testing, as it allows data from one system to inform predictions for another [29] [28]. This guide details the systematic process for assessing and expanding tDOAs, using a contemporary case study to illustrate the integration of diverse data streams and computational tools.

Core Concepts and Definitions

Adverse Outcome Pathway (AOP): A conceptual construct that describes a sequential chain of causally linked biological events at different levels of biological organization, leading from an MIE to an AO [3] [27].
Molecular Initiating Event (MIE): The initial interaction between a stressor and a biomolecule within an organism that triggers the perturbation [3] [27].
Key Event (KE): A measurable, essential change in biological state that is a required element in the progression along the pathway [3].
Key Event Relationship (KER): A scientifically justified causal or predictive link between an upstream and a downstream KE [3]. KERs are supported by evidence of biological plausibility, empirical support, and ideally, quantitative understanding [27].
Taxonomic Domain of Applicability (tDOA): The taxonomic range (species, life stages, sex) for which the AOP is considered biologically plausible. It is informed by the conservation of the MIE, intermediate KEs, and the biological processes underpinning the KERs [3] [28].
AOP Network (AOPN): Multiple AOPs linked by shared KEs and KERs, providing a more comprehensive representation of biological complexity and interacting toxicity pathways [28] [27].

Methodological Framework for Establishing Taxonomic Applicability

The workflow for defining and extending the tDOA of an AOP involves a multi-step, integrative process. The following diagram outlines the generalized framework for cross-species AOP development and tDOA expansion.

AOP Development and tDOA Expansion Workflow

Case Study: A Cross-Species AOP Network for Silver Nanoparticle Reproductive Toxicity

A 2024 study provides a seminal example of tDOA expansion [28]. It began with AOP 207, which describes NADPH oxidase and p38 MAPK activation leading to reproductive failure in C. elegans after exposure to silver nanoparticles (AgNPs). The study's objective was to integrate data from in vitro human models and other in vivo ecotoxicology studies to create a cross-species AOPN and extrapolate its tDOA.

Experimental Protocol: Data Integration and Network Building [28]

Data Collection: A set of 25 mechanistic toxicity studies on AgNPs published between 2009-2019 was assembled. Studies included in vitro human cell line data, in vivo data from C. elegans and Drosophila melanogaster, and aquatic species data.
Endpoint Mapping: Measured endpoints from each study (e.g., reactive oxygen species (ROS) levels, gene expression changes, mortality, reproduction) were systematically mapped to standardized KE terms within the AOP-Wiki framework.
Qualitative AOPN Construction: Based on biological plausibility, a preliminary AOP network was constructed by linking the mapped KEs. This network integrated pathways from different species into a unified structure.

Experimental Protocol: Bayesian Network Modeling for Quantitative Confidence [28] To quantitatively assess the causal relationships (KERs) within the constructed AOPN, a probabilistic Bayesian Network (BN) approach was employed.

Model Structure: The BN structure was defined by the hypothesized AOPN, with nodes representing KEs and directed edges representing KERs.
Parameterization: Conditional probability tables for each node were parameterized using data extracted from the 25 collected studies. This step quantifies the likelihood of a downstream KE given the state of an upstream KE.
Validation & Inference: The BN model was used to run probabilistic inferences, predicting the occurrence of AOs based on upstream MIEs or KEs. The model's predictions were compared against independent experimental data to assess its validity and the confidence in the overall AOPN.

Key In Silico Tools for Taxonomic Extrapolation

Following quantitative assessment, in silico tools were used to systematically investigate and expand the tDOA.

Experimental Protocol: Sequence Alignment to Predict Across Species Susceptibility (SeqAPASS) [28] [27] SeqAPASS compares protein sequence, structure, and functional domain similarities across species.

Input: The amino acid sequence of the primary protein target associated with the MIE or a critical KE (e.g., the NADPH oxidase protein complex in AOP 207) is used as a query.
Comparison: The tool performs pairwise alignments against protein sequences in the National Center for Biotechnology Information (NCBI) database for a wide range of species.
Output: It generates a prediction of susceptibility for each species based on tiered weights assigned to sequence similarity, functional domain conservation, and amino acid identity. A high similarity score suggests conservation of the molecular target, supporting the biological plausibility of the AOP in that species.

Experimental Protocol: Genes-to-Pathways Species Conservation Analysis (G2P-SCAN) [28] G2P-SCAN evaluates the conservation of entire biological pathways or processes (ensembles of genes) across species.

Input: A list of genes (e.g., those involved in the p38 MAPK signaling cascade or oxidative stress response from the AOPN) is defined.
Analysis: The tool uses orthology data to assess the presence and completeness of the corresponding biological pathway in the genomes of species across broad taxonomic groups (mammals, fish, invertebrates, etc.).
Output: It provides a measure of pathway conservation, identifying which taxonomic groups are likely to possess the complete biological machinery necessary for the AOP to proceed. This supports the extrapolation of KERs, not just individual molecular targets.

The integrated application of SeqAPASS (focused on molecular target conservation) and G2P-SCAN (focused on pathway conservation) provides robust, complementary evidence for hypothesizing an expanded tDOA [28].

Data Presentation: Quantitative Findings and Taxonomic Extrapolation

The following tables summarize the core quantitative relationships from the case study and the results of the tDOA expansion.

Table 1: Key Event Relationships in the AgNP AOP Network [28]

Upstream Key Event (KE)	Downstream Key Event (KE)	Biological Organization Level	Quantitative Understanding / Bayesian Network Inference
MIE: AgNP dissolution & Ag⁺ release	KE1: Increased intracellular ROS	Cellular	Probabilistic link: High [Ag⁺] strongly predicts elevated ROS probability.
KE1: Increased intracellular ROS	KE2: Activation of p38 MAPK signaling	Cellular / Molecular	Conditional probability established; ROS level modulates p38 activation state.
KE2: Activation of p38 MAPK signaling	KE3: Activation of downstream transcription factors (e.g., SKN-1/Nrf2)	Molecular	Supported by gene expression data; modeled as a causal dependency in BN.
KE3: Altered transcription factor activity	KE4: Changes in gene expression (antioxidant, apoptosis, repair pathways)	Molecular	Empirical data from transcriptomics studies used to parameterize BN node.
KE4: Changes in gene expression	KE5: Cellular apoptosis & dysfunction	Cellular	BN inference shows this linkage is essential for progression to organ-level effects.
KE5: Cellular apoptosis & dysfunction	AO: Reproductive failure (reduced brood size)	Organism	The final predictive relationship of the BN; validated against in vivo reproduction data.

Table 2: Extended Taxonomic Domain of Applicability (tDOA) for the AgNP AOP Network [28]

Ecological Compartment	Initial tDOA (Data-Driven)	Extended tDOA (In Silico Prediction)	Number of Species/Groups in Extended tDOA
Terrestrial	Caenorhabditis elegans, Drosophila melanogaster, Homo sapiens (in vitro)	Fungi, Birds, Rodents, Reptiles, Nematodes (beyond C. elegans)	Fungi: 98, Birds: 28, Others: 3+
Aquatic	Chlamydomonas reinhardtii, Daphnia magna, Oryzias latipes	Additional fish species, Crustaceans, Mollusks	Multiple taxonomic groups (Total >100)
Basis for Extension	Direct experimental evidence from collected studies.	Integrated prediction from SeqAPASS (protein target conservation) and G2P-SCAN (oxidative stress & MAPK pathway conservation).	Over 100 taxonomic groups identified as plausibly susceptible.

This table lists critical tools and resources used in the featured methodologies for developing and extrapolating AOPs.

Table 3: Research Reagent Solutions for AOP Development and tDOA Analysis

Item / Resource	Function / Purpose	Example/Citation
AOP-Wiki (aopwiki.org)	The primary collaborative knowledgebase and repository for developing, sharing, and assessing AOPs. Provides the structured template and KE ontology.	[3]
SeqAPASS Tool	A web-based tool for comparing protein sequence and structural similarity across species to predict conservation of molecular targets (MIEs/KEs).	[28] [27]
G2P-SCAN R Package	An in silico pipeline for analyzing the conservation of biological pathways and processes (gene sets) across a broad range of species.	[28]
Bayesian Network Modeling Software (e.g., Netica, GeNIe, R packages like `bnlearn`)	Used to build, parameterize, and perform probabilistic inference on quantitative AOP networks, quantifying confidence in KERs.	[28]
Standardized Assays for Key Events	Reliable, reproducible bioassays to measure specific KEs (e.g., DCFH-DA assay for ROS, phospho-specific antibodies for p38 MAPK activation, brood size assessment for reproduction).	Essential for generating empirical data to support and quantify KERs [28].
Orthology Databases (e.g., NCBI, Ensembl)	Provide the foundational genetic data required for cross-species comparisons performed by SeqAPASS, G2P-SCAN, and similar tools.	[28]

Visualizing the AOP Framework and Pathways

The core AOP conceptual framework can be visualized as a linear cascade from molecular to organism level, while specific pathways form more detailed networks.

Core AOP Framework: From MIE to AO

AgNP-Induced Reproductive Toxicity AOP Pathway

The systematic assessment of taxonomic applicability domains is fundamental to realizing the translational potential of the AOP framework. By integrating in vitro and in vivo data within a structured network and leveraging in silico extrapolation tools like SeqAPASS and G2P-SCAN, researchers can formulate biologically justified hypotheses about species susceptibility [28]. This approach directly supports the goals of NAMs and the FAIR (Findable, Accessible, Interoperable, Reusable) principles for AOP data, which aim to enhance trustability and utility for 21st-century risk assessment [29].

Future advancements will depend on the continued development of quantitative AOPs (qAOPs) with explicit probabilistic or mathematical descriptions of KERs, further integration of high-throughput screening data, and the expansion of interoperable knowledgebases. As these efforts progress, the scientifically grounded definition of tDOAs will be crucial for confidently applying mechanistic data from alternative methods to protect both human and ecosystem health.

Practical Strategies for Cross-Species AOP Evaluation and Data Integration

In the evolving paradigm of 21st-century toxicology, the Adverse Outcome Pathway (AOP) framework has emerged as a critical scaffold for organizing mechanistic knowledge. It connects a Molecular Initiating Event (MIE) triggered by a stressor to an Adverse Outcome (AO) of regulatory concern through a causally linked sequence of measurable Key Events (KEs) [3]. For researchers focused on the core thesis of assessing species susceptibility, public AOP repositories are indispensable. They provide the structured, crowd-sourced biological knowledge required to identify and compare the molecular and cellular determinants of differential sensitivity across life stages, sexes, and taxa [30].

Two primary resources anchor this field: the AOP-Wiki, the OECD-supported central repository for AOP development, and the U.S. Environmental Protection Agency's Adverse Outcome Pathway Database (AOP-DB), a decision-support tool integrating AOP information with external biological and chemical data [31] [32]. While the AOP-Wiki focuses on the qualitative, collaborative development of pathway descriptions [33], the AOP-DB enhances this foundation by adding layers of quantitative, computable data—including gene associations, chemical stressors, and human population genetic variation [32]. A comparative analysis of these platforms reveals that the AOP-Wiki excels in capturing consensus-based biological pathways, whereas the AOP-DB enables the computational interrogation of those pathways for susceptibility factors, such as genetic polymorphisms in key event genes [30] [31]. This synergy is fundamental for transitioning from qualitative pathway description to quantitative susceptibility assessment, a requirement under modern chemical safety laws like the Lautenberg Act [30].

Comparative Analysis of Repository Structure, Content, and Utility

A direct comparison of the scope, data architecture, and primary function of the AOP-Wiki and EPA AOP-DB highlights their complementary roles in toxicological research and susceptibility analysis.

Table 1: Core Comparison of the AOP-Wiki and EPA AOP-DB

Feature	AOP-Wiki	EPA AOP-DB
Primary Purpose	Collaborative development and qualitative storage of AOPs [3].	Computational integration of AOPs with external data for analysis and decision-support [32].
Governance	OECD-led programme, part of the AOP Knowledge Base (AOP-KB) [34].	U.S. Environmental Protection Agency [31].
Core Content Units	AOPs, Key Events (KEs), Key Event Relationships (KERs), Stressors [3].	AOPs, Genes, Proteins, Stressors (chemicals), Diseases, Biological Pathways, SNPs [32].
Key Quantitative Metrics (as of 2023/2021)	403 AOPs (29 OECD-endorsed) [34].	261 AOPs linked to molecular targets; 1,029 unique AOP-chemical associations [32].
Data Structure	Wiki-based; increasingly ontology-annotated; available as XML [33].	Relational SQL database; also available as semantically linked RDF data [32] [33].
Unique Strengths for Susceptibility Research	Captures taxonomic applicability of KEs; repository for weight-of-evidence assessments [3].	Directly links AOP genes to population-level SNP frequency data (e.g., from 1000 Genomes) for susceptibility characterization [30] [32].

The content within these repositories reveals specific biological focal points and gaps. An analysis of the AOP-Wiki found that AOPs related to diseases of the genitourinary system, neoplasms, and developmental anomalies are the most frequently investigated [34]. This mapping identifies areas ripe for susceptibility research, as these outcomes inherently involve complex interactions with host genetic and physiological factors. The AOP-DB further enriches this context by linking these pathways to specific human disease phenotypes via integrated sources like DisGeNET, allowing researchers to explore the continuum from molecular perturbation to population-level disease burden [31] [32].

Methodological Framework: Utilizing Repositories for Susceptibility Analysis

Foundational Protocol for AOP Development and Curation

The OECD's AOP Developers' Handbook provides a standardized workflow for creating the robust AOPs that populate these repositories [3]. This process is essential for ensuring the mechanistic clarity required for susceptibility research.

Table 2: Key Stages in the AOP Development Workflow [3]

Stage	Key Actions	Output for Repository
1. Identification	Define the AO and MIE of interest; conduct literature review.	Scope of the AOP description.
2. Description	Define intervening KEs; describe each KE (measurement method, taxonomic applicability).	Individual KE wiki pages.
3. Relationship Building	Define and support Key Event Relationships (KERs) with biological plausibility and empirical evidence.	KER wiki pages with weight of evidence.
4. Weight of Evidence Assessment	Evaluate essentiality of KEs and overall confidence in the AOP using Bradford Hill considerations.	Summary assessment guiding regulatory applicability.

A best practice is to define KEs as measurable, essential changes at a specific level of biological organization (e.g., cellular, tissue, organ). This modularity allows a single KE (e.g., "Increased oxidative stress in hepatocytes") to be reused in multiple AOPs, creating networks that are crucial for understanding how susceptibility may alter outcomes across different pathways [35].

Experimental Protocol for Integrating Genetic Susceptibility Data with AOPs

The EPA AOP-DB enables a specific computational methodology to anchor susceptibility research within the AOP framework [30]. The following protocol outlines the key steps:

Step 1: AOP Selection and Gene Target Identification. Select an AOP of interest from the AOP-Wiki or AOP-DB. Using the AOP-DB, extract the official list of gene/protein identifiers associated with the KEs and MIEs of the selected AOP. This mapping is performed by the AOP-DB by parsing protein ontology terms from the AOP-Wiki's Key Event components [32].

Step 2: Acquisition of Population Genetic Variant Data. For each human gene identified in Step 1, query integrated population databases (e.g., 1000 Genomes, gnomAD) via the AOP-DB or directly to obtain single nucleotide polymorphism (SNP) data. Focus on functional variants (e.g., missense, regulatory) and record their global and sub-population allele frequencies [30] [32].

Step 3: Functional Impact and Pathway Analysis. Use bioinformatics tools to predict the functional impact of high-frequency variants (e.g., using SIFT, PolyPhen-2). Perform over-representation analysis linking the AOP gene set to biological pathways (via GO, KEGG) and disease phenotypes (via DisGeNET) to contextualize potential susceptibility [34] [30].

Step 4: In Silico and In Vitro Hypothesis Testing. Prioritize genes containing high-impact, high-frequency variants for experimental follow-up. This can involve using diversity panels of human cell lines or in silico toxicokinetic modeling to test for differential chemical sensitivity predicted by the genetic variant [30].

Diagram Title: Computational Workflow for AOP-Based Genetic Susceptibility Analysis

Effectively leveraging public AOP repositories requires a suite of integrated tools and databases. The following toolkit is essential for researchers conducting comparative analyses and susceptibility assessments.

Table 3: Research Reagent Solutions for AOP and Susceptibility Analysis

Tool/Resource Name	Type	Primary Function in Analysis
AOP-Wiki	Primary Repository	Foundational source for qualitative AOP descriptions, weight of evidence, and taxonomic applicability [34] [3].
EPA AOP-DB	Integrated Database	Core platform for linking AOPs to genes, chemicals, diseases, and population SNP data for computational analysis [31] [32].
AOP-helpFinder	Literature Mining Tool	AI-based tool to automatically screen literature for associations between stressors/genes and AOP components, aiding development [34].
DisGeNET	Disease-Gene Database	Integrated within AOP-DB to provide associations between AOP gene targets and human disease phenotypes, contextualizing AOs [34] [32].
1000 Genomes / gnomAD	Population Genomics Database	Source of allele frequency data for functional SNPs in AOP genes, used to characterize potential population-specific susceptibility [30] [32].
ConsensusPathDB / KEGG	Pathway Database	Used for over-representation analysis to determine if AOP gene sets are enriched in specific biological pathways [32].
AOP-Wiki RDF	Semantic Web Resource	Machine-readable, FAIR-compliant version of the AOP-Wiki that enables complex queries and integration with external linked data [33].

The integration between these tools is becoming more seamless. For instance, the AOP-Wiki RDF (Resource Description Framework) conversion semantically annotates AOP content using over 20 ontologies and creates >7,500 link-outs to gene and protein databases, directly addressing prior interoperability challenges [33]. This allows for federated SPARQL queries that can, for example, retrieve all measurement methods for key events leading to a specific adverse outcome across linked resources.

Visualization and Analysis of AOP Networks for Susceptibility

A key output of comparative repository analysis is the construction of AOP networks (AOPNs), which visualize how shared KEs connect different pathways. These networks are critical for identifying convergent points of susceptibility—where a single genetic or physiological factor could modulate multiple adverse outcomes.

Diagram Title: AOP Network Formed by Shared Key Event Hubs

In this network model, KE4 and KE5 represent shared, modular events. Genetic variation or targeted modulation at these hub events could influence susceptibility to multiple initiating stressors and alter the progression to several adverse outcomes [35]. Comparative analysis using the AOP-DB can quantify the gene associations at these hubs and link them to population variation data, moving the network from a qualitative map to a quantitative susceptibility model.

The synergistic use of the AOP-Wiki and the EPA AOP-DB provides a powerful, structured approach for advancing the assessment of species and population susceptibility. The AOP-Wiki serves as the definitive source for consensus-driven biological pathways, while the AOP-DB acts as the computational engine that overlays genetic, chemical, and disease data to interrogate those pathways for points of differential sensitivity [34] [32]. The methodology outlined—from systematic AOP development to the integration of population genomics—establishes a reproducible framework for transforming qualitative pathway descriptions into testable susceptibility hypotheses.

Future development will focus on enhancing the quantitative AOP (qAOP) capabilities of these repositories. This involves integrating more kinetic and dynamic data into KERs to allow for predictive modeling of perturbation thresholds [32]. Furthermore, improving the semantic interoperability between the AOP-Wiki RDF and other biomedical linked data will enable fully automated, cross-domain queries essential for large-scale susceptibility screening [33]. Ultimately, the continued evolution of these public repositories toward more FAIR (Findable, Accessible, Interoperable, Reusable) principles is fundamental to fulfilling the regulatory mandate for protecting susceptible populations and advancing animal-free chemical safety assessment [34] [30].

Assessing species susceptibility is a fundamental challenge in chemical safety and translational toxicology. The Adverse Outcome Pathway (AOP) framework has emerged as a powerful conceptual tool for organizing mechanistic knowledge, depicting a sequential chain of causally linked biological events from a Molecular Initiating Event (MIE) to an Adverse Outcome (AO) relevant to risk assessment [27]. However, a critical uncertainty in applying AOPs lies in cross-species extrapolation: determining whether a pathway characterized in one model species (e.g., rat, zebrafish) is conserved and operative in another species of concern (e.g., a human, or an ecologically relevant endangered species) [27].

This technical guide posits that robust molecular annotation of Key Events (KEs) is the cornerstone for reliably evaluating species susceptibility within the AOP framework. By precisely defining KEs at the level of genes, proteins, and functional pathways, and by systematically comparing these molecular networks across species, researchers can move from qualitative assumption to quantitative, evidence-driven prediction. The thesis herein is that integrating orthology analysis, functional genomics, and cross-species bioinformatics into AOP development transforms the framework into a predictive tool for species susceptibility, ultimately supporting next-generation, animal-free safety assessments [36].

The AOP Framework and the Imperative for Molecular Annotation

An AOP is a modular assembly of knowledge, structured as a series of "biological dominos." It begins with a stressor (e.g., a chemical) interacting with a specific biological molecule, the MIE. This triggers a sequence of measurable KEs at increasing levels of biological organization (cellular, tissue, organ, organism), culminating in an AO relevant for regulatory decision-making [27]. Key Event Relationships (KERs) describe the causal linkages between KEs and are supported by evidence of biological plausibility, empirical support, and quantitative understanding [27].

A foundational principle is that AOPs are not stressor-specific; they depict a generalized sequence of biological effects that can be triggered by any stressor causing the same MIE [27]. This generality makes them ideal for cross-species evaluation, but only if the molecular components of the pathway are well-defined. The pathway itself, represented as nodes (KEs) and edges (KERs), can be linked with other AOPs to form AOP networks, capturing the complexity of real biological systems [27].

Molecular annotation provides the granular definition for each KE. Instead of a KE described as "oxidative stress," molecular annotation specifies the involved genes (HMOX1, SOD2), proteins, and the precise pathway (e.g., Nrf2-mediated oxidative stress response). This precise definition enables the interrogation of evolutionary conservation and facilitates the use of in vitro or computational data to predict outcomes in vivo.

Table 1: Core AOP Terminology and the Role of Molecular Annotation [27].

Term	Definition	Role of Molecular Annotation
Molecular Initiating Event (MIE)	The initial interaction between a stressor and a biological target.	Defines the specific molecular target (e.g., protein, DNA, receptor) and its variants.
Key Event (KE)	A measurable biological change essential to the progression to the AO.	Identifies the genes, proteins, and biomarkers that constitute the measurable change.
Key Event Relationship (KER)	A scientifically supported causal link between two KEs.	Describes the molecular signaling or regulatory mechanism linking the annotated events.
Adverse Outcome (AO)	A biological change of regulatory significance at the organism or population level.	Connects molecular/cellular pathway perturbations to higher-order biology.

Strategies for Molecular Annotation and Cross-Species Analysis

Defining the Molecular Landscape of Key Events

A KE can be annotated at multiple, complementary levels:

Gene/Protein Level: Identifying the specific gene(s) and protein(s) whose expression, activity, or state is altered (e.g., CYP19A1 aromatase inhibition, TP53 activation).
Pathway/Process Level: Mapping the event to a canonical biological pathway (e.g., KEGG, Reactome) or Gene Ontology (GO) biological process (e.g., "apoptotic signaling pathway," "steroid biosynthesis") [37].
Network Level: Placing the event within a protein-protein interaction (PPI) network or gene co-expression module, providing context on functional partnerships and regulatory neighborhoods [37].

Advanced approaches integrate genome-wide association study (GWAS) data with pathway and network databases. For instance, a study on thyroid carcinoma susceptibility combined GWAS data with PPI networks and tissue-specific gene expression to identify gene subnetworks related to glycogen metabolism and insulin signaling as being enriched for association signals, providing novel mechanistic insights beyond single-gene associations [37].

Computational Pipelines for Conservation Analysis: G2P-SCAN

Systematic cross-species analysis requires tools that synthesize data from diverse biological databases. The Genes-to-Pathways Species Conservation Analysis (G2P-SCAN) pipeline is a dedicated R package for this purpose [36]. It extracts, synthesizes, and structures data on gene orthologs, protein families, and pathway reactions across multiple species.

The pipeline operates by taking a set of human genes associated with a pathway or KE as input. It then:

Identifies orthologs across a set of relevant model species.
Analyzes conservation at the level of protein functional families.
Assesses the overall conservation of the entire pathway or biological process in the target species.

G2P-SCAN enables the quantification of conservation and susceptibility at the pathway level, moving beyond simple "present/absent" orthology calls to a functional assessment of whether the entire mechanistic module is likely conserved [36].

Diagram 1: The G2P-SCAN Pipeline for Cross-Species Pathway Analysis [36].

Integrating Single-Cell Genomics and Cross-Species Alignment

The rise of single-cell RNA-sequencing (scRNA-seq) atlases across species offers unprecedented resolution for comparing cell-type-specific pathway conservation. However, integrating data across species presents a "species effect," where global transcriptional differences can obscure the identification of homologous cell types [38].

A comprehensive benchmark (BENGAL) evaluated 28 strategies combining gene homology mapping methods and integration algorithms (e.g., SeuratV4, Harmony, scVI, SAMap) [38]. Key findings for AOP annotation include:

Balancing Mixing and Conservation: Top-performing algorithms (e.g., scANVI, scVI, SeuratV4) successfully mix homologous cell types from different species while preserving biologically distinct, species-specific cell populations [38].
Homology Mapping is Critical: For evolutionarily distant species, using homology mappings that include in-paralogs (species-specific gene duplications) can be more informative than strict one-to-one ortholog lists [38].
Specialized Tools for Distant Species: For highly divergent species with poor genome annotation, tools like SAMap, which iteratively builds gene-gene and cell-cell mapping graphs de novo, can outperform standard pipelines [38].

Table 2: Benchmarking Cross-Species scRNA-seq Integration Strategies (Summary) [38].

Integration Goal	Recommended Strategy	Rationale for AOP Context
General-purpose integration of 2-4 species	SeuratV4 (CCA or RPCA) or scVI	Reliable mixing of homologous cell types; good computational scalability. Enables comparison of KE responses in specific conserved cell types.
Integration of evolutionarily distant species	SAMap or methods using in-paralog-inclusive homology	Overcomes challenges with poor orthology annotation. Essential for ecological AOPs comparing, e.g., fish and invertebrate models.
Maximizing conservation of rare or species-specific cell types	Methods with high "biology conservation" scores (e.g., Harmony)	Prevents over-correction that could obscure unique, potentially susceptible cell populations in one species.

Experimental Protocols for Cross-Species Pathway Investigation

Transcriptomic Profiling for Conserved "Toolkit" Discovery

A systems-level approach to identify evolutionarily conserved genetic modules involves comparative transcriptomics across multiple species subjected to an analogous biological challenge [39].

Protocol: Identifying a Conserved Neurogenomic Response to Social Challenge [39]

Experimental Design:
- Species: Select phylogenetically diverse model organisms with sequenced genomes and defined behaviors (e.g., honey bee, three-spined stickleback fish, mouse).
- Stimulus: Apply a standardized, analogous challenge (e.g., territorial intrusion by a conspecific) to experimental groups. Use a non-social, novel object as a control.
- Tissue & Timing: Dissect brain regions homologous in function (e.g., involved in social behavior) at multiple time points post-challenge (e.g., 30, 60, 120 min) to capture dynamic responses.

RNA-seq & Differential Expression:
- Perform bulk RNA-sequencing on challenged vs. control tissues for each species and time point.
- Generate lists of differentially expressed genes (DEGs) for each comparison.
Cross-Species Computational Analysis:
- Orthology Mapping: Map DEGs from all species to a common orthology framework (e.g., OrthoDB).
- Gene Set Analysis: Test for enrichment of orthologous gene groups among DEG lists from different species.
- Module Discovery: Use co-expression network analysis (WGCNA) within each species to identify gene modules correlated with the challenge state. Then, test for preservation of these module-trait relationships across species.
- Functional Enrichment: Analyze conserved gene sets/modules for enrichment in GO terms, KEGG, or Reactome pathways.
Outcome: This protocol identified a conserved "toolkit" involving nuclear receptors, chaperones, and transcriptional regulators of mitochondrial metabolism and synaptic function in response to social challenge across three divergent species [39].

In Vitro to In Vivo Extrapolation Using an AOP

This protocol uses a defined AOP to design an in vitro testing strategy that predicts an in vivo AO across species.

Protocol: Assessing Estrogen Receptor Pathway Disruption [27]

AOP Selection: Use the well-established AOP linking Estrogen Receptor Agonism (MIE) to Population Decline via Reduced Fecundity (AO) in fish.
Molecular Annotation:
- KE1 (MIE): Agonism of the nuclear estrogen receptor (ERα/ESR1).
- KE2: Altered vitellogenin (VTG) gene expression in liver.
- KE3: Reduced plasma estradiol (E2).
- KE4: Inhibition of oocyte maturation.
- AO: Reduced fecundity.
Cross-Species Investigation:
- Use a tool like the EPA's SeqAPASS to compare the protein sequence and predicted structure of the ESR1 ligand-binding domain between a test species (e.g., fathead minnow) and a species of concern (e.g., an endangered darter).
- If high sequence similarity is confirmed, hypothesize that the AOP is conserved.
Testing:
- In Vitro: Test the chemical for ESR1 agonist activity in a human or fish ER reporter gene assay.
- In Vivo Prediction: Using the AOP and the confirmed cross-species ESR1 conservation, predict that concentrations activating the in vitro assay will also cause vitellogenin induction and reduced fecundity in the endangered species.
- Targeted In Vivo Confirmation (if needed): If regulatory certainty requires, conduct a focused in vivo study on the test species measuring the intermediate KEs (VTG, E2) to confirm pathway activation.

Table 3: Essential Research Reagent Solutions for Molecular Annotation and Cross-Species Analysis.

Resource Category	Specific Tool / Database	Primary Function in Molecular Annotation	Relevant Search Result
AOP Knowledge Base	AOP-Wiki (aopwiki.org)	Central repository for collaborative AOP development, sharing, and review. Provides structured templates for defining KEs and KERs.	[27]
Orthology & Protein Family	OrthoDB, Ensembl Compara, InterPro	Identify orthologous genes and conserved protein domains/families across species. Foundational for G2P-SCAN and homology mapping.	[36] [38]
Pathway & Network Databases	KEGG, Reactome, GO, STRING	Provide canonical pathway maps, molecular interactions, and functional gene sets for annotating KEs and performing enrichment analysis.	[37]
Cross-Species Sequence Analysis	SeqAPASS (EPA)	Predicts chemical susceptibility based on the conservation of molecular target sequences (e.g., protein, DNA) across species.	[27]
AI-Driven Literature Mining	AOP-helpFinder	Uses natural language processing to rapidly scan scientific literature (PubMed) and extract potential stressor-event and event-event pairs to support AOP development.	[40]
scRNA-seq Integration Algorithms	SeuratV4, scANVI, Harmony, SAMap	Computational tools to integrate single-cell genomics data across species, identifying conserved and divergent cell types and expression programs.	[38]
Conservation Analysis Pipeline	G2P-SCAN (R package)	Synthesizes orthology and pathway data to analyze and visualize the conservation of biological pathways across user-defined species sets.	[36]

Case Study and Data Integration

Case: Assessing Susceptibility to Radiation-Induced Microcephaly A study within the EU H2020 RadoNorm project demonstrates the integrated application of molecular annotation and AI tools for AOP development in a cross-species context [40].

Problem: Investigate the AOP for radiation-induced microcephaly (small head size), a neurodevelopmental defect.
Molecular Annotation via AI: Researchers used AOP-helpFinder to screen PubMed abstracts for co-occurrence of stressors ("ionizing radiation") and outcomes ("small head size") [40]. The tool prioritized relevant biological events from the literature.
Pathway Building & Knowledge Integration: The AI-generated leads were evaluated by experts. Information from complementary toxicogenomics and systems biology databases was integrated to identify and flesh out candidate KEs (e.g., "Neural progenitor cell apoptosis," "Disruption of cortical neuron migration") [40].
Cross-Species Consideration: The resulting AOP (AOP 441) provides a structured, mechanistically defined pathway. The molecular annotation of its KEs (e.g., specific genes regulating apoptosis or migration) allows for the subsequent use of orthology and pathway conservation tools (e.g., G2P-SCAN, SeqAPASS) to evaluate which species are likely susceptible based on the conservation of these critical biological modules.

Diagram 2: Molecular Annotation Informs an AOP for Radiation-Induced Microcephaly [40].

Molecular annotation transforms KEs from descriptive biological concepts into defined, investigable molecular entities. This precision enables the application of a powerful suite of bioinformatic and experimental tools to tackle the core challenge of cross-species extrapolation in toxicology and risk assessment. By determining whether the genes, proteins, and pathways that constitute an AOP are conserved in a species of concern, we can move from default uncertainty to evidence-based predictions of susceptibility.

The integration of evolutionarily aware pipelines like G2P-SCAN, advanced data integration algorithms for single-cell genomics, and AI-assisted knowledge mining tools represents the cutting edge of this field. As these approaches mature and are systematically applied, the AOP framework will increasingly fulfill its promise as a robust, mechanistic basis for predicting adverse outcomes across the tree of life, ultimately supporting more efficient and protective safety decisions for both human health and the environment.

The assessment of chemical hazards, particularly for complex endpoints like endocrine disruption (ED), requires a paradigm shift from observational toxicology to mechanism-based prediction. The Adverse Outcome Pathway (AOP) framework organizes mechanistic knowledge into a sequence of measurable biological events, from a Molecular Initiating Event (MIE) to an Adverse Outcome (AO) of regulatory concern [27]. For endpoints governed by interconnected biological pathways—such as those affecting the estrogen, androgen, thyroid, and steroidogenesis (EATS) modalities—single, linear AOPs are insufficient. AOP Networks (AOPNs) are the functional unit of prediction, capturing this complexity by linking multiple AOPs through shared Key Events (KEs) [41] [27].

This technical guide is framed within a critical thesis: assessing species susceptibility in toxicological research is not merely an endpoint consideration but a foundational element that must be integrated into the very architecture of AOP development and network construction. Species susceptibility arises from differences in the conservation of molecular targets, tissue-specific expression of proteins, metabolic capacity, and compensatory physiological responses. Therefore, a robust, data-driven approach to building AOPNs must systematically incorporate taxonomic and life-stage applicability from its inception. This approach enables more credible cross-species extrapolation, helping to identify susceptible populations and replace uncertain assessment factors with mechanistic, species-specific data [27].

Foundational Principles of AOP Networks

An AOP describes a causal sequence linking a MIE (e.g., receptor binding) through a series of intermediate KEs to an AO (e.g., population decline) [42]. Key Event Relationships (KERs) define the causal links between KEs based on biological plausibility, empirical evidence, and quantitative understanding [27]. The modularity of AOP components—where KEs and KERs are discrete units—allows for their recombination [42]. An AOPN emerges when multiple AOPs share common KEs, creating a web of potential toxicological pathways emanating from different stressors or converging on common adverse outcomes [41] [27].

Table 1: Core Definitions of the AOP Framework [27] [42]

Term	Abbreviation	Definition
Molecular Initiating Event	MIE	The initial interaction between a stressor and a biomolecule that perturbs biological function.
Key Event	KE	A measurable, essential change in biological state at any level of organization (e.g., cellular, tissue, organ).
Adverse Outcome	AO	An adverse effect of direct regulatory or protective concern, considered a specialized KE.
Key Event Relationship	KER	A scientifically supported, causal relationship describing how an upstream KE leads to a downstream KE.
AOP Network	AOPN	An interconnected web of AOPs linked through shared KEs and/or KERs.

For endocrine disruption, AOPNs are particularly valuable. The EU criteria for identifying an ED require establishing a causal link between an endocrine mode of action and an adverse effect [41]. AOPNs can map the multitude of potential pathways through which a chemical might perturb the endocrine system (e.g., via estrogen or thyroid hormone receptors) and link these to various AOs (e.g., developmental neurotoxicity, reproductive impairment), thereby directly supporting the weight-of-evidence assessment for causality [43].

A Data-Driven Workflow for AOP Network Construction

The manual, expert-driven construction of AOPNs becomes impractical as the number of AOPs in repositories like the AOP-Wiki grows. A reproducible, data-driven workflow is essential [41]. The following protocol outlines a generalized, five-stage process adaptable to different problem formulations, with explicit considerations for species susceptibility at each stage.

Table 2: Five-Stage Workflow for Data-Driven AOP Network Generation [41]

Stage	Core Activity	Output	Species Susceptibility Integration
1. Problem Formulation	Define scope, stressors, and AOs of interest.	A precise research or regulatory question.	Explicitly state the taxa, life stages, and sexes of concern.
2. AOP Identification & Curation	Execute structured searches in the AOP-Wiki; manually curate results.	A list of relevant, high-confidence AOPs and their components.	Filter AOPs/KEs by taxonomic applicability domains documented in the AOP-Wiki.
3. Data Extraction & Processing	Use computational scripts to extract and clean data for the curated AOP list.	Structured data files (e.g., CSV, JSON) of KEs and KERs.	Extract and preserve metadata on species, life stage, and sex for each supporting evidence item.
4. Network Assembly & Visualization	Programmatically generate network graphs linking shared KEs.	Visual AOPN maps (e.g., Graphviz, Cytoscape).	Use node/edge color or shape to denote the level of evidence for different species.
5. Application & Hypothesis Testing	Use the AOPN to guide testing, data integration, or WoE assessment.	A mechanism-based hazard assessment or testing strategy.	Identify conserved vs. divergent pathways to prioritize testing in relevant models.

Experimental Protocol: Generating an EATS-Focused AOPN

The following detailed methodology is adapted from a published case study on building an AOPN for EATS modalities [41].

1. Materials and Data Sources:

Primary Data Source: The AOP-Wiki (aopwiki.org), the crowd-sourced knowledgebase for AOPs [41] [42].
Computational Environment: R statistical programming environment.
Key Software/Packages: rWikiPathways or custom R scripts for API access to the AOP-Wiki; igraph, visNetwork, or Graphviz for network analysis and visualization [41].

2. Procedure:

Step 1 – Develop Search Strategy: Derive search terms from regulatory guidance documents specific to the endpoint. For the EATS case study, terms were extracted and simplified from the ECHA/EFSA Guidance on Identifying Endocrine Disruptors [41]. Example terms include "estrogen receptor agonism," "androgen synthesis inhibition," "thyroid hormone disruption," and "steroidogenesis."
Step 2 – Execute and Curate Search: Perform a full-text search of AOP pages in the AOP-Wiki using the prepared terms. Manually screen all returned AOPs to exclude those irrelevant to the specific problem formulation (e.g., AOs outside the scope of EATS-mediated adversity). This step requires expert toxicological judgment.
Step 3 – Extract Structured Data: For the final list of relevant AOPs, use a computational workflow (e.g., the provided R-script [41]) to query the AOP-Wiki Application Programming Interface (API). Extract structured data for all KEs and KERs within the selected AOPs, including their unique identifiers, names, descriptions, and taxonomic applicability information.
Step 4 – Construct the Network: Process the extracted data to identify shared KEs across different AOPs. Programmatically generate a network where nodes represent KEs (MIEs, intermediate KEs, AOs) and edges represent the KERs linking them. The shared KEs become the connecting hubs of the network.
Step 5 – Visualize and Filter: Render the network using visualization software. Implement filters to view specific sub-networks (e.g., only thyroid-related pathways, only pathways with high weight of evidence for a specific species like human or zebrafish).

Diagram 1: Conceptual EATS AOP Network (72 chars)

Case Study Application: PFOS as a Putative Thyroid Endocrine Disruptor

A 2024 case study on Perfluorooctane sulfonic acid (PFOS) demonstrates the application of an AOPN for a mechanism-based ED assessment [43]. The workflow mirrors the general protocol but is directed at a specific chemical.

Experimental Protocol:

Network Generation: A targeted AOPN for the thyroid modality leading to developmental neurotoxicity (DNT) was constructed [43].
Evidence Gathering: A systematic literature search for PFOS was performed, with findings mapped back to the KEs in the pre-defined thyroid-DNT AOPN.
Weight-of-Evidence (WoE) Assessment: The strength, consistency, and biological relevance of evidence for each KE and KER in the network were evaluated. The AOPN served as an organizing framework for this assessment.
Conclusion & Gap Analysis: The study concluded that while PFOS fulfilled ED criteria in a standard assessment, the mechanism-based assessment via the AOPN was hindered by a lack of quantitative data linking thyroid perturbation to the DNT AO in the context of PFOS exposure. It also highlighted potential alternative, non-endocrine neurotoxic pathways [43].

Table 3: Key Findings from the PFOS AOPN Case Study [43]

Assessment Component	Standard ED Assessment	Mechanism-Based (AOPN) Assessment	Implication for Species Susceptibility
Overall Conclusion on PFOS	Fulfilled EU criteria for an endocrine disruptor.	Could not be conclusively identified as an ED via the thyroid-DNT pathway.	Highlights that a positive apical test in one species may not confirm a specific, conserved MoA.
Critical Data Gap Identified	Lack of in vivo data on specific endpoints.	Lack of quantitative AOPs (qAOPs) defining response-response relationships.	Without quantitative understanding, predicting effect thresholds across species is highly uncertain.
Alternative MoA Identified	Not typically considered.	Direct neurotoxicity pathway was plausible based on in vitro data.	Different species may express varying susceptibility to endocrine vs. direct neurotoxic effects.

The Scientist's Toolkit: Research Reagent Solutions

Constructing and applying AOPNs relies on both digital and wet-lab tools. The following table details essential resources.

Table 4: Research Reagent Solutions for AOP Network Development [41] [27] [42]

Item / Resource	Category	Function in AOPN Workflow	Example / Source
AOP-Wiki API	Data Source	Provides programmatic access to extract structured data on AOPs, KEs, and KERs for automated network building.	https://aopwiki.org/api [41]
Computational Scripts (R/Python)	Software	Automate data extraction, processing, network assembly, and visualization; essential for reproducible research.	Published R-script for AOPN generation [41]
Network Visualization Tool	Software	Enables interactive exploration, filtering, and communication of complex AOP networks.	Cytoscape, Graphviz, R packages (`visNetwork`, `igraph`) [41]
SeqAPASS Tool	Bioinformatics Tool	Informs species susceptibility by comparing protein sequence similarity to identify taxonomically conserved molecular targets (MIEs).	U.S. EPA's Sequence Alignment to Predict Across Species Susceptibility tool [27]
New Approach Methodologies (NAMs) Battery	Experimental Assays	Generate mechanistic data to populate and test KEs within an AOPN, especially for data-poor chemicals.	In vitro high-throughput screens, transcriptomics, high-content imaging [43]
OECD AOP Developers' Handbook	Guidance Document	Provides standardized practices for developing and documenting AOP components, ensuring network consistency and reliability.	OECD Handbook (v2.7, 2024) [42]

Visualizing Cross-Species Susceptibility within an AOPN

Integrating species-specific information transforms a static AOPN into a dynamic tool for susceptibility analysis. The following diagram concept illustrates how taxonomic applicability can be embedded into the network's architecture.

Diagram 2: AOPN with Species Applicability Layer (58 chars)

The data-driven construction of AOP Networks represents a cornerstone of next-generation risk assessment. By providing a structured, visual map of mechanistic knowledge, AOPNs enable hypothesis-driven testing, integration of New Approach Methodologies (NAMs), and a systematic weight-of-evidence evaluation for complex endpoints like endocrine disruption [41] [43]. Crucially, when built with species susceptibility as a core design principle—by explicitly documenting taxonomic applicability for each KE and KER—these networks become powerful tools for credible cross-species extrapolation. They move the field beyond default assessment factors and toward predictions grounded in comparative biology.

The primary challenges moving forward are the development of Quantitative AOPs (qAOPs) to define predictive, numerical relationships between KEs [43], and the expansion of the taxonomic scope of evidence within the AOP Knowledge Base. Overcoming these challenges will fulfill the thesis promise of AOPNs: transforming the assessment of species susceptibility from an exercise in uncertainty management into a discovery-driven investigation of shared and divergent vulnerabilities across the tree of life.

The paradigm in toxicology and risk assessment is decisively shifting from observational animal studies toward mechanistic, human-relevant New Approach Methodologies (NAMs) [44]. Central to this evolution is the Adverse Outcome Pathway (AOP) framework, which organizes knowledge into a sequence of measurable Key Events (KEs), from a Molecular Initiating Event (MIE) to an adverse outcome at the organism or population level [29] [31]. This structured, modular framework is particularly powerful for investigating inter-species and inter-individual susceptibility, as differences can be mapped to specific KEs and their relationships.

Assessing susceptibility is not merely an academic exercise but a practical imperative for improving the predictive accuracy of safety assessments and enabling precision toxicology [44]. This guide positions the assessment of susceptibility within a broader thesis: that by systematically quantifying and comparing the dynamics of AOPs across species and genetic backgrounds, we can identify critical nodes of susceptibility, develop targeted testing strategies, and ultimately refine chemical and drug safety decisions to protect vulnerable sub-populations.

Drug-Induced Liver Injury (DILI) serves as an ideal case study for this thesis. The liver is a prime target for xenobiotics, and DILI remains a leading cause of drug attrition and post-market withdrawal [44]. DILI AOPs for steatosis, cholestasis, fibrosis, and cancer have been described, providing a mature scaffold for susceptibility analysis [45]. By dissecting these pathways, we can move from a general understanding of hazard to a precise evaluation of risk that accounts for variability.

Foundational Concepts: AOP Networks and the Quantification of Susceptibility

An individual AOP is a linear simplification of a biological process. In reality, toxicity often arises from the interplay of multiple pathways. AOP networks are constructed by connecting individual AOPs that share one or more KEs, providing a more realistic representation of complex toxicological outcomes like DILI [44]. Within a network, susceptibility can manifest as:

Differential KE Sensitivity: A KE (e.g., mitochondrial dysfunction) may be triggered at a lower stressor dose in a susceptible species or individual.
Alternative Path Engagement: A stressor may engage a different, more severe network branch in a susceptible population.
Altered Network Resilience: Compensatory or repair mechanisms within the network may be less efficient, allowing progression to adverse outcomes.

The FAIR (Findable, Accessible, Interoperable, Reusable) principles are critical for advancing susceptibility research [29]. Machine-actionable AOP data, standardized annotations, and interoperable tools allow for the integration of diverse data streams—including high-throughput in vitro screening (ToxCast), genomic variants, and clinical data—which is essential for identifying and validating susceptibility factors [29] [31].

Case Study Core: Key DILI AOPs and Susceptibility Modulators

DILI encompasses a spectrum of pathologies, each with distinct but sometimes overlapping AOPs. The table below summarizes key AOPs and the biological factors that confer susceptibility at different levels of organization.

Table 1: Primary DILI AOPs and Associated Susceptibility Factors

DILI Pathology	Molecular Initiating Event (MIE)	Critical Key Events (KEs)	Identified Susceptibility Modulators
Cholestasis	Inhibition of the Bile Salt Export Pump (BSEP) [44]	1. Intracellular bile acid accumulation2. Impaired bile flow3. Hepatocyte injury & inflammation	Genetic polymorphisms in ABCB11 (BSEP), ABCC2 (MRP2); Comedication; Inflammatory cytokines [44]
Steatosis	Inhibition of mitochondrial fatty acid β-oxidation [45]	1. Increased de novo lipogenesis2. Triglyceride accumulation3. Lipotoxicity & oxidative stress	Genetic variants in PNPLA3; Metabolic state (obesity, diabetes); Diet [45]
Fibrosis	Chronic hepatocyte apoptosis/necrosis [45]	1. Activation of hepatic stellate cells (HSCs)2. Excessive extracellular matrix (ECM) deposition3. Tissue remodeling & dysfunction	Polymorphisms in immune response genes; Pre-existing liver disease; Gut microbiota composition [44]
Carcinogenesis	DNA adduct formation & genotoxicity [45]	1. Sustained cellular proliferation2. Altered cell death (inhibited apoptosis)3. Neoplastic transformation	Efficiency of DNA repair mechanisms; Tumor suppressor gene status; Chronic injury milieu [45]

Susceptibility can be quantified by measuring differences in KE response thresholds, kinetic rates between KEs, or the overall network perturbation burden required to trigger the adverse outcome. This quantitative approach transforms the AOP from a qualitative description into a predictive model for susceptibility.

Methodological Framework: Experimental & Computational Protocols for Susceptibility Assessment

Assessing susceptibility within an AOP framework requires a tiered, integrated strategy that moves from in silico identification of susceptibility factors to in vitro and ex vivo validation.

Computational Identification of Susceptibility Nodes

Objective: To mine integrated biological data and identify potential genetic and molecular determinants of susceptibility for a given AOP. Protocol:

AOP Network Query: Using resources like the EPA AOP-DB, identify genes, proteins, and pathways formally linked to the DILI AOP of interest (e.g., steatosis) [31]. Export associated gene lists.
Variant Data Integration: Cross-reference the AOP gene list with population genomics databases (e.g., gnomAD, dbSNP) to identify functional variants (missense, loss-of-function) with varying allele frequencies across populations [31].
In Silico Prioritization: Use tools to predict the impact of identified variants on protein function (e.g., SIFT, PolyPhen-2). Prioritize variants in genes that are hub nodes within the AOP network or that encode proteins at MIEs.
Confidence Scoring: Apply a framework (e.g., from DisGeNET) to assign confidence scores to gene-disease-variant associations based on the weight of evidence from multiple sources [31].

In Vitro Cross-Species & Genotype-Specific KE Profiling

Objective: To empirically test differential KE activation in models representing varying susceptibility. Protocol:

Model System Selection: Use primary hepatocytes or induced pluripotent stem cell (iPSC)-derived hepatocytes from multiple species (e.g., rat, human) or from human donors with different genotypes at a susceptibility locus (e.g., PNPLA3 I148M variant for steatosis) [44].
Tiered KE Assessment: Expose models to a concentration range of the stressor.
- Transcriptional Tier: Use qRT-PCR or RNA-seq to measure KE-related gene expression (e.g., ACOX1 for β-oxidation).
- Translational/Functional Tier: Employ high-content imaging or biochemical assays to measure functional KEs (e.g., intracellular triglyceride accumulation via Oil Red O staining, mitochondrial membrane potential via JC-1 dye) [44].
Dose-Response Modeling: Fit dose-response curves (e.g., using a 4-parameter logistic model) for each KE in each model. Calculate Benchmark Doses (BMDs) or EC₁₀/EC₅₀ values.
Susceptibility Quantification: Compare BMDs/EC values across models. A statistically significant lower BMD in one model indicates greater susceptibility at that specific KE. This data can be used for Quantitative AOP (qAOP) modeling.

A 10-step workflow for assessing susceptibility in DILI AOPs from hypothesis to model.

Table 2: Research Toolkit for AOP-Based Susceptibility Assessment

Tool/Resource Category	Specific Item	Function in Susceptibility Assessment
AOP Knowledge Bases	AOP-Wiki (AOP-KB) [31], EPA AOP-DB [31]	Central repositories for curated AOP information, enabling the identification of genes and KEs for focus. The AOP-DB allows integration with chemical, gene, and disease data [31].
In Vitro Model Systems	Primary hepatocytes (multiple species) [44], Genotype-specific iPSC-derived hepatocytes [44]	Provide biologically relevant platforms for cross-species and inter-individual comparison of KE responses under controlled conditions.
KE-Specific Assays	High-content imaging kits (e.g., for lipid accumulation, ROS), qRT-PCR panels for AOP-relevant genes, Functional assays (e.g., BSEP inhibition assays) [44]	Enable quantitative, tiered measurement of KEs at transcriptional, translational, and functional levels to generate dose-response data.
Bioinformatics Tools	DisGeNET [31], Population genomics databases (gnomAD), Variant effect predictors (SIFT, PolyPhen-2)	Facilitate the identification and prioritization of genetic variants linked to AOP genes and DILI phenotypes for hypothesis generation.
Data Analysis & Modeling	Benchmark Dose (BMD) software, Statistical modeling (R/Python), qAOP modeling platforms	Allow quantification of differential sensitivity (via BMD comparison) and the integration of data into predictive quantitative models.

Visualizing Susceptibility within an AOP Network: A Cholestasis Example

The diagram below illustrates how susceptibility factors can be mapped onto an AOP network for drug-induced cholestasis. This visualization makes critical susceptibility nodes and their modulating factors explicit.

An AOP network for drug-induced cholestasis with mapped susceptibility modulators.

The systematic assessment of susceptibility within DILI AOPs, as outlined, provides a actionable blueprint for advancing toxicological science. This approach directly supports the development and acceptance of NAMs by providing a mechanistic basis to interpret differences in in vitro and in silico model responses and to extrapolate findings to human populations [29] [44].

The future of this field lies in the continued FAIRification of AOP data and the development of standardized, quantitative descriptors for KE relationships [29]. Integrating rich data on human genetic diversity, tissue-specific expression, and system dynamics into AOP networks will enable probabilistic risk assessment that moves beyond "average" responses to explicitly model population variability and identify the most vulnerable [44]. For DILI and beyond, embedding susceptibility into the core of AOP research is the critical next step toward precise, predictive, and protective safety sciences.

Overcoming Challenges and Advancing AOPs with Computational and AI Tools

Addressing Knowledge Gaps and Inconsistent Annotations in Existing AOPs

The Adverse Outcome Pathway (AOP) framework is a conceptual structure that organizes biological knowledge into a sequence of causally linked events, from a Molecular Initiating Event (MIE) triggered by a stressor to an Adverse Outcome (AO) of regulatory relevance [27]. This framework provides a systematic approach for translating mechanistic data into predictions of toxicity, supporting chemical risk assessment and the use of New Approach Methodologies (NAMs) [46].

A central challenge in applying AOPs for predictive toxicology is assessing species susceptibility. An AOP developed in one species (e.g., rat or zebrafish) must be extrapolated to others (e.g., human or an untested wildlife species) for regulatory decisions. This extrapolation relies on the conservation of key biological pathways across taxa. However, knowledge gaps and inconsistent annotations within existing AOPs in knowledge bases like the AOP-Wiki create significant uncertainty in these cross-species predictions [46] [27]. Inconsistent annotation of Key Events (KEs) and Key Event Relationships (KERs)—such as varying levels of biological organization, ambiguous definitions, or incomplete empirical support—undermines the confident identification of conserved pathways and quantitative differences in response.

This guide, framed within the thesis context of assessing species susceptibility, details technical strategies for identifying, characterizing, and mitigating these knowledge gaps and inconsistencies. It provides a roadmap for transforming qualitative AOPs into quantitative, cross-species predictive models (qAOPs) suitable for robust chemical safety assessment.

The Current Landscape: Identifying Gaps and Inconsistencies

The public AOP Knowledge Base (AOP-KB), overseen by the Organisation for Economic Co-operation and Development (OECD), is the central repository for AOP development [47]. Despite holding over 300 proposed AOPs, its utility for quantitative, cross-species prediction is limited by several systemic issues [47].

Primary Categories of Knowledge Gaps and Inconsistencies:

Qualitative vs. Quantitative Descriptions: Most AOPs are qualitative narratives. The transition to Quantitative AOPs (qAOPs), which define the mathematical relationships between KEs, is essential for prediction but remains rare [47] [48].
Incomplete Empirical Support for KERs: Many proposed KERs are based on biological plausibility rather than empirical evidence. The strength, dose-response, and temporal dynamics of these relationships are often poorly characterized [27].
Taxonomic Ambiguity and Uncertainty: KEs and KERs are frequently described for a single model species without annotation of their applicability to other taxa. This creates a major hurdle for cross-species extrapolation [27].
Dynamic Complexity: Most AOPs are depicted as linear, static pathways. They fail to capture network interactions (where multiple MIEs converge on a common KE or AO) or the temporal dynamics of biological responses, which are critical for modeling repeated or chronic exposures [47] [48].
Annotation Inconsistency: The same biological event may be described at different levels of organization (e.g., molecular, cellular, tissue) across different AOPs, complicating computational integration and network building.

Table 1: Status of AOP Development in the OECD AOP Knowledge Base (Illustrative Summary)

Development Status	Typical Characteristics	Major Gaps/Inconsistencies	Utility for Species Susceptibility Assessment
Under Development	Preliminary structure; limited evidence.	Missing KER evidence; unspecified taxonomic domain.	Very low. Cannot support extrapolation.
Proposed / Draft	Causal structure defined; some supporting evidence.	Qualitative KERs; empirical data may be from a single species.	Low to moderate. Can form a hypothesis for testing conservation.
Endorsed / OECD-Accepted	Strong weight of evidence; reviewed by OECD.	Well-defined but often still qualitative; taxonomic applicability may be broad but not quantified.	High for hazard identification. Moderate for quantitative extrapolation without a qAOP.
Quantified (qAOP)	Includes mathematical models for KERs (e.g., Bayesian networks).	Data-intensive; models may be species-specific.	Very high. Provides a basis for modeling interspecies differences in sensitivity.

Core Methodology: Quantitative Frameworks for Bridging Gaps

To address qualitative gaps and inconsistent annotations, a shift towards quantitative and probabilistic modeling is essential. These methods formalize assumptions, integrate diverse data, and explicitly characterize uncertainty.

Bayesian Networks for AOP Quantification and Uncertainty

Bayesian Networks (BNs) are probabilistic graphical models consisting of nodes (variables, like KEs) connected by directed edges (causal relationships, like KERs) [47]. They are a natural fit for quantifying AOPs because they:

Accommodate different data types (continuous, discrete).
Propagate uncertainty quantitatively through the network.
Can be run in multiple directions: forward (predictive) from stressor to AO, or backward (diagnostic) from an observed AO to infer probable causes [47].
Can integrate species-specific parameterization to model susceptibility differences.

A BN is quantified using Conditional Probability Tables (CPTs) that define the probability of a child node's state given the states of its parent nodes [47].

A Workflow for Building a Quantitative AOP (qAOP)

The following integrated workflow, synthesizing approaches from recent research, transforms a qualitative AOP into a quantitative, probabilistic model suitable for assessing species susceptibility [47] [48].

dot AOP Quantification and Species Extrapolation Workflow

Step 1: Taxonomic Applicability Assessment

Objective: Determine the taxonomic domain of each KE and KER.
Protocol: Use bioinformatics tools (e.g., EPA's SeqAPASS or orthology databases) to assess the conservation of the molecular targets (MIEs) and functional pathways underlying KEs across species of interest [27]. Annotate the AOP with confidence levels for each KE's applicability to different taxa.

Step 2: Data Collation and Harmonization

Objective: Gather all relevant empirical dose-response and response-response data for the KERs, noting the source species.
Protocol: Systematically review literature for each KER. Extract or curate data on the magnitude, timing, and conditions under which a change in an upstream KE leads to a change in a downstream KE. Standardize units and biological measurements across studies.

Step 3: Bayesian Regression for KER Quantification

Objective: Fit mathematical functions to describe each individual KER.
Detailed Protocol [47]:
- For each KER, select an appropriate dose-response function (e.g., log-logistic, Hill model).
- Implement Bayesian regression to fit the function to the collated data. This yields a posterior distribution for each model parameter (e.g., EC₅₀, slope), explicitly representing uncertainty.
- Use the fitted model to simulate 10,000 response values across a gradient of the upstream KE.
- Species-specific variant: Perform this regression separately for datasets from different species, generating species-specific parameter distributions.

Step 4: Bayesian Network Parameterization

Objective: Integrate the quantified KERs into a cohesive BN model.
Protocol: Use the simulated response values from Step 3 to populate the Conditional Probability Tables (CPTs) for each node in the BN. If species is included as a parent node for a KE, the CPT will encode different probability distributions for the child KE's state based on both the upstream KE's state and the species [47].

Step 5: Model Validation and Uncertainty Analysis

Objective: Evaluate the predictive performance and identify major uncertainty drivers.
Protocol:
- Internal Validation: Use k-fold cross-validation to assess prediction accuracy when running the network from intermediate KEs [47].
- Sensitivity Analysis: Identify which model parameters or inputs (including the "species" variable) most strongly influence the predicted probability of the AO.

Table 2: Comparison of Quantitative AOP Modeling Approaches

Modeling Approach	Key Features	Data Requirements	Strengths for Addressing Gaps	Limitations
Bayesian Network (BN)	Probabilistic; handles uncertainty; bidirectional inference.	Moderate (can work with smaller datasets).	Excellent for integrating diverse evidence and quantifying confidence in KERs. Ideal for probabilistic species comparisons.	Network structure (causality) must be defined a priori.
Dynamic BN (DBN)	Incorporates time slices; models sequences of exposures.	High (requires longitudinal/time-series data).	Directly addresses the gap in modeling chronic/repeated exposure outcomes [48].	More complex to parameterize and compute.
Systems Biology (ODE Models)	Mechanistic; based on biochemical reaction networks.	Very high (requires detailed kinetic parameters).	Provides deep biological insight and can predict perturbations outside training data.	Not feasible for most AOPs due to parameter scarcity; often species-specific.
Bayesian Regression + BN (Hybrid)	Quantifies KERs with regression, integrates via BN.	Low to Moderate (leverages standard ecotoxicological models).	Proof-of-concept shows effectiveness even with data-poor AOPs [47]. Practical for initial quantification.	Relies on the quality and relevance of the fitted regression models.

Advanced Protocol: Dynamic Modeling for Repeated Exposures

Chronic toxicity from repeated, low-dose exposure represents a significant knowledge gap in AOPs. A Dynamic Bayesian Network (DBN) extends the BN framework to model changes over time and can reveal how AOP topology itself may evolve with repeated insults [48].

Detailed DBN Protocol for Chronic Toxicity AOPs [48]:

Define Time-Sliced Network Structure: Replicate the AOP network structure across discrete time points (e.g., exposure events 1, 2, ... E). Connect corresponding KEs across time slices to represent persistence or accumulation of effect.
Generate/Use Longitudinal Data: Utilize data from in vitro or in vivo repeated exposure studies. Critical parameters include donor-to-donor variability in the timing of chronic KE manifestation.
Parameterize the DBN: Learn CPTs for both intra-slice (causal) and inter-slice (temporal) connections from the longitudinal data. This can reveal, for instance, that a KE at time t depends on its own state and the state of its parents at time t-1.
Data-Driven AOP Pruning: Employ statistical techniques (e.g., lasso-based subset selection) on the longitudinal data to identify which causal links (KERs) are strongly supported at different exposure time points. This can show that the causal structure of an AOP is dynamic, with certain pathways becoming more or less relevant over the course of chronic exposure [48].

dot Dynamic Bayesian Network for Repeated Exposure

The Scientist's Toolkit: Essential Research Reagent Solutions

Building and quantifying AOPs requires a suite of conceptual and computational tools.

Table 3: Key Research Reagents and Tools for AOP Development

Tool/Reagent Category	Specific Examples & Functions	Role in Addressing Gaps/Inconsistencies
Bioinformatics & Conservation Analysis	SeqAPASS: Predicts protein sequence and functional domain conservation across species. Orthology Databases (Ensembl, OrthoDB): Identify orthologous genes.	Resolves taxonomic ambiguity. Provides evidence for whether an MIE or KE is likely conserved in an untested species, critical for cross-species extrapolation [27].
Computational Modeling Platforms	R packages (`bnlearn`, `BayesNet`), Netica, GeNIe: Software for constructing, parameterizing, and reasoning with Bayesian Networks.	Enables qAOP development. Transforms qualitative pathways into testable probabilistic models that quantify uncertainty and species differences [47].
Data Curation & Integration Tools	AOP-Wiki Interface, Systematic Review Protocols, Data Harmonization Scripts (Python/R).	Mitigates annotation inconsistency. Forces structured data entry and enables the aggregation and standardization of empirical evidence from disparate sources.
In Vitro New Approach Methodologies (NAMs)	High-throughput screening assays, multiplexed biomarker panels, transcriptomics, CRISPR-modified cell lines.	Generates empirical data for KERs. Provides mechanistic, human-relevant data to fill evidence gaps, especially for chronic or repeated exposure scenarios [48].
Visualization & Communication Standards	Graphviz (DOT language), Cytoscape, Consistent Color Palettes (WCAG compliant).	Improves clarity and accessibility. Standardized, clear visualizations (like those in this document) reduce misinterpretation of AOP structure and relationships.

Visualization Standards for AOP Diagrams

Effective, accessible visualizations are crucial for communicating AOP structures and modeled relationships clearly and without ambiguity.

Color and Contrast Specifications: All diagrams must adhere to the WCAG 2.1 (Level AA) guidelines for contrast [49]. This ensures readability for all users, including those with low vision or color perception deficiencies.

Text Contrast: For any node containing text, the fontcolor must be explicitly set to achieve a contrast ratio of at least 4.5:1 against the node's fillcolor [49] [50].
Arrow/Symbol Contrast: All foreground elements (arrows, lines, symbols) must have sufficient contrast against the background color.
Palette: Use a defined, accessible palette. The diagrams in this document use a modified Google-inspired palette (#4285F4, #EA4335, #FBBC05, #34A853, #FFFFFF, #F1F3F4, #202124, #5F6368) [51] [52] with #F1F3F4 as a default node background and #202124 as default text for high contrast.

Implementation with Graphviz (DOT): The DOT language provides precise control for generating reproducible AOP diagrams. Key principles include:

Explicitly set fontcolor and fillcolor for all styled nodes.
Use labelloc="t" (top) for clear graph titles.
Utilize rankdir (TB for top-bottom, LR for left-right) to control layout direction.
Employ subgraphs (subgraph cluster) to group related nodes, such as time slices in a DBN or modules within an AOP network.

The assessment of species susceptibility represents a critical frontier in modern toxicology and drug safety science. The Adverse Outcome Pathway (AOP) framework provides a structured, mechanistic model for tracing the sequence of biological events from a Molecular Initiating Event (MIE) to an adverse outcome at the organism or population level [53]. A key advancement in this field is the integration of human genetic variability data, particularly Single Nucleotide Polymorphisms (SNPs), to quantitatively inform differential susceptibility within and between populations [30]. This integration directly addresses regulatory mandates, such as those in the Frank R. Lautenberg Chemical Safety Act, which require the explicit consideration of susceptible populations in chemical risk assessment [32] [30].

This technical guide explores the methodologies for incorporating genetic susceptibility data into AOP research. It details how SNP associations derived from genome-wide studies and population genetics can be mapped onto AOP key events, transforming qualitative pathways into quantitative AOPs (qAOPs) that account for human variability. By anchoring genetic evidence within a mechanistic AOP framework, researchers can move beyond descriptive associations to build predictive models of susceptibility, ultimately enhancing the precision of human health risk assessment and the safety profile evaluation of novel therapeutics [53] [54].

Foundational Concepts: SNPs, Population Genetics, and the AOP-DB

Single Nucleotide Polymorphisms (SNPs) as Markers of Susceptibility

SNPs are the most common form of genetic variation in the human genome, involving a substitution of a single nucleotide at a specific locus [55]. In susceptibility research, SNPs can be functionally classified:

Causal Variants: Directly alter gene function (e.g., non-synonymous SNPs in coding regions, SNPs affecting splice sites).
Tagging Variants: In Linkage Disequilibrium (LD) with a causal variant, serving as proxies for association studies [55].

The effect of a SNP on a phenotype is modeled in association studies, where the genotype (often coded as 0, 1, 2 representing the number of minor alleles) is tested for correlation with a trait or disease status [56]. Key metrics for interpreting SNP associations are summarized below.

Table 1: Key Metrics for Interpreting SNP-Trait Associations

Metric	Definition	Interpretation in Susceptibility Context	Typical Threshold
Odds Ratio (OR) / Beta Coefficient (β)	Effect size measuring the change in disease odds (OR) or trait mean (β) per allele copy [55].	Quantifies the magnitude of increased or decreased risk conferred by a genetic variant.	OR > 1 indicates risk allele; β indicates direction and scale of effect.
P-value	Probability of observing the association by chance under the null hypothesis.	Determines statistical significance; must be corrected for genome-wide multiple testing [55].	< 5×10⁻⁸ for genome-wide significance.
Minor Allele Frequency (MAF)	Frequency of the less common allele in a given population [55].	Determines the population impact; common variants (MAF >5%) may explain more population-attributable risk.	Often filtered (e.g., MAF > 1%) in GWAS for power.
Linkage Disequilibrium (LD)	Non-random association of alleles at different loci [55].	Critical for fine-mapping; a significant SNP may tag a region containing the true causal variant.	Measured by r²; high LD complicates causal inference.

Population Structure as a Confounding Factor

Population structure—systematic genetic differences due to ancestry—is a major source of confounding in genetic association studies. It can induce spurious associations when both allele frequency and disease prevalence differ across subpopulations within a cohort [56]. For example, a SNP more common in a population with a higher baseline disease rate may appear associated without being causal. Methods to control for this include:

Genomic Control / Principal Component Analysis (PCA): Using genetic data to calculate ancestry principal components as covariates in regression models [56] [55].
Mixed Models: Incorporating a genetic relatedness matrix to account for both population structure and cryptic relatedness [56].

The Adverse Outcome Pathway Database (AOP-DB): A Central Resource

The U.S. EPA’s AOP-Database (AOP-DB) is an indispensable tool for integrating genetic susceptibility into AOPs [32] [31]. It serves as a translational hub by programmatically linking:

Mechanistic AOP Knowledge: Curated from the AOP-Wiki, detailing MIEs, Key Events (KEs), and Adverse Outcomes (AOs).
Genetic Variant Data: Integrated from sources like GWAS catalogs, GTEx (eQTLs), and the 1000 Genomes Project [32].
Chemical, Disease, and Pathway Information: Creating a networked knowledge base.

This allows researchers to query, for instance, all SNPs associated with genes involved in a "Liver Fibrosis" AOP and retrieve their population-specific allele frequencies, thereby bridging mechanism and variability [53] [31].

Diagram 1: The AOP-DB as an Integrative Hub for Genetic Susceptibility Data

Methodological Integration: From GWAS to qAOPs

Core Experimental Protocol: Genome-Wide Association Study (GWAS)

GWAS is the primary method for identifying SNP-trait associations. A standardized protocol ensures robustness [55]:

Quality Control (QC): Filter samples and SNPs based on:
- Call Rate: Remove samples/SNPs with >5% missing genotypes.
- Minor Allele Frequency (MAF): Exclude rare SNPs (e.g., MAF < 1%).
- Hardy-Weinberg Equilibrium (HWE): Remove SNPs with extreme HWE violation in controls (P < 1×10⁻⁶).
- Relatedness: Identify and remove cryptically related individuals (e.g., PI_HAT > 0.2).
Population Stratification Adjustment: Calculate principal components (PCs) from genotype data and include top PCs as covariates.
Association Testing: Perform logistic (for case-control) or linear (for quantitative traits) regression for each SNP. A standard model is: Phenotype = β₀ + β₁(SNP genotype) + β₂(Covariate₁) + ... + βₙ(PC₁) + ... + ε
Multiple Testing Correction: Apply a genome-wide significance threshold (typically P < 5×10⁻⁸) to account for testing millions of SNPs.
Replication: Validate significant associations in an independent cohort.

Mapping Genetic Associations onto AOPs: The HIBACHI Approach

A novel computational approach leverages artificial intelligence to explore SNP interactions within AOPs [53]. The workflow for a case study on liver cancer (LC) susceptibility is as follows:

AOP and SNP Identification: Query the AOP-DB for all AOPs related to liver cancer (e.g., using "liver" and "hepatocellular" terms). Extract all genes annotated to Key Events in these AOPs, then retrieve all SNPs associated with those genes from integrated databases.
Cohort and Genotype Preparation: From a resource like the UK Biobank (UKBB), define LC case and control cohorts. Use propensity score matching on demographics and comorbidities to balance cohorts. Obtain genotypes for the AOP-derived SNP list for all cohort individuals.
Generative Modeling with HIBACHI: Use the HIBACHI (Heuristic Identification of Biological Architectures for simulating Complex Hierarchical genetic Interactions) software [53].
- Input: Real genotype data (SNP matrix) for the LC cohort.
- Process: HIBACHI uses a genetic programming-based evolutionary algorithm to generate synthetic datasets that mimic the complex, non-linear interactions (e.g., epistasis) in the real data.
- Output: An interpretable generative model (a tree of mathematical operations) that reveals which SNP features and interactions are most critical for replicating the data structure.
Interpretation and Validation: Introspect the best HIBACHI model to identify prominent SNPs and their interactions. This in-silico prioritization can highlight putative risk factors, such as interactions between SNPs in the AHR and ABCB11 genes for LC [53], which can then be validated biologically.

Diagram 2: Workflow for AI-Driven SNP-AOP Integration (HIBACHI Approach)

Accounting for Population Variability in Risk Estimation

To translate SNP associations into population-adjusted risk estimates within an AOP, several quantitative measures are employed:

Polygenic Risk Scores (PRS): Aggregate the effects of many risk-associated SNPs across the genome into a single individual-level score. PRS = Σ (βᵢ * Gᵢ), where βᵢ is the effect size of SNP i from a discovery GWAS, and Gᵢ is the individual's genotype [55].
Population Attributable Risk (PAR): Estimates the proportion of disease incidence in a population attributable to a specific risk allele, factoring in its effect size (OR) and frequency (MAF): PAR = [Pₐ(OR - 1)] / [1 + Pₐ(OR - 1)], where Pₐ is the risk allele frequency [30].
LOD Score: Used primarily in family-based linkage studies, the LOD score (Logarithm of the Odds) assesses the probability of linkage between a genetic marker and a disease locus. A LOD score >3 is considered significant evidence for linkage [57].

Practical Application in Drug Development and Safety Assessment

Genetic Biomarkers for Susceptibility and Safety

Validated genetic susceptibility markers become biomarkers with specific Contexts of Use (COU) in drug development [58]. The FDA's BEST resource categorizes them as:

Susceptibility/Risk Biomarkers: Identify individuals at increased risk of developing a disease (e.g., BRCA mutations for cancer) [58].
Predictive Biomarkers: Identify individuals more or less likely to respond to a drug (e.g., HLA-B57:01 allele predicting abacavir hypersensitivity).
Safety Biomarkers: Indicate potential for adverse effects [58].

Table 2: Genetic Biomarker Categories and Validation Requirements [58]

Biomarker Category	Primary Use in AOP/Drug Dev	Key Validation Requirements
Susceptibility/Risk	Identify subpopulations with higher baseline risk for an AO in an AOP.	Epidemiological evidence, biological plausibility, established causality.
Predictive	Stratify patients in clinical trials based on likelihood of efficacy or toxicity.	High sensitivity/specificity, strong mechanistic link to drug's mode of action.
Safety/Pharmacodynamic	Monitor early key events signaling potential for an adverse outcome.	Demonstrated consistent response to drug action across populations.

The Side Effect Genetic Priority Score (SE-GPS): A Translational Tool

A direct application in drug safety is the Side Effect Genetic Priority Score (SE-GPS), which systematically prioritizes drug targets based on human genetic evidence linked to potential side effects [54].

Data Integration: Consolidates evidence from nine genetic sources (e.g., ClinVar, GWAS, eQTLs) into four features: Clinical Variant, Single Variant, Gene Burden, and GWA Trait.
Model Training: Uses a multivariable regression model on known drug-side effect pairs (e.g., from FDA Adverse Event Reporting System) to weight the contribution of each genetic feature.
Score Calculation: For any gene-phecode (side effect) pair, the SE-GPS is the weighted sum of present genetic evidence. A higher score indicates stronger genetic support for a causal link between modulating that gene and the adverse outcome.

Table 3: SE-GPS Validation Performance for Predicting Drug Side Effects [54]

Validation Metric	Finding	Implication for AOP/Susceptibility
Enrichment of Genetic Features	All four genetic features showed significant association with drug side effects.	Multiple lines of genetic evidence strengthen causality for an AOP key event.
Risk Increase (≥2 lines of evidence)	2.3 to 2.5-fold increased risk of side effects.	Supports using a minimum evidence threshold to prioritize susceptibility factors.
Direction of Effect (SE-GPS-DOE)	Incorporating directionality (risk vs. protective) improved predictive value.	Critical for AOPs: determines if a variant increases or decreases probability of a key event.

Table 4: Key Research Reagent Solutions for Genetic Susceptibility in AOP Research

Item / Resource	Function / Purpose	Example / Source
AOP-Database (AOP-DB)	Central repository linking AOPs, genes, SNPs, chemicals, and diseases for integrative queries.	U.S. EPA (https://aopdb.epa.gov/) [32] [31]
Genome-Wide SNP Array	High-throughput genotyping platform to assay hundreds of thousands to millions of SNPs per individual.	Illumina Global Screening Array, Affymetrix Axiom arrays
Quality Control (QC) Software	Performs essential QC steps on genotype data: missingness, HWE, relatedness, population stratification.	PLINK, R packages (e.g., SNPRelate) [55]
Genetic Association Software	Conducts regression-based association testing, correcting for covariates and relatedness.	PLINK, SAIGE, GCTA [56] [55]
HIBACHI Software	AI-based tool using genetic programming to model complex SNP interactions within biological frameworks.	Command-line utility for generative modeling [53]
Polygenic Risk Score Software	Calculates individual-level PRS from GWAS summary statistics and individual genotype data.	PRSice, LDpred2, PLINK [55]
Population Genetic Reference Data	Provides LD structure and allele frequencies across diverse populations for imputation and analysis.	1000 Genomes Project, gnomAD, HapMap [55]
Biomarker Validation Assay	Analytically validates a discovered genetic biomarker (e.g., a specific SNP) for clinical use.	TaqMan PCR, Sanger Sequencing, NGS panels

The integration of genetic susceptibility into AOP science is rapidly evolving. Key future directions include the development of dynamic, probabilistic qAOP network models that incorporate population-specific SNP frequencies and effect sizes to predict a distribution of adverse outcomes rather than a binary outcome [30]. Furthermore, the expansion of diverse, non-European biobanks will dramatically improve the understanding of population variability and reduce health disparities in risk assessment [56]. The application of advanced AI, as demonstrated by HIBACHI, will be crucial for deciphering the high-order genetic interactions that underlie complex susceptibility [53].

In conclusion, the systematic exploration of SNP associations and population variability within the AOP framework provides a powerful, mechanistic strategy for assessing species susceptibility. By transforming qualitative pathways into quantitative models anchored in human genetic data, this approach meets the pressing need for more precise human health risk assessments and contributes to the development of safer, more targeted therapeutics. The methodologies and resources outlined herein provide a foundational toolkit for researchers to advance this critical intersection of genetics, toxicology, and regulatory science.

The Adverse Outcome Pathway (AOP) framework has emerged as a pivotal, mechanistic tool in modern toxicology and chemical risk assessment. It organizes biological knowledge into a sequential chain of measurable events, from a Molecular Initiating Event (MIE) triggered by a stressor (e.g., a chemical) to an Adverse Outcome (AO) relevant to regulatory decision-making [27]. This structured, modular approach—conceptualized as a network of "biological dominos"—provides the necessary foundation for developing animal-free New Approach Methodologies (NAMs) [27]. However, the manual construction and validation of AOPs from the vast, fragmented biomedical literature remain a formidable, time-consuming bottleneck [40]. Furthermore, a central challenge within the framework is understanding and predicting species susceptibility—why certain species or populations exhibit heightened sensitivity to a stressor, while others are resilient.

This is where Artificial Intelligence (AI), particularly generative modeling and graph data science, is poised to cause a paradigm shift. The international research community has explicitly identified the acceleration of AOP development through AI as a critical goal, evidenced by initiatives like the joint European Commission Joint Research Centre (JRC) and Swiss Centre for Applied Human Toxicology (SCAHT) challenge [59]. Concurrently, the push to make AOP data FAIR (Findable, Accessible, Interoperable, and Reusable) is creating the standardized, machine-actionable data infrastructure essential for AI applications [29] [13].

This whitepaper posits that the integration of AI, specifically Graph Neural Networks (GNNs) and Natural Language Processing (NLP), is not merely an efficiency tool but a transformative methodology for AOP development. It enables the systematic exploration of complex biological networks, the generation of novel mechanistic hypotheses, and—most critically for our thesis—a data-driven, quantitative approach to assessing and extrapolating species susceptibility across the AOP framework.

Core Concepts: AOPs as Networks and AI as the Navigator

The AOP as a Modular, Graph-Based Construct

An AOP is fundamentally a knowledge graph. Its core components are [27]:

Nodes (Key Events, KEs): Measurable biological changes at different levels of organization (molecular, cellular, tissue, organ, organism).
Edges (Key Event Relationships, KERs): Causal or correlative links between KEs, supported by evidence of biological plausibility, empirical support, and quantitative understanding.

This modularity allows individual AOPs to interconnect via shared KEs, forming expansive AOP networks that more accurately reflect the complexity of biological systems [27]. This graph-based representation is inherently compatible with graph data science and machine learning.

AI and Graph Data Science: A Synergistic Pairing for AOPs

Two branches of AI are particularly relevant:

Natural Language Processing (NLP) & Generative AI: These tools can "read" and synthesize the millions of scientific publications in databases like PubMed to identify potential stressor-event and event-event pairs, dramatically accelerating the initial evidence-gathering phase of AOP development [40].
Graph Neural Networks (GNNs): As a leading AI trend, GNNs are specialized for learning from graph-structured data [60]. In the context of AOPs, GNNs can be applied to:
- Predict novel connections within or between AOP networks.
- Prioritize KERs based on inferred strength or biological conservation.
- Integrate multi-omics data (transcriptomics, proteomics) directly onto the AOP graph to predict tissue or species-specific pathway perturbations.
- Generate in-silico simulations of network dynamics under different stressor conditions.

Table 1: Core AOP Components and Corresponding AI Integration Areas

AOP Component	Description	AI/Graph Science Application
Molecular Initiating Event (MIE)	Initial interaction between stressor and biomolecule [27].	NLP for literature mining; Generative models for predicting novel stressor-target interactions.
Key Event (KE)	Measurable, essential biological change [27].	GNNs for node (KE) classification and embedding within a broader biological network.
Key Event Relationship (KER)	Causal link describing how one KE leads to another [27].	GNNs for link prediction; AI for quantifying evidence strength and uncertainty.
Adverse Outcome (AO)	Regulatory-relevant harm at organism or population level [27].	Predictive modeling using upstream KE data to forecast AOs.
AOP Network	Interlinked AOPs sharing common KEs/KERs [27].	GNNs for network analysis, identifying critical hubs, and predicting cross-pathway effects.

Technical Implementation: Methodologies and Protocols

Protocol: AI-Driven Literature Mining with AOP-helpFinder

AOP-helpFinder is a prototypical AI tool that demonstrates the practical application of NLP and graph theory to AOP development [40].

Objective: To systematically mine PubMed abstracts for co-occurrences of user-defined keywords (e.g., a stressor and a biological event) to rapidly generate evidence-based hypotheses for AOP construction.

Workflow:

Query Formulation: The user defines a list of terms for two concepts (e.g., "ionizing radiation" as a stressor and "small head size" as an adverse outcome).
Automated Literature Search & Processing: The tool searches PubMed, retrieves abstracts, and performs text normalization (lemmatization).
Graph-Based Analysis: It constructs a co-occurrence network from the abstracts, identifying and ranking associations between the query terms and other biomedical entities.
Hypothesis Generation & Visualization: The tool outputs prioritized stressor-event and event-event pairs. The latest version can automatically design and visualize interactive AOP networks based on the results [40].
Expert Curation: The final, critical step involves domain experts evaluating the AI-generated associations by reviewing the full-text publications to validate and refine the proposed AOP (e.g., AOP 441 for radiation-induced microcephaly) [40].

Table 2: Quantitative Analysis of AI Tool Performance in AOP Contexts

Tool / Model	Application Area	Reported Performance Metric	Source / Benchmark
AOP-helpFinder	Literature mining for AOP KE/KER discovery	Successful deployment in EU projects (OBERON, PARC, RadoNorm); enabled proposal of AOP 441 [40].	Case studies in toxicological research
GraphSAGE	General graph representation learning (analogous to AOP networks)	Outperformed baselines by avg. 51% on classification F1-score; 100x faster inference than DeepWalk [60].	Benchmark citation graphs
PinSage (GNN)	Recommendation system on large graph (≈18B edges)	150% improvement in hit-rate, 60% improvement in MRR over prior model [60].	Pinterest content graph
GNoME (GNN)	Materials discovery at atomic scale (graph of atoms)	Discovered 2.2 million new stable crystal structures [60].	Density Functional Theory validation

Protocol: Implementing the FAIR AOP Roadmap for AI Readiness

The FAIR AOP Roadmap for 2025 outlines the technical and collaborative steps required to make AOP data AI-ready [29] [13].

Objective: To establish standardized, machine-actionable formats for AOP data and metadata, enabling reliable integration, reuse, and computational analysis by AI tools.

Key Methodological Steps:

Development of FAIR Enabling Resources: This involves creating tools and platforms that implement standardized data models for AOPs. The goal is to move beyond the AOP-Wiki as a document repository toward a structured knowledge graph [29].
Consensus on Bioinformatic Methods: International collaboration (e.g., through the FAIR AOP Cluster Workgroup) to define how diverse data types (omics, in vitro assay results, legacy toxicology data) are mapped to AOP KEs and KERs [29] [13].
Creation of an AOP "Data Domain": Treating AOPs as a distinct, well-defined data domain with clear semantics. This allows for precise linkage to other biomedical ontologies and databases, which is crucial for training AI models on integrated datasets [13].
Piloting AI Applications: Initiatives like the JRC/SCAHT challenge are operationalizing this roadmap by tasking researchers with applying state-of-the-art AI tools (e.g., LLMs, agentic AI) to defined AOP development problems, thereby creating best practice protocols [59].

Table 3: FAIR AOP Roadmap Phases and Milestones (2025-2026)

Phase	Primary Objective	Key Activities & Outputs	Stakeholders
Standardization & Tooling	Make existing AOP data machine-actionable.	Develop FAIR Implementation Profiles; enhance AOP-Wiki backend; create mapping tools for assay data [29].	FAIR AOP Cluster, OECD, AOP-Wiki developers
Community Challenge & Pilot	Explore and validate AI's utility for AOPs.	Run the "Boosting AOP with AI" challenge; document workflows and best practices [59].	JRC, SCAHT, AI/tox researchers
Integration & Scale	Embed AI-tools into mainstream AOP development.	Integrate successful AI prototypes into community platforms; update AOP-Wiki to version 3.0 with AI features [29] [59].	AOP-KB, regulatory agencies, research consortia

AI-Aided Assessment of Species Susceptibility: A Core Thesis Application

Assessing species susceptibility within an AOP context involves determining if and how the pathway is conserved across species, and identifying the quantitative differences in response (e.g., varying sensitivity of a molecular target, differential metabolic capacity) that lead to altered outcomes [27]. AI transforms this from a manual, comparison-heavy task into a predictive, systems-level analysis.

Workflow for AI-Aided Susceptibility Assessment:

Data Integration: Construct a cross-species AOP knowledge graph. Nodes (KEs) are annotated with data from multiple species (e.g., gene sequences, protein expression, physiological baselines). Edges (KERs) are weighted with evidence strength and, where available, quantitative parameters.
Conservation Analysis with GNNs: Apply GNNs to this graph to learn species-invariant and species-specific patterns. For example, a tool like SeqAPASS can analyze protein sequence similarity to infer conservation of MIEs (e.g., an estrogen receptor) [27]. GNNs can extend this concept to entire pathway modules.
Hypothesis Generation on Sensitive Nodes: The AI model can identify "susceptibility nodes"—KEs where small interspecies variations are predicted to cause large divergences in the downstream network outcome. This pinpoints where to focus experimental validation.
Quantitative Extrapolation Modeling: Using generative or probabilistic graph models, researchers can simulate the AOP network's behavior in an under-studied "target" species (e.g., an endangered fish) based on data from a well-studied "source" species (e.g., a laboratory model), providing a quantitative estimate of differential susceptibility.

Table 4: Research Reagent Solutions for AI-Enhanced AOP Development

Item / Tool	Primary Function	Relevance to Species Susceptibility Assessment
AOP-helpFinder [40]	NLP-powered literature mining to identify potential KEs and KERs from PubMed.	Discovers reported species-specific adverse effects or mechanistic differences.
Graph Neural Network (GNN) Frameworks (e.g., PyTorch Geometric, DGL)	Libraries for building and training models on graph-structured data [60].	Core technology for analyzing the cross-species AOP network and predicting susceptibility factors.
FAIR AOP-Wiki & Associated Tools [29] [13]	Central repository for curated, machine-readable AOP data following FAIR principles.	Provides the essential, standardized knowledge graph upon which susceptibility models are built.
SeqAPASS [27]	In silico protein sequence comparison tool to assess conservation of molecular targets (MIEs).	Directly evaluates the foundational conservation of an AOP's initial step across species.
Multi-omics Databases (e.g., GenBank, UniProt, GEO)	Sources of species-specific molecular data (genomic, transcriptomic, proteomic).	Provides the data to annotate AOP KEs with species-specific information for quantitative comparison.
Agentic AI / LLM Platforms (e.g., Claude Code, specialized research agents) [61]	AI systems capable of planning and executing multi-step research workflows, including code generation and data analysis.	Automates the integration of data from diverse sources and the execution of complex modeling pipelines for high-throughput susceptibility screening.

The integration of generative AI and graph data science into AOP development marks a decisive move from a descriptive, manual framework to a predictive, computational discovery engine. Tools like AOP-helpFinder demonstrate the tangible gains in efficiency, while the FAIR AOP roadmap and GNN-based breakthroughs in adjacent fields like materials science chart the course for transformative future applications [60] [29] [40].

For the core thesis of assessing species susceptibility, this AI-driven paradigm offers a powerful, mechanistic alternative to traditional extrapolation factors. By treating AOPs as computable cross-species knowledge graphs, researchers can move beyond qualitative statements of conservation to generate quantitative, testable predictions about which species are most at risk and why. The ongoing work to FAIRify AOP data and the exploration spurred by initiatives like the JRC/SCAHT challenge are critical steps in building the community-wide infrastructure and expertise needed to realize this future, ultimately leading to more precise and protective chemical risk assessments for all species [59] [13].

Optimizing AOPs for New Approach Methodologies (NAMs) and Defined Approaches for Testing

The assessment of species susceptibility represents a critical challenge in modern toxicology and drug development. Adverse Outcome Pathways (AOPs) have emerged as a powerful conceptual framework to address this challenge by providing a structured representation of the sequence of biological events leading from a molecular initiating event (MIE) to an adverse outcome (AO) [2]. An AOP functions like a series of biological dominos, where a stressor triggers a molecular interaction, initiating a cascade of measurable key events (KEs) at increasing levels of biological organization [27]. This framework is inherently modular and generalizable, meaning it is not specific to a single stressor or species, but instead depicts a sequence of effects expected for any stressor that triggers a particular MIE [27].

Optimizing AOPs for New Approach Methodologies (NAMs) involves refining these pathways to maximize their utility in human-relevant, non-animal testing strategies. NAMs encompass a diverse suite of in vitro, in silico, and in chemico methods designed to improve chemical safety assessment [62] [63]. Defined Approaches (DAs), which are specific combinations of NAMs with fixed data interpretation procedures, represent a mature application of this optimized framework for regulatory decision-making [62] [64]. The central thesis of this guide is that the systematic optimization of AOPs—through quantification, confidence assessment, and network assembly—provides the mechanistic backbone necessary to reliably extrapolate toxicological findings across species and identify genetic and biological determinants of susceptibility. This forms the cornerstone of a new paradigm in predictive toxicology.

Core Principles for AOP Optimization

Optimizing an AOP requires moving from a qualitative description to a quantitative, reliable, and well-characterized tool suitable for integration with NAMs. This process involves several key steps.

From Qualitative Pathways to Quantitative Models

A qualitative AOP outlines the sequence of KEs but lacks predictive power. Optimization demands establishing quantitative key event relationships (qKERs). These are mathematical descriptions of how a change in the magnitude or timing of an upstream KE influences a downstream KE [65]. Quantitative understanding is one of the three pillars of evidence required to define a KER, alongside biological plausibility and empirical support [27]. Methods for quantification include dose-response modeling, time-course analyses, and the development of computational models such as ordinary differential equations or Bayesian networks [44]. Quantitative AOPs (qAOPs) are essential for determining points of departure for risk assessment and for making quantitative predictions about adversity based on NAM data.

Assessing Confidence and Essentiality

The utility of an AOP depends on the scientific confidence in its components and structure. Confidence is evaluated through two complementary approaches, as summarized in Table 1.

Table 1: Frameworks for AOP Confidence Assessment

Assessment Framework	Core Focus	Key Evaluation Criteria
Bradford-Hill Criteria [65]	Weight-of-evidence for causal linkages	Concordance of dose-response relationships; temporal concordance among KEs; strength, consistency, and specificity of association; biological plausibility; consideration of alternative mechanisms.
OECD Key Questions [65]	Overall confidence and applicability	How well characterized is the AOP? How well are MIE and KEs causally linked to the AO? What are the limitations in the evidence? Is the AOP specific to certain tissues or life stages? Are the MIE and KEs conserved across taxa?

A critical related concept is essentiality, which posits that if a KE is prevented, the AO will not occur [27]. Demonstrating essentiality, often through genetic or pharmacological inhibition studies, significantly strengthens the AOP and its value for targeted testing with NAMs.

Building Modular Networks and Ensuring FAIR Compliance

Individual AOPs are simplifications of biology. Real-world outcomes often arise from interconnected pathways. AOP networks are created by linking individual AOPs through shared KEs or MIEs, providing a more realistic and comprehensive model of toxicological complexity [44] [27]. This network approach is crucial for understanding mixture toxicity and species-specific pathway variations.

To maximize their impact, optimized AOPs must be Findable, Accessible, Interoperable, and Reusable (FAIR) [66]. FAIR principles ensure AOPs are machine-actionable, facilitating their integration into computational workflows, AI/ML analyses, and regulatory decision-support systems. This involves using standardized ontologies, rich metadata, and persistent identifiers within repositories like the AOP-Wiki [66].

Diagram: Core Architecture of an Optimized AOP Network

Integration with New Approach Methodologies (NAMs)

An optimized AOP provides the mechanistic map that guides the development, selection, and interpretation of NAMs. A NAM designed for hazard identification can be defined as an in vitro system coupled with a battery of assays that measure KEs within a relevant AOP network [44].

Aligning NAMs with AOP Key Events

Different NAM technologies are suited to measuring KEs at specific levels of biological organization:

Molecular/Cellular KEs: Measured using high-content screening, transcriptomics (e.g., TempO-Seq), and targeted in chemico assays like the Direct Peptide Reactivity Assay (DPRA) for skin sensitization [62] [63].
Tissue/Organ KEs: Assessed using advanced in vitro models such as 3D spheroids, organoids, and microphysiological systems (MPS or Organ-on-a-Chip) [63] [44]. For example, a liver-on-a-chip model can measure KE like triglyceride accumulation (steatosis) or secretion of inflammatory cytokines.
Organism-Level Prediction: Achieved by integrating NAM data with Physiologically Based Kinetic (PBK) models and quantitative in vitro to in vivo extrapolation (QIVIVE) within the AOP framework to predict systemic exposure and AO likelihood [62] [63].

Defined Approaches (DAs) as a Regulatory Application

A Defined Approach is a formally validated integration of NAMs within an AOP context. It specifies a fixed set of information sources (e.g., specific in chemico and in vitro tests) and a prescribed data interpretation procedure to classify a chemical for a given endpoint, without expert judgment [62]. Successful DAs have been adopted by the OECD for skin sensitization (TG 497) and eye damage/irritation (TG 467) [62] [64]. The 2025 OECD Test Guideline updates further expanded these DAs, for example, by allowing new in vitro methods as alternate data sources in TG 497 and extending the applicability of TG 467 to surfactants [64]. DAs exemplify how optimized AOPs can be operationalized for consistent, animal-free regulatory decision-making.

Diagram: AOP-Informed NAM Integration & Testing Workflow

Defined Approaches and Experimental Protocols for Hepatotoxicity

Liver toxicity serves as an exemplary case study for applying optimized AOPs within NAM-based testing strategies [65] [44]. Well-characterized AOP networks exist for steatosis, fibrosis, and cholestasis.

AOP-Informed Defined Approach for Screening Hepatotoxicants

A proposed tiered testing strategy anchored in liver AOP networks is outlined in Table 2. This strategy progresses from high-throughput, lower-complexity assays to lower-throughput, high-fidelity human in vitro models [44].

Table 2: Tiered NAM Testing Strategy for Liver Toxicity Based on AOP Networks

Tier	AOP Network KE Targeted	Recommended NAM Assays/Models	Measured Endpoint (Example)	Throughput
Tier 1: Initial Screening	Molecular/Cellular KEs (e.g., Nuclear receptor activation, Oxidative stress)	High-content imaging in 2D hepatocytes; Transcriptomic screening (TempO-Seq); Cell viability assays (ATP content).	Translocation of transcription factors; Gene expression signatures of stress; Cell viability IC50.	High
Tier 2: Mechanistic Evaluation	Cellular/Tissue KEs (e.g., Lipid accumulation, Mitochondrial dysfunction, Bile acid transport inhibition)	3D hepatocyte spheroids; Stem cell-derived hepatocyte-like cells (iPSC-HLCs); Transporter inhibition assays.	Intracellular triglyceride accumulation (Oil Red O staining); Oxygen consumption rate (Seahorse); Bile acid accumulation.	Medium
Tier 3: Advanced Phenotypic Confirmation	Tissue/Organ KEs (e.g., Steatosis, Cholestasis, Inflammation)	Microphysiological Systems (MPS / Liver-on-a-chip) with non-parenchymal cells; Precision-cut liver slices (PCLS).	Albumin/urea secretion; release of inflammatory cytokines (IL-6, IL-8); histopathological markers.	Low

Detailed Experimental Protocol: Measuring a KE in a Liver Steatosis AOP

The following protocol details the measurement of a pivotal KE—intracellular triglyceride accumulation—within an AOP for drug-induced liver steatosis, using a 3D human hepatocyte spheroid model [65] [44].

Objective: To quantify lipid accumulation in hepatocyte spheroids following exposure to a test compound, as a measure of the KE "Increased triglyceride accumulation in hepatocytes" leading to the AO of liver steatosis.

Materials:

Cellular Model: Cryopreserved primary human hepatocytes (PHHs) or induced pluripotent stem cell-derived hepatocyte-like cells (iPSC-HLCs).
Culture Platform: Ultra-low attachment U-bottom 96-well spheroid formation plates.
Culture Medium: Hepatocyte maintenance medium, supplemented as per provider's instructions.
Test Compound & Controls: Compound of interest dissolved in DMSO (final concentration ≤0.1%). Positive Control: 500 μM sodium valproate. Vehicle Control: 0.1% DMSO.
Staining Reagents: 4% formaldehyde (fixative), 60% isopropanol, Oil Red O working solution (0.3% in 60% isopropanol), Hematoxylin for counterstaining.
Analysis Equipment: Automated brightfield and fluorescence microscope or high-content imaging system. Plate reader for spectrophotometric quantification.

Procedure:

Spheroid Formation: Seed PHHs or iPSC-HLCs at 1,000-1,500 cells/well in 100 μL of maintenance medium into U-bottom plates. Centrifuge plates at 300 x g for 3 minutes to aggregate cells. Incubate at 37°C, 5% CO₂ for 72-96 hours to form compact spheroids.
Compound Exposure: After spheroid formation, treat with the test compound, positive control, and vehicle control in fresh medium. Incubate for 72 hours, with medium change at 48 hours.
Fixation and Staining: a. Aspirate medium and wash spheroids once with PBS. b. Fix with 4% formaldehyde for 30 minutes at room temperature. c. Wash twice with PBS. d. Permeabilize and condition with 60% isopropanol for 5 minutes. e. Stain with Oil Red O working solution for 20 minutes at room temperature. f. Destain with 60% isopropanol until background is clear (2-3 quick washes). g. Wash twice with distilled water. h. Optionally, counterstain nuclei with hematoxylin for 1-2 minutes and wash.
Image Acquisition & Analysis: a. Acquire high-resolution brightfield images of entire spheroids using a 10x objective. b. For Quantitative Analysis: Elute the Oil Red O dye from the spheroids by adding 100 μL of 100% isopropanol per well with gentle shaking for 10 minutes. c. Transfer the eluate to a clear flat-bottom 96-well plate and measure the absorbance at 510 nm using a plate reader.

Data Interpretation: A statistically significant increase in Oil Red O staining intensity (via image analysis) or absorbance (≥1.5-fold over vehicle control) indicates a positive response for this KE. This data should be integrated with results from other KEs (e.g., gene expression of lipogenic enzymes, mitochondrial function assays) within the AOP network to build a weight-of-evidence for steatotic potential.

The Scientist's Toolkit: Essential Reagents & Platforms

The implementation of AOP-optimized NAMs relies on specific, validated tools.

Table 3: Research Reagent Solutions for AOP-Driven Hepatotoxicity Testing

Item / Solution	Function in AOP/NAM Context	Example & Key Application
Primary Human Hepatocytes (PHHs)	Gold-standard in vitro model providing human-relevant metabolic and transcriptional responses. Essential for measuring KEs related to xenobiotic metabolism and toxicity.	Thermo Fisher Scientific, BioIVT. Used in 2D, 3D spheroid, and MPS formats to assess KEs like transporter inhibition, CYP450 induction, and triglyceride accumulation.
iPSC-Derived Hepatocyte-Like Cells (iPSC-HLCs)	Provide a scalable, genetically diverse, and human-relevant cell source. Useful for studying genetic susceptibility factors within an AOP [67].	Cellular Dynamics International (Fujifilm), Stemcell Technologies. Enable population-based studies and long-term repeated dose testing in spheroid or organoid formats.
Liver-on-a-Chip (Microphysiological System)	Physiologically advanced model incorporating fluid flow, multiple cell types (hepatocytes, Kupffer, stellate cells), and mechanical cues. Models tissue-level KEs.	Emulate, Inc., Mimetas, CN Bio. Used in Tier 3 testing to confirm phenotypic AOs like steatosis, cholestasis, and inflammation in a human-relevant system [63].
Direct Peptide Reactivity Assay (DPRA)	In chemico assay that measures the covalent binding of chemicals to peptides, representing the MIE for the skin sensitization AOP. A component of OECD TG 497 Defined Approaches [62] [64].	Standardized OECD TG 442C. Used to predict the skin sensitization potential of chemicals without animal testing, based on a defined AOP.
TempO-Seq Platform	Targeted transcriptomics platform enabling high-throughput gene expression analysis. Measures molecular KEs (altered gene expression) across hundreds of samples.	BioClio. Used for high-throughput screening against AOP-derived gene expression signatures for pathways like oxidative stress, endoplasmic reticulum stress, and nuclear receptor activation.
AOP-KB and AOP-Wiki	Central repository and collaborative platform for AOP development, sharing, and FAIRification. Provides the essential knowledge backbone for NAM design [65] [66] [27].	https://aopwiki.org/ The foundational resource for accessing established AOPs, building new ones, and ensuring interoperability of mechanistic data.

Future Directions: AI, Genetic Susceptibility, and Regulatory Harmonization

The future of optimized AOPs and NAMs lies in embracing computational advances and resolving translational barriers. A primary research frontier is the systematic integration of genetic susceptibility data into AOPs. For instance, genetic variants in genes like AHR (aryl hydrocarbon receptor) and ABCB11 (bile salt export pump) have been identified as modifiers of individual susceptibility to liver cancer and cholestatic injury within their respective AOPs [67]. Future AOPs will incorporate such susceptibility-modifying factors as annotatable nodes, enabling NAMs to be tailored using genetically diverse cell lines (e.g., from iPSC biobanks) to assess population variability.

Artificial Intelligence (AI) and Graph Data Science are powerful tools for this optimization. AI can analyze complex AOP networks to predict novel KERs, identify critical susceptibility nodes, and generate synthetic data to explore genetic interactions [67]. Machine learning models can also integrate multimodal NAM data (transcriptomics, imaging, functional endpoints) mapped onto an AOP network to improve AO prediction.

Finally, regulatory harmonization remains critical. While OECD test guidelines for DAs are a major success [64], a unified, cross-industry framework for validating more complex NAMs for systemic toxicity is needed [16]. This framework must be based on fitness-for-purpose and human biological relevance, rather than solely on correlation with historical animal data, to fully realize the potential of AOP-optimized, human-centric safety assessment.

Establishing Confidence: Weight-of-Evidence and Quantitative Approaches for Reliable Predictions

Applying Bradford-Hill Criteria for Weight-of-Evidence Assessment of AOPs

The Adverse Outcome Pathway (AOP) framework has emerged as a critical organizing principle in modern toxicology and chemical risk assessment. An AOP describes a conceptual sequence of measurable key events (KEs), beginning with a Molecular Initiating Event (MIE)—the initial interaction of a stressor with a biological target—and progressing through linked biological changes to an Adverse Outcome (AO) relevant to risk assessment [4] [2]. As a chemically agnostic construct, an AOP provides a structured way to translate data from New Approach Methodologies (NAMs)—including in silico models, high-throughput in vitro assays, and targeted in vivo tests—into predictions of apical outcomes for human health and ecological species [4].

A central challenge is establishing scientific confidence in the postulated causal linkages within an AOP. Confidence determines the utility of an AOP for regulatory decision-making, such as prioritizing chemicals for testing, supporting integrated testing strategies, or informing quantitative risk assessments [68] [69]. The Weight of Evidence (WoE) assessment provides the methodological rigor for this evaluation. Internationally, guidance from the Organisation for Economic Co-operation and Development (OECD) advocates for applying tailored Bradford-Hill (BH) considerations to systematically assess the confidence in an AOP’s Key Event Relationships (KERs) and the essentiality of its KEs [68] [70].

This guide details the application of these tailored BH criteria within the specific context of assessing species susceptibility. A core thesis in AOP research is that susceptibility—the differential response to a stressor across species, populations, or life stages—can be understood and predicted by examining the conservation and modulation of AOP components. A robust WoE assessment grounded in BH principles is therefore not merely an academic exercise; it is a practical tool for defining the applicability domain of an AOP, bridging knowledge from model organisms to humans or across ecological species, and ultimately strengthening the scientific basis for protecting susceptible populations [71].

Core Methodology: Tailored Bradford-Hill Considerations for AOPs

The traditional Bradford-Hill criteria for epidemiological causation have been adapted by the OECD into a focused set of considerations for evaluating the mechanistic plausibility and empirical support within an AOP [68] [70]. These are applied primarily at the level of Key Event Relationships and the essentiality of Key Events.

Table: Tailored Bradford-Hill Considerations for AOP Weight-of-Evidence Assessment

Bradford-Hill Consideration	Definition in AOP Context	Key Questions for Assessment	Type of Evidence Sought
Biological Plausibility	The extent to which a KER is consistent with established biological knowledge.	Is there a well-understood biological mechanism linking the upstream and downstream KE? Is the relationship consistent across different biological contexts?	Evidence from established pathway databases (e.g., KEGG, Reactome), scientific literature on mechanism, computational models of biological networks [68] [32].
Empirical Support	The strength, consistency, and specificity of experimental or observational data supporting the KER.
• Dose-Response	Demonstration that the intensity or frequency of the upstream KE influences the downstream KE.	Do concurrent, graded changes occur in both KEs across a range of stressor doses?	Data from studies measuring both KEs in the same system across multiple concentrations [68] [69].
• Temporality	Demonstration that the upstream KE occurs before the downstream KE.	Does the upstream KE reliably precede the downstream KE in time-course studies?	Data from longitudinal or time-series experiments tracking the progression of KEs [68].
• Incidence/Consistency	The reproducibility of the KER across different studies, systems, and stressors.	Is the relationship observed consistently in independent studies, using different models or chemicals that share the same MIE?	Evidence from multiple independent laboratories, various model organisms, or a suite of structurally similar stressors [68] [72].
Essentiality	Evidence that modulating a KE (e.g., inhibiting, enhancing, or blocking it) prevents or alters the progression to the downstream KE and/or the AO.	If the KE is prevented or its function is restored, is the progression of the pathway halted or mitigated?	Data from genetic knockout/knockdown models, pharmacological inhibitors, or rescue experiments [68] [70].

Implementing the Assessment: A Tiered Workflow

A systematic WoE assessment follows a logical workflow, progressing from assembling evidence to synthesizing a confidence conclusion. The following diagram illustrates this process, integrating the BH considerations with data sourcing and confidence grading.

Workflow Description:

Evidence Assembly: The process begins with a defined AOP. Evidence is gathered from structured sources like the AOP Knowledge Base (AOP-KB) and primary literature, often facilitated by computational tools like AOP-helpFinder [72] [73].
BH Application: Each KER is evaluated against the three pillars: Biological Plausibility, Empirical Support (with its sub-components), and the Essentiality of the upstream KE [68] [70].
WoE Synthesis: The strength of evidence across all criteria is integrated to assign an overall confidence level (e.g., High, Moderate, Low) for the KER and the overall AOP. Quantitative Multi-Criteria Decision Analysis (MCDA) models can formalize this synthesis [68].
Outcome Definition: The confidence conclusion informs the AOP's suitable applications and highlights critical data gaps for future research [72].

Quantitative Extensions: From Qualitative to Quantitative AOPs

While BH assessments often start qualitatively, they enable the development of Quantitative AOPs (qAOPs). A qAOP incorporates mathematical relationships describing the dose-response or response-response dynamics between KEs, which is essential for predicting the point of departure for an AO [69].

Table: Methodologies for Quantifying Key Event Relationships

Methodology	Description	Application in AOP Development	Example Reference
Benchmark Dose (BMD) Modeling	A statistical method to determine the dose (BMD) that produces a predetermined change in response (BMR) compared to background.	Used to derive points of departure (PoDs) for individual KEs, allowing sensitivity comparison across events in a pathway [72].	Applied to identify the most sensitive KE in a gamma radiation AOP network for Lemna minor [72].
Structural Equation Modeling (SEM)	A multivariate statistical analysis technique that tests and estimates causal relationships using a combination of statistical data and qualitative causal assumptions.	Models multiple KERs simultaneously within an AOP network, estimating the strength of direct and indirect pathways leading to the AO [72].	Used to analyze the interconnected KERs in an ionizing radiation AOP network [72].
Multiple Nonlinear Regression Modeling (MNLRM)	Fits nonlinear functions to describe the relationship between an independent variable (e.g., upstream KE) and a dependent variable (e.g., downstream KE).	Quantifies specific, potentially nonlinear, response-response relationships between a pair of KEs [72].	Employed to model the relationship between chlorophyll content and growth rate in a plant AOP [72].
Ordinary Differential Equation (ODE) Models	A system of equations that describe the time-dependent rates of change of biological variables within a system.	Captures dynamic, feedback-regulated biological processes (e.g., endocrine axes) for high-fidelity qAOPs [69].	Used in a qAOP for aromatase inhibition leading to reproductive dysfunction in fish [68] [69].

Integrating Species Susceptibility into the WoE Assessment

A primary objective in contemporary AOP research is to contextualize pathways relative to species-specific susceptibility. A WoE assessment must therefore evaluate not only if a pathway exists, but also how its function and sensitivity might differ across species of interest (e.g., from rodent to human, or across aquatic taxa). This is addressed through a dedicated Human and Species Relevance Assessment workflow [71].

A Workflow for Assessing Cross-Species Relevance

The following workflow refines established frameworks (e.g., the WHO/IPCS Mode of Action framework) to systematically assess the relevance of an AOP developed in one species for application to another (typically human) [71].

Workflow Logic:

Qualitative Assessment: The first step determines if the fundamental biology underpinning the AOP's MIE and KEs is present in the target species. This involves examining the conservation of molecular targets (e.g., receptors, enzymes), signaling pathways, and tissue/organ functions [71] [32].
Empirical Assessment: The second step seeks direct evidence that the pathway can be activated in the target species. This includes epidemiological associations, in vivo data, or in vitro data using relevant cells or tissues from the target species [71].
Evidence Integration: Biological and empirical lines of evidence are combined. Strong evidence in both areas supports high confidence in cross-species relevance. If direct empirical data is lacking, a judgment based on strong biological conservation and evolutionary principle may still support plausible relevance, albeit with lower confidence and a defined data gap [71].

Conducting this assessment relies on specific bioinformatics and experimental resources that provide comparative biological data.

Table: Key Data Sources for Cross-Species AOP Relevance Assessment

Resource / Tool	Primary Function	Utility in Susceptibility/WoE Assessment
EPA AOP Database (AOP-DB) [32]	Integrates AOP information with gene, chemical, disease, and population genetics data.	Links AOP KEs to specific gene targets and identifies known functional genetic variants (SNPs) in human populations that may confer susceptibility [32].
AOP-Wiki [4] [2]	The central, collaborative repository for AOP development and sharing.	Provides the canonical description of an AOP, including associated stressors, evidence supporting KERs, and links to relevant assays—the starting point for any assessment.
Homologene/Orthology Databases [32]	Provide mappings of orthologous genes across species.	Essential for determining if the molecular target of an MIE or a KE is conserved between the source and target species.
Tissue-specific Expression Atlases (e.g., Human Protein Atlas, GTEx) [71] [32]	Provide data on gene and protein expression levels across different tissues and cell types.	Informs whether the molecular machinery of an AOP is expressed in the relevant target tissue of the species of concern (e.g., human liver vs. rat liver).
AOP-helpFinder [72] [73]	A text-mining tool that uses natural language processing to scan scientific literature for associations between stressors and KEs.	Accelerates the gathering of empirical evidence to support or refute KERs across species by systematically mining published literature.

Practical Implementation and the Research Toolkit

Successfully applying BH criteria requires a combination of structured guidance, computational tools, and practical reagents. The following toolkit synthesizes the essential components for researchers.

Category	Item / Reagent Solution	Function & Explanation
Guidance Documents	OECD Users' Handbook Supplement for Developing and Assessing AOPs [68] [70]	Provides the formal international guidance on applying tailored Bradford-Hill considerations for WoE assessment.
Knowledge Bases	AOP Knowledge Base (AOP-KB) & AOP-Wiki [4] [2]	Foundational platforms for accessing, developing, and sharing AOPs. The AOP-Wiki is the primary entry point for existing AOP descriptions and evidence.
Computational Evidence Tools	AOP-helpFinder [72] [73]	An AI-informed text-mining tool that scans PubMed abstracts to identify potential connections between stressors/chemicals and key events, streamlining literature review for empirical support.
Data Integration Platforms	EPA AOP Database (AOP-DB) [32]	A resource that connects AOP components to extensive external data (genes, chemicals, pathways, population variants), crucial for understanding mechanistic conservation and human susceptibility.
Quantitative Modeling Software	R/BMD Software, SEM Software (e.g., lavaan), ODE Solvers (e.g., R, MATLAB)	Statistical and computational environments for performing Benchmark Dose modeling, Structural Equation Modeling, and dynamic systems modeling to build qAOPs.
Comparative Biology Databases	Homologene [32], Ensembl, UCSC Genome Browser	Databases essential for finding gene orthologs and comparing genomic contexts across species, a key step in relevance assessment.
Assay & Reagent Platforms	Commercial in vitro assay kits (e.g., for cytotoxicity, receptor activation, oxidative stress); CRISPR-Cas9 reagents.	Standardized biochemical or cell-based assays to measure specific KEs (e.g., caspase-3 activity for apoptosis). CRISPR tools are critical for generating genetic models to test KE essentiality.

Case Study Application: Aromatase Inhibition to Reproductive Dysfunction

A canonical example demonstrating the integrated WoE assessment is the AOP for aromatase inhibition leading to reproductive dysfunction in fish. This AOP is well-supported and has been used to develop a prototype quantitative MCDA model for WoE [68] [69].

Biological Plausibility: Strong. The role of aromatase in converting androgens to estrogens, and estrogen's role in vitellogenin production and oocyte maturation, is deeply established in endocrinology.
Empirical Support:
- Dose-Response: Graded inhibition of brain/ovarian aromatase activity correlates with reduced plasma estradiol and vitellogenin, leading to decreased egg production [68].
- Temporality: Inhibition of aromatase and drop in estradiol precede the failure of oocyte maturation and spawning.
- Incidence: Observed consistently across multiple fish species (fathead minnow, zebrafish, medaka) exposed to various aromatase inhibitors (e.g., fadrozole, prochloraz).
Essentiality: Demonstrated by multiple lines of evidence. Pharmacological inhibition or genetic knockdown of aromatase recapitulates the adverse outcome. Furthermore, co-exposure with estradiol can rescue the reproductive failure, confirming the essential role of estrogen synthesis [69].
Quantification: This AOP has been modeled using ordinary differential equations that incorporate feedback in the hypothalamic-pituitary-gonadal axis, creating a predictive qAOP capable of simulating reproductive outcomes under various exposure scenarios [69].

This case illustrates how a robust WoE assessment, following the tailored BH criteria, transforms a mechanistic hypothesis into a trusted, quantifiable model that can be applied in ecological risk assessment and extended, with appropriate relevance assessment, to inform considerations for other vertebrate species.

Developing Quantitative AOPs (qAOPs) for Dose-Response and Species Extrapolation

The adverse outcome pathway (AOP) framework has emerged as a powerful conceptual tool for organizing mechanistic knowledge, describing a sequential chain of causally linked biological events from a molecular initiating event (MIE) to an adverse outcome (AO) relevant to risk assessment [74] [27]. While qualitative AOPs excel at hazard identification and hypothesis generation, their utility in quantitative risk assessment is limited. They describe what can happen but not when or at what dose it will occur [74] [75]. This gap is addressed by developing quantitative AOPs (qAOPs)—mathematical constructs that model the dose-response and temporal relationships between key events (KEs) [74].

The transition to qAOPs is driven by the need to extrapolate effects across biological scales (e.g., from in vitro to in vivo) and across species [74]. This is central to assessing species susceptibility, a core challenge in ecological and human health risk assessment. A qAOP provides a formal, computable structure to compare how differences in species biology—such as receptor affinity, metabolic rates, or compensatory feedback loops—modulate the severity and probability of an adverse outcome following a specific perturbation [75] [27]. By integrating toxicokinetic (what the body does to the chemical) and toxicodynamic (what the chemical does to the body) models, qAOPs enable prediction of the external exposure conditions required to trigger a pathway and the resultant magnitude of effect in diverse species [74].

Core Components and Definitions of a Quantitative AOP

A qAOP is built upon the modular architecture of a qualitative AOP but incorporates quantitative descriptors for its components. The level of quantification can vary, forming a continuum from a single quantitative Key Event Relationship (KER) to a fully developed qAOP model [74].

Table 1: Core Components of a Quantitative AOP (qAOP)

Component	Qualitative AOP Description	Quantitative qAOP Enhancement	Primary Function in Species Extrapolation
Molecular Initiating Event (MIE)	Identification of the initial chemical-biological interaction.	Quantification of binding affinity, inhibition potency (e.g., IC50, Ki), or reaction kinetics.	Species-specific MIEs (e.g., receptor homology, expression levels) define initial susceptibility.
Key Event (KE)	A measurable, essential change in biological state.	Definition as a continuous or discrete variable with a measurable baseline and dynamic range.	Identification of conserved vs. divergent KEs across species informs extrapolation confidence.
Key Event Relationship (KER)	A described causal linkage between two KEs.	A mathematical function (e.g., regression, differential equation, Bayesian probability) describing how the state of an upstream KE determines the state of a downstream KE.	Captures species-specific system dynamics (feedback, adaptation) that alter response progression.
Adverse Outcome (AO)	An adverse effect of regulatory relevance.	Defined as a probabilistic outcome or a continuous severity metric with a defined threshold.	Establishes the apical endpoint for cross-species comparison of pathway sensitivity.
Toxicokinetic (TK) Linkage	Often implicit or absent.	Explicit model linking external dose to internal concentration at the site of the MIE.	Critical for extrapolating from in vitro bioactivity concentrations to in vivo doses across species with different ADME (Absorption, Distribution, Metabolism, Excretion).

A full qAOP mathematically links the MIE to the AO, while a partial qAOP quantifies more than one KER, and a quantitative KER focuses on a single relationship [74]. For species extrapolation, the quantitative understanding embedded in KERs and TK models is paramount, as it allows for the parameterization of species-specific biological differences [75].

Methodological Framework for qAOP Development

Developing a qAOP is an iterative process that begins with a well-defined qualitative AOP and a clear problem formulation [74]. The following workflow and experimental protocols outline the key steps.

Diagram: Workflow for Quantitative AOP (qAOP) Development [74]

Experimental and Computational Protocols

The choice of methodology for quantifying KERs depends on the biological complexity, available data, and the assessment question. Below are detailed protocols for prominent approaches cited in recent literature.

Table 2: Detailed Experimental Protocols for Key qAOP Modeling Approaches

Modeling Approach	Core Protocol Description	Key Parameters & Data Requirements	Application in Species Extrapolation
Dynamic Bayesian Network (DBN) for Chronic Toxicity [48]	1. Define Network Structure: Map AOP KEs as nodes in a directed acyclic graph (DAG).2. Parameterize Conditional Probabilities: Use experimental data (dose- and time-series) to define probability distributions for each node given its parent nodes.3. Model Temporal Dynamics: Extend to DBN by linking time-sliced networks to capture delayed effects and feedback from repeated exposures.4. Inference & Prediction: Use probabilistic inference (e.g., Markov chain Monte Carlo) to compute the likelihood of the AO given evidence on upstream KEs.	- Time-series in vitro data (e.g., from repeated dosing assays).- Prior probability distributions for nodes.- Conditional probability tables for discrete data or linear Gaussian equations for continuous data.	Allows probabilistic prediction of AO in un-tested species by updating node parameters with species-specific data (e.g., different KE sensitivity). The causal structure can be tested for conservation.
Systems Biology ODE Model (e.g., HPG Axis) [75]	1. Diagram Biological Mechanism: Create detailed schematic of feedback loops (e.g., hypothalamic-pituitary-gonadal axis).2. Formulate ODEs: Write mass-action or Michaelis-Menten based equations for each biochemical process (e.g., hormone synthesis, degradation, receptor binding).3. Parameter Estimation: Calibrate model parameters (rate constants, IC50) using time-course data from controlled exposures (e.g., fish exposed to aromatase inhibitor).4. Model Coupling: Link the ODE model output (e.g., plasma vitellogenin) to downstream individual or population-level models.	- High-resolution time-course data for multiple biological variables (hormones, proteins).- Species-specific physiological parameters (e.g., metabolic clearance rates, organ volumes).- Chemical-specific toxicodynamic parameters.	Species extrapolation is achieved by replacing model parameters (e.g., enzyme abundance, feedback loop sensitivity) with values specific to the target species. This explicitly tests how system architecture modulates outcome.
Probabilistic Dose-Response using Activation Functions [76]	1. Select Activation Function: Choose a flexible mathematical function (e.g., sigmoid, hyperbolic tangent) to describe the KE response curve.2. Probabilistic Parameterization: Treat function parameters as distributions (not point estimates) to propagate uncertainty.3. Fit to Shorter-Duration Data: Calibrate the model using subacute/subchronic in vivo data.4. Extrapolate to Chronic POD: Use the fitted model to predict the point of departure (POD) for chronic exposure, comparing to regulatory benchmarks.	- Dose-response data from studies of varying duration.- Prior distributions for model parameters based on chemical class or MOA.- Definition of a biologically based adversity threshold for the KE.	Enables the use of short-term test data from a surrogate species to predict chronic toxicity values for a protected species, formalizing uncertainty in both duration and species sensitivity.

Quantitative Framework for Assessing Species Susceptibility

Assessing species susceptibility within the qAOP framework involves a systematic comparison of the quantitative parameters that govern pathway dynamics. The goal is to determine whether a pathway is conserved and, if so, how quantitative differences alter the dose-response relationship.

Diagram: Integrated TK/TD Framework for Species Extrapolation Using qAOPs [74] [75] [27]

The process involves two integrated layers:

Toxicokinetic Extrapolation: A physiologically based toxicokinetic (PBTK) model translates an external exposure or in vitro concentration into a tissue-specific internal dose at the MIE target. Species-specific physiological parameters (e.g., blood flow rates, tissue volumes, metabolic enzyme expression) are critical inputs [74].
Toxicodynamic Extrapolation: The internal dose drives the qAOP. Extrapolation requires evaluating the conservation of each KE and KER. Tools like the SeqAPASS (Sequence Alignment to Predict Across Species Susceptibility) can inform on the structural conservation of MIE targets like proteins [27]. For conserved KERs, the mathematical functions may remain the same, but their parameters (e.g., reaction rates, sensitivity thresholds) must be updated with data from the target species. AOP networks are particularly useful here, as shared KEs between pathways can reveal compensatory mechanisms that differ by species and modulate net susceptibility [27].

The Scientist's Toolkit: Essential Research Reagents and Materials

Building and validating qAOPs requires a combination of biological, analytical, and computational tools.

Table 3: Key Research Reagent Solutions for qAOP Development

Category / Item	Function in qAOP Development	Application in Species Extrapolation
In Vitro High-Throughput Screening Assays	Generate quantitative dose-response data for MIEs and early cellular KEs (e.g., receptor binding, gene expression). Provide the initial data for parameterizing upstream KERs.	Assays using human-derived cells (e.g., hepatocytes, cardiomyocytes) provide direct TD data for human health assessment. Cross-species comparisons require assays built from cells of relevant ecological species.
'Omics Platforms (Transcriptomics, Proteomics)	Measure system-wide molecular changes to identify KEs, validate KERs, and uncover adaptive or compensatory networks not captured in the simplified AOP.	Comparative 'omics across species exposed to similar internal doses can identify conserved "bioactivity signatures" and species-specific pathway deviations [48].
Physiologically Based Toxicokinetic (PBTK) Modeling Software (e.g., GastroPlus, Simcyp, open-source tools)	Computational platforms to build, simulate, and validate TK models. Essential for bridging in vitro bioactivity to in vivo dose and extrapolating across species.	Core tool for TK extrapolation. Libraries of species-specific physiological parameters are required to customize models for non-standard test species.
Bayesian Network / Statistical Analysis Software (e.g., R with `bnlearn`, `Stan`, `JAGS`)	Implement probabilistic graphical models and fit complex statistical models to quantify KERs under uncertainty. Enables DBN development for chronic toxicity [48].	Allows propagation of uncertainty in both source data and species-specific parameter estimates, yielding a probabilistic prediction of susceptibility for the target species.
AOP-KB and Supporting Databases (AOP-Wiki, Effectopedia, CompTox Chemicals Dashboard)	Central repositories for qualitative AOP knowledge, supporting evidence, and chemical properties. Provide the essential structural scaffold and literature basis for qAOP development.	The AOP-Wiki captures known domains of applicability for AOPs (e.g., taxonomic scope). SeqAPASS integration helps predict MIE conservation across species [27].
Cryopreserved Primary Cells from Multiple Species	Biologically relevant test systems for generating TD data. Primary cells maintain species-specific metabolic and regulatory functions better than immortalized cell lines.	Direct comparison of KE responses (e.g., cytotoxicity, biomarker release) between primary cells from a test species (e.g., rat) and a protected species (e.g., human or endangered fish) under controlled conditions.

The Adverse Outcome Pathway (AOP) framework is a conceptual model designed to organize biological knowledge into a structured sequence of causally linked events, from a molecular perturbation to an adverse outcome relevant to risk assessment [27]. An AOP is initiated by a Molecular Initiating Event (MIE), which is the direct interaction between a stressor (e.g., a chemical) and a biological target [27]. This trigger sets off a series of measurable Key Events (KEs) at different levels of biological organization (cellular, tissue, organ, organism). The causal linkages between a KE and its successor are defined as Key Event Relationships (KERs) [27].

The utility of an AOP for decision-making, particularly in extrapolating hazard across species, hinges on the confidence in its constituent KERs. Confidence is not a monolithic judgment but is built by systematically evaluating two pillars of evidence: biological plausibility and empirical support [27] [77]. Biological plausibility asks whether the proposed relationship is consistent with established biological knowledge, while empirical support examines the strength of experimental data demonstrating that a change in the upstream KE causes a change in the downstream KE [27]. A third pillar, quantitative understanding, further refines confidence by defining the conditions (e.g., dose, timing) under which the relationship holds [27].

Assessing species susceptibility within the AOP framework requires evaluating the conservation of these KERs across taxa. A KER with strong, well-understood evidence in a model organism may not operate identically in a species of concern if the underlying biological processes differ [27]. Therefore, a rigorous confidence assessment of KERs is the foundational step for reliable cross-species extrapolation and accurate species susceptibility assessment.

Evaluating Biological Plausibility

Biological plausibility establishes a reasoned, mechanistic argument for why a Key Event Relationship should exist, based on prior biological knowledge. It moves beyond mere statistical correlation to explain how the upstream event is capable of leading to the downstream event [77]. This assessment draws on existing understanding of biochemistry, physiology, and pathology.

Core Criteria for Assessment

Evaluation of biological plausibility involves applying modified Bradford Hill considerations to the KER [78]. The key criteria are outlined in the table below.

Table 1: Criteria for Assessing Biological Plausibility of a Key Event Relationship

Assessment Criterion	Description	Key Questions for Evaluation
Temporal Sequence	The cause (upstream KE) must precede the effect (downstream KE).	Is the onset of KE1 consistently observed before KE2 across studies?
Biological Analogy	Similar KERs are established in other well-characterized pathways or systems.	Are there analogous pathways in related biological processes or species?
Consistency with Known Biology	The KER aligns with fundamental principles of cell biology, biochemistry, and systems biology.	Is the relationship supported by literature on the involved molecules, pathways, and organ system functions?
Essentiality (Inferred)	Early, theoretical consideration of whether blocking KE1 is expected to prevent KE2.	Based on current knowledge, does KE1 appear to be a necessary precursor for KE2?

A robust plausibility assessment is a literature-driven, narrative synthesis. The process typically involves:

Pathway Mapping: Defining the precise biological components and processes that connect the two KEs. This often involves creating a detailed sub-pathway diagram.
Evidence Gathering: Conducting systematic searches for primary literature and authoritative reviews on the relevant biology. Omics data (transcriptomics, proteomics) can be particularly valuable for identifying coordinated changes in pathways that link KEs [79].
Narrative Development & Weight of Evidence: Constructing a coherent story that explains the relationship, explicitly citing supporting evidence. The assessor must weigh the breadth, quality, and consistency of the gathered information to judge the strength of plausibility [77].

Diagram 1: Workflow for Biological Plausibility Assessment

Evaluating Empirical Support

Empirical support provides the direct experimental evidence that alterations in the upstream KE actually lead to the downstream KE. It tests the predictions generated by the biological plausibility narrative.

Hierarchy and Types of Empirical Evidence

Not all experimental evidence carries equal weight. The strength of empirical support is judged by the study design's ability to establish causality.

Table 2: Hierarchy and Types of Evidence for Empirical Support

Evidence Tier	Study Type	Strength for Causality	Example in AOP Context
Direct Manipulation (Strongest)	Experimental modulation (inhibition, activation, knockout) of the upstream KE.	High. Demonstrates essentiality and sufficiency.	Using a specific inhibitor to block KE1 and showing KE2 is prevented [4].
Co-exposure / Modulation	Administering the stressor alongside a modulator of the pathway.	Moderate-High. Supports the role of a specific pathway.	Exposing to a chemical while also administering an antioxidant to mitigate oxidative stress (a KE) [80].
Dose-Response Concordance	Measuring parallel changes in both KEs across a range of stressor doses/concentrations.	Moderate. Shows association and biological gradient.	Demonstrating that increasing concentrations of cadmium cause a proportional increase in oxidative damage (KE1) and a decrease in lysosomal stability (KE2) [80].
Temporal Concordance	Measuring the onset and progression of KEs over time.	Moderate. Establishes correct temporal sequence.	Showing DNA binding (MIE) occurs before increased mutation frequency (KE), which precedes tumor formation (AO).
Association (Weakest)	Observing correlation between KEs in a single test group.	Low. Suggests a link but cannot prove causation.	Measuring both reduced hormone levels and developmental defects in exposed animals.

Quantitative Weight of Evidence (QWoE) Scoring

A structured, semi-quantitative approach can be used to integrate evaluations across multiple Bradford Hill criteria (like biological plausibility, essentiality, dose-response) into an overall confidence score for a KER [78]. This Quantitative Weight of Evidence (QWoE) method involves:

Defining Scoring Scales: Creating consistent scales (e.g., 0-3) for each criterion.
Scoring Individual Lines of Evidence: Independently scoring biological plausibility, empirical support for essentiality, dose-response concordance, etc.
Integrating Scores: Combining individual scores, often with predefined weights, to generate an overall confidence score for the KER. This QWoE score allows for the comparative evaluation of confidence across different KERs or between alternative AOPs for the same outcome [78].

Experimental Validation and Protocol Considerations

Building high-confidence AOPs requires data from well-designed experiments. The following protocol exemplifies an integrated approach to generate empirical support for multiple KERs within an AOP.

Case Study Protocol: Cadmium Toxicity in Blue Mussels

A 2024 study on Mytilus edulis provides a template for AOP-informed experimental validation [80].

Objective: To validate a putative AOP linking cadmium (Cd) exposure to impaired organismal function and growth, and to identify a minimal suite of predictive biomarkers.
Stressor & Model: Cadmium chloride (200-600 µg/L) exposed to blue mussels for 14 days [80].
Measured Endpoints (Key Events):
- MIE/KE1: Cd uptake (tissue burden).
- Cellular KEs: Oxidative stress (antioxidant capacity, lipid peroxidation), lysosomal damage, immune parameter alterations (hemocyte mortality, enzyme activity).
- Organismic KEs: Clearance (filtration) rate, scope for growth (energy budget), byssus production.
- Adverse Outcome: Reduced fitness and survival [80].
Workflow & Analysis:
- Measure the full suite of biomarkers and organismal endpoints.
- Establish dose-response and temporal relationships between KEs.
- Use multivariate statistics (e.g., Random Forest Analysis) to identify the smallest combination of biomarkers that accurately predicts the adverse outcome or critical organismal KE [80]. In this study, six key biomarkers achieved >90% accuracy in distinguishing exposed from control groups.

Diagram 2: Key Event Relationships in a Cadmium AOP

The Scientist's Toolkit for Confidence Assessment

A suite of resources and reagents is essential for developing and evaluating high-confidence AOPs.

Table 3: Research Reagent Solutions for AOP Development & Confidence Assessment

Tool / Reagent Category	Specific Examples	Function in Confidence Assessment
Knowledgebase & Curation Platforms	AOP-Wiki (OECD), AOP-KB [27] [4]	Central repository for published AOPs, allowing comparison, community review, and identification of evidence gaps.
In Vitro/New Approach Methodologies (NAMs)	High-throughput screening assays, reporter gene assays, omics platforms (transcriptomics, proteomics) [79] [4].	Generate empirical data for MIEs and early KEs; used for dose-response and essentiality testing (via chemical/genetic perturbation).
Bioinformatics & Cross-Species Tools	SeqAPASS, BLAST, orthology databases [27].	Assess biological plausibility across species by evaluating the conservation of protein targets, pathways, and sequence homology for MIEs/KEs.
Biomarker Assay Kits	Commercial kits for oxidative stress (e.g., lipid peroxidation, antioxidant capacity), cytotoxicity, specific enzyme activities [80].	Provide standardized, reproducible methods for quantifying KEs in experimental validation studies.
Statistical & Modeling Software	R/Bioconductor packages for dose-response analysis, multivariate statistics (e.g., Random Forest), pathway analysis, QWoE scoring frameworks [78] [80].	Analyze empirical data for concordance, identify predictive biomarker suites, and perform quantitative weight-of-evidence integration.

Framework for Species Susceptibility Assessment

Confidence in KERs directly enables the assessment of species susceptibility. The process involves determining if and how an AOP operational in a tested species (e.g., lab rat, fathead minnow) applies to a species of concern (e.g., a threatened fish or human).

Stepwise Assessment Process

Deconstruct the AOP: Break down the AOP of interest into its fundamental KERs.
Evaluate KER Conservation: For each KER, assess the conservation of the underlying biological process in the target species. Key questions include:
- Are the molecular targets (MIE) and proteins involved in KEs structurally and functionally conserved? (Tool: SeqAPASS [27]).
- Are the connecting pathways (signaling, metabolic) present and similar?
- Are compensating or mitigating pathways different?
Identify Critical Differences: Flag KERs where conservation is weak or unknown. These are points of uncertainty and potential susceptibility modifiers. A species may be more or less susceptible based on these differences.
Integrate into a Susceptibility Conclusion: A species is predicted to be susceptible if the AOP's critical KERs are conserved and exposure is possible. Reduced confidence in conservation lowers the certainty of this prediction.

Diagram 3: Species Susceptibility Assessment Workflow

Quantitative Decision Matrix

Findings from the stepwise assessment can be synthesized into a decision matrix to guide conclusions.

Table 4: Cross-Species Susceptibility Assessment Matrix

Conservation Status of AOP KERs	Empirical Support in Target Species	Confidence in Susceptibility Prediction	Recommended Action
High Conservation (Molecular targets & pathways confirmed).	Strong (Direct experimental evidence available).	High.	AOP is applicable for risk assessment; quantitative extrapolation may be possible.
High Conservation.	Limited or Indirect (e.g., via omics or strong analogy).	Moderate to High.	AOP is likely applicable; targeted research to fill empirical gaps is recommended.
Uncertain or Mixed Conservation (Some KERs conserved, others not).	Variable or Unknown.	Low to Moderate.	Apply significant uncertainty factors; AOP may require species-specific modification. Research on divergent KERs is critical.
Low Conservation (Critical targets/pathways absent or divergent).	Contradictory or Demonstrating Non-susceptibility.	High (for Non-Susceptibility).	AOP is not applicable; the species is likely not susceptible via this pathway.

Confidence assessment of Key Event Relationships is a systematic, evidence-driven process that forms the scientific bedrock of the AOP framework. By rigorously evaluating biological plausibility and empirical support, researchers can assign levels of confidence to AOPs, determining their suitability for use in decision-making contexts like chemical prioritization and species susceptibility assessment. The integration of Quantitative Weight of Evidence approaches provides a transparent and structured method for synthesizing this judgment [78].

The future of high-confidence AOP development is tied to several key advancements:

FAIR Data Principles: Making AOP data Findable, Accessible, Interoperable, and Reusable is a major international focus. Initiatives like the FAIR AOP Roadmap for 2025 aim to standardize annotation and improve machine-actionability, which will streamline evidence integration and confidence scoring [29].
High-Content Omics Integration: Omics technologies will play an increasing role in establishing biological plausibility across species and providing dense empirical data for KERs, moving beyond single biomarkers to network-based understanding [79].
Quantitative AOP (qAOP) Models: The transition from qualitative to quantitative AOPs that describe dynamic, dose-responsive relationships between KEs will dramatically increase predictive power and refine species extrapolation [4].

Ultimately, a robust confidence assessment protocol ensures that AOPs used to extrapolate hazard across species are not merely hypothetical constructs but are firmly grounded in biological reality and empirical proof, enabling scientifically defensible predictions of species susceptibility.

The assessment of chemical hazards has historically relied on traditional in vivo toxicity tests, which measure adverse apical outcomes—such as mortality, organ weight changes, or tumor formation—in animal models following exposure [81]. These studies yield a critical effect level, which, after applying uncertainty factors, is used to establish protective human exposure limits [81]. However, these endpoints are often non-specific; systemic toxicity can precede or co-occur with specific effects, making it challenging to distinguish the underlying mechanistic cause [81]. Consequently, the data is used to set broadly protective limits without necessarily predicting the precise toxicological response in humans [81].

The Adverse Outcome Pathway (AOP) framework presents a paradigm shift towards mechanism-based prediction. An AOP is a conceptual construct that describes a sequential chain of causally linked events at different levels of biological organization, from a molecular initiating event (MIE) to an adverse outcome (AO) relevant to risk assessment [2] [27]. It functions like a series of biological dominos, where a stressor triggers an MIE (e.g., chemical binding to a receptor), leading to measurable key events (KEs) at cellular, tissue, and organ levels, ultimately culminating in the AO [27]. This framework is chemical-agnostic and modular, allowing for the assembly of individual pathways into complex AOP networks [27] [4].

This whitepaper provides a comparative analysis of these two paradigms, framing the discussion within the critical challenge of assessing species susceptibility. It examines how the mechanistic understanding embedded within AOPs, particularly through the concept of quantitative AOPs (qAOPs) and AOP networks, can be benchmarked against traditional toxicity data to improve cross-species extrapolation and strengthen predictive toxicology.

Table 1: Foundational Comparison of Traditional Toxicity Testing and the AOP Framework

Aspect	Traditional Toxicity Testing	AOP-Based Framework
Core Principle	Empirical observation of apical outcomes in whole organisms [81].	Mechanistic description of causally linked key events from molecular initiation to adverse outcome [2] [27].
Primary Data Source	In vivo animal studies (e.g., rodents, fish) [81] [82].	Integrated data from in silico, in vitro, and targeted in vivo studies mapped to KEs [4] [83].
Nature of Endpoint	Often non-specific (e.g., body weight change, mortality); used to set protective exposure levels [81].	Specific, measurable key events at molecular, cellular, and tissue levels leading to a defined AO [2] [27].
Role in Risk Assessment	Directly provides points of departure for safety assessment using uncertainty factors [81].	Informs hazard identification, supports New Approach Methodologies (NAMs), and guides testing strategies; requires translation to risk [27] [4].
Handling of Species Differences	Relies on allometric scaling and uncertainty factors; extrapolation is often empirical [81].	Enables cross-species extrapolation by evaluating conservation of MIEs, KEs, and pathways (e.g., using SeqAPASS tool) [27].
Temporal and Resource Scale	Long duration (months to years), high cost, high animal use [84].	Rapid, higher-throughput, aims to reduce and replace animal testing [5] [84].

Methodological Comparison: Data Generation and Application

Traditional Toxicity Testing Protocols

Standard guideline studies (e.g., OECD, EPA) form the backbone of traditional assessment. A typical sub-chronic rodent toxicity study involves exposing groups of animals (e.g., rats) to graduated doses of a test chemical via a relevant route (diet, gavage, inhalation) for 90 days. Endpoints include daily clinical observations, weekly body weight and food consumption, hematology and clinical chemistry at termination, and extensive histopathological examination of all major organs [82]. The lowest-observed-adverse-effect level (LOAEL) and no-observed-adverse-effect level (NOAEL) are derived from the most sensitive relevant endpoint, which may be a non-specific measure like reduced body weight gain [81]. These studies are designed to be protective but may not elucidate the specific mechanism of action, complicating extrapolation to humans or other species.

AOP Development and Quantitative Workflow

In contrast, AOP-based prediction follows a structured, modular workflow.

AOP Development: Knowledge is collated from the scientific literature to define a qualitative AOP. This involves identifying a plausible MIE, a series of essential KEs, and the key event relationships (KERs) linking them [27] [85]. Confidence in each KER is assessed based on biological plausibility, empirical support, and quantitative understanding [27]. This information is formally captured in the OECD's AOP Wiki [85].
Quantification (qAOP): A qualitative AOP is transformed into a predictive quantitative model (qAOP). This involves gathering quantitative data for each KER to define the magnitude and timing of a change in an upstream KE required to trigger a change in a downstream KE [83]. Data sources include in vitro concentration-response, in vivo dose-response, and toxicokinetic models to translate external dose to target site concentration [83].
Application for Prediction: The qAOP serves as a scaffold for integrating data from New Approach Methodologies (NAMs). For a novel chemical, in vitro assays measuring early KEs (e.g., receptor binding, gene expression) provide input data. The qAOP model then simulates the propagation of this perturbation to predict the probability or severity of the AO in vivo [4] [83]. This process is central to defined approaches for regulatory safety assessment [5].

The AOP Footprinting Approach for Mixtures and Species Comparison

A novel methodology for direct comparison is AOP "footprinting." This approach involves the systematic profiling and comparison of all AOPs involved in a toxicological effect at the level of KEs [86]. The goal is to identify the KE(s) most proximal to the adverse outcome within each AOP where similarity between chemicals—or across species—can be confidently determined [86]. This "footprint" becomes a common basis for comparison. For species susceptibility, footprinting can identify whether the conservation of a pathway breaks down at a specific KE (e.g., a compensatory mechanism present in one species but not another), thereby pinpointing the source of differential sensitivity [86].

AOP Framework and Data Integration

Benchmarking Performance: Strengths, Limitations, and Validation

Benchmarking AOP predictions against traditional data requires evaluating predictive accuracy, uncertainty, and applicability domains. Traditional studies provide the empirical benchmark—the observed apical outcome—against which AOP-based in vitro to in vivo predictions are measured [4].

Table 2: Validation Metrics for Benchmarking AOP Predictions

Validation Metric	Description	Application in Benchmarking
Predictive Accuracy	Concordance between AOP-predicted outcome and observed traditional in vivo outcome.	Measured by sensitivity, specificity, and overall concordance for categorical endpoints (e.g., sensitizer/non-sensitizer) [4].
Quantitative Concordance	Comparison of predicted versus observed point-of-departure (e.g., ED/EC50, LOAEL).	Critical for qAOPs; assesses how well the quantitative relationship between early KEs predicts the in vivo dose-response [83].
Uncertainty Characterization	Assessment of confidence in individual KERs and overall AOP, and quantification of parametric uncertainty in qAOP models.	AOPs require transparent weight-of-evidence assessment for each KER [27]. qAOPs use sensitivity analysis to identify major uncertainty sources [83].
Applicability Domain	The chemical, biological, and experimental conditions over which the AOP is considered reliable.	Defined by the MIE and KE specificities. Traditional data validates the domain but may also reveal limitations for novel chemistries [86].

Key Strengths of AOP-Based Approaches:

Mechanistic Insight & Cross-Species Extrapolation: By defining the essential biological pathway, AOPs facilitate evaluation of KE conservation across species using genomic and functional tools (e.g., EPA's SeqAPASS) [27]. This provides a biologically reasoned basis for extrapolating toxicity, moving beyond allometric default factors.
Efficiency and Animal Sparing: AOPs enable the use of rapid, high-throughput in vitro assays to screen and prioritize chemicals, directly addressing the "too many chemicals to test" challenge [5] [4].
Assessment of Mixtures and Combined Exposures: AOP networks and the footprinting approach provide a framework for predicting mixture effects based on shared KEs or AOs, helping to identify chemicals with dose-additive potential [86].

Persistent Challenges and Limitations:

Translational Gaps: A critical challenge is quantitatively linking the magnitude of an early KE perturbation (measured in vitro) to the probability of an AO manifesting in a whole organism with its intact homeostatic and compensatory mechanisms [4] [83].
Incomplete Pathway Coverage: Many toxicological outcomes, especially chronic ones, involve complex, non-linear AOP networks. Existing AOPs are often simplifications, and confidence may be low for novel MIEs or under-studied KEs [27] [86].
Regulatory Acceptance: While AOPs are gaining traction (e.g., for skin sensitization), their use in formal risk assessment to replace traditional data often requires extensive validation and performance benchmarking to meet regulatory standards for certainty [4].

Case Studies in Predictive Application

Table 3: Case Studies of AOP Application and Benchmarking

Case Study	Traditional Approach	AOP-Based Approach	Performance & Insight
Skin Sensitization	In vivo guinea pig maximization test or murine local lymph node assay (LLNA) [4].	An OECD-validated AOP (AOP 40) linking covalent binding to proteins (MIE) to T-cell proliferation and allergic response [4]. A defined approach using in vitro and in chemico KE assays (DPRA, KeratinoSens, h-CLAT) with a Bayesian network prediction model.	Successfully implemented in EU regulation to replace animal testing. Demonstrates high predictive accuracy (>85% concordance with LLNA) and showcases a full replacement paradigm [4].
Endocrine Disruption (Estrogenic Effects)	In vivo uterotrophic assay (rats) or fish reproduction test [4].	AOP network linking estrogen receptor activation (MIE) to adverse outcomes like altered reproduction. Used with high-throughput screening (ToxCast ER model) to prioritize ~10,000 chemicals for testing [2] [4].	Effectively triages and prioritizes chemicals for higher-tier testing. Provides mechanistic understanding for cross-species extrapolation (e.g., from fish to amphibians) [27] [4].
Developmental Neurotoxicity via Thyroid Disruption	Resource-intensive multi-generational rodent studies [2].	AOP linking reduction in maternal thyroid hormone (KE) to impaired neurodevelopment in offspring (AO) [2] [27].	Enables use of shorter-term assays measuring thyroid hormone levels or related gene expression as surrogate predictive endpoints, reducing animal use and study duration [2].

Comparative Analysis and Benchmarking Workflow

The Scientist's Toolkit: Essential Research Reagent Solutions

Transitioning from traditional methods to AOP-driven research requires specialized tools and reagents.

qAOP Modeling Software & Platforms: Tools like R/Bioconductor packages for dose-response modeling, computational dynamic modeling environments (e.g., Copasi, MATLAB/SimBiology), and AOP-specific platforms (e.g., AOP-Wiki, AOP-KB) are essential for developing and sharing quantitative pathway models [85] [83].
KE-Specific Assay Kits: Reliable, validated kits for measuring defined KEs are crucial. Examples include:
- High-Content Screening Assays for nuclear receptor translocation (e.g., ER, AR, AhR).
- qPCR Arrays or RNA-Seq Panels for gene expression signatures tied to specific pathway perturbations.
- Phospho-Specific Antibody Panels for multiplexed measurement of key signaling pathway activation.
Advanced In Vitro Model Systems: To capture tissue-level KEs, 3D tissue models (spheroids, organoids), microphysiological systems ("organs-on-chips"), and ex vivo tissue cultures provide more physiologically relevant contexts than 2D cell cultures [84].
Cross-Species Comparison Tools: Bioinformatics resources like the EPA's SeqAPASS tool allow for the computational assessment of protein sequence, functional domain, and pathway conservation across species, directly informing species susceptibility assessments within an AOP context [27].

The comparative analysis reveals that traditional toxicity data and the AOP framework are not mutually exclusive but complementary. Traditional studies provide the essential in vivo apical endpoint data required to anchor and validate quantitative AOP predictions. Conversely, AOPs provide the mechanistic understanding needed to interpret traditional data, extrapolate across species, design smarter testing strategies, and ultimately reduce reliance on animal-intensive methods.

The future of predictive toxicology lies in the systematic integration of these paradigms. Key frontiers include:

Expanding qAOP Development: Building robust, tissue-scale qAOP models that incorporate systems biology feedback and toxicokinetics to close the in vitro to in vivo extrapolation gap [83].
Leveraging AOP Networks for Complex Outcomes: Moving beyond linear pathways to model interacting AOP networks that underlie chronic toxicity, immunosuppression, or chemical mixture effects [27] [86].
Standardizing Benchmarking Practices: Establishing community-wide protocols for performance assessment of AOP-based predictions against high-quality traditional datasets to build regulatory confidence [4].

For the core thesis on assessing species susceptibility, the AOP framework offers a powerful, structured methodology. By benchmarking AOP predictions—built on human or model species molecular data—against traditional toxicity data across multiple species, researchers can systematically identify conserved versus divergent KEs and KERs. This process moves species extrapolation from a default uncertainty factor to a data-driven, hypothesis-based investigation of the fundamental biological determinants of susceptibility.

Conclusion

Assessing species susceptibility is a critical, integrative component of the AOP framework that transforms it from a qualitative map into a predictive, quantitative tool for human-relevant risk assessment. By systematically applying the strategies outlined—from foundational biology and curated data integration to advanced computational troubleshooting and rigorous validation—researchers can build greater confidence in extrapolating toxicological findings across species. The future of biomedical and clinical research lies in leveraging these refined, susceptibility-aware AOPs to develop more predictive New Approach Methodologies, prioritize chemicals and drug candidates with greater accuracy, and ultimately design safer therapeutics by explicitly accounting for human variability and potential vulnerable populations [citation:1][citation:2][citation:7].