core.SimulationStudy

SimulationStudy()

A workflow manager for simulation reliability assessment.

Attributes

Name	Type	Description
inputs	List[str]	List of input variable names.
outcome	str	Name of the outcome variable.
data	pd.DataFrame	The raw simulation data.
clean_data	pd.DataFrame	Data that has passed validation.
sufficiency_results	pd.DataFrame	The latest diagnostic results.
pod_results	Dict	Results from the latest PoD analysis.
plots	Dict	Stores the latest generated figures.
linear_pod_results	Dict	Results from the classical linear analysis.
linear_plots	Dict	Stores figures from the classical linear analysis.
models_cache	Dict	Cached mean models to prevent redundant fitting.
variance_cache	Dict	Cached variance models (residuals/bandwidth).
pod_curves_cache	Dict	Cached integrated PoD curves.
threshold_spectrum_cache	Dict	Cached threshold spectrum.

Examples

from digiqual.core import SimulationStudy

# 1. Clean Initialization
study = SimulationStudy()

# 2. Add data (automatically infers input columns)
study.add_data(df, outcome_col='Signal')

# 3. Run diagnostics with custom strict thresholds
results = study.diagnose(max_gap_ratio=0.10, min_r2_score=0.75)

Methods

Name	Description
add_data	Ingests raw simulation data, configures columns, and manages the cache.
compute_pod_spectrum	Pre-calculates a spectrum of PoD curves across a range of signal thresholds.
diagnose	Runs statistical diagnostics to evaluate if the current sample size is sufficient.
estimate_compute_time	Physics-aware heuristic to estimate PoD computation time in seconds.
get_data_summary	Calculates the minimum, maximum, and median values for a given column.
get_unassigned_parameters	Calculates which input variables are not currently assigned as a Parameter
linear_pod	Runs the classical linear a-hat vs a PoD analysis with bootstrapped bounds.
optimise	Runs the automated Active Learning loop (Initialize -> Execute -> Diagnose -> Refine).
plot_collinearity	Plots a correlation matrix heatmap of the input variables to inspect collinearity.
plot_pod_vs_threshold	Generates a plot of Probability of Detection (y-axis) vs. Detection Threshold (x-axis)
pod	Runs the generalized Probability of Detection (PoD) analysis.
refine	Identifies gaps or high-variance regions and suggests new simulation points.
update_slice	Updates the evaluated slice for the PoD surface using Layer 3 Caching.
visualise	Generates and displays diagnostic plots (Signal Model and PoD Curve).