diagnostics.sample_sufficiency

sample_sufficiency(df, input_cols, outcome_col)

Performs statistical tests on sampling sufficiency.

Runs 3 checks

  1. Input Space Coverage (Gaps)
  2. Model Fit Stability (CV Score)
  3. Bootstrap Convergence (Coefficient of Variation)

Parameters

Name Type Description Default
df pd.DataFrame The simulation data. Will be validated via validate_simulation internally. required
input_cols List[str] List of input variable names. required
outcome_col str Name of the outcome variable. required

Returns

Name Type Description
pd.DataFrame pd.DataFrame: A table containing pass/fail metrics for each test, including the threshold values evaluated against.