diagnostics.sample_sufficiency
sample_sufficiency(df, input_cols, outcome_col)
Performs statistical tests on sampling sufficiency.
Runs 3 checks
- Input Space Coverage (Gaps)
- Model Fit Stability (CV Score)
- Bootstrap Convergence (Coefficient of Variation)
Parameters
| df |
pd.DataFrame |
The simulation data. Will be validated via validate_simulation internally. |
required |
| input_cols |
List[str] |
List of input variable names. |
required |
| outcome_col |
str |
Name of the outcome variable. |
required |
Returns
|
pd.DataFrame |
pd.DataFrame: A table containing pass/fail metrics for each test, including the threshold values evaluated against. |