validation
validation
¶
Data validation utilities.
validate_data
¶
Validate that dataframe has required columns.
| PARAMETER | DESCRIPTION |
|---|---|
df
|
Dataframe to validate
TYPE:
|
required_columns
|
Required column names
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
bool
|
True if valid |
Source code in fplx/utils/validation.py
check_data_quality
¶
Check data quality and report issues.
| PARAMETER | DESCRIPTION |
|---|---|
df
|
Data to check
TYPE:
|
max_missing_pct
|
Maximum acceptable percentage of missing values
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
Dict[str, float]
|
Quality metrics |
Source code in fplx/utils/validation.py
impute_missing
¶
Impute missing values.
| PARAMETER | DESCRIPTION |
|---|---|
df
|
Data with missing values
TYPE:
|
strategy
|
Imputation strategy: 'mean', 'median', 'forward_fill', 'zero'
TYPE:
|
| RETURNS | DESCRIPTION |
|---|---|
DataFrame
|
Data with imputed values |