Data Validation Scripts
Data Validation Scripts
This folder contains reusable scripts to validate datasets in .csv
, .sav
, and .dta
formats. Checks include:
- Duplicate ID detection
- Variable naming conventions
- Value range checks
- Label inspection
Languages used:
- Python (pandas)
- SPSS Syntax (.sps)
- Stata (.do)
Each script is documented inside with usage examples.