Data Validation Scripts

This folder contains reusable scripts to validate datasets in .csv, .sav, and .dta formats. Checks include:

  • Duplicate ID detection
  • Variable naming conventions
  • Value range checks
  • Label inspection

Languages used:

  • Python (pandas)
  • SPSS Syntax (.sps)
  • Stata (.do)

Each script is documented inside with usage examples.