Data Flow Documentation
Data Flow Documentation
This file explains the typical flow of data through the InsightStack toolkit.
๐ Typical Workflow:
- Raw Data Entry
- Source: External surveys, XLSForms, CSV, STATA/SPSS
- Tools:
survey_to_codebook/
,data_validation/
- Data Validation
- Scripts check for:
- Missing values
- Out-of-range entries
- Duplicates
- Folder:
data_validation/
- Scripts check for:
- Labeling Variables
- Apply human-readable labels to variables
- Folder:
label_variables/
- Exploration & Analysis
- Can now run:
- Summary statistics
- Regressions
- Visualizations
- Folder:
replication/
or use external tools
- Can now run:
- Documentation
- Generate Markdown codebooks from XLSForms
- Folder:
survey_to_codebook/
- Archival & Replication
- Structure analysis for reuse or publication
- Folder:
replication/