Data Quality Checklist
Ensure all examples follow consistent format
Remove duplicate or near-duplicate examples
Validate labels by domain expert (min 10% sample)
Balance class distribution (for classification)
Include edge cases and corner cases
Document data collection methodology
Version control dataset and splits
Log metrics on independent test set