Resource: salty – Turn Clean Data into Messy Data
From the resource: When teaching students how to clean data, it helps to have data that isn’t too clean already. salty is a new package that offers functions for “salting” clean data with problems often found in datasets in the wild, such as: pseudo-OCR errors inconsistent capitalization and spelling unpredictable punctuation in numeric fields missing […]