From the post:
Digital work in and around the Humanities often involves moving data from one system or format to another. That data often involves complex textual materials in multiple languages and writing systems. One commonly used format is the “Comma-Separated Values” text file. It’s not uncommon to find that characters not used in English get garbled when exported from a spreadsheet program like Microsoft Excel to CSV (or imported from CSV into such a program). What’s going on and how do you make it stop?
Source: Preserving Accented and Non-Roman Characters in CSV Workflows