we should group the test files in pandas/tests/io/data into sub directories, e.g. for stata, excel, html, csv etc.