WARN read_table with infer_datetime_format doesn't show FutureWarning #51017

MarcoGorelli · 2023-01-27T10:05:28Z

Running

import pandas as pd
import io

timestamp_format = '%Y-%d-%m %H:%M:%S'
date_index = pd.date_range(start='1900', end='2000')
dates_df = date_index.strftime(timestamp_format).to_frame(name='ts_col')
data = dates_df.to_csv()
df = pd.read_csv(
    io.StringIO(data),
    date_parser=lambda x: pd.to_datetime(x, format=timestamp_format),
    parse_dates=['ts_col'],
    infer_datetime_format=True,
    sep=',',
)

results in

t.py:34: UserWarning: The argument 'infer_datetime_format' is deprecated and will be removed in a future version. A strict version of it is now the default, see https://pandas.pydata.org/pdeps/0004-consistent-to-datetime-parsing.html. You can safely remove this argument.
  df = pd.read_csv(

However,

df = pd.read_table(
    io.StringIO(data),
    date_parser=lambda x: pd.to_datetime(x, format=timestamp_format),
    parse_dates=['ts_col'],
    infer_datetime_format=True,
    sep=',',
)

shows no warning

Task is just to add a warning to this function

pandas/pandas/io/parsers/readers.py

Lines 1151 to 1209 in 1951b51

    
           def read_table( 
        
               filepath_or_buffer: FilePath | ReadCsvBuffer[bytes] | ReadCsvBuffer[str], 
        
               *, 
        
               sep: str | None | lib.NoDefault = lib.no_default, 
        
               delimiter: str | None | lib.NoDefault = None, 
        
               # Column and Index Locations and Names 
        
               header: int | Sequence[int] | None | Literal["infer"] = "infer", 
        
               names: Sequence[Hashable] | None | lib.NoDefault = lib.no_default, 
        
               index_col: IndexLabel | Literal[False] | None = None, 
        
               usecols=None, 
        
               # General Parsing Configuration 
        
               dtype: DtypeArg | None = None, 
        
               engine: CSVEngine | None = None, 
        
               converters=None, 
        
               true_values=None, 
        
               false_values=None, 
        
               skipinitialspace: bool = False, 
        
               skiprows=None, 
        
               skipfooter: int = 0, 
        
               nrows: int | None = None, 
        
               # NA and Missing Data Handling 
        
               na_values=None, 
        
               keep_default_na: bool = True, 
        
               na_filter: bool = True, 
        
               verbose: bool = False, 
        
               skip_blank_lines: bool = True, 
        
               # Datetime Handling 
        
               parse_dates: bool | Sequence[Hashable] = False, 
        
               infer_datetime_format: bool | lib.NoDefault = lib.no_default, 
        
               keep_date_col: bool = False, 
        
               date_parser=None, 
        
               dayfirst: bool = False, 
        
               cache_dates: bool = True, 
        
               # Iteration 
        
               iterator: bool = False, 
        
               chunksize: int | None = None, 
        
               # Quoting, Compression, and File Format 
        
               compression: CompressionOptions = "infer", 
        
               thousands: str | None = None, 
        
               decimal: str = ".", 
        
               lineterminator: str | None = None, 
        
               quotechar: str = '"', 
        
               quoting: int = csv.QUOTE_MINIMAL, 
        
               doublequote: bool = True, 
        
               escapechar: str | None = None, 
        
               comment: str | None = None, 
        
               encoding: str | None = None, 
        
               encoding_errors: str | None = "strict", 
        
               dialect: str | csv.Dialect | None = None, 
        
               # Error Handling 
        
               on_bad_lines: str = "error", 
        
               # Internal 
        
               delim_whitespace: bool = False, 
        
               low_memory=_c_parser_defaults["low_memory"], 
        
               memory_map: bool = False, 
        
               float_precision: str | None = None, 
        
               storage_options: StorageOptions = None, 
        
               use_nullable_dtypes: bool | lib.NoDefault = lib.no_default, 
        
           ) -> DataFrame | TextFileReader:

similarly to how is already done here:

pandas/pandas/io/parsers/readers.py

Lines 887 to 895 in 1951b51

    
           if infer_datetime_format is not lib.no_default: 
        
               warnings.warn( 
        
                   "The argument 'infer_datetime_format' is deprecated and will " 
        
                   "be removed in a future version. " 
        
                   "A strict version of it is now the default, see " 
        
                   "https://pandas.pydata.org/pdeps/0004-consistent-to-datetime-parsing.html. " 
        
                   "You can safely remove this argument.", 
        
                   stacklevel=find_stack_level(), 
        
               )

Finally, you'll need to add a test: you can duplicate

pandas/pandas/tests/io/parser/test_parse_dates.py

Lines 1293 to 1303 in 9991d5e

    
           def test_parse_dates_infer_datetime_format_warning(all_parsers): 
        
               # GH 49024 
        
               parser = all_parsers 
        
               data = "Date,test\n2012-01-01,1\n,2" 
        
               parser.read_csv_check_warnings( 
        
                   UserWarning, 
        
                   "The argument 'infer_datetime_format' is deprecated", 
        
                   StringIO(data), 
        
                   parse_dates=["Date"], 
        
                   infer_datetime_format=True, 
        
               )

but for read_table (or parametrise over parser.read_csv_check_warnings and parser.read_table_check_warnings). Note that you'll need to add sep=','

The text was updated successfully, but these errors were encountered:

MarcoGorelli · 2023-01-27T10:13:36Z

To parametrise, you might want to use

@pytest.mark.parametrize( 'reader', [ 'read_csv_check_warnings', 'read_table_check_warnings' ])

and then use getattr(parser, reader) instead of parser.read_csv_check_warnings

MarcoGorelli · 2023-01-27T10:49:24Z

Also, this should be a FutureWarning

kathleenhang · 2023-01-27T18:41:01Z

take

kathleenhang · 2023-01-27T20:14:43Z

Running your first code snippet which should have produced a FutureWarning, I did not receive any warning.

I tried figuring out how to enable warnings in pandas by running pd.reset_option('all').

This resulted in the appearance of other FutureWarnings but still, it didn't produce the same FutureWarning like the one you received: "FutureWarning: The argument 'date_parser' is deprecated..."

I'm using pandas v1.5.3. Do you know why this is the case?

For now, I'll work on adding in the warning for the second code snippet and seeing if that one appears properly.

MarcoGorelli · 2023-01-27T20:18:04Z

hey @kathleenhang ,

sorry, my bad, I was running that from a branch. updating now, sorry for the confusion, thanks for having asked

MarcoGorelli · 2023-01-27T20:19:30Z

@kathleenhang I've updated the example in the issue - do you receive a warning now if you run it?

kathleenhang · 2023-01-27T20:28:27Z

@MarcoGorelli Hey there, I still don't receive any warning. I'm using Python 3.9.1 if that helps.

MarcoGorelli · 2023-01-27T20:53:55Z

Does it work if you run

pytest pandas/tests/io/parser/test_parse_dates.py -k test_parse_dates_infer_datetime_format_warning

?

kathleenhang · 2023-01-27T21:14:54Z

I ran it twice in two different folders. Here are the results:

pandas-kathleenhang is my forked pandas repo
pandas-dev just contains a file called t.py

MarcoGorelli · 2023-01-27T21:15:51Z

looks like you need to rebuild the C extensions: https://pandas.pydata.org/docs/dev/development/contributing_environment.html#step-3-build-and-install-pandas

kathleenhang · 2023-01-27T21:44:25Z

I also was not inside of my Docker virtual environment. I just set that up yesterday, and I am still understanding how it works, its purpose, and when I should have it activated.

I had built my C extensions inside of the Docker virtual environment, but since I was working outside of the virtual environment, it didn't have the updated C extensions.

I see the warning now. Thanks @MarcoGorelli !

MarcoGorelli added Warnings Warnings that appear or should be added to pandas good first issue labels Jan 27, 2023

MarcoGorelli added this to the 2.0 milestone Jan 27, 2023

github-actions bot assigned kathleenhang Jan 27, 2023

kathleenhang mentioned this issue Jan 28, 2023

ENH, TST: Add FutureWarning to read_table #51048

Merged

1 task

MarcoGorelli closed this as completed in #51048 Jan 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WARN read_table with infer_datetime_format doesn't show FutureWarning #51017

WARN read_table with infer_datetime_format doesn't show FutureWarning #51017

MarcoGorelli commented Jan 27, 2023 •

edited

Loading

MarcoGorelli commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

WARN read_table with infer_datetime_format doesn't show FutureWarning #51017

WARN read_table with infer_datetime_format doesn't show FutureWarning #51017

Comments

MarcoGorelli commented Jan 27, 2023 • edited Loading

MarcoGorelli commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023

kathleenhang commented Jan 27, 2023

MarcoGorelli commented Jan 27, 2023 •

edited

Loading