TST: Copy on Write for filter #50589

lithomas1 · 2023-01-05T17:27:40Z

xref ENH / CoW: Use the "lazy copy" (with Copy-on-Write) optimization in more methods where appropriate #49473 (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.

whoiskatrin · 2023-01-05T18:12:04Z

pandas/tests/copy_view/test_methods.py

+    df_orig = df.copy()
+    df2 = df.filter(**filter_kwargs)
+    if using_copy_on_write:
+        assert np.shares_memory(get_array(df2, "a"), get_array(df, "a"))


Out of curiosity, do you think that numpy.may_share_memory() function can be faster than using np.shares_memory()? I compared it once, and it seems like it could be the case, but not super important.

This doesn't really matter for tests, but seems like it could be faster

lithomas1 · 2023-01-05T18:31:32Z

pandas/tests/copy_view/test_methods.py

+
+    # mutating df2 triggers a copy-on-write for that column/block
+    if using_copy_on_write:
+        df2.iloc[0, 0] = 0


This was raising a SettingWithCopy error(reasonable, since filter uses loc) so I just moved it inside the CoW case.

Yeah had the same problem somewhere else

phofl · 2023-01-06T21:43:48Z

thx @lithomas1

TST: Copy on Write for filter

8e60d77

lithomas1 mentioned this pull request Jan 5, 2023

ENH / CoW: Use the "lazy copy" (with Copy-on-Write) optimization in more methods where appropriate #49473

Closed

73 tasks

lithomas1 added the Copy / view semantics label Jan 5, 2023

lithomas1 marked this pull request as draft January 5, 2023 18:11

whoiskatrin reviewed Jan 5, 2023

View reviewed changes

Move iloc inside CoW case

378b61f

lithomas1 commented Jan 5, 2023

View reviewed changes

lithomas1 marked this pull request as ready for review January 5, 2023 22:13

Merge branch 'main' into test-filter-cow

02bb978

phofl approved these changes Jan 6, 2023

View reviewed changes

phofl added this to the 2.0 milestone Jan 6, 2023

phofl merged commit 4520f84 into pandas-dev:main Jan 6, 2023

lithomas1 deleted the test-filter-cow branch January 6, 2023 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST: Copy on Write for filter #50589

TST: Copy on Write for filter #50589

lithomas1 commented Jan 5, 2023

whoiskatrin Jan 5, 2023

phofl Jan 5, 2023

lithomas1 Jan 5, 2023

phofl Jan 5, 2023

phofl commented Jan 6, 2023

TST: Copy on Write for filter #50589

TST: Copy on Write for filter #50589

Conversation

lithomas1 commented Jan 5, 2023

whoiskatrin Jan 5, 2023

Choose a reason for hiding this comment

phofl Jan 5, 2023

Choose a reason for hiding this comment

lithomas1 Jan 5, 2023

Choose a reason for hiding this comment

phofl Jan 5, 2023

Choose a reason for hiding this comment

phofl commented Jan 6, 2023