gh-107265: Fix code_hash for ENTER_EXECUTOR case #108188

corona10 · 2023-08-21T06:09:21Z

Issue: Decoding instructions should handle ENTER_EXECUTOR #107265

corona10 · 2023-08-21T15:04:04Z

I updated the code_richcompare not to modify the code object at all.

gvanrossum

@markshannon Can you please confirm that this addresses your concern? (I'm sorry it slipped by me when I reviewed the previous PR.)

Objects/codeobject.c

markshannon · 2023-08-21T15:54:02Z

Would it better to ignore co_code_adaptive, and use co_code for comparisons?
It will definitely be slower, but it will be correct.

Ultimately we will want to compare by identity, I think.

gvanrossum · 2023-08-21T15:55:50Z

Ultimately we will want to compare by identity, I think.

I'm no so sure, I expect that would cause too much breakage.

gvanrossum · 2023-08-21T15:58:22Z

Would it better to ignore co_code_adaptive, and use co_code for comparisons?

Or factor out the normalization code involved in constructing co_code so it can be reused by compare and hash.

gvanrossum · 2023-08-21T16:01:25Z

Would it better to ignore co_code_adaptive, and use co_code for comparisons?

Or factor out the normalization code involved in constructing co_code so it can be reused by compare and hash.

Never mind, that's already factored out (deopt_code()) but it modifies the bytecode array in place.

markshannon · 2023-08-21T16:01:39Z

We keep changing the hash and equality functions, so I don't really see how another change will break anything, apart from assumptions in the compiler.

corona10 · 2023-08-21T16:30:11Z

Would it better to ignore co_code_adaptive, and use co_code for comparisons?
It will definitely be slower, but it will be correct.

IIUC, we need to update _PyCode_CODE for comparisons or add a new macro. It's worth experimenting with it.
I would like to do it in separate PR including the performance overhead comparison.

gvanrossum

LGTM.

gvanrossum · 2023-08-21T16:55:55Z

We keep changing the hash and equality functions, so I don't really see how another change will break anything, apart from assumptions in the compiler.

It seems pretty fundamental that co == co.replace(), which comparing by identity would break (and we rely in many places on .replace() always creating a new code object, with no specializations or executors, and all caches reset). IMO any field that can be changed through code.replace(xxx=yyy) should be included in equality, and no others. The hash should use a pragmatic subset of these that satisfies the required relationship between hash and equality and can be computed quickly.

carljm · 2023-08-21T20:04:07Z

In #101346 I tried to change code objects to compare by identity, and in the process I reached the same conclusion as @gvanrossum.

Making co != co.replace() (or, similarly, compile(source_string, ...) != compile(source_string, ...)) is a much bigger change than any tweaks to the details of code object comparison that have happened up until now.

corona10 added the skip news label Aug 21, 2023

corona10 requested a review from gvanrossum August 21, 2023 06:09

corona10 requested a review from markshannon as a code owner August 21, 2023 06:09

bedevere-bot mentioned this pull request Aug 21, 2023

Decoding instructions should handle ENTER_EXECUTOR #107265

Open

10 tasks

bedevere-bot added the awaiting core review label Aug 21, 2023

pythongh-107265: Fix code_hash for ENTER_EXECUTOR case

ecef768

corona10 force-pushed the gh-107265-hash branch from 52bed64 to ecef768 Compare August 21, 2023 06:14

corona10 added 3 commits August 21, 2023 15:35

nit

be27a2b

Add assert

9697196

Fix code_richcompare not to change code object.

2197675

gvanrossum reviewed Aug 21, 2023

View reviewed changes

Objects/codeobject.c Outdated Show resolved Hide resolved

markshannon reviewed Aug 21, 2023

View reviewed changes

Objects/codeobject.c Outdated Show resolved Hide resolved

Address code review

2bd37a9

fix

8446dfd

corona10 requested review from gvanrossum and markshannon August 21, 2023 16:31

nit

be782e9

gvanrossum approved these changes Aug 21, 2023

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting core review labels Aug 21, 2023

gvanrossum merged commit e6db23f into python:main Aug 21, 2023
17 checks passed

bedevere-bot removed the awaiting merge label Aug 21, 2023

corona10 deleted the gh-107265-hash branch August 21, 2023 23:29

corona10 mentioned this pull request Sep 8, 2023

test_sys_settrace -R 3:3 does crash #109052

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gh-107265: Fix code_hash for ENTER_EXECUTOR case #108188

gh-107265: Fix code_hash for ENTER_EXECUTOR case #108188

corona10 commented Aug 21, 2023 •

edited by bedevere-bot

Loading

corona10 commented Aug 21, 2023 •

edited

Loading

gvanrossum left a comment

markshannon commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

markshannon commented Aug 21, 2023

corona10 commented Aug 21, 2023 •

edited

Loading

gvanrossum left a comment

gvanrossum commented Aug 21, 2023

carljm commented Aug 21, 2023 •

edited

Loading

gh-107265: Fix code_hash for ENTER_EXECUTOR case #108188

gh-107265: Fix code_hash for ENTER_EXECUTOR case #108188

Conversation

corona10 commented Aug 21, 2023 • edited by bedevere-bot Loading

corona10 commented Aug 21, 2023 • edited Loading

gvanrossum left a comment

Choose a reason for hiding this comment

markshannon commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

gvanrossum commented Aug 21, 2023

markshannon commented Aug 21, 2023

corona10 commented Aug 21, 2023 • edited Loading

gvanrossum left a comment

Choose a reason for hiding this comment

gvanrossum commented Aug 21, 2023

carljm commented Aug 21, 2023 • edited Loading

corona10 commented Aug 21, 2023 •

edited by bedevere-bot

Loading

corona10 commented Aug 21, 2023 •

edited

Loading

corona10 commented Aug 21, 2023 •

edited

Loading

carljm commented Aug 21, 2023 •

edited

Loading