bpo-44338: Port LOAD_GLOBAL to PEP 659 adaptive interpreter #26638

markshannon · 2021-06-10T09:03:35Z

Adds two specializations of LOAD_GLOBAL:
- LOAD_GLOBAL_MODULE for module variables
- LOAD_GLOBAL_BUILTIN for builtins.
Removes the old "opcache" mechanism.
Adds a few more stats (hidden behind a compile-time flag).

https://bugs.python.org/issue44338

bedevere-bot · 2021-06-10T09:52:44Z

🤖 New build scheduled with the buildbot fleet by @markshannon for commit 8ad3e55 🤖

If you want to schedule another build, you need to add the ":hammer: test-with-buildbots" label again.

ericsnowcurrently

There are a few spots where things are a little unclear but mostly I was able to follow the change.

Python/ceval.c

ericsnowcurrently · 2021-06-10T17:56:07Z

Python/ceval.c

+                STAT_INC(loadglobal_deferred);
+                cache->adaptive.counter--;
+                oparg = cache->adaptive.original_oparg;
+                JUMP_TO_INSTRUCTION(LOAD_GLOBAL);


This is also virtual identical, right? Will that be the case for all the XXXX_ADAPTIVE opcodes?

Python/ceval.c

ericsnowcurrently · 2021-06-10T18:02:39Z

Python/specialize.c

+            goto fail;
+        }
+        cache1->module_keys_version = keys_version;
+        cache0->index = index;


Yeah, I was going to ask about this (Py_ssize_t -> uint16_t) but looks like GHA beat me to it. :) Does this limit us on the size of the builtins/globals dicts or are dicts already constrained to 2^16 entries?

We just can't optimize access to variables with index > 64k.
If your module has more than 64k variables, then you have plenty of other problems 🙂

Python/ceval.c

ericsnowcurrently · 2021-06-10T19:28:05Z

Python/ceval.c

+            SpecializedCacheEntry *caches = GET_CACHE();
+            _PyAdaptiveEntry *cache0 = &caches[0].adaptive;
+            _PyLoadGlobalCache *cache1 = &caches[-1].load_global;
+            DEOPT_IF(dict->ma_keys->dk_version != cache1->module_keys_version, LOAD_GLOBAL);


Do we also need to check the case where it's a different globals dict? It can't be done from Python (except if someone creates a new function using the code object from another function, or passes the code object to exec()) but can easily be done from C. There is the remote chance that (in that already unlikely case) the keys version is the same.

If the keys version is the same, then it has the same keys in the same order and is the same kind of dict.
In which case it doesn't matter if it is a different dictionary, because we cache the index, not the value.

As an aside, you can get different dictionaries with the same keys as module dicts at different times.

class C: pass d1 = C().__dict__ d2 = C().__dict__ # d1 and d2 should share keys m = ModuleType() m.__dict__ = d1 # Specialize m.__dict__ = d2 # globals in m would see same keys as when specialized

Python/ceval.c

Python/specialize.c

…-26638) * Add specializations of LOAD_GLOBAL. * Add more stats. * Remove old opcache; it is no longer used. * Add NEWS

markshannon added 6 commits June 10, 2021 08:48

Add specializations of LOAD_GLOBAL.

6e57707

Add more stats.

b970dfc

Remove old opcache; it is no longer used.

2b7aad1

Use _PyDict_GetItemHint instead of _Py_dict_lookup

20daa07

Check globals version as well as builtins

704d217

Correctly round sizeof() for code object.

a7746e0

the-knights-who-say-ni added the CLA signed label Jun 10, 2021

bedevere-bot added the awaiting core review label Jun 10, 2021

Add NEWS

8ad3e55

markshannon added the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jun 10, 2021

bedevere-bot removed the 🔨 test-with-buildbots Test PR w/ buildbots; report in status section label Jun 10, 2021

ericsnowcurrently reviewed Jun 10, 2021

View reviewed changes

markshannon added 4 commits June 11, 2021 11:38

Remove some code duplication and fix up stats printing.

aec7872

Add casts and comment on which type of cache entries are used.

b4431be

Fix a couple of copy and paste errors

f043576

Merge branch 'main' into specialize-load-global

72a7bcc

markshannon merged commit eecbc7c into python:main Jun 14, 2021

bedevere-bot removed the awaiting core review label Jun 14, 2021

markshannon deleted the specialize-load-global branch January 6, 2022 15:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-44338: Port LOAD_GLOBAL to PEP 659 adaptive interpreter #26638

bpo-44338: Port LOAD_GLOBAL to PEP 659 adaptive interpreter #26638

markshannon commented Jun 10, 2021 •

edited by bedevere-bot

Loading

bedevere-bot commented Jun 10, 2021

ericsnowcurrently left a comment

ericsnowcurrently Jun 10, 2021

ericsnowcurrently Jun 10, 2021

markshannon Jun 11, 2021

ericsnowcurrently Jun 10, 2021

markshannon Jun 11, 2021

bpo-44338: Port LOAD_GLOBAL to PEP 659 adaptive interpreter #26638

bpo-44338: Port LOAD_GLOBAL to PEP 659 adaptive interpreter #26638

Conversation

markshannon commented Jun 10, 2021 • edited by bedevere-bot Loading

bedevere-bot commented Jun 10, 2021

ericsnowcurrently left a comment

Choose a reason for hiding this comment

ericsnowcurrently Jun 10, 2021

Choose a reason for hiding this comment

ericsnowcurrently Jun 10, 2021

Choose a reason for hiding this comment

markshannon Jun 11, 2021

Choose a reason for hiding this comment

ericsnowcurrently Jun 10, 2021

Choose a reason for hiding this comment

markshannon Jun 11, 2021

Choose a reason for hiding this comment

markshannon commented Jun 10, 2021 •

edited by bedevere-bot

Loading