Performance enhancements #557

sevaa · 2024-04-22T20:14:44Z

I made the following enhancements, in the rough order of effect:

Rewritten the LEB128 parsers to be less idiomatic but more streamlined
Instantiated the handful of parsers for scalars that are used in all the one off struct_parse calls, and replaced all points of usage
Inlined some stuff on the hot path
Removed unneeded stream_preserve for streams in auxiliary sections

elftools/common/construct_utils.py

elftools/dwarf/die.py

sevaa · 2024-04-26T13:14:02Z

On the topic of performance, there is a disconnect between ELF proper parsing and DWARF parsing. ELF parsing, at least in the typical use case, happens against a file stream. DWARF parsing is all in memory (barring exotic scenarios where users monkeypatch or subclass ELFFile), and yet we still treat DWARF sections as streams instead of byte arrays in memory that they really are. Rather than read()ing (and constructing bytes objects) all the time, the parser could have worked with bytes elements and slices. The workhorse method of parsing primitive datatypes - struct..unpack() - can take a buffer and an offset.

But that would pretty much mean abandoning construct for the DWARF part of it. As far as I understand, construct was meant to work like protobuf, working with wire and file streams.

eliben · 2024-04-29T12:44:26Z

Yes, I'd rather keep this capability. A more interesting direction would be to enable more incremental DWARF parsing without slurping whole sections into memory

sevaa · 2024-04-29T17:36:19Z

Reliance on construct is not a capability per se, it's more of an implementation detail. The DWARF parser mostly spits out Python native structures - lists, namedtuples and such. Off the top of my head, the only Construct objects that we get in the public API are CU headers. And the interface of Construct, one there x["a"] and x.a are equivalent, is hardly magical.

OBTW, construct technically has a parse-from-buffer method. Only it works by constructing a BytesIO around it, then parses the stream :) Overhead wise, not a net win.

Anyway, were I to implement buffer style parsing, I won't be getting rid of construct altogether - compound datatypes can stay. I'd implement a buffer+position object that walks like a stream but doesn't quite quack like a stream, and I'd teach the primitive type parsers (there is just a handful) to recognize those. The compound parsers - Struct, Array - can stay as they are.

I gave some thought to incremental parsing of DWARF. One big obstacle to that would be the transforms that DWARF sections undergo - two decompression hooks, and relocations (also the phantom bytes thing, but that's an edge case by far). Compressed streams are not seekable. Also, I'm assuming we are talking about support for extra large binaries here; were were to implement a no-slurp mode, I'm afraid it'd have to be accompanied by a no-cache mode. At least no CU/DIE cache - lack of an abbrev cache, I'm afraid, would be to costly.

But let's assume there are no transforms. I had three possible designs in mind:

Implement a "file window" stream class, one that wraps a proper file stream, maintains its own position/length and seeks as needed
Open the file several times so that there is a file stream per section (dup won't help, dupped handles share their current position)
Memory map the file, construct ByteIO objects around sections and trust the OS to do the right thing

Which one do you think sounds the least crazy?

eliben · 2024-05-01T13:48:07Z

The mmap idea sounds intriguing

sevaa · 2024-05-01T13:56:22Z

Basically already done this in #481 :)

Performance enhancements

0373df5

eliben reviewed Apr 22, 2024

View reviewed changes

elftools/common/construct_utils.py Outdated Show resolved Hide resolved

elftools/dwarf/die.py Show resolved Hide resolved

Exception wording

8802407

eliben approved these changes Apr 23, 2024

View reviewed changes

eliben merged commit 75f0f98 into eliben:main Apr 23, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance enhancements #557

Performance enhancements #557

sevaa commented Apr 22, 2024 •

edited

Loading

sevaa commented Apr 26, 2024 •

edited

Loading

eliben commented Apr 29, 2024

sevaa commented Apr 29, 2024 •

edited

Loading

eliben commented May 1, 2024

sevaa commented May 1, 2024

Performance enhancements #557

Performance enhancements #557

Conversation

sevaa commented Apr 22, 2024 • edited Loading

sevaa commented Apr 26, 2024 • edited Loading

eliben commented Apr 29, 2024

sevaa commented Apr 29, 2024 • edited Loading

eliben commented May 1, 2024

sevaa commented May 1, 2024

sevaa commented Apr 22, 2024 •

edited

Loading

sevaa commented Apr 26, 2024 •

edited

Loading

sevaa commented Apr 29, 2024 •

edited

Loading