bpo-27397: Make email module properly handle invalid-length base64 strings #7583

taleinat · 2018-06-10T07:00:26Z

The output of base64 encoding can never have a length of 1 more than a multiple of 4 (not including line breaks and padding).

Attempting to decode base64-encoded email payloads of such length would result in an AssertionError. This fix properly detects and handles such cases, introducing a new type of defect to be returned in this case. As decided in the b.p.o. issue's comments, in this case the encoded data is returned as-is.

Note: This also includes a change to decode_b() in _encoded_words.py, where instead of trying padding with 0-3 '=' characters, it just tries using zero or two. Using 3 will never help since 2 is the maximum possibly needed. Since extra padding is ignored by binascii.a2b_base64() and hence by base64.b64decode(), trying with 2 added padding characters will work whether 1 or 2 are needed. I've added a comment to this effect as well as a test.

https://bugs.python.org/issue27397

This defines a new InvalidBase64LengthDefect defect, which is returned with the encoded string when attempting to base64-decode a string of invalid length (1 mod 4).

bitdancer · 2018-06-10T18:47:42Z

Lib/email/_encoded_words.py

-            raise AssertionError("unexpected binascii.Error")
+                # This only happens when the encoded string's length is 1 more
+                # than a multiple of 4, which is invalid.
+                defects = [errors.InvalidBase64LengthDefect()]


This replacement of the defect list bothers me, it makes the code more fragile.

I think we should probably instead drop the 'pad_err' computation, since we're going to figure that out in the try/except, and then switch to computing the (value, defects) tuple and have a single return at the end of the method. That way the PaddingDefect can be added in the else clause of that try/except, and we'll always be appending to the defect list.

@bitdancer, I've tried to address the code's fragility and opaqueness with some restructuring and improved comments.

Adding missing padding in the first attempt, which uses validate=True, is required to pass the validation in the case where all of the characters are valid but just padding is missing. This is different than the following attempts, where it is assumed that there are invalid characters, and decoding is attempted first with and then without adding padding only in order to be able to tell whether padding was missing.

bedevere-bot · 2018-06-10T19:01:14Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

taleinat · 2018-06-11T06:10:53Z

I have made the requested changes; please review again.

bedevere-bot · 2018-06-11T06:10:55Z

Thanks for making the requested changes!

@bitdancer: please review the changes made to this pull request.

bitdancer

Yes, that looks good. Consistency in the handling of the defects was what I wanted, and your improvement of the comments about exactly what we are doing and why is great.

ned-deily · 2018-06-11T20:14:34Z

Removing "backport to 3.6" until https://bugs.python.org/issue27397#msg319338 is resolved.

ned-deily

Shouldn't Doc/library/emails.errors.rst be updated, too?

bedevere-bot · 2018-06-11T20:20:45Z

When you're done making the requested changes, leave the comment: I have made the requested changes; please review again.

taleinat · 2018-06-12T05:56:06Z

I have made the requested changes; please review again.

bedevere-bot · 2018-06-12T05:56:08Z

Thanks for making the requested changes!

@bitdancer, @ned-deily: please review the changes made to this pull request.

bedevere-bot · 2018-06-12T12:46:24Z

@taleinat: Please replace # with GH- in the commit message next time. Thanks!

miss-islington · 2018-06-12T12:46:25Z

Thanks @taleinat for the PR 🌮🎉.. I'm working now to backport this PR to: 3.6, 3.7.
🐍🍒⛏🤖

bedevere-bot · 2018-06-12T12:47:40Z

GH-7664 is a backport of this pull request to the 3.7 branch.

…rings (pythonGH-7583) When attempting to base64-decode a payload of invalid length (1 mod 4), properly recognize and handle it. The given data will be returned as-is, i.e. not decoded, along with a new defect, InvalidBase64LengthDefect. (cherry picked from commit c3f55be) Co-authored-by: Tal Einat <[email protected]>

bedevere-bot · 2018-06-12T12:48:34Z

GH-7665 is a backport of this pull request to the 3.6 branch.

…rings (pythonGH-7583) When attempting to base64-decode a payload of invalid length (1 mod 4), properly recognize and handle it. The given data will be returned as-is, i.e. not decoded, along with a new defect, InvalidBase64LengthDefect. (cherry picked from commit c3f55be) Co-authored-by: Tal Einat <[email protected]>

…rings (GH-7583) (GH-7664) When attempting to base64-decode a payload of invalid length (1 mod 4), properly recognize and handle it. The given data will be returned as-is, i.e. not decoded, along with a new defect, InvalidBase64LengthDefect. (cherry picked from commit c3f55be) Co-authored-by: Tal Einat <[email protected]>

…rings (GH-7583) (GH-7665) When attempting to base64-decode a payload of invalid length (1 mod 4), properly recognize and handle it. The given data will be returned as-is, i.e. not decoded, along with a new defect, InvalidBase64LengthDefect. (cherry picked from commit c3f55be) Co-authored-by: Tal Einat <[email protected]>

taleinat added 3 commits June 10, 2018 09:17

bpo-27397: email module properly handle invalid-length base64 strings

a70626c

This defines a new InvalidBase64LengthDefect defect, which is returned with the encoded string when attempting to base64-decode a string of invalid length (1 mod 4).

bpo-27397: add NEWS entry

db375e2

bpo-27397: test all possible numbers of missing padding characters

6bf1b73

taleinat added needs backport to 3.6 labels Jun 10, 2018

taleinat requested a review from a team as a code owner June 10, 2018 07:00

the-knights-who-say-ni added the CLA signed label Jun 10, 2018

bedevere-bot added the awaiting merge label Jun 10, 2018

bitdancer requested changes Jun 10, 2018

View reviewed changes

bedevere-bot added awaiting changes and removed awaiting merge labels Jun 10, 2018

taleinat added 2 commits June 11, 2018 09:00

bpo-27397: refactor decode_b() code for robustness and clarity

2dd86e5

bpo-27397: additional comment improvements

754bdd4

bedevere-bot added awaiting change review and removed awaiting changes labels Jun 11, 2018

bitdancer approved these changes Jun 11, 2018

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting change review labels Jun 11, 2018

ned-deily removed the needs backport to 3.6 label Jun 11, 2018

ned-deily requested changes Jun 11, 2018

View reviewed changes

bedevere-bot added awaiting changes and removed awaiting merge labels Jun 11, 2018

bpo-27397: add mention of the new defect in the appropraite docs

8a5bbce

bedevere-bot removed the awaiting changes label Jun 12, 2018

bedevere-bot added the awaiting change review label Jun 12, 2018

taleinat added the needs backport to 3.6 label Jun 12, 2018

ned-deily approved these changes Jun 12, 2018

View reviewed changes

bedevere-bot added awaiting merge and removed awaiting change review labels Jun 12, 2018

taleinat merged commit c3f55be into python:master Jun 12, 2018

bedevere-bot removed the awaiting merge label Jun 12, 2018

bedevere-bot removed the needs backport to 3.7 label Jun 12, 2018

bedevere-bot removed the needs backport to 3.6 label Jun 12, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bpo-27397: Make email module properly handle invalid-length base64 strings #7583

bpo-27397: Make email module properly handle invalid-length base64 strings #7583

taleinat commented Jun 10, 2018 •

edited by bedevere-bot

Loading

bitdancer Jun 10, 2018

taleinat Jun 11, 2018 •

edited

Loading

bedevere-bot commented Jun 10, 2018

taleinat commented Jun 11, 2018

bedevere-bot commented Jun 11, 2018

bitdancer left a comment

ned-deily commented Jun 11, 2018

ned-deily left a comment

bedevere-bot commented Jun 11, 2018

taleinat commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

miss-islington commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

bpo-27397: Make email module properly handle invalid-length base64 strings #7583

bpo-27397: Make email module properly handle invalid-length base64 strings #7583

Conversation

taleinat commented Jun 10, 2018 • edited by bedevere-bot Loading

bitdancer Jun 10, 2018

Choose a reason for hiding this comment

taleinat Jun 11, 2018 • edited Loading

Choose a reason for hiding this comment

bedevere-bot commented Jun 10, 2018

taleinat commented Jun 11, 2018

bedevere-bot commented Jun 11, 2018

bitdancer left a comment

Choose a reason for hiding this comment

ned-deily commented Jun 11, 2018

ned-deily left a comment

Choose a reason for hiding this comment

bedevere-bot commented Jun 11, 2018

taleinat commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

miss-islington commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

bedevere-bot commented Jun 12, 2018

taleinat commented Jun 10, 2018 •

edited by bedevere-bot

Loading

taleinat Jun 11, 2018 •

edited

Loading