in group then group #1735

DenisYaroshevskiy · 2024-01-28T17:15:42Z

in group then group.

The common group of shuffles is: shuffle in groups, then shuffle groups.
Example is avx2 shuffle 128 byte lanes and then intermix them somehow.

There is a question: what to do first: do you shuffle big groups first or in big groups first.
Techincally: big groups first is strictly more powerful.

However, can lead to better 0ing.

It's not an obvious thing at all and I suspect we will come back to this one again.

Now I implemented "in groups then groups" to solve avx2 slide (partially.

asimd

for asimd we used to have purely vextq_u16 based solution.
Now this will prefer shift_n when that's avaliable, becasue no constants (horray - the system works).

For neon it's not enabled, because there is a bug in level computation: emulation should always return 0 but sometimes it returns 1. Which messes with neon, double.

* in group + asimd * disable accidental sve

DenisYaroshevskiy requested a review from jfalcou January 28, 2024 17:16

DenisYaroshevskiy force-pushed the slide_2 branch 2 times, most recently from 7a7a503 to d5aa556 Compare January 29, 2024 00:36

in group + asimd

08e2605

DenisYaroshevskiy force-pushed the slide_2 branch from d5aa556 to 08e2605 Compare January 30, 2024 20:00

disable accidental sve

bdfa98e

jfalcou approved these changes Jan 31, 2024

View reviewed changes

DenisYaroshevskiy merged commit 51216c0 into main Jan 31, 2024
36 checks passed

DenisYaroshevskiy deleted the slide_2 branch January 31, 2024 18:38

jtlap pushed a commit that referenced this pull request May 12, 2024

in group then group (#1735)

872d4c5

* in group + asimd * disable accidental sve

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

in group then group #1735

in group then group #1735

DenisYaroshevskiy commented Jan 28, 2024 •

edited

Loading

in group then group #1735

in group then group #1735

Conversation

DenisYaroshevskiy commented Jan 28, 2024 • edited Loading

DenisYaroshevskiy commented Jan 28, 2024 •

edited

Loading