[SPARK-48817][SQL] Eagerly execute union multi commands together #47224

wForget · 2024-07-05T07:09:50Z

What changes were proposed in this pull request?

Eagerly execute union multi commands together.

Why are the changes needed?

MultiInsert is split to multiple sql executions, resulting in no exchange reuse.

Reproduce sql:

create table wangzhen_t1(c1 int);
create table wangzhen_t2(c1 int);
create table wangzhen_t3(c1 int);
insert into wangzhen_t1 values (1), (2), (3);

from (select /*+ REPARTITION(3) */ c1 from wangzhen_t1)
insert overwrite table wangzhen_t2 select c1
insert overwrite table wangzhen_t3 select c1;

In Spark 3.1, there is only one SQL execution and there is a reuse exchange.

However, in Spark 3.5, it was split to multiple executions and there was no ReuseExchange.

Does this PR introduce any user-facing change?

yes, multi inserts will executed in one execution.

How was this patch tested?

added unit test

Was this patch authored or co-authored using generative AI tooling?

No

wForget · 2024-07-05T07:16:42Z

It seems to be caused by #32513

wForget · 2024-07-05T08:36:11Z

@cloud-fan @beliefer Could you please take a look?

ulysses-you

lgtm except some minor comments

sql/core/src/main/scala/org/apache/spark/sql/execution/QueryExecution.scala

ulysses-you · 2024-07-10T01:11:47Z

thanks, merged to master

cloud-fan · 2024-07-10T12:26:39Z

late LGTM

### What changes were proposed in this pull request? Eagerly execute union multi commands together. ### Why are the changes needed? MultiInsert is split to multiple sql executions, resulting in no exchange reuse. Reproduce sql: ``` create table wangzhen_t1(c1 int); create table wangzhen_t2(c1 int); create table wangzhen_t3(c1 int); insert into wangzhen_t1 values (1), (2), (3); from (select /*+ REPARTITION(3) */ c1 from wangzhen_t1) insert overwrite table wangzhen_t2 select c1 insert overwrite table wangzhen_t3 select c1; ``` In Spark 3.1, there is only one SQL execution and there is a reuse exchange. ![image](https://github.com/apache/spark/assets/17894939/5ff68392-aaa8-4e6b-8cac-1687880796b9) However, in Spark 3.5, it was split to multiple executions and there was no ReuseExchange. ![image](https://github.com/apache/spark/assets/17894939/afdb14b6-5007-4923-802d-535149974ecf) ![image](https://github.com/apache/spark/assets/17894939/0d60e8db-9da7-4906-8d07-2b622b55e6ab) ### Does this PR introduce _any_ user-facing change? yes, multi inserts will executed in one execution. ### How was this patch tested? added unit test ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47224 from wForget/SPARK-48817. Authored-by: wforget <[email protected]> Signed-off-by: youxiduo <[email protected]>

[SPARK-48817][SQL] Eagerly execute union multi commands together

49ad250

github-actions bot added the SQL label Jul 5, 2024

ulysses-you reviewed Jul 8, 2024

View reviewed changes

address comments

01f3463

ulysses-you approved these changes Jul 8, 2024

View reviewed changes

ulysses-you closed this in b5f3e1e Jul 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-48817][SQL] Eagerly execute union multi commands together #47224

[SPARK-48817][SQL] Eagerly execute union multi commands together #47224

wForget commented Jul 5, 2024 •

edited

Loading

wForget commented Jul 5, 2024

wForget commented Jul 5, 2024

ulysses-you left a comment

ulysses-you commented Jul 10, 2024

cloud-fan commented Jul 10, 2024

[SPARK-48817][SQL] Eagerly execute union multi commands together #47224

[SPARK-48817][SQL] Eagerly execute union multi commands together #47224

Conversation

wForget commented Jul 5, 2024 • edited Loading

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

wForget commented Jul 5, 2024

wForget commented Jul 5, 2024

ulysses-you left a comment

Choose a reason for hiding this comment

ulysses-you commented Jul 10, 2024

cloud-fan commented Jul 10, 2024

wForget commented Jul 5, 2024 •

edited

Loading