Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HIVE-25583: Support parallel load for HastTables - Interfaces #2999

Merged
merged 1 commit into from
Feb 10, 2022

Conversation

pgaref
Copy link
Contributor

@pgaref pgaref commented Feb 4, 2022

What changes were proposed in this pull request?

Support parallel load for HastTables - Interfaces

  • Introducing VectorMapJoinFastHashTableContainerBase class that implements VectorMapJoinHashTable
  • Each VectorMapJoinFastStringHashMapContainer is a singleton that contains an array of HashTables (1 or more)
  • VectorMapJoinFastTableContainer now initializes VectorMapJoinFastHashTableContainers instead of HTs directly

Why are the changes needed?

Changes needed to introduced parallel HT loading used by HashTableLoader for Tez to constructs the hashtable from records read from a broadcast edge

Does this PR introduce any user-facing change?

No

How was this patch tested?

Existing tests for correctness while HIVE-25146 aims to test performance

@pgaref
Copy link
Contributor Author

pgaref commented Feb 4, 2022

@ramesh0201 can you please take a look on this one?
Intentionally left the HashTable loader untouched to add it as part of HIVE-25149 for FAST implementations

@pgaref
Copy link
Contributor Author

pgaref commented Feb 5, 2022

Hey @abstractdog do you mind taking a look on this one?

@ramesh0201
Copy link
Contributor

@pgaref Looks good to me. +1

* Introducing VectorMapJoinFastHashTableContainerBase class that implements VectorMapJoinHashTable
* Each VectorMapJoinFastStringHashMapContainer is a singleton that contains an array of HashTables (1 or more)
* VectorMapJoinFastTableContainer now initializes VectorMapJoinFastHashTableContainers instead of HTs directly

Change-Id: I14f71dab3859b75d822914966151548793998c39
@pgaref pgaref merged commit a96c697 into apache:master Feb 10, 2022
DongWei-4 pushed a commit to DongWei-4/hive that referenced this pull request Oct 28, 2022
…#2999) (Panagiotis Garefalakis reviewed by Ramesh Kumar)

* Introducing VectorMapJoinFastHashTableContainerBase class that implements VectorMapJoinHashTable
* Each VectorMapJoinFastStringHashMapContainer is a singleton that contains an array of HashTables (1 or more)
* VectorMapJoinFastTableContainer now initializes VectorMapJoinFastHashTableContainers instead of HTs directly

Change-Id: I14f71dab3859b75d822914966151548793998c39
dengzhhu653 pushed a commit to dengzhhu653/hive that referenced this pull request Dec 15, 2022
…#2999) (Panagiotis Garefalakis reviewed by Ramesh Kumar)

* Introducing VectorMapJoinFastHashTableContainerBase class that implements VectorMapJoinHashTable
* Each VectorMapJoinFastStringHashMapContainer is a singleton that contains an array of HashTables (1 or more)
* VectorMapJoinFastTableContainer now initializes VectorMapJoinFastHashTableContainers instead of HTs directly

Change-Id: I14f71dab3859b75d822914966151548793998c39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants