Clone Tools
  • last updated 19 mins ago
Constraints: committers
Constraints: files
Constraints: dates
[NO ISSUE][COMP][RT] Enable multiway similarity joins

- Enable the FuzzyJoinRule that transforms

a nested-loop-similarity-join plan to a three-stage-similarity join.

- Modify FuzzyJoinRuleCollections.

- Add the ExtractCommonExpressionRule to extract common expressions

in the star-like multiple similarity join substitutions.

- Add the InlineSubplanInputForNestedTupleSourceRule to translate

the generated subplan from the similarity function-derived

substitution into join in case of nested schemas.

- Use similarity-jaccard-prefix to enable the pp+ join strategy.

- Use the right side to build the heavy hash join on

the prefix tokens from both sides.

- Add RemoveAssign/Variables/AggRules to iteratively remove unused

assign/vars once FuzzyJoinRule is applied in each round.

- Add three new optimization cases for multiway similarity joins.

- link-like multiway similarity joins

- star-like multiway similarity joins

- hybrid multiway similarity joins with the both styles of similarity joins.

- Add a check whether a similarity function is on

a select over an existing similarity join.

- Change the inverted-index-based similarity join to the three-stage-similarity join

due to efficiency considerations.

Change-Id: I8736f104905eeda763d39709e002c2b9629278cc


Sonar-Qube: Jenkins <>

Tested-by: Jenkins <>

Contrib: Jenkins <>

Integration-Tests: Jenkins <>

Reviewed-by: Dmitry Lychagin <>

Reviewed-by: Taewoo Kim <>

    • -31
    • +147
  1. … 260 more files in changeset.
Changed the physical tag of ReplicatePOperator (SPLIT -> REPLICATE)

Change-Id: Ic298f90c5bc9875cea1017aff17a524214596b1e


Sonar-Qube: Jenkins <>

Tested-by: Jenkins <>

Integration-Tests: Jenkins <>

Reviewed-by: Till Westmann <>

    • -2
    • +2
  1. … 93 more files in changeset.