drill

Clone Tools
  • last updated 21 mins ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
DRILL-2309: Fix reduce aggregate rule to create new aggregate calls based on input row type

    • -0
    • +3
    /exec/java-exec/src/test/resources/jsoninput/nullable3.json
DRILL-2488: Return DEFAULT as supported encoding for MergeJoin since it does not currently handle SV2 or SV4.

DRILL-2389: Remove timestamp with time zone

  1. … 21 more files in changeset.
DRILL-2491: Fix use of injectable QueryDateTimeInfo in localtimestamp function

DRILL-2342: Store the nullability property of column in view persistence store.

DRILL-2731: Repeat realloc until enough space

Repeat realloc for variable length vectors as well

Merge branch 'gh-pages-master' into gh-pages

DRILL-2728: Merge spill files when number gets too large

DRILL-2758: Fix writer cleanup

    • -17
    • +17
    /_docs/data-sources/001-hive-types.md
    • -20
    • +50
    /_docs/sql-ref/001-data-types.md
    • -113
    • +93
    /_docs/sql-ref/data-types/001-date.md
DRILL-2442: Initial implementation of C++ client support for impersonation.

DRILL-2180: Star column is enabled to work along with complex expression

    • -0
    • +26
    /exec/java-exec/src/test/resources/store/text/sample.json
DRILL-2466: Fix "<a>.VARCHAR -> <b>.NVARCHAR" to "-> <b>.VARCHAR" (Types.h).

- Fixed mapping from TypeProtos.MinorType.VARCHAR to java.sql.Types.NVARCHAR

to be to java.sql.Types.VARCHAR.

- Also renamed getSqlType to getJdbcType, getSqlTypeName to getSqlTypeName.

DRILL-2453: Handle the case where incoming has no schema in PartitionSender.

DRILL-2311: In ProjectRecordBatch, even if a column from incoming recordbatch does not need to be classified, the output name for this column is still ensured to be unique

DRILL-2730: Use different paths for ExternalSort spills

DRILL-2467: Fix "datatype" to "datetype" for test Hive DATE column.

DRILL-1957: Support nested loop join planning in order to enable NOT-IN, Inequality, Cartesian, uncorrelated EXISTS planning.

Add support for nested loop join planning where right input is scalar and is broadcast.

Add check for scalar subquery for NLJ. Add support for creating a Filter-NLJ plan.

Rebase on the branch with Jinfeng's Calcite rebasing work.

Conflicts:

exec/java-exec/src/main/java/org/apache/drill/exec/physical/impl/join/JoinUtils.java

Add unit tests for NLJoin.

Added test for inequality join.

Tests with BroadcastExchange, with HJ/MJ disabled.

Fix filter push down for NL joins by modifying row count computation for joins with always true conditions. Rebase on master. Refactor unit tests.

Improved checking of preconditions for NL join.

Handle the case where scalar aggregate is a child of Filter.

DRILL-1957: Support nested loop join planning in order to enable NOT-IN, Inequality, EXISTS planning.

Better checks for cartesian and inequality joins. Rebase on latest master.

Refactor costing for logical join. Add tests. Enable more TPC-H tests.

Remove the check for cartesian join from DrillJoinRel constructor.

Clear left and right keys before calling splitJoinCondition.

Address review comments: Remove redundant call to getJoinCategory. Added comment in DrillRuleSet.

  1. … 5 more files in changeset.
DRILL-2106: Fix SplitUpComplexExpression rule to correctly detect last used column reference in the project expression

    • -0
    • +12
    /exec/java-exec/src/test/resources/flatten/drill-2106-result.json
DRILL-3200: Add Window functions: ROW_NUMBER, RANK, PERCENT_RANK, DENSE_RANK and CUME_DIST

- enum WindowFrameRecordBatch.WindowFunction to handle supported window function and their corresponding output MajorType

- renamed WindowFrameTemplate -> DefaultFrameTemplate, cleaned the template to handle the default frame efficiently:

. a batch can be processed as soon as we find the last peer row of it's last row

. once a batch is processed it can be safely released => we can transfer it's value vectors to the container instead of copying them

- DefaultFrameTemplate.Partition tracks the current window frame and computes the following window functions automatically: row_number, rank, dense_rank, percent_rank, cume_dist. It doesn't need to aggregate the value vectors to compute these window functions

- updated TestWindowFrame to check the results of row_number, rank, dense_rank, percent_rank and cume_dist in various cases

. added a debug config option to MSorter to control the size of batches. This is needed by TestWindowFrame so it can use small test data files (20 rows per batch)

. removed contrib/data/window-test-data

- WindowFrameRecordBatch properly releases saved batches if the query stops prematurely

- GenerateTestData can be used to generate test data for the window function unit tests [it's a work in progress and can be either improved to make it developer friendly or removed from the final patch]

- using newly created WindowDataBatch in place of RecordDataBatch, to expose FragmentContext and VectorAccessible (fixes DRILL-3218)

- window.enable is true by default

    • -64
    • +0
    /contrib/data/window-test-data/pom.xml
  1. … 32 more files in changeset.
DRILL-2448: Enable standard implicit cast between Varchar and Varbinary rather than outdated special case in softEquals.

This is necessary to allow the interpreted expression system to evaluate these functions in the same manner as the code-generation based expression evaluation system does today.

DRILL-2446: Improvement in finding Drill log dir

DRILL-2446: Improvement in finding Drill log dir

DRILL-2441: For outer-join, if there is any inequality condition, Cartesian-Join exception will be thrown

DRILL-2128.1: Preparatory changes: Labeled result cols.; formatted SQL. [MetaImpl]

DRILL-2128.2: Fixed DatabaseMetaData.getColumns's DATA_TYPE, TYPE_NAME.

- Created basic test for DATA_TYPE and TYPE_NAME.

- Fixed DATA_TYPE: Added mapping from type name/descriptor strings from

INFORMATION_SCHEMA.COLUMNS.DATA_TYPE to java.sql.Types.* integer type codes

for DatabaseMetaData.getColumns's DATA_TYPE.

- Fixed TYPE_NAME: Added TYPE_NAME returning type name/descriptor strings from

INFORMATION_SCHEMA.COLUMNS.DATA_TYPE

- Added FIXMEs for some missing/misnamed/wrong fields. (See DRILL-2420.)

DRILL-2397, new data types doc, misc other fixes

    • -16
    • +23
    /_docs/connect/002-plugin-conf.md
    • -20
    • +11
    /_docs/data-sources/001-hive-types.md
    • -6
    • +6
    /_docs/data-sources/003-parquet-ref.md
    • binary
    /_docs/img/connect-plugin.png
    • -60
    • +103
    /_docs/sql-ref/001-data-types.md
    • -12
    • +16
    /_docs/sql-ref/002-lexical-structure.md
    • -0
    • +77
    /_docs/sql-ref/data-types/002-disparate-data-types.md
DRILL-2225: Fix missing PartitionSenderRootExec stats.

DRILL-2413: FileSystemPlugin refactoring: avoid sharing DrillFileSystem across schemas

  1. … 19 more files in changeset.
DRILL-2414: Give proper error message if Union-All is applied on schema-less tables