Clone Tools
  • last updated 13 mins ago
Constraints: committers
Constraints: files
Constraints: dates
DRILL-7337: Add vararg UDFs support

  1. … 37 more files in changeset.
DRILL-7155: Create a standard logging message for batch sizes generated by individual operators. This is needed for QA verification of the Batch Size feature DRILL-6238. closes #1716

  1. … 10 more files in changeset.
DRILL-7011: Support schema in scan framework

* Adds schema support to the row set-based scan framework and to the "V3" text reader based on that framework.

* Adding the schema made clear that passing options as a long list of constructor arguments was not sustainable. Refactored code to use a builder pattern instead.

* Added support for default values in the "null column loader", which required adding a "setValue" method to the column accessors.

* Added unit tests for all new or changed functionality. See TestCsvWithSchema for the overall test of the entire integrated mechanism.

* Added tests for explicit projection with schema

* Better handling of date/time in column accessors

* Converted recent column metadata work from Java 8 date/time to Joda.

* Added more CSV-with-schema unit tests

* Removed the ID fields from "resolved columns", used "instanceof" instead.

* Added wildcard projection with an output schema. Handles both "lenient" and "strict" schemas.

* Tagged projection columns with their output schema, when available.

* Scan projection added modes for wildcard with an output schema. The reader projection added support for merging reader and output schemas.

* Includes refactoring of scan operator tests (the test file grew too large.)

* Renamed some classes to avoid confusing reader schemas with output schemas.

* Added unit tests for the new functionality.

* Added "lenient" wildcard with schema test for CSV

* Added more type conversions: string-to-bit, many-to-string

* Fixed bug in column writer for VarDecimal

* Added missing unit tests, and fixed bugs, in Bit column reader/writer

* Cleaned up a number of unneded "SuppressWarnings"

closes #1711

  1. … 222 more files in changeset.
DRILL-6888: Move nested classes outside HashAggTemplate to allow for plain java compile option closes #1569

    • -0
    • +51
    • -0
    • +44
DRILL-6804: Simplify usage of OperatorPhase in HashAgg.

  1. … 7 more files in changeset.
DRILL-1328: Support table statistics

    • -0
    • +223
    • -0
    • +38
  1. … 49 more files in changeset.
DRILL-6766: Lateral Unnest query : IllegalStateException - rowId in right batch of lateral is smaller than rowId in left batch being processed Note: Issue was in StreamingAgg where if output from one or multiple input batch was splitting into multiple output batch, then remaining input records were discarded after producing first output batch closes #1490

  1. … 4 more files in changeset.
DRILL-6763: Codegen optimization of SQL functions with constant values(#1481)

closes #1481

  1. … 17 more files in changeset.
DRILL-6724: Dump operator context to logs when error occurs during query execution

closes #1455

  1. … 98 more files in changeset.
DRILL-6719: Separate spilling queue logic from HashJoin and HashAgg.

  1. … 6 more files in changeset.
DRILL-6422: Replace guava imports with shaded ones

  1. … 981 more files in changeset.
DRILL-6709: Extended the batch stats utility to other operators

closes #1444

  1. … 14 more files in changeset.
DRILL-6566: Reduce Hash Agg Batch size and estimate when low available memory (#1438)

DRILL-6566: Reduce Hash Agg Batch size and estimate when mem available is low

  1. … 9 more files in changeset.
DRILL-6656: Disallow extra semicolons and multiple statements on the same line.

closes #1415

  1. … 141 more files in changeset.
DRILL-6654: Data verification failure with lateral unnest query having filter in and order by

closes #1418

  1. … 1 more file in changeset.
DRILL-6622: Fixed a NullPointerException in a query with Union

closes #1391

  1. … 1 more file in changeset.
DRILL-6631: Streaming agg causes queries with Lateral and Unnest to return incorrect results.

This commit fixes issues with handling straight aggregates (no group by)

with empty batches received between EMIT(s).

closes #1399

  1. … 1 more file in changeset.
DRILL-6310: limit batch size for hash aggregate

closes #1324

  1. … 7 more files in changeset.
DRILL-6512: Remove unnecessary processing overhead from RecordBatchSizer

closes #1341

  1. … 11 more files in changeset.
DRILL-6516: EMIT support in streaming agg

This closes #1358

    • -141
    • +284
  1. … 2 more files in changeset.
DRILL-6479: Support EMIT for the Hash Aggr

closes #1311

  1. … 5 more files in changeset.
DRILL-6386: Remove unused imports and star imports.

  1. … 230 more files in changeset.
DRILL-6027: - Added fallback option for HashJoin. - No copy of incoming for single partition, and avoid HT resize. - Fix memory leak when cancelling while spill file is read - get correct schema when probe side is empty - Re-create the HashJoinProbe

  1. … 42 more files in changeset.
DRILL-6320: Fixed license headers.

closes #1207

  1. … 2062 more files in changeset.
DRILL-6375 : Support for ANY_VALUE aggregate function

closes #1256

  1. … 36 more files in changeset.
DRILL-6027: Initial implementation of HashJoin spill, without memory limits checks yet

  1. … 29 more files in changeset.
DRILL-6027: - Added memory claculator - Added unit tests and docs. - Fixed IOB caused by output vector allocation. - Don't double count records that were spilled in HashJoin

  1. … 55 more files in changeset.
DRILL-6138: Move RecordBatchSizer to org.apache.drill.exec.record package

This closes #1115

  1. … 10 more files in changeset.
DRILL-3993: Changes after review:

Moved checks for OOM into spillIfNeeded method to avoid excessive call of chooseAPartitionToFlush method.

Ignored unit tests until DRILL-6018 is fixed.

These failures appears because new Calcite tries to simplify expressions from the query when applying ReduceExpressionsRule and fails with NFE.

Similar problem, but with old version of Calcite was described in DRILL-6018, therefore these unit tests are marked as ignored until DRILL-6018 is fixed.

  1. … 2 more files in changeset.
DRILL-5730: Mock testing improvements and interface improvements

closes #1045

  1. … 216 more files in changeset.