DRILL-7696: EVF v2 scan schema resolution

Provides the mechanism to resolve the scan schema from a

projection list, provided schema, early reader schema and

actual reader schema.

    • -0
    • +933
  1. … 74 more files in changeset.
DRILL-7665: Add UNION to schema parser

  1. … 3 more files in changeset.
DRILL-7630: Add additional types into SchemaParser for Parquet

closes #2019

  1. … 3 more files in changeset.
DRILL-7361: Support MAP (DICT) type in schema file

closes #1967

  1. … 7 more files in changeset.
DRILL-7393: Revisit Drill tests to ensure that patching is executed before any test run

- Added BaseTest with patchers and extended all tests from it.

- Added a test to java-exec module to ensure that all tests there are inherited from BaseTest.

- Revised exception handling in the patchers, now it's individual for each patching method.

closes #1910

    • -2
    • +2
  1. … 135 more files in changeset.
DRILL-7441: Fix issues with fillEmpties, offset vectors

Fixes subtle issues with offset vectors and "fill empties"


Drill has an informal standard that if a batch has no rows, then

offset vectors within that batch should have zero size. Contrast

this with batches of size 1 that should have offset vectors of

size 2. Changed to enforce this rule throughout.

Nullable, repeated and variable-width vectors have "fill empties"

logic that is used in two places: when setting the value count and

when preparing to write a new value. The current logic is not

quite right for either case. Added tests and fixed the code to

properly handle each case.

Revised the batch validator to enforce the offset-vector length of 0 for

0-sized batches rule. The result was much simpler code.

Added tools to easily print a batch, restoring some code that

was recently lost when the RowSet classes were moved.

Code cleanup in all files touched.

Added logic to "dirty" allocated buffers when testing to ensure

logic is not sensitive to the "pristine" state of new buffers.

Added logic to the column writers to enforce the zero-size-batch rule

for offset vectors. Added unit tests for this case.

Fixed the column writers to set the "lastSet" mutator value for

nullable types since other code relies on this value.

Removed the "setCount" field in nullable vectors: turns out

it is not actually used.

closes #1896

    • -24
    • +28
    • -0
    • +623
  1. … 42 more files in changeset.
DRILL-7359: Add support for DICT type in RowSet Framework

closes #1870

  1. … 82 more files in changeset.
DRILL-7341: Vector reAlloc may fail after exchange

closes #1838

    • -0
    • +24
  1. … 3 more files in changeset.
DRILL-7332: Allow parsing empty schema

closes #1828

  1. … 2 more files in changeset.
DRILL-7310: Move schema-related classes from exec module to be able to use them in metastore module

closes #1816

    • -0
    • +209
    • -0
    • +155
    • -0
    • +355
  1. … 100 more files in changeset.
DRILL-6320: Fixed license headers.

closes #1207

    • -1
    • +1
  1. … 2066 more files in changeset.
DRILL-6234: Improved documentation for VariableWidthVector mutators, and added simple unit tests demonstrating mutator behavior.

close apache/drill#1164

    • -0
    • +141
  1. … 3 more files in changeset.