Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
DRILL-7724: Refactor metadata controller batch

Also changed for (;;) infinite loops to

while (true) as preferred by IntelliJ.

  1. … 25 more files in changeset.
DRILL-7702: Update ZooKeeper and Curator, exclude org.codehaus.jackson

  1. … 12 more files in changeset.
DRILL-7702: Update httpclient, libthrift, httpdlog-parser, jetty, derby, exclude and ban log4j, jasper-runtime, commons-httpclient

closes #2057

  1. … 5 more files in changeset.
DRILL-6604: Upgrade Drill Hive client to Hive3.1 version

closes #2038

    • -0
    • +25
    ./src/main/codegen/configHive3.fmpp
    • -0
    • +38
    ./src/main/codegen/data/Hive2DateTypes.tdd
    • -0
    • +38
    ./src/main/codegen/data/Hive3DateTypes.tdd
    • -14
    • +0
    ./src/main/codegen/data/HiveTypes.tdd
    • -11
    • +26
    ./src/main/codegen/templates/ObjectInspectors.java
    • -0
    • +33
    ./src/main/java/org/apache/logging/log4j/util/Strings.java
  1. … 5 more files in changeset.
DRILL-7330: Implement metadata usage for all format plugins

  1. … 58 more files in changeset.
DRILL-7429: Wrong column order when selecting complex data using Hive storage plugin

  1. … 4 more files in changeset.
DRILL-7620: Fix plugin mutability issues

A recent commit made the plugin registry more strict about

the rule that, once a plugin is registered, it must be

immutable. A flaw enforcing that rule in the UI put the

registry in an inconsistent state.

Also

* Registry-specific errors

* Push more operations from UI layer into registry

* Clean up semantics of "resolve" for plugins

* Add more unit tests

* Better handling of "bad" plugins

* Force plugin names to lower case

* Fix comparison bugs in some format plugins

  1. … 93 more files in changeset.
DRILL-7592: Add missing licenses and update plugins exclusion list and fix licenses

closes #1989

    • -3
    • +21
    ./src/main/resources/drill-module.conf
    • -3
    • +21
    ./src/test/resources/drill-module.conf
  1. … 82 more files in changeset.
DRILL-7590: Refactor plugin registry

Major cleanup of the plugin registry to split it into components

in preparation for a proper plugin API.

Better coordinates the named and ephemeral plugin caches.

Cleans up the registry API. Sharpens rules for modifying

plugin configs.

closes #1988

  1. … 161 more files in changeset.
DRILL-7634: Rollup of code cleanup changes

Collection of code cleanup changes. The most significant

is to create constants for function names.

closes #2020

  1. … 128 more files in changeset.
DRILL-7530: Fix class names in loggers

1. Fix incorrect class names for loggers.

2. Minor code cleanup.

closes #1957

  1. … 52 more files in changeset.
DRILL-7502: Invalid codegen for typeof() with UNION

Also fixes DRILL-6362: typeof() reports NULL for primitive

columns with a NULL value.

typeof() is meant to return "NULL" if a UNION has a NULL

value, but the column type when known, such as for non-UNION

columns.

Also fixes DRILL-7499: sqltypeof() function with an array returns

"ARRAY", not type. This was due to treating REPEATED like LIST.

Handling of the Union vector in code gen is problematic

with about three special cases. Existing code handled two

of the cases. This change handles the third case.

Figuring out the change required poking around quite a bit

of unclear code. Added comments and restructuring to make

that code a bit more clear.

The fix modified code gen for the Union Holder. It can now

"go back in time" to add the union reader at the point we

need it.

closes #1945

  1. … 53 more files in changeset.
[maven-release-plugin] prepare for next development iteration

  1. … 36 more files in changeset.
[maven-release-plugin] prepare release drill-1.17.0

  1. … 36 more files in changeset.
DRILL-7483: Add support for 12 and 13 java versions

closes #1935

    • -0
    • +32
    ./src/test/java/org/apache/drill/exec/hive/HiveClusterTest.java
  1. … 13 more files in changeset.
DRILL-7463: Apache license is not added to the generated classes

closes #1916

    • -0
    • +17
    ./src/main/codegen/includes/license.ftl
  1. … 2 more files in changeset.
DRILL-7393: Revisit Drill tests to ensure that patching is executed before any test run

- Added BaseTest with patchers and extended all tests from it.

- Added a test to java-exec module to ensure that all tests there are inherited from BaseTest.

- Revised exception handling in the patchers, now it's individual for each patching method.

closes #1910

  1. … 136 more files in changeset.
DRILL-7441: Fix issues with fillEmpties, offset vectors

Fixes subtle issues with offset vectors and "fill empties"

logic.

Drill has an informal standard that if a batch has no rows, then

offset vectors within that batch should have zero size. Contrast

this with batches of size 1 that should have offset vectors of

size 2. Changed to enforce this rule throughout.

Nullable, repeated and variable-width vectors have "fill empties"

logic that is used in two places: when setting the value count and

when preparing to write a new value. The current logic is not

quite right for either case. Added tests and fixed the code to

properly handle each case.

Revised the batch validator to enforce the offset-vector length of 0 for

0-sized batches rule. The result was much simpler code.

Added tools to easily print a batch, restoring some code that

was recently lost when the RowSet classes were moved.

Code cleanup in all files touched.

Added logic to "dirty" allocated buffers when testing to ensure

logic is not sensitive to the "pristine" state of new buffers.

Added logic to the column writers to enforce the zero-size-batch rule

for offset vectors. Added unit tests for this case.

Fixed the column writers to set the "lastSet" mutator value for

nullable types since other code relies on this value.

Removed the "setCount" field in nullable vectors: turns out

it is not actually used.

closes #1896

  1. … 43 more files in changeset.
DRILL-7440: Failure during loading of RepeatedCount functions

closes #1894

  1. … 3 more files in changeset.
DRILL-7418: MetadataDirectGroupScan improvements

1. Replaced files listing with selection root information to reduce query plan size in MetadataDirectGroupScan.

2. Fixed MetadataDirectGroupScan ser / de issues.

3. Added PlanMatcher to QueryBuilder for more convenient plan matching.

4. Re-written TestConvertCountToDirectScan to use ClusterTest.

5. Refactoring and code clean up.

  1. … 12 more files in changeset.
DRILL-7406: Update Calcite to 1.21.0

1. DRILL-7386 - added tests to TestHiveStructs.

2. DRILL-4527 - the DrillAvgVarianceConvertlet can't be removed without test failures.

3. DRILL-6215 - switched to prepared statement in JdbcRecordReader.

4. DRILL-6905 - added test into TestExampleQueries.

5. DRILL-7415 - Fixed jdbc show tables when 2 tables with same name are present in different schemas.

6. DRILL-7340 - Fixed jdbc filter pushdown when few jdbc datasources enabled.

7. Split SqlConverter into multiple source files.

8. Minor refactorings for jdbc and other places.

closes #1940

  1. … 53 more files in changeset.
DRILL-7359: Add support for DICT type in RowSet Framework

closes #1870

  1. … 82 more files in changeset.
DRILL-7254: Read Hive union w/o nulls

    • -0
    • +3
    ./src/test/resources/complex_types/array/union_array.txt
    • binary
    ./src/test/resources/complex_types/map/map_union_tbl.avro
  1. … 11 more files in changeset.
DRILL-7387: Failed to get value by int key from map nested into struct

  1. … 1 more file in changeset.
DRILL-7380: Query of a field inside of an array of structs returns null

1. Fixed parquet reader projection for Logical lists (DrillParquetReader.java)

2. Fixed projection pushdown for RexFieldAccess (ProjectFieldsVisitor.java)

3. DrillParquetReader.getProjection(...) splitted into few methods

4. Added javadocs for PathSegment and SchemaPath

  1. … 4 more files in changeset.
DRILL-7357: Expose Drill Metastore data through information_schema

1. Add additional columns to TABLES and COLUMNS tables.

2. Add PARTITIONS table.

3. General refactoring to adjust information_schema data retrieval from multiple sources.

closes #1860

  1. … 33 more files in changeset.
DRILL-7376: Drill ignores Hive schema for MaprDB tables when group scan has star column

  1. … 3 more files in changeset.
DRILL-7252: Read Hive map using Dict<K,V> vector

    • -0
    • +3
    ./src/test/resources/complex_types/array/map_array.json
    • -0
    • +3
    ./src/test/resources/complex_types/map/map_complex_tbl.json
  1. … 5 more files in changeset.
DRILL-4517: Support reading empty Parquet files

1. Modified flat and complex parquet readers to output schema only when requested number of records to read is 0. In this case readers are not initialized to improve performance.

2. Allowed reading requested number of rows instead of all rows in the row group (DRILL-6528).

3. Fixed issue with nulls number determination in the row group (fixed IsPredicate#isAllNulls method).

4. Allowed reading empty parquet files via adding empty / fake row group.

5. General refactoring and unit tests.

6. Parquet tests categorization.

closes #1839

    • binary
    ./src/test/resources/empty.parquet
  1. … 44 more files in changeset.
DRILL-7337: Add vararg UDFs support

  1. … 35 more files in changeset.