Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
DRILL-7358: Fix COUNT(*) for empty text files

Fixes a subtle error when a text file has a header (and so has a

schema), but is in a COUNT(*) query, so that no columns are

projected. Ensures that, in this case, an empty schema is

treated as a valid result set.

Tests: updated CSV tests to include this case.

closes #1867

  1. … 2 more files in changeset.
DRILL-5983: Add missing nullable Parquet readers for INT and UINT logical types

closes #1866

DRILL-7170: Ignore uninitialized vector containers for OOM error messages

    • -2
    • +5
    ./exec/physical/impl/spill/SpillSet.java
DRILL-7254: Read Hive union w/o nulls

    • -30
    • +54
    ./exec/vector/complex/MapUtility.java
  1. … 19 more files in changeset.
DRILL-7387: Failed to get value by int key from map nested into struct

  1. … 2 more files in changeset.
DRILL-7380: Query of a field inside of an array of structs returns null

1. Fixed parquet reader projection for Logical lists (DrillParquetReader.java)

2. Fixed projection pushdown for RexFieldAccess (ProjectFieldsVisitor.java)

3. DrillParquetReader.getProjection(...) splitted into few methods

4. Added javadocs for PathSegment and SchemaPath

    • -0
    • +8
    ./exec/planner/common/DrillRelOptUtil.java
    • -57
    • +105
    ./exec/store/parquet2/DrillParquetReader.java
  1. … 5 more files in changeset.
DRILL-7357: Expose Drill Metastore data through information_schema

1. Add additional columns to TABLES and COLUMNS tables.

2. Add PARTITIONS table.

3. General refactoring to adjust information_schema data retrieval from multiple sources.

closes #1860

    • -0
    • +216
    ./exec/store/ischema/FilterEvaluator.java
    • -1
    • +0
    ./exec/store/ischema/InfoSchemaConfig.java
    • -64
    • +74
    ./exec/store/ischema/InfoSchemaConstants.java
    • -15
    • +15
    ./exec/store/ischema/InfoSchemaFilter.java
    • -14
    • +14
    ./exec/store/ischema/InfoSchemaFilterBuilder.java
    • -17
    • +12
    ./exec/store/ischema/InfoSchemaGroupScan.java
    • -13
    • +12
    ./exec/store/ischema/InfoSchemaPushFilterIntoRecordGenerator.java
    • -12
    • +11
    ./exec/store/ischema/InfoSchemaStoragePlugin.java
    • -134
    • +188
    ./exec/store/ischema/InfoSchemaTable.java
    • -0
    • +413
    ./exec/store/ischema/RecordCollector.java
  1. … 19 more files in changeset.
DRILL-7373: Fix problems involving reading from DICT type

- Fixed FieldIdUtil to resolve reading from DICT for some complex cases;

- optimized reading from DICT given a key by passing an appropriate Object type to DictReader#find(...) and DictReader#read(...) methods when schema is known (e.g. when reading from Hive tables) instead of generating it on fly based on int or String path and key type;

- fixed error when accessing value by not existing key value in Avro table.

    • -22
    • +119
    ./exec/expr/EvaluationVisitor.java
    • -67
    • +186
    ./exec/planner/logical/DrillOptiq.java
    • -1
    • +1
    ./exec/store/avro/AvroRecordReader.java
    • -8
    • +8
    ./exec/vector/complex/FieldIdUtil.java
  1. … 7 more files in changeset.
DRILL-7376: Drill ignores Hive schema for MaprDB tables when group scan has star column

    • -38
    • +149
    ./exec/vector/complex/fn/JsonReaderUtils.java
  1. … 3 more files in changeset.
DRILL-7368: Fix Iceberg Metastore failure when filter column contains nulls

    • -1
    • +3
    ./exec/ops/ExecutorFragmentContext.java
    • -19
    • +24
    ./exec/ops/FragmentContextImpl.java
  1. … 6 more files in changeset.
DRILL-7369: Schema for MaprDB tables is not used for the case when several fields are queried

closes #1852

DRILL-7252: Read Hive map using Dict<K,V> vector

    • -340
    • +66
    ./exec/vector/complex/MapUtility.java
  1. … 15 more files in changeset.
DRILL-7367: Remove Server details from response headers

closes #1851

    • -13
    • +18
    ./exec/server/rest/WebServer.java
  1. … 1 more file in changeset.
DRILL-7168: Implement ALTER SCHEMA ADD / REMOVE commands

    • -6
    • +176
    ./exec/planner/sql/handlers/SchemaHandler.java
    • -34
    • +194
    ./exec/planner/sql/parser/SqlSchema.java
    • -0
    • +71
    ./exec/record/metadata/schema/StorageProperties.java
  1. … 9 more files in changeset.
DRILL-7362: COUNT(*) on JSON with outer list results in JsonParse error

closes #1849

    • -0
    • +167
    ./exec/store/easy/json/reader/BaseJsonReader.java
    • -116
    • +10
    ./exec/vector/complex/fn/JsonReader.java
  1. … 1 more file in changeset.
DRILL-7360: Refactor WatchService in Drillbit class and fix concurrency issues

DRILL-7222: Visualize estimated and actual row counts for a query

With statistics in place, it is useful to have the estimated rowcount along side the actual rowcount query profile's operator overview. A toggle button allows this with the estimated rows hidden by default

We can extract this from the Physical Plan section of the profile.

Added a toggle-ready table-column header

closes #1779

    • -0
    • +2
    ./exec/server/rest/profile/HtmlAttribute.java
    • -2
    • +6
    ./exec/server/rest/profile/OperatorWrapper.java
    • -0
    • +6
    ./exec/server/rest/profile/ProfileWrapper.java
  1. … 2 more files in changeset.
DRILL-7356: Introduce session options for the Drill Metastore

closes #1846

  1. … 1 more file in changeset.
DRILL-7326: Support repeated lists for CTAS parquet format

closes #1844

    • -6
    • +130
    ./exec/store/parquet/ParquetRecordWriter.java
  1. … 4 more files in changeset.
DRILL-7350: Move RowSet related classes from test folder

  1. … 278 more files in changeset.
DRILL-7338: REST API calls to Drill fail due to insufficient heap memory

This PR allows for the 85% threshold to be customizable with a value of 0 meant for disabling.

closes #1837

  1. … 1 more file in changeset.
DRILL-4517: Support reading empty Parquet files

1. Modified flat and complex parquet readers to output schema only when requested number of records to read is 0. In this case readers are not initialized to improve performance.

2. Allowed reading requested number of rows instead of all rows in the row group (DRILL-6528).

3. Fixed issue with nulls number determination in the row group (fixed IsPredicate#isAllNulls method).

4. Allowed reading empty parquet files via adding empty / fake row group.

5. General refactoring and unit tests.

6. Parquet tests categorization.

closes #1839

    • -3
    • +30
    ./exec/store/CommonParquetRecordReader.java
    • -14
    • +13
    ./exec/store/parquet/FilterEvaluatorUtils.java
    • -2
    • +2
    ./exec/store/parquet/columnreaders/PageReader.java
    • -50
    • +41
    ./exec/store/parquet/columnreaders/ReadState.java
    • -0
    • +275
    ./exec/store/parquet/metadata/FileMetadataCollector.java
    • -201
    • +42
    ./exec/store/parquet/metadata/Metadata.java
  1. … 34 more files in changeset.
DRILL-7337: Add vararg UDFs support

    • -14
    • +14
    ./exec/expr/ExpressionTreeMaterializer.java
    • -17
    • +2
    ./exec/expr/fn/AbstractFuncHolder.java
    • -13
    • +15
    ./exec/expr/fn/DrillAggFuncHolder.java
    • -29
    • +117
    ./exec/expr/fn/DrillFuncHolder.java
    • -2
    • +2
    ./exec/expr/fn/DrillSimpleFuncHolder.java
    • -0
    • +4
    ./exec/expr/fn/FunctionAttributes.java
    • -5
    • +21
    ./exec/expr/fn/FunctionConverter.java
    • -0
    • +49
    ./exec/expr/fn/impl/CollectToListFunction.java
  1. … 23 more files in changeset.
DRILL-7335: Fix error when reading csv file with headers only

closes #1834

    • -3
    • +7
    ./exec/store/easy/text/reader/TextInput.java
  1. … 1 more file in changeset.
DRILL-7331: Drill Iceberg Metastore metadata expiration

closes #1831

  1. … 15 more files in changeset.
DRILL-7327: Log Regex Plugin Won't Recognize Schema

The previous commit revised the plugin config classes to work

with table functions. That caused Jackson to stop working for

the classess. Fixed those issues and added unit tests.

closes #1827

    • -1
    • +19
    ./exec/store/log/LogFormatField.java
  1. … 3 more files in changeset.
DRILL-6961: Handle exceptions during queries to information_schema

closes #1833

  1. … 6 more files in changeset.
DRILL-7205: Drill fails to start when authentication is disabled

closes #1824

  1. … 1 more file in changeset.
DRILL-7314: Use TupleMetadata instead of concrete implementation

1. Add ser / de implementation for TupleMetadata interface based on types.

2. Replace TupleSchema usage where possible.

3. Move patcher classes into commons.

4. Upgrade some dependencies and general refactoring.

    • -3
    • +3
    ./exec/physical/rowSet/model/BaseTupleModel.java
    • -3
    • +1
    ./exec/store/dfs/easy/EasyGroupScan.java
    • -2
    • +1
    ./exec/store/dfs/easy/EasySubScan.java
    • -1
    • +0
    ./exec/store/parquet/AbstractParquetGroupScan.java
    • -5
    • +5
    ./exec/store/parquet/ParquetGroupScan.java
  1. … 26 more files in changeset.
DRILL-7317: Close ClassLoaders used for udf jars uploading when closing FunctionImplementationRegistry

- Fix issue with caching DrillMergeProjectRule and FunctionImplementationRegistry when different drillbits are started within the same JVM

    • -1
    • +2
    ./exec/expr/fn/FunctionImplementationRegistry.java