Clone Tools
  • last updated 19 mins ago
Constraints: committers
Constraints: files
Constraints: dates
DRILL-7200: Update Calcite to 1.19.0 / 1.20.0

  1. … 32 more files in changeset.
DRILL-7049 return VARBINARY as a string with escaped non printable bytes

DRILL-7045 UDF string_binary java.lang.IndexOutOfBoundsException

UDF string_binary was not reallocating the drillbuffer so it would fill

up and throw and out of bounds exception

edit convert_from example to show correct results

DRILL-7070: Fix deb and rpm issues on apache drill master branch

The Apache Drill failed to generate debian and rpm packages. The issues fixed as below,

-Add apache as prefix

-Remove lib folder while packaging, because lib folder does not exist.

-Add git.properties, KEYS, LICENSE, NOTICE, README.md and sample-data dir into the /opt/drill/.

-Update plugin version to 2.2.0

-Update Maven org.vafer:jdeb plugin version to 1.7

-Change the rpm name tobe generated by editing version, and name.

-Remove two files under distribution/src/deb/control/ i.e. conffiles and control.

-The `<copyright>` tag isn't supported anymore for this plugin. Hence use `<license>` and project variables content instead of `2013 ASF`.

Signed-off-by: Naresh Bhat <naresh.bhat@linaro.org>

    • -21
    • +0
DRILL-6855: Do not load schema if there is an IOException

closes #1626

DRILL-5603: Replace String file paths to Hadoop Path - replaced all String path representation with org.apache.hadoop.fs.Path - added PathSerDe.Se JSON serializer - refactoring of DFSPartitionLocation code by leveraging existing listPartitionValues() functionality

closes #1657

  1. … 69 more files in changeset.
DRILL-6582: SYSLOG (RFC-5424) Format Plugin closes #1530

    • -0
    • +41
    • -0
    • +89
    • -0
    • +8
    • -0
    • +8
    • -0
    • +6
    • -0
    • +1
    • -0
    • +2
  1. … 6 more files in changeset.
DRILL-6734: JDBC storage plugin returns null for fields without aliases closes #1642 - Add output column names to JdbcRecordReader and use them for storing the results since column names in result set may differ when aliases aren't specified

DRILL-4858: REPEATED_COUNT on an array of maps and an array of arrays is not implemented

- Implemented 'repeated_count' function for repeated MAP and repeated LIST;

- Updated RepeatedListReader and RepeatedMapReader implementations to return correct value from size() method

- Moved repeated_count to freemarker template and added support for more repeated types for the function

closes #1641

DRILL-7117: Support creation of equi-depth histogram for selected data types.

Support int/bigint/float4/float8, time/timestamp/date and boolean.

Build the histogram from the t-digest byte array and serialize as JSON string.

More changes for serialization/deserialization.

Add code-gen stubs (empty) for VarChar/VarBinary types.

Address review comments (part 1). Add unit test.

Address review comments (part 2) for sampling.

close apache/drill#1715

change httpd storage plugin to format plugin

edit httpd page - change to format plugin vs storage

DRILL-7058: Refresh command to support subset of columns closes #1666

DRILL-7038: Queries on partitioned columns scan the entire datasets

- Added new optimizer rule which checks if query references directory columns only and has DISTINCT or GROUP BY operation. If the condition holds, instead of scanning full file set the following will be performed:

1) if there is cache metadata file, these directories will be read from it,

2) otherwise directories will be gathered from selection object (PartitionLocation).

In the end Scan node will be transformed to DrillValuesRel (containing constant literals) with gathered values so no scan will be performed.

closes #1640

DRILL-7022: Partition pruning is not happening the first time after the metadata auto-refresh

closes #1638

DRILL-7046: Support for loading and parsing new RM config file closes #1652

  1. … 49 more files in changeset.
edit cgroup doc

DRILL-7031: Add Travis job that runs protobuf generation command and checks if all protobufs are up-to-date

closes #1636

DRILL-6989: Upgrade to SqlLine 1.7

closes #1717

    • -1
    • +1
edits to cgroup doc

    • -35
    • +37
DRILL-6780: Caching dependencies for CircleCI

closes #1632


DRILL-7018: Fixed Parquet buffer overflow when reading timestamp column

close apache/drill#1630

DRILL-7019: Add check for redundant imports

close apache/drill#1629

  1. … 9 more files in changeset.
DRILL-7016: Wrong query result with RuntimeFilter enabled when order of join and filter condition is swapped

close apache/drill#1628

DRILL-7024: Refactor ColumnWriter to simplify type-conversion shim

DRILL-7006 added a type conversion "shim" within the row set framework. Basically, we insert a "shim" column writer that takes data in one form (String, say), and does reader-specific conversions to a target format (INT, say).

The code works fine, but the shim class ends up needing to override a bunch of methods which it then passes along to the base writer. This PR refactors the code so that the conversion shim is simpler.

closes #1633

  1. … 52 more files in changeset.
DRILL-7008: Drillbits: clear stale shutdown hooks

ShutdownThread is no longer required when Drillbit#close() is called.

mvn install for Drill project consumed 600MiB (there were 160 shutdown hooks)

close apache/drill#1625

DRILL-7007: Use verify method in row set tests

Many of the early RowSet-based tests used the pattern:

new RowSetComparison(expected)


Revise this to use the simplified form:

RowSetUtilities.verify(expected, result);

The original form is retained when tests use additional functionality, such as the ability to perform multiple verifications on the same expected batch.

closes #1624

DRILL-7006: Add type conversion to row writers

Modifies the column metadata and writer abstractions to allow a type conversion "shim" to be specified as part of the schema, then inserted as part of the row set writer. Allows, say, setting an Int or Date from a string, parsing the string to obtain the proper data type to store in the vector.

Type conversion not yet supported in the result set loader: some additional complexity needs to be resolved.

Adds unit tests for this functionality. Refactors some existing tests to remove rough edges.

closes #1623