drill

Clone Tools
  • last updated 27 mins ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
[maven-release-plugin] prepare release drill-1.3.0

    • -3
    • +2
    /contrib/data/tpch-sample-data/pom.xml
  1. … 6 more files in changeset.
DRILL-4063: Add s3a dependency jars and example core-site.xml

    • -0
    • +31
    /distribution/src/resources/core-site.xml
DRILL-4103: add drill.version to parquet metadata

DRILL-4056: Fix corruption bug reading string data out of Avro

- Fix issue where we are reading a byte array without considering length

- Removed use of unnecessary Holder objects.

- Added restriction on batch size produced by a single call to next.

- Add some basic result verification to avro tests.

DRILL-4065: Maintain a cache of created UserGroupInformation objects so that we don't accidentally generate thousands of FileSystem objects & threads.

DRILL-4048: Fix reading required dictionary encoded varbinary data in parquet files after recent update

Fix was small, this update is a little larger than necessary because I was hoping to create

a unit test by modifying the one I had added in the earlier patch with the version upgrade.

Unfortunately we don't have a good way to generate Parquet files with required columns from

unit tests right now. So I just added a smaller subset of the binary file that was posted on

the JIRA issue. The refactoring of the earlier test was still useful for readability,

so I kept it in.

Add synchronization to DrillBuf.retain() to avoid contention issues.

Add assertion around QueryDataBatch release to ensure that we're not releasing the buffers twice.

DRILL-4049: Terminate StatusThread when Workmanager is closed.

Avoid excessive growth in threads with long-lived test JVMs.

DRILL-4046: Upgrade Joda to later version (avoid contention issues)

DRILL-4041 & DRILL-4057: Disable RPC thread offload until concurrency bug is found.

DRILL-4042: Unable to run sqlline in embedded mode on Windows

Use newer version of hadoop-winutils

DRILL-4040: Build failure on master

This closes #244

DRILL-4032: Check if a new vector was created to determine if allocation is necessary in MapWriter closes #241

DRILL-4031: Log warning and ignore columns returned from JDBC source that are unsupported.

This closes #240

DRILL-951: Add support for csv header row parsing

This closes #232

    • -0
    • +6
    /exec/java-exec/src/test/resources/store/text/data/cars.csvh
DRILL-4025: Create FileSelection object with the appropriate file statuses

DRILL-3810: Add Avro schema support

Also updates the FileFormatPlugin to support providing schema.

DRILL-3956: Add support for MySQL text type

DRILL-4006: Reallocate offset vector in repeated vectors when index is beyond the current capacity

Author: Steven Phillips <smp@apache.org>

This closes #243, #242

DRILL-4000: Ensure storage plugins are not needlessly created. Add start and close capability to storage plugins.

This closes #227

Add a new configuration based map so any configurations that are passed to a node can leverage existing storage plugins.

Update FileSystemConfig to correctly implement hashcode()

Update StoragePlugin interface to extends Autocloseable and add start() method.

Update Mongo plugin to close clients when closing plugin.

DRILL-3994: make classpath scanning work on windows

DRILL-3634: Add number of partitions information in the plan for hive scans

closes #231

DRILL-3992: Add/fix support for JDBC schemas (tested against oracle and derby)

This closes #225

DRILL-3983: Small test improvements improve error message when SQL parsing error add a simple test to Parquet writer make errors verbose by default in tests

This closes #221

Also includes:

- make the checkstyle conf not tied to the execution to allow running mvn checkstyle:check from the command line

- fix verbose output in tests

DRILL-3921: Initialize the underlying record reader lazily in HiveRecordReader

This closes #197

+ OperatorContextImpl has a lazily initialized executor service that is

a decorator around the worker pool. This service is used through

#runCallableAs to run a callable task as the given proxy user.

+ Unit tests use partition student table with chained impersonation to

exercise running multiple delegate callables (creates multiple readers,

a reader per partition).

+ Remove unused WorkManager#shutdownAndAwaitTermination method

DRILL-3937: Handle the case where min/max columns in metadata cache file are string or binary values.

Simplify serialization and check for nulls. Remove byte array comparison for min/max since it is not needed anymore.

close apache/drill#220

DRILL-3340: Part 2: Reverting 1a589ab and committing latest patch

Add operator metrics registry for metric definitions

+ Display metrics as a table within an operator profile panel

+ Rename FragmentStats#getOperatorStats to newOperatorStats

DRILL-3975: Make sure to strip scheme and authority from partition location.

This closes #217

DRILL-3941: Add timing instrumentation in different phases of partition pruning Add basic timing instrumentation while initializing FileSelection and reading the metadata cache file

closes #230