Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
DRILL-7350: Move RowSet related classes from test folder

    • -184
    • +0
    ./single/BuildVectorsFromMetadata.java
    • -113
    • +0
    ./single/SingleSchemaInference.java
  1. … 278 more files in changeset.
DRILL-7314: Use TupleMetadata instead of concrete implementation

1. Add ser / de implementation for TupleMetadata interface based on types.

2. Replace TupleSchema usage where possible.

3. Move patcher classes into commons.

4. Upgrade some dependencies and general refactoring.

  1. … 39 more files in changeset.
DRILL-7086: Output schema for row set mechanism

Enhances the row set mechanism to take an "output schema" that describes the vectors to

create. The "input schema" describes the type that the reader would like to write. A

conversion mechanism inserts a conversion shim to convert from the input to output type.

Provides a set of implicit type conversions, including string-to-date/time conversions

which use the new format property stored in column metadata. Includes unit tests for

the new functionality.

closes #1690

  1. … 64 more files in changeset.
DRILL-6903: SchemaBuilder code improvements

1. ColumnBuilder: setPrecisionAndScale method

2. SchemaContainer: addColumn method parameter AbstractColumnMetadata was changed to ColumnMetadata

3. MapBuilder / RepeatedListBuilder / UnionBuilder: added constructors without parent, made buildColumn method public

4. TupleMetadata: added toMetadataList method

5. Other refactoring

  1. … 24 more files in changeset.
DRILL-6809: Handle repeated map in schema inference

It turns out that the RowSet utilities build a repeated map without including the hidden $offsets$ vector in the metadata for the map. But, other parts in Drill do include this vector.

The RowSet behavior might be a bug which can be addressed in another PR.

This PR:

* Adds unit tests for map accessors at the row set level. Looks like these were never added originally. They are a simplified form of the ResultSetLoader map tests.

* Verified that the schema inference can infer a schema from a repeated map (using the RowSet style.)

* Added a test to reproduce the case from the bug.

* Made a tweak to the RowSetBuilder to allow access to the RowSetWriter which is needed by the new tests.

* Could of minor clean-ups.

closes #1513

  1. … 5 more files in changeset.
DRILL-6676: Add Union, List and Repeated List types to Result Set Loader

Adds required functionalty to the list and repeated list vectors.

Row set accessor changes

Adds a "variant" type to model both unions and (non-repeated) lists (which can act as a repeated union, among other things.)

Adds union, list and repeated list support to the result set loader and associated classes.

Copied much of the general documentation from my private Wiki into mark-down files.

closes #1429

    • -10
    • +120
    ./single/BaseReaderBuilder.java
    • -11
    • +106
    ./single/BaseWriterBuilder.java
    • -4
    • +84
    ./single/BuildVectorsFromMetadata.java
    • -5
    • +55
    ./single/SingleSchemaInference.java
  1. … 61 more files in changeset.
DRILL-6386: Remove unused imports and star imports.

  1. … 230 more files in changeset.
DRILL-6389: Fixed building javadocs - Added documentation about how to build javadocs - Fixed some of the javadoc warnings

closes #1276

  1. … 64 more files in changeset.
DRILL-6373:

- Adds code to return the proper vector type given the actual vector, adjusting metadata as needed.

- Refactor result set loader

- Revised projection & vector cache

closes #1244

    • -17
    • +24
    ./single/BuildVectorsFromMetadata.java
  1. … 38 more files in changeset.
DRILL-6335: Column accessor refactoring

closes #1218

  1. … 43 more files in changeset.
DRILL-6320: Fixed license headers.

closes #1207

  1. … 2065 more files in changeset.
DRILL-6230: Extend row set readers to handle hyper vectors

closes #1161

    • -0
    • +45
    ./AbstractReaderBuilder.java
    • -0
    • +72
    ./hyper/HyperSchemaInference.java
    • -0
    • +40
    ./single/DirectRowIndex.java
    • -0
    • +65
    ./single/SingleSchemaInference.java
  1. … 57 more files in changeset.
DRILL-6138: Move RecordBatchSizer to org.apache.drill.exec.record package

This closes #1115

  1. … 10 more files in changeset.
DRILL-6114: Metadata revisions

Support for union vectors, list vectors, repeated list vectors. Refactored metadata classes.

closes #1112

    • -2
    • +2
    ./single/BuildVectorsFromMetadata.java
  1. … 68 more files in changeset.
DRILL-5657: Size-aware vector writer structure

- Vector and accessor layer

- Row Set layer

- Tuple and column models

- Revised write-time metadata

- "Result set loader" layer

this closes #914

    • -0
    • +117
    ./BaseTupleModel.java
    • -0
    • +115
    ./ContainerVisitor.java
    • -0
    • +93
    ./MetadataProvider.java
    • -0
    • +61
    ./SchemaInference.java
    • -0
    • +117
    ./TupleModel.java
    • -0
    • +149
    ./hyper/BaseReaderBuilder.java
    • -0
    • +30
    ./hyper/package-info.java
    • -0
    • +89
    ./single/BaseReaderBuilder.java
    • -0
    • +72
    ./single/BaseWriterBuilder.java
    • -0
    • +97
    ./single/BuildVectorsFromMetadata.java
    • -0
    • +112
    ./single/VectorAllocator.java
    • -0
    • +28
    ./single/package-info.java
  1. … 174 more files in changeset.