Clone Tools
  • last updated 25 mins ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
moved files for the merge

  1. … 175 more files in changeset.
Move AsterixDB to Subfolder

    • -5
    • +0
    ./adm-load/double_end_record_token.adm
    • -6005
    • +0
    ./big-object/lineitem.tbl.big
    • -1500
    • +0
    ./big-object/order.tbl.verylong.big
  1. … 44144 more files in changeset.
ASTERIXDB-865: fix query compilation for if-else expression.

Change-Id: I80e7995e814180fe567818263c79493901af284c

Reviewed-on: https://asterix-gerrit.ics.uci.edu/703

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

  1. … 21 more files in changeset.
ASTERIXDB-1322: fix the AQL translator and add tests.

Change-Id: I994bb69ca989c8c9e2ce417d62739a859e5ae94b

Reviewed-on: https://asterix-gerrit.ics.uci.edu/663

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

  1. … 13 more files in changeset.
Enabled Feed Tests and Added External Library tests

Feed tests had been switched off for a while due to having too many

sporadic failures. Now, we are switching them back on.

In addition, a new set of tests have been added to test that external

library works as expected.

Change-Id: Idd1fccd136fa2645b2707bbf7c04e60991ae8d4a

Reviewed-on: https://asterix-gerrit.ics.uci.edu/625

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: abdullah alamoudi <bamousaa@gmail.com>

    • -0
    • +1106
    ./external-parser/August16-20-long.txt
    • -0
    • +12869
    ./external-parser/dropbox/jobads1.txt
    • -0
    • +12869
    ./external-parser/dropbox/jobads2.txt
    • -0
    • +12869
    ./external-parser/jobads.new
    • -0
    • +1106
    ./external-parser/jobads.old
    • -0
    • +12869
    ./external-parser/jobads.txt
    • -0
    • +684032
    ./external-parser/testdata.txt
  1. … 198 more files in changeset.
Merge asterix-experiments to master

Change-Id: I79b91e045bd056e62adb6e48795a6625baee0062

Reviewed-on: https://asterix-gerrit.ics.uci.edu/642

Reviewed-by: Ian Maxon <imaxon@apache.org>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

  1. … 254 more files in changeset.
ASTERIXDB-1157: Pushdown Limit

- Limit Pushdown into an Order (ExternalSort) operator.

- ConstantFoldingRule is not applied on the record-related operations.

Change-Id: I19aa4ce402b1834d9f68320acb72d7635a41a837

Reviewed-on: https://asterix-gerrit.ics.uci.edu/617

Reviewed-by: Yingyi Bu <buyingyi@gmail.com>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

    • -0
    • +60
    ./spatial/spatialData2.json
  1. … 37 more files in changeset.
patch 5

  1. … 19 more files in changeset.
patch 4

  1. … 95 more files in changeset.
ASTERIXDB-1281 - Interval format update to AQL and ADM

The new interval format takes a more generic approach to representing intervals.

Here is an example for a date interval:

interval(date("2012-01-01”), date(”2013-04-01”))

Note that the interval type is defined by the arguments to the interval expression.

Currently only date, time, and datetime types are supported for intervals. The new

format is used for ADM and AQL.

In addition to the format change, the internal byte structure of an interval has been

updated. The format looks like the following:

byte tag, T start, T end (where T is a date, time or datetime type)

Note how the tag has been moved to the front. Also with the new sturcture, an

interval is variable length, not fixed length as before.

Change-Id: I009c71b7a445d141e228ba15d56d0b6cf3c8a3f5

Reviewed-on: https://asterix-gerrit.ics.uci.edu/602

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

  1. … 70 more files in changeset.
Add Support for Upsert Operation

This change allows users to execute upsert commands which

couples delete if found with insert. It locks the primary

keys before doing the search ensuring consistency.

Change-Id: I8999000331795a5949d621d2dd003903e057a521

Reviewed-on: https://asterix-gerrit.ics.uci.edu/477

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

    • -0
    • +13
    ./dblp-small/more-dblp-small-id.txt
    • -0
    • +10
    ./more-fbm-with-send-time.adm
    • -0
    • +4
    ./semistructured/tiny01/more-customer.adm
    • -0
    • +9
    ./spatial/moreSpatialData.json
    • -0
    • +14
    ./tpch0.001/other-orders.tbl
    • -0
    • +4
    ./upsert/raw-data/more-data.txt
    • -0
    • +5
    ./upsert/raw-data/overlapping.data
    • -0
    • +4
    ./upsert/raw-data/test-data.txt
  1. … 244 more files in changeset.
ASTERIXDB-1005, ASTERIXDB-1263: Clean up subplan flattening: 1. Inline NestedTupleSource and remove SubplanOperator for special cases that join operators inside the SubplanOperator can be re-targeted for correlations; 2. Blindly inline NestedTupleSource and remove SubplanOperator for general cases where the condition of the special case is not met.

Change-Id: I4dd130a25f3c81272cc23f844ea20e376e990612

Reviewed-on: https://asterix-gerrit.ics.uci.edu/579

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

    • -0
    • +30
    ./restaurants/restaurants.adm
  1. … 84 more files in changeset.
ASTERIXDB-1044 Allow Reading large records from HDFS

This change tests reading of HDFS records larger than 8192.

The fix for the issue was merged previously.

Change-Id: I3cf52be4bc0fd8af2555062eeb421d7235088d98

Reviewed-on: https://asterix-gerrit.ics.uci.edu/529

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

    • -0
    • +11
    ./hdfs/large-record.txt
  1. … 5 more files in changeset.
patch 1

    • -0
    • +7
    ./tinycollege/staff.adm
    • -0
    • +7
    ./tinycollege/students.adm
  1. … 88 more files in changeset.
ASTERIXDB-1126 Correct comparators and type traits for external index

In case of external data, the type definition does not contain information

about primary keys and we should get them based on the data input format.

Change-Id: I71d924d7e2b7a7e6c752bc97679e612946afc17c

Reviewed-on: https://asterix-gerrit.ics.uci.edu/504

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

  1. … 15 more files in changeset.
ASTERIXDB-1102: VarSize Encoding to store length of String and ByteArray

This patch is to change the encoding format that stores the length value

of

the variable length type (e.g. String, ByteArray) from fix-size encoding

(2bytes) to variable-size encoding ( 1 to 5bytes)

It will solve the issue 1102 to enable us to store a String that longer

than 64K. Also for the common case of storing the short string ( <=

127), it will save one byte per string.

Some important changes include:

1. The UTF8StringSerDer and ByteArraySerDer is not Singleton instance

any more. I need some state to speedup the serialization and avoid the

object creatation. Luckily, 99% percent of Serializer were used as

factory way. The other 1% has been fixed.

A separate Test support, the ExcutionTest now can produce the only.xml

which stores the previous failed runtime test.xml. It can speedup the

debug process.

Change-Id: I41fff780f5c071742ef10129d83c8f945d5886d7

Reviewed-on: https://asterix-gerrit.ics.uci.edu/450

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Jianfeng Jia <jianfeng.jia@gmail.com>

    • -0
    • +1500
    ./big-object/order.tbl.verylong.big
  1. … 322 more files in changeset.
ASTERIXDB-920: Fixed consecutive index-nested-loop joins failure

Index-nested-loop join now tries to use an index from the inner relation first.

Change-Id: Ie7bcc70c19d2c1e70ce274eaacfe2c3ce89c8de8

Reviewed-on: https://asterix-gerrit.ics.uci.edu/434

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Yingyi Bu <buyingyi@gmail.com>

    • -0
    • +8
    ./index-join/results.adm
    • -0
    • +13
    ./index-join/tw_messages.adm
    • -0
    • +12
    ./index-join/tw_users.adm
  1. … 28 more files in changeset.
Almost full support for clean JSON, plus some lossless JSON fixes

Clean JSON correctly supports all numeric and spatial types, plus hex and uuid.

Lossless JSON correctly supports uuid and hex, and "circle" is correct now.

Date/time types remain unfixed for both JSON formats.

Change-Id: I5c37d3b5aad7f286eba1cb8cb5f05ad456a6521d

Reviewed-on: https://asterix-gerrit.ics.uci.edu/362

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Chris Hillery <ceej@lambda.nu>

    • -0
    • +25
    ./nontagged/allData.adm
  1. … 46 more files in changeset.
Added the UUID constructor and changed the relevant documentation

Change-Id: I2276bf2b44975b13c99d56f448ae3e9e4a74b04e

Reviewed-on: https://asterix-gerrit.ics.uci.edu/326

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

    • -10
    • +10
    ./pub-small/dblp-small-id-autogenerated-pk_including_uuid.adm
  1. … 33 more files in changeset.
Adding introspection for getting record details.

Change-Id: Iba15acff0f6830cffbc33d9dac3b5f0c417be844

    • -0
    • +4
    ./classifications/animals.adm
  1. … 196 more files in changeset.
Adding introspection for getting record details.

Change-Id: Iba15acff0f6830cffbc33d9dac3b5f0c417be844

Reviewed-on: https://asterix-gerrit.ics.uci.edu/313

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

  1. … 196 more files in changeset.
Introduces Feeds 2.0

commit c3f577861fc705d848c1641605689cadd6973bae

Merge: ebc4cae fc0c2c0

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Fri Jun 26 13:04:05 2015 -0700

Merge branch 'raman/feeds_2_release' of https://code.google.com/p/asterixdb-sandbox into raman/feeds_2_release

Conflicts:

asterix-app/src/main/java/edu/uci/ics/asterix/api/http/servlet/FeedServlet.java

asterix-external-data/src/main/java/edu/uci/ics/asterix/external/library/java/JObjectAccessors.java

commit ebc4cae21a7302869f953df1ebda601e798d12d2

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Sat Jun 20 17:14:45 2015 -0700

Introduces Feeds 2.0

Some of the prominent chnages introduced are as follows

a) Support for building a cascade network of feeds (via secondary feeds feature)

b) Feed Management Console for tracking active feeds and associated metrics

c) Support for elastic runtime for data ingestion

d) Improved fault-tolerance with support for logging of failed records

Documentation has been added at asterix-doc/src/site/markdown/feeds/

commit fc0c2c0549a6ee8b202e57607d2e110478cd57bb

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Sat Jun 20 17:14:45 2015 -0700

Introduces Feeds 2.0

Some of the prominent chnages introduced are as follows

a) Support for building a cascade network of feeds (via secondary feeds feature)

b) Feed Management Console for tracking active feeds and associated metrics

c) Support for elastic runtime for data ingestion

d) Improved fault-tolerance with support for logging of failed records

Documentation has been added at asterix-doc/src/site/markdown/feeds/

Change-Id: I498f01c591a229aaf51cec43ab20f3e5c4f072f4

Reviewed-on: https://asterix-gerrit.ics.uci.edu/297

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

Reviewed-by: abdullah alamoudi <bamousaa@gmail.com>

    • -0
    • +12
    ./twitter/obamatweets_duplicate.adm
  1. … 383 more files in changeset.
VariableSizeFrame(VSizeFrame) support for Asterix (Runtime Only)

Apply the https://asterix-gerrit.ics.uci.edu/#/c/234/ API changes to

Asterix level.

Change-Id: I5459e877707a1494fc1bebf03d4457a7427e9e0f

Reviewed-on: https://asterix-gerrit.ics.uci.edu/259

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Yingyi Bu <buyingyi@gmail.com>

    • -0
    • +150
    ./big-object/customer.tbl.big
    • -0
    • +6005
    ./big-object/lineitem.tbl.big
    • -0
    • +1500
    ./big-object/order.tbl.big
  1. … 41 more files in changeset.
Issue 867: Test cases for CSV with headers and various line separators.

Change-Id: I046aa30824d14cbc894a28719c6c6560f46d0133

Reviewed-on: https://asterix-gerrit.ics.uci.edu/247

Reviewed-by: Ian Maxon <imaxon@uci.edu>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Preston Carman <ecarm002@ucr.edu>

    • -0
    • +1
    ./csv/sample_08_header.csv.cr
    • -0
    • +9
    ./csv/sample_08_header.csv.crlf
    • -0
    • +9
    ./csv/sample_08_header.csv.lf
  1. … 16 more files in changeset.
Temporal update based on a merge from an old branch.

Includes the following new functions: overlap-bins, get-overlapping-interval, get-interval-start-date, get-interval-start-datetime, get-interval-start-time, get-interval-end-date, get-interval-end-datetime, get-interval-end-time

Change-Id: Ie15ed39ae7de83ce71c63c4e7490f2ebf5911540

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/244

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ildar Absalyamov <ildar.absalyamov@gmail.com>

  1. … 108 more files in changeset.
Added support of typed indexes over open fields & indexes over nested fields

Open indexes requires user to provide a type along with a indexed field name.

This type would be enforced for all the indexed records, i.e. index cannot be created if in some records a field with provided name has a different type.

Index-specific rewrite rules match provided type with the inferred types of other arguments in join\select statements and trigger index rewrite.

Nested indexes use the same semantics as the regular indexes, with exception that field could be located arbitrarily deep inside nested structure

Change-Id: I53d00aba243ccf7cf79cf7d775dd305813d24f98

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/97

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

    • -0
    • +21
    ./hdfs/spatialDataNested.json
    • -0
    • +250
    ./hdfs/tw_for_indexleftouterjoin_nested.adm
    • -1000
    • +1000
    ./semistructured/co1k_olist/customer.adm
    • -0
    • +12
    ./tinysocial/twm-nested.adm
  1. … 1785 more files in changeset.
- Fixed Type Casting issue - Reorganized duplicated internal class in the DelimitedDataParser and DelimitedDataParserFactory - Prevented a user from creating an inverted index on a dataset with a variable-length PK - INT64 is now the default type - Issue 852 fixed

Change-Id: I2d71e8a21da4f709c3259a3d3f678c640f9e1160

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/192

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <westmann@gmail.com>

    • -0
    • +56
    ./types/deptDataset.adm
    • -0
    • +20
    ./types/deptDataset_minus_data.adm
    • -0
    • +55
    ./types/empDataset.adm
    • -0
    • +20
    ./types/empDataset_minus_data.adm
  1. … 844 more files in changeset.
Add test case and documentation for CSV parsing with headers.

Also fix a minor CSV parsing bug with line counting.

Change-Id: Ib875d60aa2465d4a50ee50c5c0e9356185f35c73

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/228

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Taewoo Kim <wangsaeu@gmail.com>

Reviewed-by: Ian Maxon <imaxon@uci.edu>

    • -0
    • +9
    ./csv/sample_08_header.csv
  1. … 8 more files in changeset.
Add the Binary data type and corresponding helper functions to Asterix.

The binary data type is implemented as a bytearray. Its storage format

follows the String type which has 2 bytes for length and then store the

bytes contents.

Binary data will take hex("") or base64("") as the constructor method to

passing a hex string or base64 string into Asterix. For output we use

hex("") format.

The parse-[hex|base64](string) function will parse the corresponding hex

or base64 string to binary type. The print-[hex|base64](binary)

functions will print the binary to hex or base64 STRING format.

The sub-binary(binary, offset, [length]) function works the same as

substring(string, offset, [length])

The find-binary(srcbinary, targetbinary, [start-offset]) will find the

position of the targetbinary in the srcbinary.

Change-Id: I5ecf0cc115c44070fb5c1fc5b0ec12a95d4243a4

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/175

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Yingyi Bu <buyingyi@gmail.com>

    • -0
    • +1
    ./adm-load/binary_type.adm
    • -0
    • +25
    ./adm-load/usermd5.adm
    • -0
    • +20
    ./adm-load/usermd5copy.adm
  1. … 124 more files in changeset.
Fixed CSV parser to recognize quote and delimiter inside a string

Change-Id: Iaaabc23e86df4f9bbee9f06b7976d7fbdcbb0f3f

Reviewed-on: http://fulliautomatix.ics.uci.edu:8443/135

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <westmann@gmail.com>

    • -0
    • +5
    ./csv/sample_04_quote_error.csv
    • -0
    • +5
    ./csv/sample_05_space_error_1.csv
    • -0
    • +5
    ./csv/sample_06_space_error_2.csv
  1. … 71 more files in changeset.