asterixdb

Clone Tools
  • last updated a few minutes ago
Constraints
Constraints: committers
 
Constraints: files
Constraints: dates
Optimized the binary tokenizer - get the total number of tokens Change-Id: I83765b6cd7cb92648aa55b260b0044fa3ffdcd7d Reviewed-on: https://asterix-gerrit.ics.uci.edu/349 Reviewed-by: Young-Seok Kim <kisskys@gmail.com> Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Fix for issue 923

Change-Id: I87053315fc7650682fcbedd573b1155c17810073

Reviewed-on: https://asterix-gerrit.ics.uci.edu/347

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Young-Seok Kim <kisskys@gmail.com>

This change includes: 1. avoid calling ByteBuffer.get() and ByteBuffer.put() for bloom filter set/get operations. ByteBuffer will check boundaries of the index for get/put. It's a very frequent operation. 2. let the hash table size in BufferCache be 3X of the buffer cache page count. This will even reduce conflicts.

Change-Id: Ifd47c410338ce557a494ab3605660bad1899e786

Reviewed-on: https://asterix-gerrit.ics.uci.edu/345

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Young-Seok Kim <kisskys@gmail.com>

Reviewed-by: Ian Maxon <imaxon@apache.org>

improve the buffer cache perf. with 1) a better hash function for fileid-pageid, 2) reduce synchronization in clock page replacement policy.

Change-Id: I296c589a556a9afa7f27c6f560fa07fc4e2c1861

Reviewed-on: https://asterix-gerrit.ics.uci.edu/342

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

Reviewed-by: Young-Seok Kim <kisskys@gmail.com>

Performance fix for BufferCache. The dpid of a page of file is calculated by fileid<<32 + pageid. But BufferCache.hash(long dpid) returns the hash value dpid%pageMap.length. In many cases, the asterix configuration results in power-of-2 pageMap.length (buffer-cache-size/page-size), which makes fileid useless. That used to result in serious consequences: different partitions contend for the same cache bucket (which contains a link list of size #partitions) for most of the time and therefore the CPU couldn't be saturated.

Change-Id: I4afc406d612e569e23f65afdedc469459235ce7d

Reviewed-on: https://asterix-gerrit.ics.uci.edu/341

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Young-Seok Kim <kisskys@gmail.com>

Reviewed-by: Pouria Pirzadeh <pouria.pirzadeh@gmail.com>

Fix really bad typo in POM

- One change to a UUID result which doesn't correspond to its query

- Disable the one introspection test case that fails temporarily

Change-Id: I4e62a505754967c590f195bd997aba44ddb5a535

Reviewed-on: https://asterix-gerrit.ics.uci.edu/340

Reviewed-by: Taewoo Kim <wangsaeu@gmail.com>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Fix the temporary dataset life time threshold.

Change-Id: I076c085d4decfa23ab7413e85fe60458dd96d058

Reviewed-on: https://asterix-gerrit.ics.uci.edu/339

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

1. Fix the memory bloat issue introduced by https://github.com/apache/incubator-asterixdb/commit/c66d23a5ac65ec5218ee47134aea423fd62a32cc Recycle object pools in ADMDataParser before parsing each top-level ADM record.

2. Avoid using Strings as parameters of object pools because string comparison is slow and several comparisons done for parsing each ADM record.

Change-Id: I31848d8bd42a07f72035aefa2d671fdb2fce037d

Reviewed-on: https://asterix-gerrit.ics.uci.edu/334

Reviewed-by: Pouria Pirzadeh <pouria.pirzadeh@gmail.com>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Preston Carman <prestonc@apache.org>

Doc updates to HTTP API

-Rename "REST" API to HTTP API

-Minor documentation for /aql endpoint

Change-Id: I5c6520495752da312f4db7641fcfd14f7d63c829

Reviewed-on: https://asterix-gerrit.ics.uci.edu/331

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Chris Hillery <ceej@lambda.nu>

    • -1
    • +31
    /asterix-doc/src/site/markdown/api.md
Add missing documentation to sidebar, fix typos

Change-Id: Id957eb1321eb7539b951418178b989eb194ed625

Reviewed-on: https://asterix-gerrit.ics.uci.edu/330

Reviewed-by: Taewoo Kim <wangsaeu@gmail.com>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

NKC fix for int64

Change-Id: I60b3ec3f15921093b1fa11ac9fc157b7930f4e4d

Reviewed-on: https://asterix-gerrit.ics.uci.edu/329

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

Reviewed-by: Yingyi Bu <buyingyi@gmail.com>

Added the UUID constructor and changed the relevant documentation

Change-Id: I2276bf2b44975b13c99d56f448ae3e9e4a74b04e

Reviewed-on: https://asterix-gerrit.ics.uci.edu/326

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

    • -10
    • +10
    /asterix-app/data/pub-small/dblp-small-id-autogenerated-pk_including_uuid.adm
  1. … 19 more files in changeset.
Introducing data replication API to LSM indexes

Change-Id: I80565fc9d74e30440d2df5917911904ba8f33c25

Reviewed-on: https://asterix-gerrit.ics.uci.edu/322

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: abdullah alamoudi <bamousaa@gmail.com>

  1. … 25 more files in changeset.
This change includes the following: 1. allowing short-circuit reads to be used with external data. 2. enhancing behaviour of index access for Text data. 3. updating the external data documentation.

Rebased on current Master.

The following commits from your working branch will be included:

commit 3176d741b8ef187703c7346044c9531a99f2b716

Author: Abdullah Alamoudi <bamousaa@gmail.com>

Date: Wed Feb 4 18:32:24 2015 +0300

added changes from external experiments and reflected external data documentation to include external data indexing

Change-Id: I041c71391d8704cd800c4446a085beef197e7acf

Reviewed-on: https://asterix-gerrit.ics.uci.edu/218

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Raman Grover <ramangrover29@gmail.com>

    • -51
    • +164
    /asterix-doc/src/site/markdown/aql/externaldata.md
Add record type and primary key in the response of connector servlet api.

Change-Id: Ieefe79557cfb3786a6b22371a2e64ac4161ff900

Reviewed-on: https://asterix-gerrit.ics.uci.edu/315

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Ian Maxon <imaxon@apache.org>

New feed fixes: Updated the ExternalFunctionProvider to improve handling "setResult"

Change-Id: I7c026e2f3b927bda2628835c15318d6c96f8b043

Reviewed-on: https://asterix-gerrit.ics.uci.edu/321

Reviewed-by: Ian Maxon <imaxon@apache.org>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reconcile ASF and Gerrit master branches

Both parents of this merge commit are the same content-wise, excepting the commit message.

We have to make this commit to get both repositories back to a common version of master.

Adding introspection for getting record details.

Change-Id: Iba15acff0f6830cffbc33d9dac3b5f0c417be844

    • -0
    • +4
    /asterix-app/data/classifications/animals.adm
  1. … 182 more files in changeset.
Adding introspection for getting record details.

Change-Id: Iba15acff0f6830cffbc33d9dac3b5f0c417be844

Reviewed-on: https://asterix-gerrit.ics.uci.edu/313

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

  1. … 182 more files in changeset.
Temporarily disable flaky Feeds tests.

Change-Id: I9ddd27484b0b26b79954c17ccabd571d2791ee0e

Reviewed-on: https://asterix-gerrit.ics.uci.edu/312

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

Change Java package from edu.uci.ics to org.apache

Change-Id: I99172d856e88954b00cf7cfb24d33bb400f53994

Reviewed-on: https://asterix-gerrit.ics.uci.edu/308

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Till Westmann <tillw@apache.org>

    • -3
    • +3
    /algebricks/algebricks-compiler/pom.xml
  1. … 2019 more files in changeset.
Fix for metadata indexes first LSN

Change-Id: I6ce08ee38e49e0f0f0c2acd27b64415d771bda67

Reviewed-on: https://asterix-gerrit.ics.uci.edu/293

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: abdullah alamoudi <bamousaa@gmail.com>

Add Apache RAT License Auditor plugin, and fix missing licenses

Change-Id: Ic0fa0b7836e8a69d0d8beed83f9c3b41c168afd4

Reviewed-on: https://asterix-gerrit.ics.uci.edu/263

Reviewed-by: Chris Hillery <ceej@lambda.nu>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

  1. … 1530 more files in changeset.
Add Apache RAT License Auditor plugin, and fix missing licenses

Change-Id: I39d92ec6654c73b4e6b8ba76dd66770bb60c7b79

Reviewed-on: https://asterix-gerrit.ics.uci.edu/260

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Chris Hillery <ceej@lambda.nu>

Reviewed-by: Till Westmann <tillw@apache.org>

  1. … 18 more files in changeset.
YARN integration for AsterixDB

This is an initial version of YARN integration for AsterixDB.

- Uses static assignment of CC and NC nodes to NM locations

- Stores state locally on each NM, outside of HDFS

- "All or nothing" container allocation. We don't attempt to

move or rellocate containers the RM may kill (yet).

- Retains feature parity with managix.

Change-Id: I49c849179d17fc7faa446b9be57a0695df6836ab

Reviewed-on: https://asterix-gerrit.ics.uci.edu/161

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Murtadha Hubail <hubailmor@gmail.com>

    • -0
    • +103
    /asterix-common/src/main/resources/schema/yarn_cluster.xsd
    • -0
    • +236
    /asterix-doc/src/site/markdown/yarn.md
    • binary
    /asterix-doc/src/site/resources/images/asterix_webui.png
    • binary
    /asterix-doc/src/site/resources/images/running_inst.png
    • binary
    /asterix-doc/src/site/resources/images/yarn_clust.png
    • -0
    • +241
    /asterix-yarn/pom.xml
    • -0
    • +101
    /asterix-yarn/src/main/assembly/binary-assembly.xml
  1. … 51 more files in changeset.
Introduces Feeds 2.0

commit c3f577861fc705d848c1641605689cadd6973bae

Merge: ebc4cae fc0c2c0

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Fri Jun 26 13:04:05 2015 -0700

Merge branch 'raman/feeds_2_release' of https://code.google.com/p/asterixdb-sandbox into raman/feeds_2_release

Conflicts:

asterix-app/src/main/java/edu/uci/ics/asterix/api/http/servlet/FeedServlet.java

asterix-external-data/src/main/java/edu/uci/ics/asterix/external/library/java/JObjectAccessors.java

commit ebc4cae21a7302869f953df1ebda601e798d12d2

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Sat Jun 20 17:14:45 2015 -0700

Introduces Feeds 2.0

Some of the prominent chnages introduced are as follows

a) Support for building a cascade network of feeds (via secondary feeds feature)

b) Feed Management Console for tracking active feeds and associated metrics

c) Support for elastic runtime for data ingestion

d) Improved fault-tolerance with support for logging of failed records

Documentation has been added at asterix-doc/src/site/markdown/feeds/

commit fc0c2c0549a6ee8b202e57607d2e110478cd57bb

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Sat Jun 20 17:14:45 2015 -0700

Introduces Feeds 2.0

Some of the prominent chnages introduced are as follows

a) Support for building a cascade network of feeds (via secondary feeds feature)

b) Feed Management Console for tracking active feeds and associated metrics

c) Support for elastic runtime for data ingestion

d) Improved fault-tolerance with support for logging of failed records

Documentation has been added at asterix-doc/src/site/markdown/feeds/

Change-Id: I498f01c591a229aaf51cec43ab20f3e5c4f072f4

Reviewed-on: https://asterix-gerrit.ics.uci.edu/297

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

Reviewed-by: abdullah alamoudi <bamousaa@gmail.com>

    • -0
    • +12
    /asterix-app/data/twitter/obamatweets_duplicate.adm
  1. … 369 more files in changeset.
Changes to support Feeds 2.0 (random partitioning of tuples)

commit 8b5c352d831aa0d7e006457f0b1430ac12b54731

Author: ramangrover29 <ramangrover29@gmail.com>

Date: Sat Jun 20 17:16:11 2015 -0700

Changes to support Feeds 2.0 (random partitioning of tuples)

Change-Id: I712c1f019cbc43f66d50620772c3df03c5944394

Reviewed-on: https://asterix-gerrit.ics.uci.edu/296

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

Make managix scripts properly executable

Most of the scripts managix was using were missing the shebang at the beginning of the script.

This can be problematic if one uses a shell besides bash. This patch fixes that.

Change-Id: Ie7722f832d0d25bec049e123a0ed1570d0aeb650

Reviewed-on: https://asterix-gerrit.ics.uci.edu/286

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Raman Grover <ramangrover29@gmail.com>

  1. … 6 more files in changeset.
Remove erroneous casts of list item offsets

The offset of an item in a list is read and written as an int,

but for some reason we were casting it down to a short when

recording the offsets during list building.

Change-Id: Ic058aa219e016d5efd719ddec0d1bb90a23e0c48

Reviewed-on: https://asterix-gerrit.ics.uci.edu/292

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Steven Jacobs <sjaco002@ucr.edu>

Fix for issue 872

Change-Id: If8cf621b8b89302cb28bd34f63e23f9c5f684351

Reviewed-on: https://asterix-gerrit.ics.uci.edu/290

Reviewed-by: Ildar Absalyamov <ildar.absalyamov@gmail.com>

Reviewed-by: Ian Maxon <imaxon@apache.org>

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>