Move Pregelix and Hivesterix codebase to new repositories: 1. Move Pregelix codebase to; 2. Move Hivesterix codebase to .

Support big vertex in Pregelix. --For those vertice beyond page size, we store them on HDFS as immutable files. --Updates on those big vertice will trigger creations of new immutable files.

Several major changes in hyracks: -- reduced CC/NC communications for reporting partition request and availability; partition request/availability are only reported for the case of send-side materialized (without pipelining) policies in case of task re-attempt. -- changed buffer cache to dynamically allocate memory based on needs instead of pre-allocating -- changed each network channel to lazily allocate memory based on needs, and changed materialized connectors to lazily allocate files based on needs -- changed several major CCNCCFunctions to use non-java serde -- added a sort-based group-by operator which pushes group-by aggregations into an external sort -- make external sort a stable sort

1,3,and 4 is to reduce the job overhead.

2 is to reduce the unecessary NC resource consumptions such as memory and files.

5 and 6 are improvements to runtime operators.

One change in algebricks:

-- implemented a rule to push group-by aggregation into sort, i.e., using the sort-based gby operator

Several important changes in pregelix:

-- remove static states in vertex

-- direct check halt bit without deserialization

-- optimize the sort algorithm by packing yet-another 2-byte normalized key into the tPointers array

fix conflict in comment

add support for Hadoop Counters via job.setCounterAggregatorClass

The PregelixJob.setCounterAggregatorClass sets up a (user-specified)

global aggregator and an iterationComplete hook to save Counter values.

The user-specified Counter-based aggregator (must extend

HadoopCountersAggregator) is saved to HDFS in each iteration and should

be restart/snapshot-aware.

The usage for setting up counters is to make a call to

job.setCounterAggregatorClass. After job completion, the Counters may

be retrieved from HDFS using BspUtils.getCounters(job).

Note that there is currently only one spot for iterationComplete hooks

and this behavior occupies it.

api for specifying update state for activate() and voteToHalt()

Merge branch 'master' into zheilbron/hyracks_msr_demo

fix IIndexAccessor interface, add a boolean exclusiveMode parameter for the createSearchCursor method

fix file write race condition

Merge branch 'master' into dev/hyracks_msr










merge from zheilbron/hyracks_msr

use hadoop's vlong and fix driver's re-attempt to be 1

addd partition early termination support

address Vinayak's comments on open/close of vertex

add message overflow support

refactoring dataflow operators to be more general--suport both b-tree and lsm b-tree

add the support for customized partitioner

add/update license headers

add normalized key computer support in Pregelix

Merged fullstack_lsm_staging upto r3336

fix the halt-activate issue reported by Anbang

cross merge fullstack_release_candidate into trunk

Merge fullstack_asterix_stabilization with fullstack_hyracks_result_distribution.

Merged fullstack_asterix_stabilization -r 3157:3163

Merged fullstack_asterix_stabilization -r 2933:3157

fix an issue reported by anbang

Merge fullstack_asterix_stabilization into fullstack_hyracks_result_distribution.

fix addVertex in the inner join plan

fix addVertex and add a test case

Merged fullstack_asterix_stabilization -r 2813:2933

