Clone
JavierJia <jianfeng.jia@gmail.com>
committed
on 23 Oct 15
ASTERIXDB-1102: VarSize Encoding to store length of String and ByteArray
This patch is to change the encoding format that stores the length … Show more
ASTERIXDB-1102: VarSize Encoding to store length of String and ByteArray

This patch is to change the encoding format that stores the length value of

the variable length type (e.g. String, ByteArray) from fix-size encoding

(2bytes) to variable-size encoding ( 1 to 5bytes)

It will solve the issue 1102 to enable us to store a String that longer

than 64K. Also for the common case of storing the short string ( <=

127), it will save one byte per string.

Some important changes include:

1. Add one hyracks-util package to consolidate all the hyracks

independent utility functions. It will reduce the chances of having

duplicate utils in different packages.

2. Move parts of Asterix string functions down to Hyracks

UTF8StringPointable object, which will benefit the other dependencies,

such as VXQuery.

Change-Id: I7e95df0f06984b784ebac2c84b97e56a50207d27

Reviewed-on: https://asterix-gerrit.ics.uci.edu/449

Tested-by: Jenkins <jenkins@fulliautomatix.ics.uci.edu>

Reviewed-by: Taewoo Kim <wangsaeu@gmail.com>

Reviewed-by: Jianfeng Jia <jianfeng.jia@gmail.com>

Show less

master + 7 more