hadoop - Reading compressed (.xz) file in Apache pig -
i trying read .xz file compressed using hadoop-xz codec using pig script.
the sample code tried is,
register hadoop-xz-1.4.jar set output.compression.enabled true; set output.compression.codec io.sensesecure.hadoop.xz.xzcodec; msg = load 'pigtest/newxz.xz' using pigstorage(); store msg 'pigtest/output' using pigstorage(); dump msg;
the result still in compressed format. doing wrong or have use xzinputstream
inside pig?
the running environment hortonworks sandbox 2.2 (hue)
depends on want do.
it seems want read xz file assume need setup input codec not output one.
i'm not pig user gather cannot handle custom compression (unlike hive , streaming example).
Comments
Post a Comment