Skip to content

Latest commit

 

History

History
42 lines (34 loc) · 1.25 KB

README.md

File metadata and controls

42 lines (34 loc) · 1.25 KB

aws-logs-serde

Hive deserializer for various Aws log formats.

Usage

Download the latest release from [here](https://github.com/federicob/aws-logs-serde/releases/latest directly) on your Hive client.

Inside your script or from the Hive interactive shell you should add first the jar and then create the table specifying the row format.

add jar /home/hadoop/aws-logs-serde-0.1.jar;

create table s3_accesslogs(
bucketowner             string,
bucketname              string,
rdatetime               string,
rip                     string,
requester               string,
requestid               string,
operation               string,
rkey                    string,
requesturi              string,
httpstatus              int   ,
errorcode               string,
bytessent               int   ,
objsize                 int   ,
totaltime               int   ,
turnaroundtime          int   ,
referer                 string,
useragent               string,
versionid               string
)
ROW FORMAT SERDE 'com.amazonaws.hive.serdes.s3.S3LogDeserializer'
LOCATION 's3://<yours3bucket>/<yourprefixifany>/';

Status

Build Status