Skip to content

federicobaldo/aws-logs-serde

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

aws-logs-serde

Hive deserializer for various Aws log formats.

Usage

Download the latest release from [here](https://github.com/federicob/aws-logs-serde/releases/latest directly) on your Hive client.

Inside your script or from the Hive interactive shell you should add first the jar and then create the table specifying the row format.

add jar /home/hadoop/aws-logs-serde-0.1.jar;

create table s3_accesslogs(
bucketowner             string,
bucketname              string,
rdatetime               string,
rip                     string,
requester               string,
requestid               string,
operation               string,
rkey                    string,
requesturi              string,
httpstatus              int   ,
errorcode               string,
bytessent               int   ,
objsize                 int   ,
totaltime               int   ,
turnaroundtime          int   ,
referer                 string,
useragent               string,
versionid               string
)
ROW FORMAT SERDE 'com.amazonaws.hive.serdes.s3.S3LogDeserializer'
LOCATION 's3://<yours3bucket>/<yourprefixifany>/';

Status

Build Status

About

Hive deserializer for various Aws log formats.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages