Tag: oozie

Avro MapReduce jobs in Oozie

Normally when using Avro files as input or output to a MapReduce job, you write a Java main[] method to set up the Job using AvroJob. That documentation page does a good job of explaining where to use AvroMappers, AvroReducers,…

Read More >

Making Oozie Logs A Little Easier On The Eyes

Today we’re having a quick one. Earlier during the day, I had to peruse an Oozie log for the first time. And it looked like: 2014-02-11 20:13:14,211 INFO ActionStartXCommand:539 – USER[running_user] GROUP[-] TOKEN[] APP[some-big-job-workflow] JOB[0004636-140111040403753-oozie-W] ACTION[[email protected]:start:] Start action [[email protected]:start:] with…

Read More >

Oozing Caribou

Meet Oozie’s Workflows Oozie is a workflow scheduler for Hadoop, but that’s not terribly important right now. What is important is that it defines its workflows using an XML dialect. And as all XML things go, the result is… shall…

Read More >