Normally when using Avro files as input or output to a MapReduce job, you write a Java main[] method to set up the Job using AvroJob. That documentation page does a good job of explaining where to use AvroMappers, AvroReducers,…
Read More >Today we’re having a quick one. Earlier during the day, I had to peruse an Oozie log for the first time. And it looked like: 2014-02-11 20:13:14,211 INFO ActionStartXCommand:539 – USER[running_user] GROUP[-] TOKEN[] APP[some-big-job-workflow] JOB[0004636-140111040403753-oozie-W] ACTION[[email protected]:start:] Start action [[email protected]:start:] with…
Read More >Meet Oozie’s Workflows Oozie is a workflow scheduler for Hadoop, but that’s not terribly important right now. What is important is that it defines its workflows using an XML dialect. And as all XML things go, the result is… shall…
Read More >