Oozie is formidable because it is entirely written in XML, which is hard to debug when things go wrong. However, once you've figured out how to work with it, it's like magic. Complex dependencies, managing a multitude of jobs at different time schedules, managing entire data pipelines are all made easy with Oozie.
Oozie allows you to manage Hadoop jobs as well as Java programs, scripts and any other executable with the same basic set up. It manages your dependencies cleanly and logically.
Knowing the right configurations parameters which gets the job done, that is the key to mastering Oozie
Install and set up Oozie
Configure Workflows to run jobs on Hadoop
Configure time-triggered and data-triggered Workflows
Configure data pipelines using Bundles
Students should have basic knowledge of the Hadoop eco-system and should be able to run MapReduce jobs on Hadoop
Working with Oozie requires some basic knowledge of the Hadoop eco-system and running MapReduce jobs.
Who is this course intended for?
Engineers, analysts and sysadmins who are interested in big data processing on Hadoop
This course is not recommended for the beginners who have no knowledge of the Hadoop eco-system.