关键词:数据处理;自动化;一致性
摘 要:Big data in its raw form rarely satisfies the Hadoop developer's data requirements forperforming data processing tasks. Different extract/transform/load (ETL) and pre-processingoperations are usually needed before starting any actual processing jobs. Oozie is a frameworkthat helps automate this process and codify this work into repeatable units or workflows thatcan be reused over time without the need to write any new code or steps. Learn how Oozie canbe used to create different types of workflows.