Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Apache Oozie Essentials

You're reading from  Apache Oozie Essentials

Product type Book
Published in Dec 2015
Publisher
ISBN-13 9781785880384
Pages 164 pages
Edition 1st Edition
Languages
Author (1):
Jagat Singh Jagat Singh
Profile icon Jagat Singh

Running our first Oozie job


We will start with a very simple example. In this chapter, our use case is to delete a given folder on HDFS via Oozie. In our case study, we get data daily in one folder in HDFS, but we want to delete the previous day's data. We want to keep just latest version in our system. Let's solve our business problem:

  1. Log in to Hue and go to Workflows | Editor.

  2. In the top row of editor, there are various types of actions. Select the Hadoop Fs action.

    Tip

    Take some time with your mouse over and read the names of various types of actions that Oozie can run.

  3. Drag the Hadoop Fs action to the editor as shown in the next screenshot.

  4. Give a meaningful name to this action, for example, my_delete_folder_action.

  5. Give the path of the folder that you want to delete. I have used /user/hue/learn_oozie/my_first_oozie_job. I have also set the name of the Workflow as My First Oozie Job, as shown in the following screenshot:

    Hue Workflow editor

  6. Make these changes and click on Save for the Workflow...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}