Take the 2-minute tour ×

Stack Overflow is a question and answer site for professional and enthusiast programmers. It's 100% free, no registration required.

Orchestration of Apache Spark using Apache Oozie

up vote 3 down vote favorite

We are thinking of the integration of apache spark in our calculation process where we at first wanted to use apache oozie and standard MR or MO (Map-Only) jobs.

After some research several questions remain:

Is it possible to orchestrate an apache spark process by using apache oozie? If yes, how?
Is oozie necessary anymore or could spark handle orchestration by itself? (unification seems to be one of the main concerns in spark)

Please consider the following scenarios when answering:

executing a work flow every 4 hours
executing a work flow whenever specific data is accessible
trigger a work flow and configure it with parameters

Thanks for your answers in advance.

edited Jul 15 at 11:20

asked Jul 14 at 14:44

Matthias Kricke
695414

Don't know much about Oozie, but I would say for spark go as simple as possible, since most of the flow handling is done within the job – aaronman Jul 14 at 15:18

add a comment |

Your Answer

Sign up or log in

Post as a guest

Name

Email required, but not shown

Post as a guest

Name

Email required, but not shown

discard

By posting your answer, you agree to the privacy policy and terms of service.

Browse other questions tagged hadoop bigdata apache-spark oozie or ask your own question.

question feed

asked	1 month ago
viewed	216 times

current community

your communities

more stack exchange communities

Orchestration of Apache Spark using Apache Oozie

Your Answer

Browse other questions tagged hadoop bigdata apache-spark oozie or ask your own question.

Hot Network Questions

current community

your communities

more stack exchange communities

Orchestration of Apache Spark using Apache Oozie

Know someone who can answer? Share a link to this question via email, Google+, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Browse other questions tagged hadoop bigdata apache-spark oozie or ask your own question.

Related

Hot Network Questions