Question d’entretien chez IBM

Used Spark API over Hadoop YARN to perform analytics on data in Hive. Developed job processing scripts for Oozie workflow. Scheduling Oozie workflow engine to run multiple Hive and spark jobs. Created Python scripts to read, process and store required details from files. Developed Spark code based on business requirements.