Hadoop Eco System (HDFS, yarn, mapreduce, oozie, hive), Spark core, Scala, Spark SQL, core Java, ETL tools (any tool such as AbInitio, Talend, SyncSort, Informatica, SSIS, DataStage, Kettle, etc.)