These two courses are must-learn if anyone want to know more about Apache Spark. Course instructors put tons of effort to design the course materials, especially the ipython notebooks for exercises. They are difficult, but so much fun! In my opinion, Apache Spark is gradually replacing Hadoop with more flexibility and functionality.
- Big Data Specialization on Coursera: So far, the first two courses in this specialization are more general. Students complained about doing a lot of copying rather than learning. I think as an introductory course, it is doing its job. I’m still looking forward to the rest of the courses.
- Implementing Real-Time Analytics with Hadoop in Azure HDInsight by Microsoft on Edx
- Implementing Predictive Analytics with Hadoop in Azure HDInsight by Microsoft on Edx
In all, Big data is such an complex concept. We just cannot count on any single courses to learn it all. Currently, I feel like Apache Spark is the future and plan on focusing on it.