Into Big Data

Courses:

Completed:

These two courses are must-learn if anyone want to know more about Apache Spark. Course instructors put tons of effort to design the course materials, especially the ipython notebooks for exercises. They are difficult, but so much fun! In my opinion, Apache Spark is gradually replacing Hadoop with more flexibility and functionality.

Ongoing:

  • Big Data Specialization on Coursera: So far, the first two courses in this specialization are more general. Students complained about doing a lot of copying rather than learning. I think as an introductory course, it is doing its job. I’m still looking forward to the rest of the courses.
  • Implementing Real-Time Analytics with Hadoop in Azure HDInsight by Microsoft on Edx
  • Implementing Predictive Analytics with Hadoop in Azure HDInsight by Microsoft on Edx

Resources:

In all, Big data is such an complex concept. We just cannot count on any single courses to learn it all. Currently, I feel like Apache Spark is the future and plan on focusing on it.

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s