Setting up Google Cloud Dataproc with Jupyter and Python 3 stack

Modern big data world is hard to imagine without Hadoop. It made a small revolution in how analysts deal with large amount of emerging data (before Hadoop, it used to be a torture). Sparkis “Hadoop 2.0”, it much improves on the original MapReduce engine.


This is a companion discussion topic for the original entry at https://blog.sourced.tech/post/dataproc_jupyter/