You can try CloudxLab . Although it is not free, it is quite affordable. It provides a complete environment for practicing Hadoop, Spark, Kafka, Hive, Pig, HBase, Oozie, Zookeeper, Flume, Sqoop, Mahout, R, Linux, Python, Scala, NumPy, Scipy, scikit-learn, etc. You will not need to install or configure any software on your local machine to use CloudxLab. Many of the popular trainers already use CloudxLab.
source
share