Shark
Shark. Hive on Spark. Cliff Engle, Antonio Lupher , Reynold Xin , Matei Zaharia , Michael Franklin, Ion Stoica , Scott Shenker. Spark Review. Resilient distributed datasets (RDDs): Immutable, distributed collections of objects Can be cached in memory for fast reuse Operations on RDDs:
412 views • 22 slides