A Fault Tolerant Abstraction For In Memory Cluster Computing
Abstract we present resilient distributed datasets rdds a dis tributed memory abstraction that allows programmers to perform in memory computations on large clusters while retaining the fault tolerance of data ow models like mapreduce.
A fault tolerant abstraction for in memory cluster computing. Rdds are motivated by two types of applications that current computing frameworks handle inefficiently. A fault tolerant abstraction for in memory cluster computing. Rdds are motivated by two types of applications that current data ow systems handle in. Rdds are motivated by two types of applications that current computing frameworks han dle inefciently.
We present resilient distributed datasets rdds a distributed memory abstraction that lets programmers perform in memory computations on large clusters in a fault tolerant manner. In both cases keeping data in memory. Iterative algorithms and interactive data mining tools.