Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming entire clusters with implicit data parallelism and fault tolerance. Originally developed at the University of California, Berkeley's AMPLab, the Spark codebase was later donated to the Apache Software Foundation, which has maintained it since.
Edit details Edit relations Attach new author Attach new topic Attach new resource
7.0 rating 3.0 level 8.0 clarity 2.0 background – 1 rating
We compare Hadoop vs Spark platforms in multiple categories including use cases. Which big data frame...