About 400 results
Open links in new tab
  1. Apache Spark Tutorial - Online Tutorials Library

    This tutorial has been prepared for professionals aspiring to learn the basics of Big Data Analytics using Spark Framework and become a Spark Developer. In addition, it would be useful for …

  2. Prerequisite - Online Tutorials Library

    Apache Spark computation. It is based on Hadoop MapReduce and it extends the MapReduce model to efficiently use it for more types of computations, which includes interactive queries …

  3. Spark SQL Tutorial - Online Tutorials Library

    This tutorial has been prepared for professionals aspiring to learn the basics of Big Data Analytics using Spark Framework and become a Spark Developer. In addition, it would be useful for …

  4. Apache Spark - Introduction - Online Tutorials Library

    Apache Spark is a lightning-fast cluster computing technology, designed for fast computation. It is based on Hadoop MapReduce and it extends the MapReduce model to efficiently use it for …

  5. PySpark Tutorial - Online Tutorials Library

    This is an introductory tutorial, which covers the basics of Data-Driven Documents and explains how to deal with its various components and sub-components. PySpark is the Python API for …

  6. Apache Spark - Quick Guide - Online Tutorials Library

    Apache Spark is a lightning-fast cluster computing technology, designed for fast computation. It is based on Hadoop MapReduce and it extends the MapReduce model to efficiently use it for …

  7. Apache Spark - Core Programming - Online Tutorials Library

    Spark uses a specialized fundamental data structure known as RDD (Resilient Distributed Datasets) that is a logical collection of data partitioned across machines.

  8. Apache Spark - RDD - Online Tutorials Library

    Spark makes use of the concept of RDD to achieve faster and efficient MapReduce operations. Let us first discuss how MapReduce operations take place and why they are not so efficient.

  9. Apache Spark - Installation - Online Tutorials Library

    Spark is Hadoops sub-project. Therefore, it is better to install Spark into a Linux based system. The following steps show how to install Apache Spark.

  10. Advanced Spark Programming - Online Tutorials Library

    Spark allows you to do different operations on numeric data, using one of the predefined API methods. Sparks numeric operations are implemented with a streaming algorithm that allows …