Apache Spark is the most recent data processing framework from open source. It is enormous scope information preparing engine that will in all likelihood supplant Hadoop’s MapReduce. Apache Spark and Scala are indivisible terms as in the most straightforward manner to start utilizing Spark are by means of the Scala shell. Be that as it may, it additionally offers support for Java and python. The system was created in UC Berkeley’s AMP Lab in 2009. So far there is a major gathering of 400 engineers from in excess of fifty organizations expanding on Spark. It is obviously an immense venture. The spark training in Pune has gotten one of the most looked for after exercises for any goal-oriented programming proficient. Apache Spark has been sought after since its dispatch.