It does not have its own storage system, but runs analytics on other storage systems like hdfs, or other popular stores like amazon redshift, amazon s3, couchbase, cassandra, and others. It is available in either scala (which runs on the java vm and is thus a good way to use existing java libraries) or python. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance.
Spark has libraries for cloud sql, streaming, machine learning, and graphs. Spark’s shell provides a simple way to learn the api, as well as a powerful tool to analyze data interactively All spark examples provided in this apache spark tutorial for beginners are basic, simple, and easy to practice for beginners who are enthusiastic about learning spark, and these sample examples were tested in our development environment.