Check out some basic features of the Apache Spark Data Sources API V2 to create your own data source and explore push-down optimization techniques.
A tutorial for developers who want to customize their Spark application with their own optimizer, parser, analyzer, or physical planning strategy rules.
Learn how to use Apache Spark and Jupyter Notebooks in conjunction to gain the ability to analyze data that resides on z/OS and mainframe systems.
Alluxio is fast virtual storage for Big Data. Formerly known as Tachyon, it’s an open-source memory-centric virtual distributed storage system (yes, all that!), offering data...
Quickly set up a development environment to use Stocator to connect with Apache Spark.