Apache® Spark™ is the open-source, in-memory computing framework for distributed data processing. One of the nicest things about this technology is that it features a simple programming model that hides the complexity inherent to distributed computing. As an added bonus, the APIs come in multiple flavors: Scala, Java, Python, and R. Added integration with SWIFT Object Storage, Cloudant, Db2 Warehouse on Cloud (formerly dashDB), SQLDB and other IBM Cloud Data Services makes development and analytics with Apache Spark more accessible, centralized and useful.
Access your Spark instance through IBM Cloud or IBM Watson Studio (formerly Data Science Experience):
Use Apache Spark through IBM Cloud
Watch this video to see how to provision an Apache Spark instance through IBM Cloud.
Next steps on IBM Cloud
- On IBM Cloud, use spark-submit which to run Spark jobs programmatically. Learn more about using spark-submit with IBM Cloud.
- Read how to Get started on IBM Cloud.
Use Apache Spark through IBM Watson Studio
Watch this video to see how to create a Watson Studio project.