This IBMÂ® Redguideâ„˘ publication describes big data and analytics (BD&A) deployments that are built on IBM Spectrum Scaleâ„˘. Spectrum Scale is a proven enterprise-level distributed file system that is a high-performance and cost-effective alternative to Hadoop Distributed File System (HDFS) for Hadoop analytics services.
IBM Spectrum Scale includes NFS, SMB, and Object services and meets the performance that is required by many industry workloads, such as technical computing, big data, analytics, and content management. IBM Spectrum Scale provides world-class, web-based storage management with extreme scalability, flash accelerated performance, and automatic policy-based storage
tiering from flash through disk to the cloud, which reduces storage costs up to 90% while improving security and management efficiency in cloud, big data, and analytics environments.
This Redguide publication is intended for technical professionals (analytics consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for providing Hadoop analytics services and are interested in learning about the benefits of the use of Spectrum Scale as an alternative to HDFS.
Table of contents
IBM Spectrum Scale with Big Data and Analytics Solutions
IBM Spectrum Scale for Spark
IBM Spectrum Scale Features and Benefits for Big Data and Analytics
IBM Spectrum Scale Versus HDFS
When to Consider IBM Spectrum Scale for Big Data and Analytics Solution
IBM Spectrum Scale is a preferred platform for running Big Data and Analytics workloads. IBM Spectrum Scale in-place analytics for file and object data solves traditional analytics solution challenges.
HDFS Remote Procedure Call (RPC) based IBM Spectrum Scale HDFS transparency Hadoop connector provides enhanced High Availability (HA) capability, performance, and security for Big Data and Analytics workloads. The IBM Spectrum Scale Big Data and Analytics solution is deployed in Storage Rich Server architecture, SAN shared storage, and an integrated system Elastic Storage Server.
POSIX compatibility with various enterprise class features provides flexible data management and protection for Big Data and Analytics workloads.