Published August 15, 2023
Author Sarika Mehta, Kapil Karkra, Wayne Gao
Current data center real-estate and power consumption budgets are trying to keep up with unprecedented data growth and the need for sufficient storage performance to feed that data back to users or to AI training models. The industry is seeing a thirst for high-density storage to maximize the usage of existing data center space.
While the HDD segment is working hard to find ways to increase density and performance, Solidigm’s 3D NAND QLC SSDs have already achieved both and, in fact, have been in production since 2018.
With the newest 61.44TB D5-P5336 3D NAND QLC SSD, Solidigm has launched its largest capacity NAND drive yet, which comes with a cost and performance benefit that can help data centers achieve higher density.
Read-intensive applications can already benefit from high-density QLC SSDs like the Solidigm D5-P5336, but what about mixed workloads and data placement applications?
To further expand the application benefits of high-density Solidigm 61.44TB SSDs with mixed workloads and emergent NVMe designs like data placement technologies such as streaming, flexible data placement (FDP), and zoned namespace (ZNS) the Solidigm team is leveraging CSAL, the Cloud Storage Acceleration Layer, a new open-source cloud-scale share-nothing storage software layer (bdev, i.e., block device) in the Storage Performance Development Kit (SPDK).
CSAL allows design flexibility to tune endurance of the SSD for the entire platform. This allows the solution to maintain high application write performance through Emerging Storage Class Memory (SCM) SSDs, such as Solidigm’s first-generation D7-P5810 SSDs, and optimizes TCO by leveraging low-cost and high-density QLC storage while taking advantage of the TLC-equivalent read performance offered by Solidigm’s QLC SSDs.  Figure 1 shows memory tiering hierarchy with the slowest tier at the bottom of the pyramid and the fastest tier at the top.
In traditional cache architecture, high-performance storage, such as a storage class memory (SCM) SSD, is put in front of primary storage like a QLC SSD. Instead of writing data to primary storage directly, writes are acknowledged to users or applications as soon as data is written to the cache tier. Then, data is written back to the capacity tier.
Traditional caches can help high-density NAND media to maintain write performance per TB and boost endurance for high-temporal-locality workloads. For example, a high-performance, high-endurance SCM tier can absorb frequently updated writes without sending them to the QLC NAND tier.
The key strategy of CSAL is to leverage an SCM SSD as the cache to compact and shape user random writes to SSD-friendly writes. The goal of a CSAL design is to minimize the system-level write amplification and the wear for NAND SSDs, hence improving overall performance and system endurance of NAND-based primary storage.
CSAL improves on traditional cache technologies in three ways:
Figure 2 below shows the major differences between a traditional write cache and a write shaping cache.
In a given scenario, CSAL is implemented in SPDK for high-performance storage systems. SPDK offers a full- stack storage system from a logic volume, a generic block layer to an NVMe driver. CSAL is implemented in the SPDK block layer and exposed as a virtual block device that consists of two physical block devices:
Storage applications, such as NVMe-oF (NVMe over Fabrics), can use this virtual block device as a generic block device.
Figure 3 shows the overall architecture of CSAL. There are several key points to highlight:
To achieve the above data transition, CSAL manages four key components:
CSAL software solution architecture is not limited to any specific hardware architecture and can be deployed on various server architectures including Intel, AMD, ARM, IPU/DPU, and GPU etc. We are eager to see the open-source community’s involvement and organic growth of support for various architectures.
CSAL is a write-shaping cache that unleashes the value of high-density NAND flash media. By leveraging the host-side FTL, CSAL preserves the existing software interface while transforming any write workload to a sequential write workload. Furthermore, CSAL minimizes the frequency of writes by caching frequently updated or temporary data on P5810 SSDs.
With these two strategies, CSAL enhances endurance of the entire platform and delivers application performance. CSAL is a software-defined and flexible storage architecture for next-gen media and data placement technologies. It is easy to scale-out in data centers and can easily be tuned to various performance and TCO requirements.
Achieving Optimal Performance and Endurance on Coarse-grained Indirection Unit SSDs
Sarika Mehta is a Storage Solutions Architect with over 15 years of storage experience. Her focus is to work closely with Solidigm’s customers and partners to optimize their storage solutions for cost and performance.
Kapil Karkra is a Sr. Principal Engineer and the Chief Storage Platform Architect at Solidigm responsible for the architecture of Cloud Storage Acceleration Layer (CSAL), a host based FTL. His current focus is to define a turnkey Reference Storage Platform (RSP), both software and hardware, that helps develop insights about Cloud use cases, and speeds high-density NAND SSD development and adoption. Kapil has over 25 years of storage experience and has over 20 patent filings/grants. Kapil holds a bachelor’s degree in electrical engineering from National Institute of Technology (NIT) in India and an MBA from Arizona State University.
Wayne Gao is a Principal Engineer as Storage solution architect and worked on CSAL from PF to Alibaba commercial release. Wayne has over 20 years of storage developer experience as previous DellEMC ECS all flash object storage team and has 4 US patent filings/grants and 1 EuroSys paper published.