| Most new arrays stripe data across their spindles automatically to increase performance and better use disk capacity. With capabilities like that, RAID could become a thing of the past.
Most Storage readers don't require an introduction to the concepts of RAID. But the rules of the game are changing. As recently as five years ago, storage administrators were constantly challenged by die-hard application administrators to control data placement on the disk array at a very granular level. Not being content with merely specifying "table spaces on RAID 5 and logs on RAID 0/1," some database administrators asked for particular data stripe placement on the platter itself so that, for example, the highest usage online transaction processing (OLTP) instances could get better performance by being written to volumes occupying the outermost disk cylinders.
Caches keep getting larger, improving performance as they grow. Only when your application seeks data outside the cache (a cache "miss"), does the selection of RAID level affect performance. With new cache algorithms and proper tuning, cache misses can be kept to a minimum. With 90% plus cache "hits," RAID level selection will have a relatively minor impact. This hasn't gone unnoticed by storage array vendors.
| Storage administrators have always been confronted with the task of matching the application storage workload to the storage configuration. For many enterprise applications, this goes beyond the simple "logs on 0/1, tables on 5" to complex volume management involving both hardware and software RAID. Some array vendors actually recommend such techniques as software RAID 0 striping across their array controllers. But the storage workload profile of an application can change over time, requiring constant re-tuning to maintain performance.
Many storage administrators, managing hundreds of terabytes of data, are beginning to realize that twiddling knobs to achieve better performance isn't as much fun as it used to be when 2TB was considered a very big array. Today, the hottest trend in storage arrays is data stripe abstraction. Blocks of data are written to many disks in the array according to a pattern determined by the intelligence in the array; the RAID level, as it were, is fixed and not configurable. In the most advanced arrays, the data block layout pattern can change dynamically as the application workload profile changes.
| Let the array decide
"At NetApp, we have the view that selecting RAID levels for specific applications has become rather anachronistic," says David Dale, industry evangelist at Network Appliance (NetApp) Inc. "The best modern arrays offer automated RAID with architectural mitigation of performance or capacity trade-offs."
The idea of "carving a LUN" and not knowing which physical spindles the data will end up on may be unnerving to some, but it's the wave of the future. Technologies like storage virtualization, thin provisioning, index copying/hardware continuous data protection (CDP) and dynamic volume sizing--just to name a few--mandate the automation of physical disk layout.
New, distributed disk or grid storage technologies are taking on some of RAID's data protection duties. "There are many ways to create redundant copies of data without using RAID," writes John Spiers, founder and CTO at LeftHand Networks. "LeftHand's volume replication can be configured to withstand multiple drive failures, array level failures and complete site failures without losing data, and all without the use of traditional RAID algorithms ... the days of traditional RAID systems may be coming to an end."
Other storage array companies are also heralding the end of RAID as we know it. "While most vendors assign a single tier or RAID level to a volume, Compellent assigns these parameters on a block basis," observes Bob Fine, Compellent's director of product marketing. "Both data classification and data movement are automatically tuned by the array."
For very highly specialized application workload profiles, such as CASE and video editing, manual storage configuration might still be worth considering. But for mainstream applications, like Exchange, SQL Server, Oracle and ERP, storage vendors offer very appropriate automation solutions.
| Balancing requirements
In the final analysis, business decisions need to be made before technical factors can be considered. What, for example, are the requirements of the application and the operating system? How will the data be protected? What are the infrastructure requirements? How much will it cost?
Picking a storage technology is all about balancing parameters. Choosing a RAID level, or a product with data stripe abstraction, is no different. There are a wide variety of options, with no one selection "best" for all circumstances. As a storage administrator, you want happy "internal customers" with a minimum of work and stress. This may mean giving up some control to the array for disk management.
- Storage Insights Enables Broader Use of Storage Resource Management –Arrow and IBM
- StrongLink Autonomous Data and Storage Management –Fujifilm Recording Media USA, Inc.
- Managed Apache Spark for Large-Scale Analytics –Instaclustr
- Storage Designs for Big Data and Real-Time Analytics –Western Digital