The case for high-end arrays


This article can also be found in the Premium Editorial Download "Storage magazine: Is it time for SAN/NAS convergence?."

Download it now to read this article plus other related content.

All performance isn't the same

Requires Free Membership to View

Even though the feature of performance may be thought of as a commodity in storage arrays, Hitachi Data Systems' (HDS) Claus Mikkelsen, senior director of storage applications, sees a point of differentiation. He says that while pure raw performance is about equivalent in midrange and high-end storage arrays, providers of the traditional high-end monolithic storage arrays distinguish themselves in terms of how they handle and balance the same workload.

HDS achieves this handling and balancing of work loads on their 9900 series Lightning boxes through the use of its CruiseControl software. It continuously monitors its 9900 series arrays and can dynamically move data between disks while the data is being accessed. While it monitors and analyzes performance, it can either automatically make tuning adjustments or generate reports with tuning recommendations for the administrators to review. Furthermore, it achieves this without the deployment of agents on servers using the 9900 series storage on the backend.

EMC uses a similar tool in their environment called Workload Analyzer and sets itself apart from its competitors in two ways. While this tool collects, graphs, analyzes and archives performance data on EMC supplied storage, it works not only with all of EMC's high-end Symmetrix storage arrays, but also with their midtier Clariion offering. However, getting this functionality requires the deployment of agents on servers attached to their storage arrays.

IBM offers its ESS Expert, which also enables administrators to monitor and report on the performance statistics of their Enterprise Storage Servers (ESS) Sharks anywhere in the enterprise storing the collected information in a relational database. However, this tool currently lacks the ability to make any necessary changes dynamically and requires the administrators to make decisions about volume placement and data movement based on the information the ESS Expert gathers.

Performance always shows up as a check mark when buying a high-end storage array. Typically, high performance is the result of internal architecture and optimal disk drive sizes and speeds. And while you should certainly use these as guides in defining what comprises a high-end storage array in your environment, your mileage will depend on the application for which the storage array is being used.

For instance, applications that are highly sequential in nature--such as mainframe batch jobs--better exploit storage arrays that contain large amounts of cache with caching algorithms that optimize prefetching data. Storage arrays with large cache configurations can partially overcome drives with slower speeds and larger sizes because much of the data can be loaded into cache prior to the application even requesting it.

High-end storage arrays such as EMC's DMX 3000, IBM's 2105 Model 800 (Shark) and HDS' 9980 V (Lightning) that follow the traditional monolithic model make the most sense for these applications. They minimize the I/O requests to disk the storage array has to make due to the large amounts of cache they support and the caching algorithms they use. Each of these storage arrays support at least 64GB of cache; the DMX 2000 and 3000 models each support up to 128GB of cache.

Conversely, many of today's random access, read-intensive relational database applications negate some of the benefits of a large amount of cache in a high-end storage array. Because of the random nature of the queries and the fact that the caching algorithms can't easily predict the appropriate data to load into memory, I/O requests must bypass memory and read data directly from the disk.

Here's where a midrange storage array's internal architecture and disk drives may equal or even outperform a traditional architecture. Fremont, CA-based 3PAR's InServ S800 storage array uses a backplane with mesh architecture that it says supercedes both internal bus and switch architectures with up to 28GB/s of internal bandwidth. The InServ S800's controller nodes also separate the processing of controller commands from the data movement thereby removing another possible performance bottleneck that may exist in today's systems. These two features combine with FC disk drives with faster rotational speeds to potentially offer equal or better performance for today's applications than their monolithic counterparts.

Arrays following the traditional monolithic models shouldn't be disregarded for these sorts of applications, but they should no longer be thought of as the only option.

Availability and reliability
Availability and reliability are not just must-haves on high-end storage arrays--they are assumed to be there. Chuck Hollis, EMC's vice president of storage platforms marketing, points out that today's mission-critical environments are "always on," so everything from routine configuration changes to maintenance code upgrades must be nondisruptive. With the high cost of downtime in these environments, the cost of high-end storage arrays is more than justified by the savings that they generate by avoiding the possibility of any outages, planned or unplanned.

3PAR's president and CEO, David Scott, contends that there's currently very little difference between storage arrays classified as midrange and high-end in the areas of reliability and availability. Both of these classes of storage arrays generally use the same highly available and reliable components purchased from essentially the same set of underlying hardware suppliers. The differences that do exist in availability and reliability on the different storage arrays frequently depend on how each storage array vendor's puts that hardware together in their array, how well they test it in their labs and how their proprietary software works with it.

One factor that does influence the availability and reliability of storage arrays is the RAID configuration of the disk drives within the storage array. The two most common deployments that preserve information if one of the disk drives fails are RAID 1 (mirroring the data on two disk drives) and RAID 5 (striping the information across five disk drives).

All high-end storage vendors offer at least one--if not both--of these configurations. Some such as HDS' 9980 V and IBM's 2105 Model 800 now offer users the ability to mix and match various RAID configurations within their storage arrays to meet the needs of specific applications. In addition, some arrays also offer advanced RAID functions such as RAID 10 that offer spare drives that will immediately replace any disk drive, should it fail as part of the primary RAID configuration.

But one factor in evaluating availability that rarely gets the attention it deserves is how code updates are applied on the storage arrays themselves, a task you will probably confront once or twice a year. Code upgrades can be needed for any number of reasons, from routine maintenance to fixing a known issue to gaining some additional functionality. Keep in mind that all code upgrades aren't created equal.

In a poorly laid out or misunderstood environment, they can be disruptive and will likely require outages lasting for several hours or longer on hosts connected to a single port on a storage array. This minor task can wreak havoc, especially where multiple hosts with different service level agreements and maintenance windows connect into the same high-end storage array. Extensive forethought and planning is required to assess existing storage network connections and ascertain if specific system outages will occur when the storage array code upgrade takes place.

This was first published in September 2003

There are Comments. Add yours.

TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to: