Feature

Is HSM ready for open-systems storage?

Ezine

This article can also be found in the Premium Editorial Download "Storage magazine: Distance: the new mantra for disaster recovery."

Download it now to read this article plus other related content.

HSM in today's environment
So, is there a place for HSM? The answer is a qualified yes. Some reasons to consider HSM today are:

  • To reduce costs of storage management
  • To improve backup/restore performance
  • To improve management of large e-mail repositories and other databases
It would be difficult to build an effective business case for HSM based solely on disk drive costs, but you should look at the overall management of storage. In a multitiered storage environment, HSM may help to manage the rate of storage growth at each tier, enabling more of a steady-state operation in an automated fashion. It's feasible in some environments to ensure a consistent targeted storage utilization rate of 80% or more with HSM.

Requires Free Membership to View

When disk wasn't cheap
With today's steeply falling prices, it's easy to forget that in the early days of computing, resources such as memory and disk were extremely expensive. To maximize the utilization of these valuable and often limited resources, engineers employed some creative techniques.

For instance, virtual memory (VM) was one major development that greatly enhanced the capabilities of computer systems by maximizing utilization of core memory. Unused pages in memory were migrated to slower magnetic disk storage to make room for other pages. If and when the unused pages were needed again, they were recalled into memory and other pages migrated out. The algorithms developed to manage virtual memory have become extremely efficient and reliable, and VM has become a standard function of virtually (no pun intended) every modern operating system.

Because this concept worked so well with memory, it stood to reason that it should also work with disk. The analogy was essentially the same: Magnetic disk was a faster, more expensive medium than magnetic tape. Wouldn't it make sense to migrate infrequently accessed data from the more expensive media to the cheaper one? As you might suspect, the answer to this question was yes, and HSM was born. Just as the first VM systems were mainframes, the same was true for HSM. With HSM, mainframe operators were able to maintain consistently high utilization of their valuable direct-access storage devices (DASD) as data sets were migrated to and from tape, as needed.

Compared with mainframe environments, early Unix and Windows platforms originally had relatively primitive memory management capabilities, but over time, they adopted some of the more advanced mainframe techniques and developed some of their own, as well. However, the same analogy didn't hold with regard to HSM. Although a number of software vendors have introduced HSM products for open systems, by and large, they haven't been widely embraced.

HSM also dramatically improves backup operations. In traditional backup environments, full backups are performed on a regularly. Studies have shown that a large percentage of files on file servers are rarely accessed after a few months, yet these files continue to impact the time required to perform backups and the amount of media consumed.

With HSM, these files would be migrated to tape with stubs--or fingerprints--left on primary storage, greatly reducing the size of the primary data stores. They would no longer be constantly backed up, thereby improving backup and recovery times and reducing tape consumption.

Similarly, a problem plaguing many environments today is the growth of e-mail and databases. Several vendors offer HSM-related products specifically designed for use with applications such as Microsoft Exchange or Oracle that enable the migration of old messages, attachments and infrequently accessed records to other media (see "Application-focused HSM/HSM-related products"). The promised result is a reduction in the size of the primary repositories.

Additional considerations
Integrating an HSM solution into a storage management framework shouldn't be approached without evaluating the impact on the rest of the organization. You should consider four main questions:

How well do you know your data? There needs to be a solid understanding of the data being managed in order to establish appropriate policies that correctly align with the value of the data at risk. Simply determining that a file hasn't been accessed for a certain period of time isn't sufficient to make it a candidate for migration. A clearly defined data classification methodology with broad support within the organization is one requirement for a successful HSM implementation.

What's the impact on users and applications? The impact of delays in accessing data needs to be understood before deploying HSM. Are delays acceptable? Can they be mitigated with near-line storage? Can your applications handle them appropriately?

How does HSM impact backup and other storage operations? It's important to understand what operational changes will be required to accommodate HSM. Also, where will HSM software or agents need to be deployed, and what's the impact?

How can I back out the HSM solution from the environment, if necessary? If HSM turns out to not be the right solution or there's a desire to change vendors, it's important to understand the level of effort and impact to return the environment to its non-HSM state.

With the evolution of storage networks, low-cost disk, and enhanced software offerings, HSM is worth another look. Application-focused HSM solutions, in particular, have the potential to provide some unique benefits. The success of these solutions depends on a clear understanding of requirements, benefits and risks.

This was first published in May 2003

There are Comments. Add yours.

 
TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to: