Storage.com

hierarchical storage management (HSM)

By Robert Sheldon

What is hierarchical storage management?

Hierarchical storage management (HSM) is policy-based management of data files that uses storage media economically and without the user being aware of when files are retrieved from storage. HSM automatically moves data among different storage tiers based on a defined policy. It can be set up as a standalone system, but it is often used in distributed enterprise networks.

HSM consists of two or more types of storage media that make up the storage tiers. At one end is a high-performance tier that is typically more expensive and accessed more frequently than the other tiers. This tier includes storage class memory (SCM), enterprise-grade flash solid-state drives (SSDs) and high-performing hard disk drives (HDDs).

At the other end of the storage tier spectrum are slower, less expensive devices, such as optical disks and tape systems. This tier is used for data archives and cold data. Other tiers fall in between these extremes, based on data requirements, supported workloads and how often users access the data.

How does hierarchical storage management work?

Each tier in the HSM hierarchy represents a different cost and performance pairing. As a file ages and is accessed less often, the system moves it to slower and less expensive form of storage.

A file that has moved to a slower tier can be retrieved and moved back to a higher performing tier if it is needed for more critical workflows. Administrators set data governance policies that manage how data moves among the tiers. Once the policies are set, the HSM software manages the data itself.

HSM helps organizations make more efficient use of storage devices and lowers storage costs. It is especially useful in large-scale environments that support massive data sets. Older files can be moved to less expensive storage yet still appear to be immediately accessible. When a user tries to access the files, the HSM software restores them automatically and transparently from the lower data tier to a higher performing tier.

HSM software often uses stub files -- abbreviated representations of the original files -- to point to the real location of the files in secondary storage. Some HSM software enables administrators to set high and low thresholds for disk capacity. With this approach, the software determines when to move older or less-frequently accessed files to another storage medium. In addition, administrators can usually exclude certain file types, such as executable files, from being automatically moved.

How does storage tiering work?

Tiered storage divides data based on its business value and how often users and applications access it. Data is assigned to a specific storage tier based on these factors. More valuable, frequently used and mission-critical data is assigned to faster, more expensive storage flash SSDs and SCM devices. Less important and less frequently used data goes to optical disks and HDDs. Older, seldom accessed data is archived on tape drives or in the cloud.

The exact configuration of a tiered system depends on the storage needs of an organization and how it uses and classifies its data. These systems generally have between two and five tiers. Possible tiers include the following:

Benefits of HSM

HSM systems provide a range of advantages, such as the following:

HSM products

HSM software is available as standalone products that can be used with specific hardware systems. It also comes as part of software platforms that address other storage, data lifecycle management and data security needs.

The following products are examples of how HSM is used.

Learn 12 ways to manage your storage infrastructure and get it under control.

01 Feb 2022

All Rights Reserved, Copyright 2000 - 2024, TechTarget | Read our Privacy Statement