Nearline storage is the on-site storage of data on removable media. The removable storage concept dates back to the IBM mainframe computer and remains a popular option for individuals, small businesses and large enterprises. The term nearline is a combination of the words near and online.
Comparing nearline storage, always-on storage and archival storage
Data housed on nearline storage does not require the high availability or redundancy of primary storage. As such, it straddles online storage and long-term archiving. Although it is accessed only frequently, the data needs to be available to users on demand. Nearline storage relies primarily on inexpensive disk storage to accomplish this.
Online storage provides rapid access to frequently used data, often to a larger number of users simultaneously. Archival storage retains cold backup -- sometimes called offline backup -- data on disk-based appliances or cloud storage.
By definition, online storage refers to electromechanical magnetic disks that need to remain continuously available to support ongoing business operations, while access to an archived backup copy typically requires some human intervention.
Examples of removable nearline storage media
As the term implies, nearline storage is a midpoint between fast storage and archiving. The data in nearline storage is kept on a secondary or tertiary tier. In that respect, nearline storage works in a similar manner as an active archive. Nearline drives are not attached to a computer, but may be made available quickly to handle I/O tasks, without direct intervention of IT staff.
High-capacity nearline HDDs have emerged as a staple for bulk storage in enterprise data centers. These are Serial Advanced Technology Attachment (SATA) drives that work with storage devices equipped for the serial-attached SCSI (SAS) protocol. Nearline SAS drives also are widely used for high-performance secondary storage.
Several hundred or more nearline SAS drives grouped together is known as a massive array of idle disks (MAID). This configuration offers the potential to reduce the cost per terabyte of storage to a level on par with magnetic tape.
In a MAID, the only drives that rotate are those performing reads or writes. For the most part, the drives in a MAID remain idle, which helps extend their overall life and helps to reduce power consumption. A drive will spin into action only when an application calls for data to be read or written.
A nearline storage system can request data from a specific cartridge in a physical tape library. A robotic autoloader -- also known as a stackloader -- handles the request and automatically loads the correct cartridge in a tape drive, either sequentially or in another specified order. The time between requesting the data and the insertion of the cartridge takes a few seconds.
Although it has given way to other backup storage media, many organizations continue to use tape for archiving due to its high capacities, relatively low cost and durability. Tape performs read and write operations faster than disk, although spooling is required to enable reads and writes in a sequential format.
Cloud nearline storage
The emergence of cloud platforms provides an additional option for parking data off site for rapid retrieval when needed. Many customers choose to buy some, if not all, of their storage as a service (SaaS) from cloud services providers. Using a SaaS model, a company leases space as a tenant on a service provider's physical IT infrastructure. This can help reduce costs associated with managing backups.
Google Cloud Storage Nearline is one of four services in the public Google Cloud Platform, intended for archiving, backup and disaster recovery. The other classes are Google Cloud Multi-Regional Storage, Google Cloud Regional Storage and Google Cloud Storage Coldline.
Google Cloud Storage was launched to compete with market leader Amazon Web Services' Amazon Glacier cloud storage, which is designed for data that is retrieved in three to five hours. Amazon Glacier is less expensive than Amazon Simple Storage Service, which retrieves data in real time.
Nearline storage media, when on the shelf, are immune to infection by online viruses, Trojan horses and worms because the media are physically disconnected from networks, computers, servers and the internet.