Definition

cloud archive

Dave Raffo

By

Dave Raffo

What is a cloud archive?

A cloud archive is storage as a service for long-term data retention. The archive holds data that is infrequently accessed and may be optimized for security and compliance with data regulation policies.

Archiving was among the first popular use cases for cloud storage for several reasons:

Storing archived data in the cloud can be cost-effective when compared with storing and maintaining large amounts of nonessential data in-house.
Using the cloud alleviates the need for buying and upgrading on-premises disk or tape hardware systems and archiving software to manage and store nonprimary data.
Archived data rarely must be brought down from the cloud, a process that can be time-consuming and expensive.

Cloud archiving services

Cloud archiving is often done completely in a public cloud, although there are hybrid setups where data that may require faster access is stored on premises with only rarely accessed cold data moved off-site.

Public clouds require no special on-premises hardware or software. An organization can reduce its data center footprint and use less power and cooling resources by storing data in the cloud. Public cloud archive attributes include elasticity, abstraction, durability and cost.

Popular public cloud archiving services for cold data -- such as Amazon S3 Glacier, Microsoft Azure Blob Storage and Google Cloud Archive Storage -- store data for as little as less than a penny per gigabyte. However, some low-cost services take hours to restore data. There may also be extra costs to transfer data back out of the cloud.

This article is part of

What is cloud backup and how does it work?

Download this entire guide for FREE now!

Cloud archiving vendors

Cloud gateways are frequently used to help move data into the cloud in the right format. These gateways are sold by vendors, including AWS, Ctera, Microsoft, Nasuni, NetApp and Panzura.

When recovering data, the cloud may need to support the application used to create the data. Retrieval times may also vary. Glacier, for example, may require more than three hours to restore data. Google archival storage has a response time of seconds.

There are specialized enterprise cloud archiving vendors that usually focus on vertical markets. For example, Proofpoint has an archiving service for the financial industry for email, documents, instant messages, social media and other forms of electronic communication. Mimecast archives email and files for industries such as healthcare, legal and manufacturing.

When looking for a cloud archiving provider, organizations should consider the providers' service-level agreement for data recovery, what tools are available to find data when it is needed, whether the cloud has a self-service portal, if the cloud meets all the customers' compliance requirements and if the application that stores the data is supported.

Cloud archiving security

As with any other archiving product or application, a cloud archive service must provide secure storage optimized for long-term data retention that complies with data regulation policies. For security in flight, data moving in and out of the cloud is managed through secure HTTPS protocols. Most providers can also encrypt data stored in their clouds. Customers can add their own encryption keys for an extra layer of security or encrypt data before sending it to the cloud.

An archive in the cloud must be easily searchable, protected from tampering or overwriting, and enable easy access to specific data when it is required for a compliance audit or e-discovery.

Cloud vs. tape

The cloud is an alternative to on-premises tape, which is frequently used to archive data for long-term retention. To replace tape, a cloud archive must match tape's low cost, longevity, scalability and security.

Tape has the advantage of portability; it can be shipped across locations without the need to rewrite data. Cloud's advantages are geographical redundancy that mitigates the risk of data loss from hardware failures, advanced search capabilities and the elimination of costs from technology refreshes.

Cloud archive vs. cloud backup

A cloud archive should not be confused with a cloud backup. Just as there are differences between on-premises archive and backup, cloud archive and cloud backups are not the same thing.

Backup involves copying data at regularly scheduled intervals, and often involves data that has changed. Cloud archiving moves data off-site once and that data will not be changed after it goes to the cloud. Archiving is usually done to free up storage space for more frequently accessed data.

This was last updated in January 2023

Next Steps

What are the pros and cons of cloud backup?

Cloud backup vs. cloud storage: What are the differences?

Full vs. incremental vs. differential: Comparing backup types

The 7 critical backup strategy best practices to keep data safe

Continue Reading About cloud archive

Archive as a service: What you need to know

Dig Deeper on Storage architecture and strategy

Disaster Recovery

New SIOS console enables high availability visualization
IT generalists on Linux systems can avoid the complexity of HA management for mission-critical apps or databases with a new ...
4 disaster recovery plan best practices for any business
Disaster recovery plans are unique, built around an organization's size, type and industry. However, there are some key best ...
Free business continuity testing template for IT pros
Business continuity testing can be a major challenge for any organization. This free template offers ways to incorporate testing ...

8 data protection challenges and how to prevent them
Businesses contend with a combination of issues spawned by data overload, privacy regulations, access rights, cyberattacks, cloud...
Commvault acquires Appranix for recovery automation
Appranix, Commvault's third acquisition, provides automated recovery services for cloud applications including configuration data...
Data protection vs. security vs. privacy: Key differences
Data protection, privacy and security might look alike but their differences can make or break a comprehensive compliance program...

Infrastructure for machine learning, AI requirements, examples
Infrastructure for machine learning, deep learning and AI has component and configuration requirements. Compare hardware and how ...
Dell partners with Intel, releases file storage for Azure
With the recent Intel and Azure partnerships, Dell continues to expand its on-premises options for AI while expanding its Apex ...
How to install and run Podman on Rocky Linux
Rocky Linux can run and install Podman, an open source Linux tool and competitor to Docker that uses containers to find, run and ...

Sustainability
and ESG

Businesses need to prepare for SEC climate rules, EU's CSRD
While the SEC's new climate rules and the EU's CSRD are both facing delays, businesses still need to identify methods for ...
A green IT assessment: Why it's important, what to include
A company's technology systems and devices can have a profound effect on sustainability efforts. Learn how a green IT assessment ...
AI to boost sustainability if carbon costs are kept in check
AI has the potential to drive ESG goals and improve sustainability outcomes, but using the tech also creates considerable ...

Close