Ezine

This article can also be found in the Premium Editorial Download "Storage magazine: Choosing the best disaster recovery planning tool."

Download it now to read this article plus other related content.

Email often starts the process

Most archives start small, and many begin with one type of data like email (see "What to archive: Different data types," below). IT decides it needs to control growth, so they go looking for a system that can stub out attachments or move data out of the Exchange server. Then the legal department demands a complete set of email messages for a litigation-related search. Later, records management needs to retain certain messages for compliance with Sarbanes-Oxley or other industry regulations.


Requires Free Membership to View

What to archive: Different data types
Although most business information is digital today, not all systems manage information equally. Manageability requires organization and structure, the ability to search for information and meta data to categorize content. We use these elements to classify data as structured, semi-structured or unstructured.

Structured applications are inherently organized, although the identification, description and relation between data can be highly customized. In the enterprise, structured databases are often core applications with specialized administrators managing the data and archiving.

Systems like email have some structure, but they weren't developed with information management in mind, and it shows. What structure they have is functional, designed to serve specific application needs rather than the higher goal of manageability.

Finally, there's the class of unstructured data that's so familiar in file systems. Although some basic systems are used to organize and describe these files, they can't be called truly structured as they lack information about their functional or organizational relationships.

Structured

Semi-structured

Unstructured

Example

Relational database

Email, medical images, document repositories

File servers, multimedia files

Organization

Highly organized with intrinsic interrelation

Moderately organized by meta data

Basic location and name "tree" organization

Meta data

Complex custom attributes

Some intrinsic attributes

Basic attributes and rarely used extended abilities

Search

Built around query language

Keyword search and categorization by meta data

Third-party tools for search


This was first published in January 2009

There are Comments. Add yours.

 
TIP: Want to include a code block in your comment? Use <pre> or <code> tags around the desired text. Ex: <code>insert code</code>

REGISTER or login:

Forgot Password?
By submitting you agree to receive email from TechTarget and its partners. If you reside outside of the United States, you consent to having your personal data transferred to and processed in the United States. Privacy
Sort by: OldestNewest

Forgot Password?

No problem! Submit your e-mail address below. We'll send you an email containing your password.

Your password has been sent to: