Home > Storage Technology News > Tech Closeup: Content-addressed storage -- Strengths and weaknesses
Storage Technology News:
EMAIL THIS

Tech Closeup: Content-addressed storage -- Strengths and weaknesses

By Stephen J. Bigelow, Features Writer
12 Jan 2006 | searchStorage.com

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   

Content-addressable storage: Strengths and weaknesses

Content-addressed storage (CAS) can potentially offer significant benefits to the enterprise. On the surface, costs are lowered because less expensive SATA or SAS hard disks are used rather than Fibre Channel drives. Features like data reduction (a.k.a. "commonality factoring") also lower the total amount of data being stored. This means fewer hard disks to purchase or replace. Less data translates to faster backup/replication processes and can be particularly important with bandwidth-intensive WAN data transfers. The move to less-expensive (though lower-performing) disks allows CAS data to reside near-line instead of offline to tape. This offers a superior user experience and provides a far better service level than tape backup or archival systems.

CAS systems are touted as offering improved manageability and scalability. Management is enhanced by enforcing corporate compliance and retention policies -- a notable step toward risk mitigation. "CAS should be easier to scale and manage," says Tony Asaro, senior analyst at Enterprise Strategy Group. "The idea is to have a system that is as easy to manage with 100 files as it is with 100 million files."

Other analysts highlight the value of enriched metadata. "By being more tightly integrated with the application, more metadata can be stored -- more 'information about the information'," says Greg Schulz, founder and senior analyst, StorageIO. "There's much more awareness of the content." Schulz explains that enriched metadata content supports more comprehensive content-based searches and enables powerful business capabilities such as data classification [see the searchStorage.com Tech Closeup on Data Classification].

Appropriate for any size of organization

Although it's easy to see how a large storage infrastructure can apply CAS technology, analysts are clear that the technology is appropriate for any size organization -- as long as there is enough data to justify the cost of acquisition and deployment. Today, CAS is being deployed by companies that span all sizes and industries. "ESG has seen the financial sector embrace CAS," Asaro says. "The healthcare industry is another big market. Any publicly traded company that is concerned about SOX compliance should consider CAS."

Beyond the clear market segments like healthcare or legal, there are also creative uses for CAS technology. Schulz points to e-mail as one powerful use -- saving space up front by moving attachments to directly to CAS. "Open those mailboxes up and enable a GB, but the active data is only going to be on fast disk. Everything else, whether it is archived or inactive, is moved off onto object-based storage," he says. "In other words, it's tiered storage for e-mail."

Sorting data to CAS

Corporate information does not transition to a CAS system automatically. Storage administrators must make conscious decisions about which data is moved, and configure a CAS system accordingly. Such decisions about corporate data fit ideally as part of a data classification initiative, though a formal data classification process may not be necessary. "Yes, it [CAS] is impacted by data classification and regulatory requirements," says Jim Damoulakis, chief technical officer at GlassHouse Technologies Inc. "A lot of it is just dictated by retention requirements for certain kinds of data."

Other analysts take a somewhat broader view, noting the importance of holistic business consideration -- looking at each business function and determining what business interests are best served by moving particular data types to a CAS system. "How are those applications being used, and how is the data from those applications used?" Schulz says. "It's taking a little broader view of data and getting into 'information classification'."

Understanding CAS limitations

While CAS offers a number of benefits, it also presents several notable limitations which should be considered carefully. The first issue is performance. CAS was clearly designed around secondary (tier two) storage. A CAS system is much faster than tape, but CAS is not suitable for high-performance storage tasks. "Given the implementations today, it's not well suited for transaction-type processing applications, update-intensive or heavy read/write activity," Schulz says.

Schulz suggests the only time that CAS might be considered as a primary storage platform is when your primary application involves the collection and preservation (archiving) of data. However, CAS vendors like Bycast Inc. are challenging the common Tier two status, and are pushing CAS toward primary storage (tier one) using products like Bycast StorageGRID. StorageTek's IntelliStor system is another example where archival data can be migrated and stored across multiple tiers.

Another area to guard against is "hash collisions". Ideally, a hashing algorithm produces a unique identification value for every portion of data processed onto the CAS system. When another portion of data produces the same hashing result, the CAS system knows that data is already on the system, and that duplicated data is not resaved -- this is the cornerstone of data reduction (a.k.a. commonality factoring) technology. However, analysts note rare cases where a single hashing algorithm has returned the same result for two or more different data elements. When these hash collisions occur, new data can be lost because the CAS system already thinks that the data exists. Vendors like EMC claim to have resolved the problem of hash collisions by integrating multiple hash schemes into the same algorithm.

CAS is often implemented to address regulatory requirements, but SNIA warns against CAS products that are sold specifically to meet government regulations. Regulatory requirements involve more than just products. Best practices and other protection schemes are necessary to meet regulatory requirements adequately. "The good vendors are not making those claims," says Mark Carlson, chair of the SNIA Technical Council. He emphasizes the need to understand the practices and procedures necessary to meet regulatory compliance first, then evaluate CAS products based on that comprehensive understanding. "Once you get down to the actual requirements for a storage box, look at it from that point of view."

Go to the next part of this article: Content-addressed storage: The vendors

Or skip to the section of interest:

  • Introduction
  • Content-addressed storage: An overview
  • Content-addressed storage: Strengths and weaknesses
  • Content-addressed storage: The vendors
  • Content-addressed storage: User perspectives
  • Content-addressed storage: Future directions


    Tags: Data management toolsVIEW ALL TAGS

    Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us   



    RELATED CONTENT
    Data management tools
    Green storage essentials: Addressing power, cooling and space issues
    Performance metrics: Evaluating your data storage efficiency
    Tools and techniques for reducing your enterprise data storage footprint
    School district maintains uptime with Xiotech, DataCore
    Tools for using your enterprise data storage resources more efficiently
    Enterprise data storage technologies rise from the dead
    SAN sales boosted by need for storage efficiency
    Thin provisioning brings utilization and capacity benefits to data storage, but with a caveat
    Improving storage utilization with thin provisioning
    Managing capacity planning with thin provisioning
    Data management tools Research

    RELATED GLOSSARY TERMS
    Terms from Whatis.com − the technology online dictionary
    application-aware storage  (SearchStorage.com)
    capacity optimization  (SearchStorage.com)
    compression artifact  (SearchStorage.com)
    data classification  (SearchDataManagement.com)
    data deduplication  (SearchStorage.com)
    depository  (SearchStorage.com)
    storage consolidation  (SearchStorage.com)
    storage provisioning  (SearchStorage.com)
    storage resource management (SRM)  (SearchStorage.com)
    wide-area file services  (SearchStorage.com)

    RELATED RESOURCES
    2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
    Search Bitpipe.com for the latest white papers and business webcasts
    Whatis.com, the online computer dictionary



  • Backup Solution Directory
    TechTarget Storage Media
    Storage Magazine View this month\\'s issue and subscribe today.
    Storage Decisions Apply online for free conference admission.
    SearchStorage.com
    HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

    About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
    TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

    TechTarget Corporate Web Site  |  Media Kits  |  Site Map




    All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
      TechTarget - The IT Media ROI Experts