Home > Storage Technology News > Johns Hopkins selects Caringo CAS software for data archiving
Storage Technology News:
EMAIL THIS

Johns Hopkins selects Caringo CAS software for data archiving

By Dave Raffo, News Director
03 Jul 2008 | SearchStorage.com

News and trends in the storage industry
Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google

A research center at Johns Hopkins University turned to Caringo Inc.'s CAStor content-addressed storage (CAS) software to provide data archiving and also to manage its sensitive and rapidly expanding genotyping data.

The Center of Inherited Disease Research (CIDR) provides genotyping and statistical genetics services for investigators trying to identify genes that contribute to human disease. The work of CIDR is, to put it bluntly, a data hog. As part of its research, CIDR might scan up to 12 DNA samples on one slide, according to Lee Watkins, Jr., the Center's director of Bioinformatics. One sample can produce files ranging from 2 GB to 4 GB.

CIDR uses CAStor to archive the data and delete it from the Windows file share. With data from tens of thousands of DNA samples in its system, the archive builds up fast. The Baltimore-based CIDR often generates terabytes of data a week, sometimes hitting a terabyte in one day. The Center used high-capacity PetaBox systems from Capricorn Technologies to store the data, but last summer the 50-person research team realized they needed help managing it all.

More on archiving
The enterprise archive of tomorrow

Atempo, Nirvanix offer cloud-based file archiving
 
Symantec shops hail Enterprise Vault archiving software

Bridgehead archiving app can search and index PACS
"We knew we needed to have an archiving strategy," Watkins said. "Keeping up with all the data became unmanageable. People wanted to recover files by project, keeping track of which files go with each slide scanned."

But perhaps the hardest part was finding technology that wouldn't deplete the budget. "We're well-funded, but we can't go out and buy a system from EMC or Hitachi to do this," Watkins said. "We said, 'There has to be somebody who has written software that can keep track of this.'"

CIDR became aware of Caringo through Capricorn. Caringo gave CIDR a free trial period to test CAStor. CAStor passed the test and CIDR became a paying customer last November. The Center started with a 30 TB CAStor cluster and is now up to a 99.9 TB cluster with 80 TB used. . .and is still growing.

To keep up with its data growth, the Center is installing a high-density Rackable Systems array for more capacity and will install CAStor clusters on that as well. This new set-up is scheduled to go live in August.

At first, CAStor had trouble keeping up with the data that the Center was throwing at the clusters. "It wasn't 100% robust," Watkins said. "There were cases where a disk wouldn't fail but it would stop performing and act weird, give us little hiccups now and then. They wrote a fix a few months ago, and we haven't had that problem."

Derek Gascon, Caringo marketing vice president, said, "They wanted to have disk capacity freed up much quicker, so we put together a new version for them that includes a faster turnaround in releasing disk capacity." That fix is now included in the general release of the product.

According to Watkins, no relief from data growth is in sight. "Our plan is to keep data online for a year," he said. "We haven't gotten to that point yet where we've released projects, so we can't predict our high water mark. But we suspect it will be between 300 and 400 TB."

CIDR keeps its data on tape for long-term archiving, but uses CAStor for active data. "We've had to recover a lot of stuff we didn't think we would have to recover, and it's there," Watkins said. "What we were doing before was not scalable, and we couldn't keep track of everything. We had to do everything on a separate storage device. "Now it's simple as simple can be," he said. "You need more storage, add another storage device, boot up from a NetBoot server and you're done."

Watkins said CAStor has also helped provide disaster recovery, surviving various mechanical failures, and even a flood in the lab where the clusters were temporarily installed while CIDR was expanding its server. "We've had random disk failures, and power failures where all the nodes went down and we had to power it back up," he said. "We never had a problem with that, which is amazing to me."



Tags: Data storage compliance and archivingNAS managementVIEW ALL TAGS

Digg This!    StumbleUpon Toolbar StumbleUpon    Bookmark with Delicious Del.icio.us    Add to Google



RELATED CONTENT
Data storage compliance and archiving
Dexrex Gear offers cloud instant messaging and social media data archiving
EMC lays out data archiving and eDiscovery plans
Storage Decisions: Pros and cons of cloud storage technology
Storage Decisions: Storage managers must explain retention, email archiving and compliance
Choosing a storage system for data archiving
Mimosa Systems adds case management tool to NearPoint 4.0 data archiving software
Mimosa NearPoint, LiveOffice Mail Archive offer hybrid SaaS email archiving approach
HP resizes its ExDS9100 scale-out NAS system; finds market broader than original Web 2.0 target
New data archiving products focus on software-only delivery, cloud integration
Email archiving strategies: Five best practices
Data storage compliance and archiving Research

NAS management
NFS 4.1's pNFS: Big NAS performance boost
NetApp begins rollout of Data Ontap 8
Storage Decisions Chicago 2009 Session Downloads
Isilon expands with transactional and archive systems
Digital Reef aims for data classification scalability
EMC adds file-level single instancing, Flash to Celerra
Scale-out NAS poised for growth
How to determine a NAS system's scalability
Top five NAS tips of 2008
Storage Decisions San Francisco 2008 Session Downloads
NAS management Research

RELATED GLOSSARY TERMS
Terms from Whatis.com − the technology online dictionary
litigation hold  (SearchStorage.com)

RELATED RESOURCES
2020software.com, trial software downloads for accounting software, ERP software, CRM software and business software systems
Search Bitpipe.com for the latest white papers and business webcasts
Whatis.com, the online computer dictionary



Backup Solution Directory
TechTarget Storage Media
Storage Magazine View this month\\'s issue and subscribe today.
Storage Decisions Apply online for free conference admission.
SearchStorage.com
HomeNewsMagazineTopicsLearningMultimediaWhite PapersBlogsEventsAbout Us

About Us  |  Contact Us  |  For Advertisers  |  For Business Partners  |  Site Index  |  RSS
TechTarget provides technology professionals with the information they need to perform their jobs - from developing strategy, to making cost-effective purchase decisions and managing their organizations' technology projects - with its network of technology-specific websites, events and online magazines.

TechTarget Corporate Web Site  |  Media Kits  |  Site Map




All Rights Reserved, Copyright 2000 - 2009, TechTarget | Read our Privacy Policy
  TechTarget - The IT Media ROI Experts