Once the retention policy has been implemented for a few quarters, and depending on the desired schedule of your tape vault vendor, you can determine how large of a tape library you'll need for your solution. Three quarters - including fiscal and calendar year end if possible - worth of trending data under the retention policy should give you some idea about the projected growth that your selected tape library must accommodate.
In that scenario, the library supporting the backup of a few servers may be best served by using DLT or LTO technologies. However, when those servers' data is migrated from disk to tape as a result of HSM, the larger user community could potentially overrun the HSM tape library because of the slow load times of the selected tape drive technology. Although it isn't desirable to support two tape formats, in some cases it may be necessary, depending on the traffic patterns of the applications involved.
One particular tape format that serves both ends of the spectrum well is Sony's AIT-3 tape drive. The AIT-3 tape drive has the highest capacity per square inch. Combining a native capacity of 100GB with a sustained transfer rate of 12MB/s and 27-second load time, covers most if not all of your application concerns, including cost when compared to some high-end tape solutions. The memory-in-cartridge feature also enhances the load time and file location on the tape, thereby providing your solution with an optimal condition for large numbers of tape mount requests such as in a complete disaster recovery, or HSM application. AIT's 3.5-inch form factor also provides the scalability you will need by allowing the greatest tape drive count in your chosen library.
Judging by market share, Sony's AIT-3 features have not caught the attention of many storage managers looking to ease the burdens of tape management. Or possibly, no matter how good a different technology has become, storage managers or their management aren't willing to change from their current format because it indirectly means that someone has made a bad decision in the past.
If you're an administrator tasked with implementing a tape library using a format that may not necessary match the needs of the application, appeal to your management to consider the benefits of a new tape format. It really could mean the difference between success and failure. But be prepared with a plan to show your management how you intend to preserve the recoverability of the data remaining on those archived tapes.
Connecting the library
There are various ways you can connect a tape library to your storage area network (SAN). For the most part, the connecting interface of your tape drives will determine the hardware requirements representing the data path between your backup servers and the library. For example, if the plan is to move forward with direct-attached SCSI tape drives, you'll need to bridge the SCSI traffic onto a Fibre Channel (FC) highway via a SCSI-FC bridge.
However, if your tape drives are outfitted with native FC connections, you get the benefit of accessing your tape drives directly, without any translation. Native connections also enhance the manageability of your SAN by reducing the number of devices in it. And depending on your library vendor, you may also experience the benefit of placing your library in a lights-out location without any IP security vulnerabilities. That's because some vendors are now offering the ability to control the library's control port via a native FC connection. Otherwise, you'll need to make the library's control port available via a SCSI-FC bridge, which often have management IP ports.
In my experience, FC-SCSI bridging has been problematic. Not only does it impose the additional management overhead of keeping firmware revisions current, it also adds computational overhead to the data path between the backup server and destination storage device, thereby possibly lowering performance and usability.
For example, during one implementation I participated in, the tape library was located more than the allowable distance from the backup server to use a SCSI cable. As a result, the engineer driving the project decided to bridge the gap by connecting the library's robotic arm to an FC-SCSI bridge, and then connected the bridge to the SAN to allow the backup server to address the robotic arm via the SAN (fiber optic cabling was used in the SAN). Often, the robotic arm was slow to respond to the SCSI commands of the backup server during the busiest of times, resulting in failed backups and the inability to restore data. To remedy this problem, we relocated the backup server within SCSI's distance limitations and directly connected the library's robotic arm to the backup server and the problem went away.
Yet the simplicity of the native FC tape drives can often lead to bad practices in the field as well. If a tape library solution called for 10 native FC tape drives, common practices in the field are to connect the 10 tape drives into 10 separate ports on one 16-port fabric switch. Considering the speed of the fastest tape drives available today, this is quite a bit of resources that will never be utilized per port. And if 2Gb/s switches are used, the problem and costs are compounded even further.
Remember that when connecting a native FC tape drive to a fabric, the tape drive usually shows up as a public loop device: the tape interface is talking FC-AL and the now initialized FL_Port on the switch is also talking FC-AL with a view into the nameserver's database in the switch. This allows fabric-attached hosts on the SAN to address and establish a path to the public loop tape drive.
In this configuration, a more economical and scalable approach would be to connect two 8-port arbitrated loop switches to two ports on the fabric switch (see "Connecting libraries to SANs: dos and don'ts,"). The five tape drives will share the bandwidth of the FL_Port, making it a more likely match of required resources and lowering the cost of the overall solution.
When designing a library solution, be creative but pragmatic. Once you know precisely what you need, there are many ways to exploit the various technologies involved to achieve your ends.