320 likes | 323 Views
Learn about SAN components and architecture, hardware and software configuration and testing, common problems, rules of thumb, plans for the future, and references for Veritas NetBackup Shared Storage Option.
E N D
NetBackup Shared Storage Option VERITAS Bill.Coleman@veritas.com
Agenda • SAN Components/Architecture • Hardware Configuration & Testing • Software Configuration & Testing • Common Problems • Rules of Thumb • Plans for the Future • References
Enabling Technology Application Host A Media Server MM Daemons (avrd) (ltid) (vmd) (tldd) Host Bus Adapter Application Host B Media Server MM Daemons (avrd) (ltid) (vmd) (tldd) (tldcd) Host Bus Adapter Hardware Connection
FC Tape Tape SAN Components Application Host Host Bus Adapter Fibre Channel FC-AL Hub or Switched Fabric Fibre Channel Fibre Channel Bridge SCSI
Hardware Testing • Determine the physical drive within robot • Connect tape drives to SCSI adapters • Record physical configuration • Record adapter • SCSI address • Fibre Channel Logical Unit Numbers (LUN) • Create device files manually or automatically depending on OS (ex. boot -r) • Ensure tape drive is visible to Operating System (ex. mt -f /dev/rmt/0 status)
Physical to Logical Mapping # /usr/openv/volmgr/bin/sgscan /dev/sg/c2t1l0: Tape (/dev/rmt/21): "Quantum DLT4000" /dev/sg/c2t2l0: Tape (/dev/rmt/18): "Quantum DLT4000" /dev/sg/c2t4l1: Tape (/dev/rmt/19): "Quantum DLT4000" /dev/sg/c2t4l2: Tape (/dev/rmt/20): "Quantum DLT4000" /dev/sg/c2t5l0: Tape (/dev/rmt/12): "QUANTUM DLT7000" /dev/sg/c5t0l0: Changer: "ADIC Scalar 1000" /dev/sg/c5t2l0: Tape (/dev/rmt/13): "QUANTUM DLT7000" /dev/sg/c5t3l0: Tape (/dev/rmt/14): "QUANTUM DLT7000" /dev/sg/c5t4l0: Tape (/dev/rmt/15): "QUANTUM DLT7000" /dev/sg/c5t5l0: Tape (/dev/rmt/16): "QUANTUM DLT7000" /dev/sg/c6t2l0: Changer: "STK 9738" /dev/sg/c6t3l0: Changer: "STK 9730"
tpconfig # /usr/openv/volmgr/bin/tpconfig -d Index DriveName DrivePath Type Multihost Status ***** ********* ********** **** ********* ****** 2 s1kd1 /dev/rmt/12cbn dlt Yes DOWN TLD(55) Definition DRIVE=1 3 s1kd2 /dev/rmt/13cbn dlt Yes UP TLD(55) Definition DRIVE=2 Currently defined robotics are: TLD(55) robotic path = /dev/sg/c5t0l0, volume database host = sunflower
Analyzer (mmcrawl) • Common errors found are: • Inconsistent robotic paths • Shared drive with different robot drive numbers • Daemons cannot communicate • All daemons not running • Duplicate robotic definitions (robot control on only one host) • Drive configured as both shared & dedicated • Storage unit not defined for each media server • Using tpconfig instead of configuration wizards • Not testing drive paths on every Media Server
Software Testing • Start NetBackup & Media Manager daemons on server controlling robotics • Identify Device Allocator (volmgr/bin/vmdareq) • Use volmgr/bin/robtest to verify each server can access all tapes & drives • Test each Media Server accessing drives separately • Run volmgr/bin/tpreq to write/read tape • Use Device Monitor to verify mounting status • Use vmdareq or Java Device Monitor status in Drive Summary report
Definitions • Scan host - media server monitoring tape drive (avrd). Only one host in a shared drive configuration monitors a particular tape drive. If scanning host is unavailable, scan host moves to another Media Server. Each tape drive could have a different scan host. • Device Allocator - media server controlling access to shared drives. This is the same host has the Media Manager volume database host for the drive. Typically there is one volume database host for a site, therefore one DA can manage multiple robotics connected to many Media/Master servers.
Daemons Application Host A Media Server MM Daemons (avrd) (ltid) (vmd) (tldd) Host Bus Adapter Application Host B Media Server MM Daemons (avrd) (ltid) (vmd/DA) (tldd) (tldcd) Host Bus Adapter Voldb Data Path Control path Hardware Connection Data Path
Device Allocation #/usr/openv/volmgr/bin/vmdareq -h host1 drive1 - AVAILABLE hostA SCAN_HOST hostB drive2 - AVAILABLE hostA SCAN_HOST hostB #/usr/openv/volmgr/bin/vmdareq -h host1 -unavailable -H host1 drive1 - AVAILABLE hostA hostB SCAN_HOST
Common Problems • Not installing SSO software on each server • Forgot to add VERITAS tape-config-list entries into /kernel/drv/st.conf (Solaris) • No configuration entries for fast/wide adapter cards in sg.links and sg.conf files (Solaris) • Forgot to manually add device file (Tru64) • Did not use Berkeley-Style close on tape path • Using outdated firmware in Hub, Switch, HBA, Bridge (rapid change) • Configuring same robotic path on every server - only single server controls robot • Forceload sg/st/fcaw drivers on Solaris • Set JNI failover to “0” seconds to avoid I/O hangs (working with bridge/HBA vendor fix)
Rules of Thumb • Ensure hardware and SAN configuration is understood and stable before configuring NetBackup Media Manager • Configure one host to have Volume database for multiple servers or masters • Test backup/restore with dedicated tape drives before sharing • Start sharing with a small number of tape drives between 2-3 Media Servers • Try to duplicate Storage Area Networking issues with native Operating System usage • Mixing SAN components adds risk (ex. GBIC or Host Bus Adapter)
Plans for the Future • Improved configuration automation of tape drives, robotics and shared tape drive (SSO) configurations • Automatic configuration of serialized peripherals starting with NB3.3 (not SSO) • Continued testing (ex. Compaq) • Frequently Asked Questions (FAQ) • Improved documentation on Verifying a SSO Configuration • Continue supporting configurations beyond what is tested • More VERITAS product interoperability testing in SAN configurations
References • 3.2 Media Manager Device Configuration Guide (100-001009A) • Chapter for each specific Operating System • 3.2 Media Manager System Administrator’s Guide (100-000986A) • Appendix B (page 310) Multihosted Drives • Chapter 2 Configuring Storage Devices (page 37) • 3.2 NetBackup Troubleshooting Guide (100-001046A) • Appendix A Functional Overview • Appendix B Networks & Hostnames • Appendix C Robotic Test Utilities • NetBackup 3.2 Shared Storage Option White Paper • http://www.veritas.com/library/pi/SSO_SAN_WP.pdf • NetBackup 3.2 Datasheet • http://www.veritas.com/products/nbux/datasheet.html • NetBackup 3.2 SAN Tested Matrix • http://dept-slo.veritas.com/prodman/products/products/netbackup/netbackup/supported_platforms.htm