Download Remote Site Recovery of Oracle Enterprise Data Warehouse

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Microsoft Jet Database Engine wikipedia , lookup

Entity–attribute–value model wikipedia , lookup

Relational model wikipedia , lookup

Extensible Storage Engine wikipedia , lookup

Database wikipedia , lookup

Functional Database Model wikipedia , lookup

Oracle Database wikipedia , lookup

Clusterpoint wikipedia , lookup

Database model wikipedia , lookup

Transcript
White Paper
REMOTE SITE RECOVERY OF ORACLE
ENTERPRISE DATA WAREHOUSE USING
EMC DATA DOMAIN
A Detailed Review
EMC SOLUTIONS GROUP
Abstract
This white paper describes how a 12 TB Oracle data warehouse was transported
from a data center in Massachusetts, U.S, to Cork, Ireland. The infrastructure
was re-created at the Cork site on EMC® Symmetrix® VMAX™ storage and the
database was restored and recovered. EMC Data Domain® Encryption was used
to secure the data during transport.
July 2011
Copyright © 2011 EMC Corporation. All Rights Reserved.
EMC believes the information in this publication is accurate as of its
publication date. The information is subject to change without notice.
The information in this publication is provided “as is.” EMC Corporation makes
no representations or warranties of any kind with respect to the information in
this publication, and specifically disclaims implied warranties of
merchantability or fitness for a particular purpose.
Use, copying, and distribution of any EMC software described in this
publication requires an applicable software license.
For the most up-to-date listing of EMC product names, see EMC Corporation
Trademarks on EMC.com.
All trademarks used herein are the property of their respective owners.
Part Number H8238.1
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
2
Table of contents
Executive summary ............................................................................................................... 5
Business case .................................................................................................................................. 5
Solution overview ............................................................................................................................ 5
Key results ....................................................................................................................................... 6
Introduction .......................................................................................................................... 7
Purpose ........................................................................................................................................... 7
Scope .............................................................................................................................................. 7
Audience.......................................................................................................................................... 7
Terminology ..................................................................................................................................... 7
Technology overview ............................................................................................................. 8
Overview .......................................................................................................................................... 8
EMC Data Domain DD660 ................................................................................................................. 8
EMC Data Domain Encryption ........................................................................................................... 9
EMC Data Domain file system encryption lock .................................................................................. 9
EMC Symmetrix VMAX ...................................................................................................................... 9
EMC Solutions Enabler ..................................................................................................................... 9
EMC Symmetrix TimeFinder .............................................................................................................. 9
EMC PowerPath .............................................................................................................................. 10
Oracle Database ............................................................................................................................ 10
Solution overview ............................................................................................................... 11
Use case requirements................................................................................................................... 11
The challenge................................................................................................................................. 11
Nondisruptive to production ...................................................................................................... 11
Distance and latency ................................................................................................................. 11
Database size............................................................................................................................ 12
The solution ................................................................................................................................... 12
Project timeline .............................................................................................................................. 14
Discovery – primary site ...................................................................................................... 15
Overview ........................................................................................................................................ 15
Security.......................................................................................................................................... 15
Business and technical .................................................................................................................. 16
Primary data center configuration information................................................................................ 16
Remote site infrastructure .............................................................................................................. 17
Architecture ............................................................................................................................... 17
Hardware resources................................................................................................................... 18
Software resources .................................................................................................................... 18
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
3
Backup appliance...................................................................................................................... 18
Oracle database backup – primary site ................................................................................ 19
Overview ........................................................................................................................................ 19
DD660 setup .................................................................................................................................. 19
Network settings ....................................................................................................................... 19
Encryption ................................................................................................................................. 20
RMAN backup ................................................................................................................................ 20
File system lock.............................................................................................................................. 20
Data compression results............................................................................................................... 21
Oracle server deployment and storage provisioning – remote site ........................................ 23
Overview ........................................................................................................................................ 23
Deploying the remote site server .................................................................................................... 23
Configuring the remote site storage ............................................................................................... 23
Primary data warehouse configuration information.................................................................... 23
Provisioning storage at the remote site ...................................................................................... 24
Setting up the Oracle database environment.................................................................................. 25
Oracle database recovery – remote site ............................................................................... 27
Overview ........................................................................................................................................ 27
DD660 setup .................................................................................................................................. 27
Network settings ....................................................................................................................... 27
Unlock file system ..................................................................................................................... 28
RMAN restore ................................................................................................................................. 29
Verifying the integrity of the database ............................................................................................ 32
Data Domain consolidation – remote site ............................................................................ 34
Overview ........................................................................................................................................ 34
Replication process........................................................................................................................ 34
Conclusion ......................................................................................................................... 37
Summary ....................................................................................................................................... 37
Findings ......................................................................................................................................... 37
References .......................................................................................................................... 38
White papers ................................................................................................................................. 38
Product documentation.................................................................................................................. 38
Other documentation ..................................................................................................................... 38
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
4
Executive summary
Business case
EMC offers an extensive series of solutions for quick, efficient, and nondisruptive
backup of large Oracle data warehouse environments. Increasingly, customers are
looking to repurpose or reuse these backups to facilitate operations such as:
•
Migration of Oracle data warehouses to new data center locations
•
Seeding of Oracle data warehouse environments at remote disaster recovery
locations
•
Deployment of remote database copies for test and development purposes
These scenarios present challenges in relation to determining the most appropriate
mechanism for transferring the data from the primary site to the remote site, and
developing validated procedures for re-creating the production environment and
recovering the database at the remote site.
The main considerations in relation to data transfer to the remote site include:
•
Security and corporate governance compliance
•
Distance and latency
•
Database size
•
Cost effectiveness
For one-off migration scenarios, where the data warehouse is many terabytes in size
and the distance between data centers is great, it is impractical to transfer the data
across the WAN. Consequently, customers are looking for an offline transport solution
that is economical and secure.
Solution overview
At its headquarters in Hopkinton, Massachusetts, EMC IT maintains a 12 TB Oracle
data warehouse, deployed on an Oracle Real Application Clusters (RAC) 11g
environment. To demonstrate the benefits and implementation of the solution
documented in this white paper, this real-world production database was cloned and
backed up, shipped by air from the primary data center in Hopkinton, Massachusetts
to the EMC Solutions Center in Cork, Ireland, and then restored and recovered for test
and development use.
The remote data center infrastructure consisted of a two-node Oracle RAC 11g cluster
that accessed the Oracle database on an EMC® Symmetrix® VMAX™ storage system.
This reflects the infrastructure at the primary data center.
The production database in Hopkinton was cloned using EMC TimeFinder® and then
backed up to an EMC Data Domain® DD660 appliance using Oracle Recovery Manager
(RMAN). Data Domain Encryption and file system encryption locking were used to
ensure security and corporate governance compliance during transport.
At the remote site, the DD660 was added to the environment and the file system was
unlocked. The database was then restored and recovered to a test system using
Oracle RMAN.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
5
After verifying that the database had been successfully restored, Data Domain
Replicator was used to move the database image from the DD660 to a shared Data
Domain DD880 appliance in the data center in Cork, so that the DD660 could be
repurposed for another project.
Key results
The key benefits demonstrated by the solution include:
•
Nondisruptive backup
Creating a TimeFinder clone has no impact on production. Mounting the clone
to a separate backup server provides a fully-usable, independent, point-in-time
copy that can be backed up to the Data Domain appliance with no disruption to
the production environment.
•
Reduced backup footprint and shipping costs
Data Domain deduplication reduces the footprint of the backup database
significantly. In this solution, a 70 percent reduction in capacity requirements
enabled use of the midrange DD660 appliance, which minimized shipping
costs.
•
Seamless Oracle RMAN integration
Data Domain systems integrate seamlessly into Oracle architectures,
supporting existing backup strategies, including Oracle RMAN.
•
Data security and corporate governance compliance
Data Domain Encryption ensures that all data is physically stored in an
encrypted manner and cannot be accessed without first decrypting it. Data
Domain file system encryption lock adds a further level of protection that
enables shipping on commercial air transport while satisfying corporate
governance compliance requirements.
•
Validated and documented end-to-end procedures
Database integrity testing at the end of the project demonstrated successful
recovery of the database at the remote data center.
This white paper documents the procedures for achieving these results, from
the initial discovery phase necessary for designing the infrastructure for the
remote site, to a complete restore and recovery of a working data warehouse.
These procedures are applicable to any solution involving recovery of an Oracle
database.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
6
Introduction
Purpose
This white paper documents a case study that involved cloning and backing up a realworld 12 TB database, shipping it by air from the primary data center in Hopkinton,
Massachusetts to the EMC Solutions Center in Cork, Ireland, and then restoring and
recovering the database for test and development use.
Scope
The white paper covers the following:
Audience
Terminology
•
An overview of the technologies involved in the solution
•
The discovery process carried out at the production site to develop a build plan
for the remote site
•
Re-creation of the data warehouse at the remote site
•
Backup and securing of the DD660 for transfer by commercial air transport
•
Restore and recovery of the database at the remote site
•
Validation of the recovered database
•
Replication of the database from the DD660 to a DD880 appliance
This white paper is intended for Oracle, storage, server, and backup administrators
who want to understand how to migrate an Oracle database to a remote site and the
build requirements at the remote site to enable recovery of the database. It is
assumed that the reader is familiar with the following EMC and Oracle products:
•
EMC Symmetrix VMAX
•
EMC Data Domain
•
Oracle Database 11g R1 Enterprise Edition with Oracle Clusterware
Table 1 defines terms used in this white paper.
Table 1.
Terminology
Term
Definition
AES
Advanced Encryption Standard
ASM
Automatic Storage Management
DD
Data Domain
DD OS
Data Domain operating system
RAC
Real Application Clusters
RMAN
Recovery Manager
SISL™
Stream-Informed Segment Layout architecture
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
7
Technology overview
Overview
EMC Data Domain
DD660
The case study uses the following hardware and software components:
•
EMC Data Domain DD660
•
EMC Data Domain Encryption
•
EMC Data Domain file system encryption lock
•
EMC Symmetrix VMAX
•
EMC Symmetrix TimeFinder
•
EMC PowerPath®
•
Oracle Database 11g R1 Enterprise Edition with Oracle Clusterware
•
Oracle RMAN
For the solution, an EMC Data Domain DD660 appliance was used to back up the
source database and transport it to the recovery site in Cork.
Data Domain deduplication storage systems dramatically reduce the amount of disk
storage needed to retain and protect enterprise data, requiring a storage footprint
that is 10 to 30 times smaller, on average, than that required by the original data set.
The DD660 is a midrange appliance that offers up to 2 TB/hour of aggregate inline
deduplication throughput, and up to 700 GB/hour for a single stream. The model
used in the case study provides 8 TB of storage capacity, with much higher data
storage capacity enabled by deduplication.
Like all Data Domain systems, the DD660 derives its performance advantages from
the Data Domain Stream-Informed Segment Layout (SISL) scaling architecture. SISL
identifies 99 percent of duplicate segments in RAM, inline, ensuring that only unique
data is stored to disk. The storage of unique data is invisible to backup software,
which sees the entire virtual file system. The integrity and recoverability of backup
data are secured by the Data Domain Operating System (DD OS) and Data
Invulnerability Architecture™, which ensures that data is stored and recoverable with
continuous write verification, fault-detection, and self-healing mechanisms.
When data enters the Data Domain system, using any of the supported protocols
(NFS, CIFS, VTL, OST, and Distributed Segmented Processing), the stream is
segmented, fingerprinted, deduplicated (Global Compression™), grouped into multisegment compression regions, and then locally compressed. When confidentiality is
required, deduplicated and compressed data can be encrypted inline and the file
system can then be locked.
Data Domain systems integrate seamlessly with Oracle RMAN, with no changes to
scripts, backup processes, or system architecture.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
8
EMC Data Domain
Encryption
Data Domain Encryption protects data if the Data Domain system is stolen or lost
during transit, and eliminates accidental exposure during replacement of failed
drives. DD Encryption is designed such that if an intruder circumvents other network
security controls and gains access to encrypted data, the data is unreadable and
unusable to that person without the proper cryptographic keys.
The inline encryption process integrates seamlessly with the inline deduplication
process, encrypting data before it is written to disk. Similar to the advantages of
inline deduplication, inline encryption requires minimal resources to provide fast,
reliable, and secure backup and recovery.
EMC Data Domain
file system
encryption lock
File system encryption lock is specifically designed to lock an encrypted Data Domain
appliance and its external storage prior to transporting the appliance between
locations.
File system encryption lock requires an encryption passphrase. This is a humanreadable key that is used to generate a machine-usable AES256 encryption key that
encrypts the Data Domain system encryption key. Locking the file system involves
changing the passphrase. This changes the encryption of the Data Domain system
key and removes the passphrase from the system. Locking the system allows a Data
Domain system to be transported with the encryption keys on the system. The keys
are not recoverable until the file system is unlocked using the passphrase (which is
not stored on the appliance).
EMC Symmetrix
VMAX
An EMC Symmetrix VMAX provides the storage platform at both the primary and
remote data centers.
Symmetrix VMAX storage arrays provide high performance and scalability for
demanding enterprise storage environments. Built on the strategy of simple,
intelligent, modular storage, the Symmetrix VMAX incorporates a highly scalable
Virtual Matrix Architecture™ that enables it to grow seamlessly and cost-effectively
from an entry-level configuration into the world’s largest storage system. The VMAX
supports Flash drives, Fibre Channel drives, and SATA drives within a single array, as
well as an extensive range of RAID types.
EMC Solutions
Enabler
EMC Solutions Enabler is the software that provides a host with the Symmetrix
Command Line Interface (SYMCLI), which is a comprehensive command set for
managing your Symmetrix storage environment. SYMCLI is used in single command
line entries and scripts to monitor device configuration and status, and perform
control operations on devices and data objects within a managed storage system.
EMC Symmetrix
TimeFinder
EMC Symmetrix TimeFinder software provides powerful replication capabilities for
creating local point-in-time copies that are independent of the hosts and operating
systems, applications, and databases. These copies are immediately available for
read and write access.
For the solution, a TimeFinder copy of the Oracle database was used to offload
backup overhead from the production environment to another server.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
9
EMC PowerPath
EMC PowerPath® is server-resident software that enhances performance and
application availability by supporting multiple I/O paths to logical devices and
intelligently distributing I/O requests across all available paths. PowerPath also
provides automatic failover in the event of a hardware failure by automatically
detecting the path failure and redirecting I/O to another path.
For the solution, PowerPath provides intelligent path management and automatic
path failover at the remote site.
Oracle Database
The solution involves an Oracle Database 11g R1 data warehouse and uses the
features of Oracle ASMLib, Oracle Database 11g, and Oracle Clusterware 11g such as
RAC, Automatic Storage Management (ASM), and RMAN.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
10
Solution overview
Use case
requirements
This solution is based on a real-world use case and is essentially a case study. The
requirements of the use case were to take a production database and transfer it to a
lab environment for test and development purposes.
The EMC IT Production DB resides in EMC IT corporate headquarters in Hopkinton,
Massachusetts, as shown in Figure 1. The test and development lab is located in the
EMC Solutions Center in Cork, Ireland. The database is approximately 12 TB.
Figure 1.
The challenge
An economical and secure mechanism was required to transport the data, and the
process also had to have zero impact to the EMC IT production system.
The challenge
Nondisruptive to production
As the project involved a real-world, production system, it was critical that the backup
procedure did not impact database operations. This factor dictated a nondisruptive
backup solution.
Distance and latency
The distance between the data centers in Hopkinton and Cork is approximately 4,740
kilometers (2,946 miles). The minimum latency for the round-trip is 47.4 ms, not
including other delays introduced by encryption or the network itself. In fact, the
average measured ping time between the data centers is 90 ms.
For example, a dedicated WAN link, with 47.4 ms round-trip latency, 1 Gb/s
bandwidth, and a TCP window size of 65,536 bytes, would restrict the maximum
throughput to 1.3 MB/s and take approximately 112 days to move 12 TB of data.
As this was a one-off project, it was not envisaged that sufficient dedicated
bandwidth would be made available on the corporate WAN for the data migration.
These factors dictated an offline method for transporting the data.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
11
Database size
The size of the database—approximately 12 TB— was a significant challenge and
another factor that made it impractical to transport the data across the WAN. Figure 2
indicates just how big 12 TB is in real terms.
Figure 2.
The solution
What 12 TB looks like
To facilitate transporting the data in an economical and secure fashion, a Data
Domain DD660 appliance was chosen. The reasons for this are:
•
Small form factor
The DD660 used has a 2U form factor and weighs 30 kg (66 lb)—perfectly
suited for low shipping costs.
•
Data compression
The DD660 has 8 TB of data storage space. However, Data Domain systems
store only unique data, using Global Compression to pool redundant data from
each backup image. This enables much higher data storage rates than the raw
capacity of the device.
Figure 3 shows the difference in size between the original database, the database
backed up by RMAN, and the deduplicated, encrypted database on the DD660.
Figure 3.
Original and deduplicated database sizes
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
12
•
Data integrity
The Data Domain Data Invulnerability Architecture protects against data loss
from hardware and software failures.
•
Encryption of data at rest
Data Domain Encryption protects data if the Data Domain system is stolen or
lost during transit.
•
Encrypted file system lock
Data Domain file system encryption lock protects an encryption-enabled Data
Domain system (and its external storage devices) while in transit. The encryption
keys are not recoverable without a passphrase, which is not stored with the
system.
To ensure that the production environment was not impacted by the project, it was
decided to create a TimeFinder clone of the database and mount this to a backup
server. This would provide a fully-usable, independent, point-in-time copy of the
production database that could be backed up to the DD660 nondisruptively.
Figure 4 illustrates the solution designed for the project and outlines the procedures
involved.
Figure 4.
The solution
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
13
Project timeline
The initial planning and feasibility phase of this project took place over several weeks
prior to the official kickoff of the project. After the official kickoff, the entire project
took just 22 working days to complete, from commencing discovery to verification of
the restored database at the remote site. Figure 5 shows the project schedule.
Predictably, discovery was the lengthiest phase (2 weeks) and was iterative in nature.
It progressed in parallel with, and was informed by, the build process at the remote
site. The comprehensive information gathering and planning during this phase was
key to ensuring successful completion of the project. The result was a complete and
successful restore and recovery of the database at the remote site in only 28 hours.
Figure 5.
Project timeline
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
14
Discovery – primary site
Overview
This section describes the discovery process carried out at the start of the project to
determine the feasibility of the project and to gather infrastructure and configuration
information from the production site. This information was used to determine the
transport mechanism between the two data centers and to create a build plan for the
remote site. This phase of the project involved the steps shown in Figure 6.
Figure 6.
Discovery process
The stakeholders involved in the project were:
•
•
Security
Primary site

Stakeholders: EMC IT; EMC IT Security and Compliance

Location: EMC IT headquarters, Hopkinton, Massachusetts, U.S.

Database: EMC IT Production DB
Remote site

Stakeholders: EMC Global Solutions (Oracle Engineering)

Location: EMC Solutions Center, Cork, Ireland (CSC)

Database: CSC test and development database
The security of the data to be transferred between the data centers was of critical
importance to the feasibility of the project. EMC IT and EMC IT Security considered
data security issues in terms of:
•
Corporate governance and compliance—that is, ensuring data security both in
transit and at the remote data center so as to meet legislative requirements and
EMC and industry standards.
•
Identifying any potential personal/sensitive data for masking/cleaning.
•
Choosing a backup/copy mechanism that would be nondisruptive to the
production database.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
15
This feasibility phase concluded that a copy of the EMC IT Production DB would be
supplied to EMC Global Solutions under the following conditions:
Business and
technical
•
The data would be stored in a secure EMC data center at the remote site.
•
Access to the data would be limited to specified EMC personnel.
•
No data would be made public or available to external parties.
•
A Data Domain DD660 would be used to transport the data, with Data Domain
Encryption and Data Domain file system encryption lock enabled.
To enable recovery and repurposing of the data warehouse, the infrastructure at the
primary site would be re-created at the remote site. In preparation for this, the
following business and technical issues were considered:
•
Mapping and configuration of the underlying infrastructure—servers, storage,
and IP and storage networks
•
Identifying hardware and software versions for the:
•

Server operating system

Oracle database

Symmetrix VMAX storage system

Data Domain system
Identifying the groups and individuals within EMC IT who could supply
information and carry out the necessary tasks in the production environment
A project plan was then agreed to that would minimize the impact on EMC IT staff and
would move the data to Cork with minimal delay.
Primary data center To facilitate the project, EMC IT made several important documents available to the
EMC Global Solutions team in Cork. These documents detailed the design and
configuration
information
architecture of the EMC IT data warehouse.
The Detailed Solutions Design document for the data warehouse was key to
understanding the production environment. This document contains the following
information:
•
High-level architecture
•
Hardware inventory and configuration information for the database and backup
servers and cluster, including CPU and memory, internal drives, and HBA and
network cards and ports
•
List of required Shared Infrastructure Services, including: SAN, storage array,
and IP networks—private (Cluster Interconnect), data center, and backup
•
Storage rationale and design, hardware and software, and configuration
•
Security—user and group access and authentication
•
Data center hardware provisioning—environment and cabling details
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
16
In addition, the following “live” information was supplied:
•
A copy of the current symapi_db.bin file from the production Symmetrix array
•
An extract of the Oracle data dictionary views including V$ASM_DISKGROUP
and V$ASM_DISK
•
A listing of the Oracle ASM disks
•
A mapping of the PowerPath devices and paths to the Symmetrix LUNs
•
Multiple sample Automatic Workload Repository (AWR) reports
•
A listing of the latest operating system and Oracle Clusterware and database
patch levels
Discovery was an iterative process, and continued in parallel with, and in response
to, the remote site build.
Remote site
infrastructure
Architecture
From the information gathered during the discovery phase, the infrastructure for the
remote site was designed as shown in Figure 7.
Figure 7.
Remote site architecture
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
17
Hardware resources
Table 2 details the hardware environment for the remote site.
Table 2.
Hardware resources
Purpose
Quantity
1
Storage array
Resources
Symmetrix VMAX with:
• 2 x VMAX 64 GB engines
• 8 Gb FC connectivity
• 450 GB 15k FC drives
Oracle RAC database servers
2
Linux server with:
• Quad-core CPU
• 96 GB RAM
FC switches
2
8 GB/s FC switches
Ethernet switches
2
1 Gigabit Ethernet switches
Software resources
Table 3 details the software resources for the remote site.
Table 3.
Software resources
Software
Purpose
Red Hat Enterprise Linux 5.5
Server OS for database and application servers
Oracle Database 11g R1 (11.1.0.7)
Database and cluster software
EMC Enginuity™ 5875
Symmetrix VMAX operating environment
Backup appliance
Table 4 details the configuration of the Data Domain appliance used for backing up
and transporting the database.
Table 4.
Data Domain appliance configuration
Appliance
Details
EMC Data Domain DD600
• 12 x 1 TB SATA drives
• 1 Gb IP connectivity
• DD OS 4.9
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
18
Oracle database backup – primary site
Overview
The process for configuring the DD660 and backing up the production database at
the primary data center involved the steps shown in Figure 8. When the process was
complete, the appliance was secure and ready to ship to Cork—that is, the data was
encrypted and the file system locked.
Local site
backup
Configure DD660
network settings &
add to network
Enable DD660 file
system encryption
Create clone copy
of production
database
RMAN backup to
DD660
Test database
restore
DD660 secure & ready to ship
Data encrypted
File system locked
Lock DD660 file
system
Figure 8.
DD660 setup
Backup
complete
Oracle database backup process
Network settings
The DD660 used for this solution had two 1 Gb network interfaces. To improve
network performance and resiliency, these were aggregated and used in parallel.
Figure 9 shows the link aggregation setup.
Figure 9.
DD660 link aggregation setup
To enable backup of the database, the appliance was then provided with an IP
address in the local network.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
19
Encryption
Data Domain Encryption was enabled prior to enabling the file system on the DD660.
Data was encrypted using the 128-bit Advanced Encryption Standard (AES) algorithm
(aes_128_cbc), as shown in Figure 10. This option strikes a reasonable balance
between security and performance for the use case.
Figure 10.
RMAN backup
DD660 encryption
On the production site, a fresh clone copy of the storage was created using
TimeFinder. An Oracle RMAN backup of this copy was then taken, with the database
mounted, but not opened, on a backup server. A recovery catalog was used to record
RMAN metadata.
With Data Domain Encryption enabled, the data being backed up to the DD660 was
deduplicated and then encrypted inline, before it was written to disk.
The following were backed up to the DD660:
•
A full RMAN backup of the Oracle database
•
The database control file
•
A schema export of the RMAN recovery catalog
The Oracle team on the production site then validated the backup by restoring the
control file and database to the backup server. Table 5 shows the duration of the
backup and restore operations.
Table 5.
File system lock
Backup and restore times
Process
Timings
Database backup to DD660
28 hours and 38 minutes
Database restore (for verification)
30 hours and 2 minutes
The filesys encryption lock command was used to lock the Data Domain file system.
This required a passphrase change, as shown in Figure 11, automatically re-encrypts
the system encryption key, and removes the new passphrase from the DD system.
The new passphrase was communicated separately to the remote site.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
20
Figure 11.
Data compression
results
Data Domain file system encryption lock
Data Domain deduplication eliminated redundant data segments during the backup
process. This allowed the Data Domain appliance to store and manage much more
data than would be possible with a traditional storage device.
Figure 12 shows a DD OS view that summarizes data storage and compression
statistics for the use case and illustrates the value of Data Domain deduplication
when transporting an Oracle database. Table 6 also summarizes these statistics.
Figure 12.
Use case compression summary
The total physical space available on the appliance is 7.5 TB, the total uncompressed
data written by RMAN to the appliance was 10.6 TB, and the space required to store
the compressed data was 3.2 TB. This demonstrates a 69.7 percent reduction in the
space required to store the database.
Table 6.
Use case compression summary
Item
Value
Total physical space available
7,528.6 GiB (7.5 TB)
Total data written to appliance (uncompressed)
10,613.6 GiB (10.6 TB)
Data stored on appliance (compressed)
3,217.0 GiB (3.2 TB)
Deduplication factor
3.3
Percent reduction—storage space saving
69.7%
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
21
Figure 13 shows the allocated, used, and free space information for the source
database. This closely matches the statistics gathered by the DD660 and shown in
Figure 12.
Figure 13.
SQL output showing database allocated and free space information
The database had 11.6 TB of storage assigned to its data files, not including temp
files, redo logs, control files and archive logs. Of this, 10.13 TB was allocated to
tables and indexes within the data files, with 1,504 GB of free space available.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
22
Oracle server deployment and storage provisioning – remote site
Overview
This section describes preparation of the hardware and software infrastructure at the
Solutions Center lab in Cork, as outlined in Figure 14.
Figure 14.
Deploying the
remote site server
Remote site preparation process
To simplify the restore process, the Oracle Solutions Engineering team decided to
restore the database as a single instance before migrating it to Oracle RAC.
The database server was configured with Red Hat Enterprise Linux 5.5 and prepared
for a standard Oracle ASM/Database install. Server configuration involved the
following steps:
1.
Install and configure EMC PowerPath.
2.
Install and configure ASMLIB.
Primary data warehouse configuration information
Configuring the
remote site storage During the discovery phase of the project, EMC IT provided the Oracle Solutions
Engineering team in Cork with the configuration database (symapi_db.bin) for the IT
data warehouse storage array.
At the remote site, the configuration information for the storage array was extracted
by querying symapi_db.bin in offline mode using standard Solutions Enabler display
commands. The procedure involved the following steps:
1.
Copy symapi_db.bin to a host with Solutions Enabler installed.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
23
2.
Use the following Solutions Enabler commands to set up Solutions Enabler to
query the database file:
export SYMCLI_OFFLINE=1
export SYMCLI_DB_FILE=”/DWproject/symapi_db.bin”
export SYMCLI_SID=XXX
3.
Use the following command to extract the configuration details for all devices
in the database storage group (GDWORAPRD), as shown in Figure 15:
symsg show SG_GDWORAPRD
Figure 15.
Extracting device configuration information from a storage group
With this information it was possible to understand how the production storage was
provisioned. In conjunction with the other information sources supplied by EMC IT
during the discovery phase, the Cork team was able to determine how much storage
was required on the target Symmetrix VMAX, the Oracle ASM disk groups required,
and the number of LUNs in each disk group.
Provisioning storage at the remote site
Provisioning storage at the remote site involved the following steps:
1.
Create the devices on the target Symmetrix VMAX array, as listed in Table 7.
Table 7.
Storage assigned to Symmetrix VMAX array at remote site
ASM disk group
2.
Number of devices
Device size (MB)
GDWP_TEMP
32
32,263
GDWP_ARCH
16
64,528
GDWP_REDO01
16
8,066
GDWP_REDO02
16
8,066
GDWP_T0_DATA01
16
129,056
GDWP_T1_DATA01
80
129,056
GDWP_T5_DATA01
40
129,056
Total Devices
216
Configure each device with a device identifier label that corresponds to the
name of the ASM disk group to which the device will belong. This label can be
read from the host using the Inquiry utility, which makes device identification
from the host much easier.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
24
Figure 16 shows these labels being created with Solutions Enabler.
Figure 16.
3.
Setting device identifiers with Solutions Enabler
Create a masking view for the devices so that they are visible to the RAC
cluster.
At this point, the storage is ready to be incorporated into the Oracle configuration at
the remote site.
Setting up the
Oracle database
environment
Setting up the Oracle database environment at the remote site involved the following
steps:
1.
Partition and assign the PowerPath devices for use with Oracle ASM.
Figure 17 shows the mapping of the Oracle ASM disks to the Symmetrix
assigned “device_name”.
Figure 17.
2.
Oracle ASM disk mapping to Symmetrix devices
Install Oracle Clusterware and the database binaries.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
25
3.
Configure the ASM instance and create the ASM disk groups listed in Table 8.
Table 8.
ASM disk groups and sizing
Allocation unit
Size
(MB)
(GB)
GDWP_ARCH
1
1,008
GDWP_REDO01
1
126
GDWP_REDO02
1
126
GDWP_T0_DATA01
16
2,016
GDWP_T1_DATA01
16
10,081
GDWP_T5_DATA01
16
5,041
GDWP_TEMP
1
1,008
Disk group
For example, Figure 18 lists the 16 Oracle ASM disks assigned to disk group
GDWP_ARCH.
Figure 18.
Oracle ASM disks assigned to disk group GDWP_ARCH
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
26
Oracle database recovery – remote site
Overview
This section describes the restore and recovery of the Oracle database at the remote
site, as outlined in Figure 19.
Figure 19.
DD660 setup
Database restore and recovery process
Network settings
When the appliance arrived at the remote data center, the IP address had to be
changed to enable connectivity to the remote network. Link aggregation was already
in place, so this was simply a case of changing the IP address of the virtual interface,
as shown in Figure 20.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
27
Figure 20.
Reset the IP address
Unlock file system
The Data Domain file system was locked during transit and could not be enabled
without the passphrase, as indicated in Figure 21. The passphrase was sent to Cork
separately from the DD660 appliance.
Figure 21.
Data Domain file system enable refused—passphrase required
The filesys encryption unlock command allows the user to enter the passphrase to
unlock the file system, as shown in Figure 22.
Figure 22.
Unlocking and enabling the Data Domain file system
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
28
RMAN restore
With the Data Domain file system unlocked, the database was then restored from the
DD660 using RMAN. This process involved the following steps:
1.
Mount the DD660 on the /backup directory, as shown in Figure 23.
Figure 23.
2.
Mounting the DD660 on the remote site server
Create the Oracle database and build the RMAN recovery catalog from the
export file held on the DD660. Figure 24 shows a summary listing of the
database and control file backups from the catalog.
Figure 24.
Summary of database and control file backups
3.
Use RMAN to connect to the target database and the recovery catalog.
4.
Start up the Oracle instance without mounting the database (nomount mode),
as shown in Figure 25.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
29
Figure 25.
5.
Restore the backup control file to the control_file location specified in the
SPFILE, as shown in Figure 26.
Figure 26.
6.
Starting up the Oracle instance in nomount mode
Restoring the backup database control file
Allocate the channels and restore the database, as shown in Figure 27.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
30
Figure 27.
7.
Database restore
Mount and open the database, as shown in Figure 28.
Figure 28.
Mounting and opening the database
In this use case, further recovery was not required as the database had been
backed up in a consistent state, a backup control file was used, and no
further archive logs needed to be applied.
8.
Re-create and size the Oracle temp files for the temporary tablespaces TEMP
and TEMP1. The SQL commands used to do this are shown in Figure 29.
Figure 29.
Re-creating the Oracle temp files
This completed the successful restore and recovery of the database and provided the
first step in validating the integrity of the database.
The total time taken for the restore and recovery operations was 27 hours, 8 minutes.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
31
Verifying the
integrity of the
database
Figure 30 shows a breakdown of the restored database objects and files and their
mappings to the Oracle ASM disk groups.
Figure 30.
Mapping of Oracle database objects to Oracle ASM disk groups
As part of the discovery process, EMC IT supplied the Cork team with Automatic
Workload Repository (AWR) reports from the production database. A number of key
SQL queries in the reports were identified to verify the integrity of the restored
database. At the remote site, each of these queries was extracted from the AWR
reports and run against the restored database.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
32
Figure 31 demonstrates this process for one of the queries. The first item is an extract
from the SQL ordered by Gets section of an AWR report. The second item is an extract
from the AWR report for SQL ID: 0v7knzwvnc9u7. The third item shows the Oracle
TKProf output of the trace file gathered for this SQL query when it was run against the
restored database.
SQL
query ID
SQL
statement
Figure 31.
Testing the integrity of the restored database
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
33
Data Domain consolidation – remote site
Overview
The Solutions Center in Cork uses a shared Data Domain DD880 for consolidated
backup on many projects within the lab. To ensure consistency, and to maintain a
copy of the EMC IT Production DB in its original state, Data Domain Replicator was
used to copy the database image to the DD880 at the end of the project, as shown in
Figure 32. The DD660 was then repurposed to another project.
Cork Solutions Center
EMC Data Domain
DD660
DD Replicator
EMC Data Domain DD880
Cork Solutions Center shared
backup target
Figure 32.
Database consolidation to shared DD880
Data Domain Replicator software provides automated, fast, and reliable replication of
data for disaster recovery (DR), remote office data protection, and multiple site tape
consolidation. Replicator duplicates the compressed, deduplicated data over a
network (including WANs), greatly reducing the demands on the network.
Once replication has been configured between a source and a destination, any new
data written to the source can be automatically replicated to the destination—for the
use case, a one-off copy was made to the DD880 but continued replication was not
required.
Replication
process
Setting up replication between the DD660 and DD880 involves simply creating a
replication pair by specifying the replica source and destination. This can be done
using the DD Enterprise Manager, as shown in Figure 33, or using the DD OS CLI.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
34
Figure 33.
Creating a replication pair
The Replication view in DD Enterprise Manager provides configuration, status,
performance, and topology information for the replication pair. Figure 34 shows this
information for the use case replication process at completion.
Figure 34.
Data Domain Enterprise Manager—Replication view
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
35
Figure 35 shows the timeline for the replication process, which took 23 hours to
complete.
Figure 35.
Data Domain replication timeline
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
36
Conclusion
Summary
This white paper captures and describes the process used to transport a 12 TB Oracle
data warehouse between two EMC data centers and to restore and recover the
database at the target site. The data centers were located on different continents and
over 4,000 kilometers apart.
Data security was a critical requirement. If the transport mechanism was not secure,
then it would not meet EMC corporate governance compliance and the move could
not happen. In addition, as the process involved a real-world data warehouse, the
solution had to be nondisruptive to EMC core production activities.
The large size of the database and the long distance involved meant that:
•
Transfer over a shared WAN link was neither practical nor efficient
•
Creating a dedicated WAN link was neither practical nor economical
In comparison, backing up the data to a Data Domain DD660 appliance, and
transporting this by commercial carrier to the target site, proved to be a secure,
practical, economical, and efficient solution.
At the primary site, EMC TimeFinder enabled offloading of the backup process to a
non-production server, and using an NFS mount allowed simple RMAN backup and
restore on the DD660.
Findings
The key findings of this solution include:
•
Nondisruptive backup
EMC TimeFinder and seamless Data Domain integration with Oracle RMAN
ensured zero impact to the production environment.
•
Security and corporate governance compliance
Data Domain data integrity, Data Domain Encryption, and Data Domain file
system encryption locking secured the system, enabling it to be shipped by
standard commercial transport.
•
Simple restore and recovery
Using an NFS mount allowed simple RMAN backup and restore from disk. Data
Domain technology ensured balanced backup and restore speeds.
The 12 TB data warehouse was available for use within 3 days of arrival in Cork.
•
Economic transport
Data Domain deduplication reduced the database storage and transport
requirements from 12 TB to 3.3 TB—a small form factor DD660 (2U, 60 kg)
minimized shipping costs.
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
37
References
White papers
Product
documentation
Other
documentation
For additional information, see the white papers listed below.
•
EMC IT's Migration to the Open, Expandable Oracle BI Grid — Applied
Technology
•
EMC Backup and Recovery for Oracle Database 11g Data Warehouse — A
Detailed Review
For additional information, see the product documents listed below.
•
EMC DD OS 4.9 Administration Guide
•
EMC Solutions Enabler Symmetrix CLI 7.2 Command Reference
For additional information, see the documents listed below.
•
Oracle Clusterware Administration and Deployment Guide 11g Release 1 (11.1)
•
Oracle Real Application Clusters Administration and Deployment Guide 11g
Release 1 (11.1)
•
Oracle Clusterware Installation Guide 11g Release 1 (11.1) for Linux
•
Oracle Database Installation Guide 11g Release 1 (11.1) for Linux
•
Oracle Real Application Clusters Installation Guide 11g Release 1 (11.1) for
Linux and UNIX
•
Oracle Database Backup and Recovery Reference 11g Release 1 (11.1)
•
Oracle Database Backup and Recovery User's Guide 11g Release 1 (11.1)
Remote Site Recovery of Oracle Enterprise Data Warehouse Using EMC Data Domain
A Detailed Review
38