Challenge
When backing up to a deduplication storage appliance from Veeam Backup & Replication, performance or deduplication ratio is low without using best practice settings.
Solution
The default options of Veeam Backup & Replication are intended for non-deduplication storage.
If you use deduplication storage systems, you need to additionally configure the backup architecture and Veeam settings to achieve best performance. Below are some hardware and software configuration options that can be used with deduplication storages. In some cases, you may need to combine more than one option, but in all cases, it is strongly encouraged that you consult your deduplication appliance vendor and/or a value added reseller to discuss individual needs.
For the general architecture planning, Veeam’s best practices is to follow the industry 3-2-1 backup rule:
3: Maintain at least three copies of your data and applications. That’s the one copy you’re using and two backups. This way, if one of your backups is unavailable for any reason (for example, if you store the backup in the same locations with the copy you use and both go down), you can still recover what you need.
2: Store your backups on at least two different types of media or storage controller logics. One reason for this is that each type of media has its own vulnerabilities, and you don’t want both of your backups susceptible to the same problem.
For example: run your business on a general purpose fast storage system and store your backups on a storage system that is specialized for this purpose, or a separate server with local disks.
1: Keep one of the backups in a different location. Consider a crisis in your primary data center, such as a fire or power failure. If all your copies are collocated, they can all be affected, taking down your organization.
To achieve this, use the Veeam best practices. However, each deduplication storage vendor has specific unique feature sets that allow the usage of their storages in different ways. Please check the following links for vendor specific guidelines:
- Best Practices for HPE
- Best Practices for ExaGrid
- Best Practices for Dell EMC DataDomain
- Best Practices for Quantum
General Deduplication Storage Best Practices
Step 1: Creating backups on the first site with long term retention.
Use a backup target storage system (general purpose storage system) for short-term primary backups and instruct Veeam to copy the backups to a deduplication storage system for long-term retention.A variant of this approach is to use a standard server with battery backed RAID controller and disks to store the primary backups (cache approach) and use backup copy to deduplication storage systems for long-term etention.
Step 2: Add offsite target.
In addition to the above scenarios, you can use the following options to create an offsite copy of your data.
The scenario above reflects general best practices. Please contact your deduplication storage vendor for further guidance and check the vendor links provided above for additional usable scenarios with the specific storage.
Best Practices For Deduplication Storage Systems
Backup Chains
Veeam offers a wide range of backup chains but for deduplication storages the following are recommended:
- General Deduplication Storages / ExaGrid / Quantum DXI + Backup Copy Job That Reads From Source Backup Chain:
- HPE StoreOnce Catalyst / Dell EMC DataDomain DDboost + Copy Over WAN + Backup Copy Job:
The previous described option provides fast processing in situations that do not involve transport over WAN links. Also, optimized integration with Catalyst and Dell EMC DDBoost allows using synthetic processing. For these environments uncheck the “Read the entire restore point from source backup instead of synthesizing it from increments” but leave the weekly GFS backup enabled.
- Special Scenarios Based on Vendor Own Guidelines + Backup Job With Active Full or Synthetic Full Backup:
Inline Deduplication
Disable inline deduplication setting when writing into deduplication storages.However for deduplication devices that have a non deduplicated landing zone (Like ExaGrid) and backup only VMs, you can save some space: to do this, enable inline deduplication
Compression
By default, the Veeam source DataMover (Veeam Proxy or Agent) compression level is set to “optimal” to reduce data that needs to be transported over the network. The repository or the gateway server uncompress the data before storing the data to the deduplication device.For backup copy jobs, set the comression level to “Auto” to leave the data in the way it was stored originally on the primary backup target.
For deduplication devices that have a non deduplicated landing zone (Like ExaGrid) and backup only VMs, you can save some space: to do this, set the compression level to “Dedup Friendly” and uncheck the “Decompres backup data blocks before storing” option.
The general best practices are:
Block Size
Veeam can use different block sizes to store backups on the target. The block size is defined in the backup job settings and will be kept across all copies. Non-VM backups settings are set to “Local Target” automatically, and the size can not be modified.Block size changes are only activated by an active full and will only be used for the new backup chains (the data stored already are not touched).
Only 2 of the block size settings are recommended for deduplication storage usage at VM backups:
- Storage optimization: Local target (large blocks)
- Storage optimization: Local target
Use this option for ExaGrid.