VM Loses Connection During Snapshot Removal

Challenge

During the snapshot removal step of a Veeam job, the source VM loses connectivity temporarily.

Cause

Veeam does not remove the snapshot itself, Veeam sends an API call to VMware to have the action performed.

The snapshot removal process significantly lowers the total IOPS that can be delivered by the VM because of additional locks on the VMFS storage due to the increase in metadata updates, as well as the added IOP load of the snapshot removal process itself. In most environments, if you are already over 30-40% IOP load for your target storage, which is not uncommon with a busy SQL/Exchange server, then the snapshot removal process will easily push that into the 80%+ mark and likely much higher. Most storage arrays will see a significant latency penalty once IOP's get into the 80%+ mark which will of course be detrimental to application performance.

Solution

Isolation Test

The following test should be performed during a time when connectivity to the VM is not sensitive, for instance during off peak hours.

To isolate this issue to the specific VMware snapshot removal event, Veeam suggests the following isolation test:

Create a snapshot on the VM in question.
Leave the snapshot on the VM for duration of time that a Veeam job runs against that VM.
Remove the snapshot.
Observe the VM during the snapshot removal.

If while performing the test above you observe the same connectivity issues as during the Veeam job run, the issue very likely exists within the VMware environment itself. Please review the following list of troubleshooting steps and known issues. If none of the following work to resolve the issue, we advise that you contact VMware support directly regarding the snapshot removal issue.

Common Troubleshooting Tasks

Check the VM for snapshots while no job is running and remove any that are found.
Check for orphaned snapshots on the VM. (See: http://kb.vmware.com/kb/1005049)
Reduce the number of concurrent tasks that are occurring within Veeam, this will in turn reduce the number of active snapshot tasks on the datastores.
Move VM to a datastore with more available IOPS, or split the disks of the VM up into multiple datastores to more evenly spread the load.
If the VMs CPU resources spike heavily during Snapshot consolidation, consider increasing the CPU reservation for that VM.
Ensure you are on the latest build of your current version of vSphere, hypervisors, VMware Tools and SAN firmware when applicable.
Move VM to a host with more available resources.
If possible, change the time of day that the VM gets backed up or replicated to a time when the least storage activity occurs.
Use a workingDir to redirect Snapshots to a different datastore than the one the VM resides on. http://kb.vmware.com/kb/1002929
Disable VMware Tools Sync driver on the VM: http://kb.vmware.com/kb/1009886

Known Issue with NFS 3.0 Datastores

Note: This issue will present as multiple minutes worth of stun. Normal snapshot stun is only mere seconds.

At the time of the writing of this KB there is a known issue with NFS Datastores and Virtual Appliance (HOTADD) transport mode. The issue is documented in this VMware KB article: http://kb.vmware.com/kb/2010953

Veeam advises that if this issue occurs one of three things can be done to work around this:

1. Use Direct NFS Mode

More details available here:
https://helpcenter.veeam.com/docs/backup/vsphere/direct_nfs_access.html

2. Configure Veeam Backup & Replication to use a Proxy on the same host as the VM

Note: This option is only available in versions newwer than 7.0.0.839

Create a VM proxy on every host in your VMware cluster where backups occur
On the server where the Veeam Backup & Replication console is installed open ‘Registry Editor’
Create the following registry value in the key below:
HKLM\Software\Veeam\Veeam Backup and Replication\
Type: DWORD
Name: EnableSameHostHotaddMode
value:
For the value there are two options, "1" or "2" these options are explained below:
- 1- if proxy on same host as VM is unavailable, Veeam Backup & Replication will fail over to a proxy on a different host and use available transport mode, which may cause stun
- 2 - If proxy on same host as VM is unavailable, Veeam Backup & Replication will use an available proxy on a different host, but force it to use network transport mode, so that no stun occurs; this may be preferable when stun is not tolerable.

Note: Both values 1 or 2 will enable the SameHostHotaddMode, which forces Veeam B&R to first attempt to use the Proxy that is on the same host as the VM to be backed up.

3. Reconfigure proxies to use Network Transport mode.

Edit the proxies listed under [Backup Infrastructure]>[Backups Proxies]
Click the [Choose] button next to “Transport mode”
Select the radio option for “Network” mode
Click [OK] to close the prompt and then [Finish] to commit the change.

More Information

The following links are provided to help understand more about the Snapshot Removal process within VMware:

http://up2v.nl/2011/05/09/know-the-performance-impact-of-snapshots-used-for-backup/

http://kb.vmware.com/kb/1002836

VM Loses Connection During Snapshot Removal

Challenge

Cause

Solution

Isolation Test

Common Troubleshooting Tasks

Known Issue with NFS 3.0 Datastores

More Information

Trending Articles

Scuffham Amps - S-GEAR 2.6.0 VST, AAX, STANDALONE x86 x64 (R2R NO iLok2, +NO...

Practice Sheet of Right form of verbs for HSC Students

VHSE First (1st) Allotment 2025 - vhscap.kerala.gov.in

UNIVERSE LEAGUE – UNIVERSE LEAGUE – WAR (We Are Ready) – EP [iTunes Plus M4A]

City Hunter Teledrama – Episode 18 – 07th May 2016

Comment on Proposed Criteria for Identifying Predatory Conferences by Luke...

Bureau of Internal Revenue: Regional Offices (Directory)

Kendrick Lamar – Not Like Us (2024) [24Bit-88.2kHz] [PMEDIA] ⭐️

Inception 2010 Hindi Dual Audio 650MB BRRip 720p ESubs HEVC

East Hull MD admits sexual assaults after another victim comes forward

Download: FK ft Shenky – Nakuyewa ”Prod by: Shenky”

R. v. Sargeant, 2023 ONSC 6406 (CanLII)

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

Who’s been sentenced at Northampton Magistrates’ Court

मतलबी दोस्त स्टेट्स | Matlabi Dost Status in Hindi – Selfish Friends Status

Family cries out as traditional ruler allegedly abducts brother, extorts N2.5m

Long-Running Conflict In Springfield (MA) Gangland Sphere Has Manzi Family &...

Wondershare Filmora X v10.1.20.16 x64

Man arrested after fracas in flat

Man charged in ongoing Sexual Assault Investigation Derek Nyilas, 46, Faces...