DP 10.30 / Service Guard / Huge restore of 52TB fails randomly.

DP 10.30 / Service Guard / Huge restore of 52TB fails randomly.

There is a huge restore that needs to be performed. The total size is 52 TB. When it starts, all works and at some point just crashes.

It can be at 14 TB, 3TB or less.

DP 10.30 running on RHEL 7 with Service Guard.

Solution

---Investigation---

Searching the problem, we saw that cluster is chainging the node from one to another in any moment.

The omnisv detected a false negative saying that HPDP-AS is down.

 

---Solution---

This incorrect output, makes the Service Guard to halt to the other node and crashes the restore.

To make it work, we disabled the Service Guard and keep only one node running. It worked with no issues.

 

Note: This is a temporary solution. The problem with the omnisv wrong output is still under investigation. But this can work if you need to do a restore and have the same behavior. 

Labels (1)

DISCLAIMER:

Some content on Community Tips & Information pages is not officially supported by Micro Focus. Please refer to our Terms of Use for more detail.
Top Contributors
Version history
Revision #:
1 of 1
Last update:
‎2020-04-03 18:13
Updated by:
 
The opinions expressed above are the personal opinions of the authors, not of Micro Focus. By using this site, you accept the Terms of Use and Rules of Participation. Certain versions of content ("Material") accessible here may contain branding from Hewlett-Packard Company (now HP Inc.) and Hewlett Packard Enterprise Company. As of September 1, 2017, the Material is now offered by Micro Focus, a separately owned and operated company. Any reference to the HP and Hewlett Packard Enterprise/HPE marks is historical in nature, and the HP and Hewlett Packard Enterprise/HPE marks are the property of their respective owners.