Skip to content

VCF 9 Management Domain Deployment hangs within VCF Ops Collector (Cloud Proxy) #46

@freddyblue

Description

@freddyblue

Describe the bug

Task in VCF Installer:
Retrieve the status for VCF Operations with VCF Operations collector Deployment request
Error message:
Request createenvironment failed with error cause [{"messageId":"LCMVROPSSYSTEM29002","message":"Operations collector group operation failure","eventId":"15e218b6-168f-4256-9d2b-7b120af1d42a","retry":true,"exceptionMessage":"Wait timeout, Cloud Proxy node didn't join the Operations cluster, retry after sometime.","exceptionStackTrace":"com.vmware.vrealize.lcm.plugin.common.vrops.exceptions.CollectorGroupOperationsException: Wait timeout, Cloud Proxy node didn't join the Operations cluster, retry after sometime.\n\tat com.vmware.vrealize.lcm.plugin.core.vrops.tasks.VropsOnPremCpNodeWaitForStartTask.execute(VropsOnPremCpNodeWaitForStartTask.java:67)\n\tat com.vmware.vrealize.lcm.plugin.core.vrops.tasks.VropsOnPremCpNodeWaitForStartTask.retry(VropsOnPremCpNodeWaitForStartTask.java:122)\n\tat com.vmware.vrealize.lcm.automata.core.TaskThread.run(TaskThread.java:60)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source)\n\tat java.base/java.lang.Thread.run(Unknown Source)\n","localizedMessageId":"VROPS_ON_PREM_CP_NODE_WAIT_FOR_START_TASK_00002","parameters":null,"properties":{"deleteVm":"false"}}] Reference Token: F4V4HD

Error message in Holo-Router:
27-09-2025 03:06:45 SddcMgmtDomain[64348]: [INFO] Current task in progress: Retrieve the status for VCF Operations with VCF Operations collector Deployment request
27-09-2025 03:06:45 SddcMgmtDomain[64348]: [INFO] Management Domain is not ready yet. Sleeping for 5 mins
27-09-2025 03:11:45 SddcMgmtDomain[64348]: [ERROR] Management Domain deployment failed. Check the logs below for more details
27-09-2025 03:11:45 SddcMgmtDomain[64348]: [ERROR] @{name=Retrieve the status for VCF Operations with VCF Operations collector Deployment request; description=Retrieve the status for VCF Operations with VCF Operations collector Deployment request; status=COMPLETED_WITH_FAILURE; creationTimestamp=09/27/2025 01:41:49; updateTimestamp=09/27/2025 03:10:42; errors=System.Object[]}
27-09-2025 03:11:45 SddcMgmtDomain[64348]: [INFO] Retrying Management Domain deployment
27-09-2025 03:11:46 SddcMgmtDomain[64348]: [INFO] Management Domain is not ready yet. Sleeping for 5 mins

..... after six retrys, script stops

27-09-2025 04:31:52 SddcMgmtDomain[64348]: [INFO] Management Domain is not ready yet. Sleeping for 5 mins
27-09-2025 04:36:52 SddcMgmtDomain[64348]: [INFO] Current task in progress: Retrieve the status for VCF Operations with VCF Operations collector Deployment request
27-09-2025 04:36:52 SddcMgmtDomain[64348]: [INFO] Management Domain is not ready yet. Sleeping for 5 mins
27-09-2025 04:41:52 SddcMgmtDomain[64348]: [ERROR] Management Domain deployment failed. Check the logs below for more details
27-09-2025 04:41:52 SddcMgmtDomain[64348]: [ERROR] @{name=Retrieve the status for VCF Operations with VCF Operations collector Deployment request; description=Retrieve the status for VCF Operations with VCF Operations collector Deployment request; status=COMPLETED_WITH_FAILURE; creationTimestamp=09/27/2025 03:11:55; updateTimestamp=09/27/2025 04:40:49; errors=System.Object[]}
27-09-2025 04:41:52 SddcMgmtDomain[64348]: [ERROR] SDDC Deployment fails after 5 retries.
27-09-2025 04:41:52 SddcMgmtDomain[64348]: [ERROR] SDDC Deployment fails after 5 retries.
Exception: Exiting

Reproduction steps

1.PS /root> New-HoloDeckInstance -version "9.0" -InstanceID 'HOLOH1v9' -vsanmode "ESA" -managementonly -depottype "Offline"
2.
3.
...

Expected behavior

deployment of VCF9 MD without problems

Additional context

Offline Depot ODA works fine

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions