The Issue? A host in the VCF Management Domain Failed
Once the host has been removed from the inventory, perform your maintenance tasks to resolve the underlying issue. This step will also work for brand new hosts.
Populate the details relevant to the host you are adding, selecting the correct network pool, then click on add.
There are various reasons a host could fail unexpectedly, and hopefully you will not have to use this method in your environment. It is always preferred to remove hosts non-forcefully, however in this article I have demonstrated an approach that works for me. As mentioned in the article, perform some of these tasks at your own risk.
How to forcefully remove the failed host?
Step 1: Clean up the failed host
Select the check mark next to confirm fingerprint and then click on validate all.
To add the host back into the Management Domain, you must first obtain the hosts ID. Using the bearer token generated in the previous step, issue the GET command in the images below to get identify the host. The host ID is labelled “id”.
Related
The host will display as a standalone and disconnected host in NSX-T, it is now safe to remove it from there as well.
The following section will demonstrate the method I used to remove the host from my inventory.
Step 3: Add the host to the Management Domain
After the successful validation of the host, change the method from POST to PATCH, and modify the URI to what is shown below, changing the domain and cluster details to suit your environment.
Step 3a: Generate a bearer token
The code below can be used to generate a bearer token, change the details to suit your environment.
The image below shows the validation was successful.
Step 3b: Find Unassigned Usable Hosts
In this post I discuss the method I used to rebuild a failed VCF managed host. The host is already unresponsive, therefore, removing the host from the SDDC Manager inventory will not work. The images below show the process of attempting to remove the failed host through SDDC Manager.
Step 4: Add the host to the Management Domain
Make sure you take a snapshot before making any database changes, performing database changes may leave you in an unsupported state.
The example JSON below can be used to add the host identified in the previous step into the Management Domain. Update the details relevant for your environment; use the id obtained from the earlier step, the vDS name that the current hosts are attached to, and the vmnics you wish to use.
For some unknown reason, after attempting to remove the host, it remained in the cluster. Nothing at all happened. I had a quick look at the lcm, operationsmanager and domainmanager log files, and couldn’t see anything that stuck out.
This section will highlight the procedure to add the host back into the Management Domain using API. The reason I have used API is due to the host using custom NICs. Ordinarily, you would use vmnic0 and vmnic1, however in this case the host will need to use vmnic4 and vmnic5. The following steps detail this approach.