vCSA /storage/log Running Full

June 1, 2021 0 By Allan Kjaer

I have been trouble shooting a problem with the /storage/log running full, for a customer.

The problem was that the file /storage/log/vmware/vmware-sps/sps-runtime.log.stderr file was growing very large.

I the log file I found the the vmware-sps (Storage Profile Service) had some problems. The errors looks like this

May 31, 2021 7:18:02 AM org.bouncycastle.jsse.provider.ProvTlsClient notifyAlertRaised
 INFO: Client raised fatal(2) certificate_unknown(46) alert: Failed to read record
 org.bouncycastle.tls.TlsFatalAlert: certificate_unknown(46)
Caused by: Certification path could not be validated.   
Caused by: Unable to construct a valid chain
Caused by: Could not validate certificate: certificate expired on 20200917181137GMT+00:00

The last one I did a filter on and could see that the date was different, so that indicated that it was communicating with multiple endpoint, which must be the ESXi server.

So I took a look at the some hosts in the vCenter and found that some of the had expired certificate. I tried to update the certificate from vCenter, but that did not work, and that was because I earlier had used this blog: vCenter 6.7 Update 3, Unable to add Host So I had to set the setting back to the original setting of “vmca” and then I could change the certificates on the hosts.

vpxd.certmgmt.mode = vmca

After all hosts with expired certificates, had them renewed the file stopped growing.

UPDATED June 7, 2021:

There above was only one part of the problem, there was another problem also.

The hosts that was add, when the vCenter was using “thumprint” instead of “vmca” was using a custom certificate that the vmware-sps service could validate, so the also filled up the log file.

To find which hosts certificate was expired or custom, I used the a script from William Lam that can be found thru this blog: Is vCenter Server & ESXi hosts using VMware Certificate Authority (VMCA) or custom CA certificates?

The result of the script looked like this.

To fix the problem I just did a “Renew Certificate” and “Refresh CA Certticates”, on all the hosts with expired and “custom” certificates.

Please share this page if you find it usefull: