We are currently investigating this issue.
We have re-established a fresh cluster architecture, our monitoring shows that the issue is gone and confirms that the issue was provided by the failing nodes.
We keep an attentive monitoring on the cluster.
We have identified the root issue, provided nodes were corrupted, we are currently rebuilding the cluster with fresh nodes.
We're working closely with the Public Cloud provider to identify the root issue and bring fixes.