Solo node became unresponsive and required reboot

max · May 8, 2019, 7:50am

Last week I couldn’t connect to datomic through the bastion and the lambdas timed out. The cloud watch had no new events, and the instance system log had no strangeness as far as I could tell. The EC2 health checks all indicated that everything was ok.

I did a reboot on the instance and had to wait 4 minutes which likely meant that EC2 had to do a forced reboot. When it came up again everything was working as normal.

Have you seen similar issues? Is there a good way to handle it? Can I monitor the Datomic instance in a way that catches errors like this?

marshall · May 28, 2019, 3:25pm

Max,

We have seen a couple of issues that could explain this behavior. They have been corrected in the latest release of Datomic Cloud: https://docs.datomic.com/cloud/releases.html#477-8741

If you upgrade to the latest version and see this behavior again, please let us know and we can look into the logs/system details to determine what might be the underlying cause.

-Marshall

max · November 25, 2019, 7:55am

Today this happened again. The log became completely silent Friday night and I had to reboot to solo node be able to connect again. We’re just about to upgrade to the production topology. I would be happy if you could have a look. What do you need?

marshall · December 11, 2019, 3:26pm

Max,

You can file a support ticket at support.cognitect.com and we can help investigate.

Topic		Replies	Views
Busy anomaly Datomic Cloud	3	767	May 6, 2019
Missing data when busy, then data reappears when not busy Datomic Cloud	2	703	May 28, 2019
Failed to get catalog items Datomic Cloud	5	671	April 24, 2020
Stuck with connecting to Datomic Ions during "Getting Started" Datomic Cloud	8	1753	January 14, 2019
Datomic Cloud Solo subscription/install failing Datomic Cloud	8	1353	December 21, 2018

Solo node became unresponsive and required reboot

Related topics