Peer Query Performance

saurabh.mittal · June 18, 2019, 1:25pm

We are using Datomic to store Time series data. We get the data hourly.
We have around ~40 billion datoms
When we query ,we try to load the data for last ~5 days.We have defined indexes for data and we use date range based queries.
At times , some query take up to ~1 min.Problem is its very inconsistent
Sadly, We realized Datomic is not optimized for such huge amount of data.

I read this article that give some information about how the data that is stored in Peer cache
http://www.dustingetz.com/:datomic-performance-gaare/

I had couple of Queries:

Is there any way we can identify the levels we need to go to query data.Subsequently , number of trips to storage?
Is there any direction where we can work on to optimize it?
why there is inconsistency in time?Sometimes it s fine sometime just take too much time?

Any help will be appreciated.
Thanks

eoliphant · June 19, 2019, 12:43am

“Time series data”, this is called out in various places as one of the things that Datomic is not well suited for. You might want to consider a store that’s more geared to that use case, and potentially say use Datomic for ‘projections’ of that data, etc. depending on your needs

dustingetz · June 19, 2019, 1:52am

40B datoms doesn’t sound that bad, I believe Nubank and HCA both rollover to a new database every quarter or so when they reach the limit and there is a crossover technique, see this talk for the details: http://2018.clojure-conj.org/igor-ges/

saurabh.mittal · June 19, 2019, 9:01am

Thanks Dustin. The video was helpful.It mentions about sharding and roll over.I had watched Nubank presentation on Infoq , they also mention Sharding.
Is there any way we can identify how many trips are made to storage to complete a query?
Thanks

aanacarolina · March 13, 2025, 4:17am

Maybe it is not relevant anymore , but for the records now Datomic pro has query stats, that might be helpful in such cases:

Topic		Replies	Views
Datomic time queries Datomic Applications	2	1490	November 15, 2018
Datomic entity-api on large amount of entities Datomic Pro	6	2478	March 2, 2018
Slow query performance (200ms+) Datomic Pro	2	2999	June 19, 2018
Datomic Cloud Status Update Announcements	17	2610	February 21, 2018
General Product Questions General	6	2634	December 7, 2018

Peer Query Performance

Related topics