Excision is very slow for huge number of entities

nikolayandr · June 16, 2021, 1:36pm

Hi, my task is to determine all entities which were existing before but are missing in the current version of DB and finally say goodbye to all of them. For example I have 1 million of such entities, all of them are mapped into list of objects made from template {:db/excise e}. Then this list of one million items is passed to transactor. What I see that only one CPU of 8 is used. Are there any ways to make the process faster, 7 hours went, the excision process is still running.

P.S. When I tried to excise the whole DB passing all attributes it took only 1.5 hour. Yes it means to delete all DB, but anyway why partial excision tooks more time then excision of whole db

P.P.S. The DB under datomic is H2

What I’m doing in words of code:

(def get-deleted
		(d/q '[:find [?e ...]
		:in $ $hdb
		:where
		[$hdb ?e :IModel/uid ]
		(not [?e :IModel/uid])
		]
		(d/db conn) (d/history (d/db conn)) )
		)

(defn makeExciseSt
  [e]
  {:db/excise e})

(def exciseSt (map makeExciseSt get-deleted))

(def trResult (d/transact conn exciseSt))
(def trResultData (.get user/trResult))
(def dbAfter (:db-after user/trResultData))
(def afterBasisT (.basisT dbAfter))
(.get (d/sync-excise conn afterBasisT))
(d/gc-storage conn (java.util.Date.))

Datomic on-prem 1.0.6269

jaret · June 16, 2021, 2:57pm

Hi @nikolayandr

What are you timing? Everything in the code block? Am I correct in understanding you are trying to identify retracted entities and then excise them? Can you explain why? If you retracted these facts why are you interested in excising them?

P.P.S. Then DB under datomic is H2

Additionally, you are using the dev protocol which does not have the same performance characteristics as other production storage protocols. Are you planning on using this same strategy against a production database?

I should point out that excision was designed specifically to meet the following two scenarios and should be an extremely infrequent operation:

removing data for privacy reasons
removing data older than some domain-defined retention period

Excision is lossy. You cannot ask “was this datom in the database prior to an excision?” (that would defeat the purpose of excision.) Excision | Datomic
Excision creates a substantial burden on background indexing jobs in whose execution time is proportional to the size of the entire database, leading to back pressure and reduced write availability.

https://docs.datomic.com/on-prem/reference/excision.html#performance

nikolayandr · June 16, 2021, 3:11pm

Am I correct in understanding you are trying to identify retracted entities and then excise them?

Right

Are you planning on using this same strategy against a production database?

No, dev mode only for local development, postgres for production use

Can you explain why? If you retracted these facts why are you interested in excising them?

Would say a lot of data were created in DB by mistake what has lead to increasing memory requirements, since every megabyte is payable we need to rid of unnecessary data

stu · June 16, 2021, 3:24pm

Hi @nikolayandr , can you double-check that Datomic version on both the transactor and the peer(s)?

Thanks!

nikolayandr · June 16, 2021, 9:44pm

Sorry, my mistake: datomic version is 1.0622, the above example is run inside repl which comes with datomic, so both are on the same version

jaret · June 17, 2021, 11:50am

Sorry, my mistake: datomic version is 1.0622, the above example is run inside repl which comes with datomic, so both are on the same version

@nikolayandr before we delve too deeply, we made a big performance improvement on excision in the latest release:

https://docs.datomic.com/on-prem/changes.html#1.0.6269

Can you upgrade your peer and transactor to the latest and retest?

Cheers,
Jaret

nikolayandr · June 18, 2021, 8:08am

Yeah, it helped to make the excision process much faster, I saw alternation between high CPU usage and disk read(much bigger in comparing to the previous library) but I have faced another issue: 6288656 entities were sent to the excision process, but after the end of the process I still see 6192721 of entities which I have expected to be excised

P.S. I’m trying to do the same actions second time to check the number of excised entities after retrying

P.P.S. Peer has been restarted, also I have executed requestIndex, but judging by CPU usage/disk read/sync-index nothing happened.

P.P.P.S. Is a reindexing the part of the excision process? The doc says:
At some point after the excision request, an indexing job will run.
So when I run sync-excise, does it wait the finish of that indexing job? I just look for a way which give me the answer: everything is done, all entities to be excised are already wiped from everywhere

P.P.P.P.S I have restarted the transactor when the future returned from sync-excise has provided ready db(what was the indication for me that all all activities completed)

nikolayandr · June 18, 2021, 11:51am

An Interesting result I have got after the second run of the excision process. There are still the same number of entities which actually were expected to be excised: 6192721.

Futhermore interesting: I have checked one of entities, and I see that datomic has marked this entity as excised(we see two ids of excising because I did two attempts):

and this entity is still found in history DB:

P.S. The peer restarted, gc storage executed, requestIndex run(but still the same no effect, nothing happened)

jaret · June 21, 2021, 7:11pm

@nikolayandr what are you trying to tell here? I am very confused when you use terms like “nothing happened.” Can you share what you are expecting to have happened when you run excision vs what actually happened?

The screenshots you show seem to show that you are looking at the :db/excise attributes. Per our docs (Excision | Datomic):

Note that the excise attributes themselves are protected from excision, so there is no way to ‘erase your tracks’. Every excision creates a permanent record. This helps preserve a fundamental value proposition of Datomic - it is a database that greatly facilitates your knowing why it is in the state it is in, and how it got there.

Thus excision strikes a delicate balance between forgetting, and remembering that you forgot.

Does that explanation match your observatrion?

nikolayandr · June 22, 2021, 6:48am

@jaret
Right, I’m showing that datomic has marked these entities as excised(what is expected) but actually they continue existing in history DB(what is not expected). I thought that it is necessary to request indexing after ending the excision process but the DB still has these entities(looks like nothing happened after requesting index, no index job has started)

nikolayandr · June 22, 2021, 6:52am

In another words: Is it expected that some number of entities sent to the excision process might be still available in the history DB once the excision process has ended?

nikolayandr · June 23, 2021, 3:41pm

Guys, any thoughts on that?

jaret · June 23, 2021, 7:02pm

Hi @nikolayandr,

I did some quick testing and was unable to see the same results. Would it be possible for you to make a gist of a small reproduction of the behavior you are seeing. I think you have most of the steps in this thread, but it would be nice to see a complete gist with an eye towards formatting the gist as we describe in writing a datomic problem report (i.e. environment, expectation, actual results etc).

More specifically, you say:

actually they continue existing in history DB(what is not expected).

Do you mean the counts or are you actually able to query the data? Is this only in console that you see this? I am hoping you can demonstrate this issue using the REPL.

nikolayandr · June 24, 2021, 6:21am

Do you mean the counts or are you actually able to query the data? Is this only in console that you see this? I am hoping you can demonstrate this issue using the REPL.

I’m able to query data in history DB. The test was done in REPL packed in datomic distribution package.

I did some quick testing and was unable to see the same results.

I starting to think that it might be issue related to our data too. So reproducing can be possible using our DB

nikolayandr · June 24, 2021, 6:44am

Environment : Ubuntu 20.01, Datomic on-prem 1.0.6269
History :

Start clear datomic: ./bin/transactor -Xms20g -Xmx20g /opt/datomic/config/transactor.properties
Restore DB: ./bin/datomic -Xmx1g -Xms1g restore-db “file:” "datomic:dev://localhost:4334/testExc?password=datomic"
Restart transactor
Run the next script in REPL:

(require '[datomic.api :as d]) 
(def conn (d/connect "datomic:dev://localhost:4334/testExc?password=datomic")) 

(def get-deleted
		(d/q '[:find [?e ...]
		:in $ $hdb
		:where
		[$hdb ?e :IModel/uid ]
		(not [?e :IModel/uid])
		]
		(d/db conn) (d/history (d/db conn)))
		)

(defn makeExciseSt
  [e]
  {:db/excise e})

(def exciseSt (map makeExciseSt get-deleted))

(def trResult (d/transact conn exciseSt))
(def trResultData (.get user/trResult))
(def dbAfter (:db-after user/trResultData))
(def afterBasisT (.basisT dbAfter))
# The question: Does sync-excise  wait on finishing index job which is run in result of excision
(.get (d/sync-excise conn afterBasisT))
(d/gc-storage conn (java.util.Date.))

Restart all peers(including REPL too)

Expectation : All entities which were in get-deleted do not existing anymore in datomic
Actual : Some number of entities from get-deleted are still available in history DB.
Evidence : I’m still able to query entities from get-deleted in history DB
Impact : Can’t decrease the size of db to make peers consume less RAM

jaret · June 30, 2021, 1:08pm

@nikolayandr Apologies for not being able to immediately review I plan to look at this today and Friday. I noticed that you had previously provided the backup file. If you want to send that confidentially you can share that with me via support@cognitect.com.

nikolayandr · February 8, 2025, 11:51am

Hi @jaret, Is there any chance that this issue will be addressed in a future update? It’s causing significant inconvenience in our use case and we have this issue for all our clients where datomic database is used. The issue is rerpoduced for dev and sql(postgres) protocols

P.S. The excision process issue still present in the latest datomic version

Thanks in advance for any insights!

nikolayandr · February 9, 2025, 9:09am

On the screen, you can see that 17592206943037 was excised(the left window proves that excision was executed),

but I can still find data in the history db(the right window on the screen), whereas Excision | Datomic states:

Excision is lossy. Given a datom, you cannot ask "Was this datom in the database before an excision?" If you could ask that question, then the datom is still present in some sense, which defeats the purpose of excision.

P.S. Sync excise, GC storage, and Postgres full vacuum were executed before I did the screenshot

jaret · February 9, 2025, 1:32pm

Hi @jaret, Is there any chance that this issue will be addressed in a future update? It’s causing significant inconvenience in our use case and we have this issue for all our clients where datomic database is used. The issue is rerpoduced for dev and sql(postgres) protocols

P.S. The excision process issue still present in the latest datomic version

Thanks in advance for any insights!

Can you provide full details on what you are doing with the exact excision you are running?

Excision is much more performant than in previous versions of Datomic, but there is certainly overhead involved in excising datoms, rebuilding indexes etc. I would like to know the size of your machines, transactor properties, review metrics in the dashboard to understand the performance impact you are seeing.

On the screen, you can see that 17592206943037 was excised(the left window proves that excision was executed),

Can you close/kill your console, re-open and view the same entity?

If you can, I am interested in a full reproduction and will happily chat on a call or collect info via support ticket to understand what you are observing here. For a support ticket you can e-mail support@datomic.com or go to the support portal and create a ticket: https://support.cognitect.com/hc/en-us/requests/new

nikolayandr · February 10, 2025, 9:56am

Hi Jaret, sure, I will create a support ticket, it is a good idea, I will do it once I get approval to send the customer’s production db, but I will give answers to some of your questions here:

Can you provide full details on what you are doing with the exact excision you are running?

you can see it here Excision is very slow for huge number of entities - #15 by nikolayandr, but for the current case, I will provide a slightly different script in the support ticket

I would like to know the size of your machines, transactor properties, review metrics in the dashboard to understand the performance impact you are seeing.

I will provide this information in the support ticket, but you can see which env was used for the case described earlier by checking for the URL mentioned above

Can you close/kill your console, re-open and view the same entity?

It was already done before I did these screenshots

Topic		Replies	Views
Entities Excision doesn't take effect Peer API	1	1071	May 8, 2020
Entities Excision doesn’t take effect Troubleshooting	1	1110	May 8, 2020
Datomic entity-api on large amount of entities Datomic Pro	6	2453	March 2, 2018
Does excision count towards the 1 billion datom limit? Datomic Pro	4	1029	August 20, 2019
Slow query performance (200ms+) Datomic Pro	2	2981	June 19, 2018

Excision is very slow for huge number of entities

Related topics