Remove $EGGNOG_DBMEM from eggnog env vars by cat-bro · Pull Request #104 · galaxyproject/tpv-shared-database

cat-bro · 2026-01-30T11:18:08Z

Galaxy Australia gives each eggnog job 32GB of RAM. The presence of —dbmem (introduced recently) in the command line causes the db to be loaded to memory. GA has strict OOM killing and all eggnog jobs since this change have been OOM killed. I’m about to try overriding this in our local config.

Galaxy Australia gives each eggnog job 32GB of RAM. The presence of `—dbmem` (introduced recently) in the command line causes the db to be loaded to memory. GA has strict OOM killing and all eggnog jobs since this change have been OOM killed. I’m about to try overriding this in our local config.

cat-bro · 2026-01-30T13:21:28Z

I think this is the right move. After removing the env var on Galaxy Aus the jobs are OK. The db file in CMVFS is 40GB+ but even if we allocated enough memory, it takes a long time to load into memory and would add runtime for a lot of jobs.

bgruening · 2026-01-30T14:03:26Z

Should we maybe include a rule and in dependency on the number of sequences we use this option. We are using this option because most of our inputs that we see have many hundreds or thousands of sequences. And our understanding is that in those cases this dramatically reduces disc IO and runtime.

bgruening · 2026-01-30T14:04:11Z

Double-checking are you also using version 5.0.2 of the database?

cat-bro · 2026-01-30T14:45:25Z

Double-checking are you also using version 5.0.2 of the database?

Yes

Should we maybe include a rule and in dependency on the number of sequences we use this option. We are using this option because most of our inputs that we see have many hundreds or thousands of sequences. And our understanding is that in those cases this dramatically reduces disc IO and runtime.

Sounds good. We’ll need to give these jobs enough mem for the —dbmem under this rule.

cat-bro · 2026-01-30T15:06:18Z

Ages ago we were contemplating a sequence check for blast and never got around to using it. This worked, though I’m not sure if pop() is the best operation.

toolshed.g2.bx.psu.edu/repos/devteam/ncbi_blast_plus/ncbi_blastn_wrapper/.*:
    context:
      fasta_sequence_limit: 10000
    rules:
    - if: |
        input_fasta_dataset = job.input_datasets.pop()
        num_sequences = input_fasta_dataset.dataset.metadata.sequences
        num_sequences > fasta_sequence_limit
      fail: "Fasta input exceeds limit of {fasta_sequence_limit}. Email help@genome.edu.au if you think this is in error, or for some advice"

bgruening · 2026-01-30T15:26:08Z

Oh interesting, didn't know there is this use of the context. Yes, lets use that - this is cool!

cat-bro requested a review from bgruening January 30, 2026 13:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove $EGGNOG_DBMEM from eggnog env vars#104

Remove $EGGNOG_DBMEM from eggnog env vars#104
cat-bro wants to merge 1 commit into
mainfrom
cat-bro-patch-5

cat-bro commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cat-bro commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

cat-bro commented Jan 30, 2026

Uh oh!

bgruening commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

bgruening commented Jan 30, 2026 •

edited

Loading