Skip to content

pyCoGAPS gets stuck on Sampling Phase #77

@pikapika505

Description

@pikapika505

I've been trying to run pyCoGAPS on a dataset with ~10K cells and ~16K genes. I am running it on an hpc server where 12 CPUs are allocated for this job. CoGAPS is distributed, so nSets = 8. Here is the script I use:

if __name__ == "__main__":
    from PyCoGAPS.parameters import *
    from PyCoGAPS.pycogaps_main import CoGAPS
    import scanpy as sc

    dog_adata = sc.read('dog_adataCoGAPS.h5ad')

    params = CoParams(adata=dog_adata) 
    setParams(params, {
        'nIterations': 10000,
        'seed': 42,
        'nPatterns': 20,
        'useSparseOptimization': True,
        'distributed': "genome-wide"
    })

    params.setDistributedParams(nSets=8)

    start = time.time()
    result = CoGAPS(dog_adata, params)
    end = time.time()

    print("TIME:", end - start)
    result.write("dogOS1_cogaps_result20.h5ad")
    #To save as a .csv file, use the following line:
    result.write_csvs(dirname='./cogaps20', skip_data=True, sep=',')

It runs for approximately 10 hours and then doesn't produce anything. The last messages on the Sampling phase look like this:

10000 of 10000, Atoms: 22414(A), 132246(P), ChiSq: 110973088, Time: 09:30:20 / 09:30:21
worker 1 is finished! Time: 09:30:22
10000 of 10000, Atoms: 22292(A), 133690(P), ChiSq: 109608360, Time: 09:34:45 / 09:34:46
worker 1 is finished! Time: 09:34:48
8000 of 10000, Atoms: 23256(A), 133080(P), ChiSq: 115623760, Time: 09:37:37 / 10:49:31
10000 of 10000, Atoms: 22121(A), 132537(P), ChiSq: 109911624, Time: 09:46:17 / 09:46:18
worker 1 is finished! Time: 09:46:20
10000 of 10000, Atoms: 22939(A), 134133(P), ChiSq: 112030752, Time: 09:49:59 / 09:50:00
worker 1 is finished! Time: 09:50:02
9000 of 10000, Atoms: 22402(A), 136417(P), ChiSq: 109871584, Time: 09:50:50 / 10:25:34
9000 of 10000, Atoms: 23230(A), 133072(P), ChiSq: 115623528, Time: 10:09:16 / 10:45:05
10000 of 10000, Atoms: 22432(A), 136955(P), ChiSq: 109873688, Time: 10:21:06 / 10:21:08
worker 1 is finished! Time: 10:21:09
10000 of 10000, Atoms: 23300(A), 133025(P), ChiSq: 115627600, Time: 10:40:29 / 10:40:31
worker 1 is finished! Time: 10:40:31

I ran it twice and it gets stuck exactly there

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions