data import script #203

cooperlab · 2025-05-13T03:50:10Z

Adds a command-line script for data import or upload and documentation of data formats.

cooperlab · 2025-05-13T03:53:02Z

@abs711 update the pixelmap_annotation function to capture the additional information needed by the platform.

andsild · 2025-05-13T16:04:21Z

README.rst

+whole-slide image (various formats)
+    Any format that is supported by `large image <https://girder.github.io/large_image/formats.html>`_ can be used.
+feature (.h5)
+    This file contains a single array where each row is a feature embedding for the object. A single blank row should be prepended if the image contains non-object background pixels.


instead of "the object" we could say "each superpixel"

also for the rest of the paragraph

cooperlab · 2025-05-13T17:35:00Z

@abs711 we need to check a few things:
-Are bounding boxes defined at the pixelmap image resolution, or the wsi resolution?
-Do we need a dummy bounding box for the background?

@andsild see if the superpixelSize is used anywhere after data generation.

abs711 · 2025-05-13T18:03:38Z

@abs711 we need to check a few things: -Are bounding boxes defined at the pixelmap image resolution, or the wsi resolution?

They are defined at wsi resolution:

if str(opts.bounding).lower() not in {'', 'none'}:
    regions = skimage.measure.regionprops(1 + segments)
    for _pidx, props in enumerate(regions):
        by0, bx0, by1, bx1 = props.bbox
        bboxes.append((
            ((bx0 + bx1) / 2 + tx0) * scale + x0,
            ((by0 + by1) / 2 + ty0) * scale + y0,
            (bx1 - bx0) * scale,
            (by1 - by0) * scale))
        bboxesUser.extend([
            (bx0 + tx0) * scale + x0,
            (by0 + ty0) * scale + y0,
            (bx1 + tx0) * scale + x0,
            (by1 + ty0) * scale + y0,
        ])

andsild · 2025-05-13T18:10:06Z

@andsild see if the superpixelSize is used anywhere after data generation.

TL;DR nowhere important as far as I can tell.

superpixelSize is one of the parameters defined in the mongoDB database (which is why we struggled with "missing jobId" last week"). It would be fetched here:
https://github.com/girder/slicer_cli_web/blob/master/slicer_cli_web/rest_slicer_cli.py#L568

but seems to only really be used if you ask for heatmaps with predictions from a chain here:
https://github.com/DigitalSlideArchive/superpixel-classification/blob/main/superpixel_classification/SuperpixelClassification/SuperpixelClassificationBase.py#L844

heatmaps are disabled by default for now. So I would say we don't have have to address anything.

Probably not relevant now, not sure about scenarios like:

a user uploads slides through this import job
adds more slides in the UI
asks UI to generate superpixels for new slides

I assume a default superpixelsize will be used for item 3, which may cause a mismatch.

cooperlab · 2025-05-13T18:16:29Z

Great, thank you guys. I will update the README and will make a note of possible issues with superpixel size.

Someone who is uploading data is unlikely to generate additional data through the interface, and so it may be mostly irrelevant.

andsild · 2025-05-13T18:27:08Z

FYI I'm working on integrating my changes to work with AML for this and the superpixel_classification repository.
it should fix problems with background both for UI and in the backend.

abs711 · 2025-05-13T19:12:15Z

@abs711 update the pixelmap_annotation function to capture the additional information needed by the platform.

@manthey Can I have the permissions to push changes?

manthey · 2025-05-14T15:08:41Z

@abs711 update the pixelmap_annotation function to capture the additional information needed by the platform.

@manthey Can I have the permissions to push changes?

You should have an invite

cooperlab · 2025-05-16T21:01:36Z

@manthey wondering how progressCallback can be used with tqdm in GirderClient.uploadFileToFolder?

manthey · 2025-05-19T14:49:02Z

For any of the girder client upload function that take a progress call back, you should be able to do something like:

with tqdm.tqdm(total=0) as pbar:
    def progFunc(prog):
        pbar.total = prog['total']
        pbar.n = prog['current']
        pbar.update(0)

    gc.uploadFileToFolder(..., progressCallback=progFunc)

or as an obtuse lambda

with tqdm.tqdm(total=0) as pbar:
    gc.uploadFileToFolder(..., progressCallback=lambda prog: (pbar.total = prog['total'], pbar.n = prog['current'], pbar.update(0))

I'm not sure if

cooperlab · 2025-05-20T19:22:11Z

data_import.py

@@ -251,7 +251,7 @@ def main():
    features = [row[1] for row in inputs]
    pixelmaps = [row[2] for row in inputs]
    boxes = [row[3] for row in inputs]
-    scales = [row[4] for row in inputs]
+    scales = [int(row[4]) for row in inputs]


@abs711 Should this be float? I don't think the scales will always be integer.

data import script

7a7710d

cooperlab linked an issue May 13, 2025 that may be closed by this pull request

Import and upload script #201

Open

cooperlab added 2 commits May 13, 2025 00:33

Update README.rst

39754f2

key help

71fa0b2

andsild reviewed May 13, 2025

View reviewed changes

andsild added 2 commits May 13, 2025 11:19

Add pixelmap example

a7a16d3

Add bounding box example

e5a3de3

added attributes to pixelmap_annotation

40dc931

abs711 added 2 commits May 20, 2025 14:19

Added bounding box coordinates to annotation

3195a2b

Ensure integer scale value

1bcdef2

cooperlab commented May 20, 2025

View reviewed changes

Ensure float scales

cf923a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

data import script #203

data import script #203

Uh oh!

cooperlab commented May 13, 2025

Uh oh!

cooperlab commented May 13, 2025 •

edited

Loading

Uh oh!

andsild May 13, 2025

Uh oh!

andsild May 13, 2025

Uh oh!

cooperlab commented May 13, 2025

Uh oh!

abs711 commented May 13, 2025

Uh oh!

andsild commented May 13, 2025

Uh oh!

cooperlab commented May 13, 2025

Uh oh!

andsild commented May 13, 2025 •

edited

Loading

Uh oh!

abs711 commented May 13, 2025

Uh oh!

manthey commented May 14, 2025

Uh oh!

cooperlab commented May 16, 2025

Uh oh!

manthey commented May 19, 2025

Uh oh!

cooperlab May 20, 2025

Uh oh!

Uh oh!

data import script #203

Are you sure you want to change the base?

data import script #203

Uh oh!

Conversation

cooperlab commented May 13, 2025

Uh oh!

cooperlab commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

andsild May 13, 2025

Choose a reason for hiding this comment

Uh oh!

andsild May 13, 2025

Choose a reason for hiding this comment

Uh oh!

cooperlab commented May 13, 2025

Uh oh!

abs711 commented May 13, 2025

Uh oh!

andsild commented May 13, 2025

Uh oh!

cooperlab commented May 13, 2025

Uh oh!

andsild commented May 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abs711 commented May 13, 2025

Uh oh!

manthey commented May 14, 2025

Uh oh!

cooperlab commented May 16, 2025

Uh oh!

manthey commented May 19, 2025

Uh oh!

cooperlab May 20, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cooperlab commented May 13, 2025 •

edited

Loading

andsild commented May 13, 2025 •

edited

Loading