Back to
Projects List
NCI Imaging Data Commons - user support and platform development
Key Investigators
- Andrey Fedorov (BWH, USA)
- Deepa Krishnaswamy (BWH, USA)
- Vamsi Thiriveedhi (BWH, USA)
- Cosmin Ciausu (BWH, USA)
- Leonard Nuerenberg (AIM Lab, USA)
- Suraj Pai (AIM Lab, USA)
- Steve Pieper (Isomics Inc, USA)
- Ron Kikinis (BWH, USA)
- Michael Onken (OpenConnections GmbH, Germany)
- Daniela Schacherer (Fraunhofer MEVIS, Germany)
- Andras Lasso (Queen's University, Canada)
Presenter location: In-person
Project Description
NCI Imaging Data Commons is a cloud-based environment containing publicly available cancer imaging data co-located with analysis and exploration tools and resources.
IDC provides a growing amount of publicly available cancer imaging data (>65TB at the moment, radiology and digital pathology, including images, annotations, analysis results and clinical data) curated in the cloud to support highly efficient access and to simplify analysis.
Objective
- Raise awareness about IDC, help users, collect feedback to help prioritize future development.
- Identify robust AI models that can be applied to IDC data to enrich IDC with annotations.
- Work on various issues related to the development of IDC platform and related software tools.
Approach and Plan
- Interact with current and prospective users to answer questions and collect feedback.
- Support any project that has a need for public datasets available for testing, cloud-based notebook implementations of the analysis, scaling up analysis to large cohorts within IDC.
- Work on priority aspects of the project: maintenance and improvement of SlicerIDCBrowser and idc-index, improvements of the documentation and other learning materials
- Improve/simplify access to the NLST/TotalSegmentator analysis results.
- Work on maintenance of dcmqi priority issues: https://github.com/QIICR/dcmqi/issues/489, python wrapper API
- MRTotalsegmenator SCT codes - Andras
- DCMTK upgrade in Slicer - JC
Progress and Next Steps
- Update MHub+IDC tutorial in how it accesses IDC.
- Prepared initial version of the query to extract processing steps for slide microscopy (SM) images using DICOM metadata (https://github.com/ImagingDataCommons/idc-index-data/pull/30). When completed, this will allow selecting SM images by embedding method, staining (H&E), and fixative without using BigQuery, and with queries of significantly lower complexity as compared to querying full index.
- Implemented new feature in the dcmqi converter that allows including into DICOM SEG references to the segmented images when geometry of the segmentation is different from the image (e.g., when segmentation was done on the slices orthogonal to the segmented image) (https://github.com/QIICR/dcmqi/issues/489). Lacking this feature, ReMIND collection encoded images that are disconnected from the segmented MR images.
- Mapped model-specific segmentation labels for OMAS and TotalSegmentator to SNOMED-CT (related PRs https://github.com/wasserth/TotalSegmentator/pull/324 and https://github.com/wasserth/TotalSegmentator/pull/325). Those interested to map labels from their model can follow instructions in https://qiicr.gitbook.io/dcmqi-guide/opening/coding_schemes/searching_codes_outside_dicom and of course contact Andrey and/or ask questions on the IDC forum.
- Presented IDC updates at the Thu breakout session (see notes and references in this document).
- Reviewed beta (aka pita) release of the pydcmqi python wrapper of dcmqi prepared by Leo. pydcmqi aims to simplify pythonic access to dcmqi functionality.
Illustrations
Background and References
- Fedorov, A., Longabaugh, W. J. R., Pot, D., Clunie, D. A., Pieper, S. D., Gibbs, D. L., Bridge, C., Herrmann, M. D., Homeyer, A., Lewis, R., Aerts, H. J. W., Krishnaswamy, D., Thiriveedhi, V. K., Ciausu, C., Schacherer, D. P., Bontempi, D., Pihl, T., Wagner, U., Farahani, K., Kim, E. & Kikinis, R. National Cancer Institute Imaging Data Commons: Toward Transparency, Reproducibility, and Scalability in Imaging Artificial Intelligence. RadioGraphics (2023). https://doi.org/10.1148/rg.230180
- Thiriveedhi, V. K., Krishnaswamy, D., Clunie, D., Pieper, S., Kikinis, R. & Fedorov, A. Cloud-based large-scale curation of medical imaging data using AI segmentation. Research Square (2024). https://doi.org/10.21203/rs.3.rs-4351526/v1