Back to Projects List
Integration of clinical data into medical imaging pipelines
Key Investigators
- Philipp Schader (German Cancer Research Center, Germany)
- Andrey Fedorov (BWH)
Project channel in Discord #kaapana-clinical-data-integration
Project Description
Clinical data like age, blood type, diagnosis and other non-imaging biomarkers are highly relevant in medical image processing as they provide context for the analysis of imaging datasets. Incorporating this additional data layer into image processing tools facilitate the development of complex biomarkers. While platforms for medical image processing like Kaapana focus on the imaging layer they often lack for features to relate clinical data to it.
This projects aims to integrate clinical data better into the Kaapana medical image processing platform by using the FHIR standard. To facilitate this a FHIR server will be integrated in the platform and linked to the imaging data stored in the internal PACS. Additionally the workflow component will be extended to be able to store and retrieve FHIR objects from the internal server. This forms the basis to create import procedures allowing the import of clinical data from tabular data into the internal FHIR store. By extending the preexisting radiomics workflow of the platform to store its results in the FHIR server a first imaging biomarker is made available. In a last step a joint analysis using the Jupyter Lab service of the platform joining the clinical data with the imaging biomarkers from the radiomics analysis is performed.
Objective
- Integration of a data store for clinical data in the Kaapana Platform
- Integration of clinical data retrieval and storage from processing pipelines (Try to store analysis results like radiomics reports in FHIR)
- Establish import procedures for clinical data (e.g. for data formats available via TCIA or IDC - up for discussion)
- Integration of the clinical data store into the analysis frameworks of the platform and example of a joint analysis
Approach and Plan
- Review clinical data available in IDC, start with the clinical data IDC tutorial
- Find a small collection of datasets including images, segmentations of pathologies and according clinical data
- Decide on an open-source FHIR server (like HAPI FHIR) integrate it into Kaapana (and link it to the PACS if possible)
- Creation of query / retrieve operators for FHIR objects within the workflow components
- Create import workflow to import different clinical data fromats from the object store into the FHIR server and discuss which data formats to support (csv, Excel, RedCap, odm)
- Represent the results of an workflow (maybe by extending the preixisitng radiomics workflow - feedback welcome) as FHIR objects and store them in the FHIR Server of the platform
- Perform an example analysis using the workflow results and enriching them using other clinical data
Progress and Next Steps
- Explored TCIA clinical data collection via IDC and selected NSCLC-Radiomics since it contains a csv as well as segmentations to perforem radiomics.
- Created a Kaapana Extension for the HAPI FHIR JPA server to store and retreive the resources created in the next steps.
- Draft a mapping from the clinical data from NSCLC-Radiomicss into FHIR Resources.
- Created Jupyther Notebook to create the the actual resources for imaging and clinical data and import them into the platforms FHIR Server.
- WiP: Translate the Jupyther Notebook into Workflow Operators to automate the import
- WiP: Wrap Radiomics Features
- WiP: Performe Joint Analysis
Illustrations
Background and References
- Kaapana the imaging platform to use
- HAPI FHIR a potential open source FHIR server
- CCE_DART a project using Kaapana to discover complex biomarkers
- Clinical data in IDC - start with this tutorial