FHIR-PYrate: a data science friendly Python package to query FHIR servers

Abstract

We present FHIR-PYrate, a Python package to handle the full clinical data collection and extraction process. The software is to be plugged into a modern hospital domain, where electronic patient records are used to handle the entire patient’s history. Most research institutes follow the same procedures to build study cohorts, but mainly in a non-standardized and repetitive way. As a result, researchers spend time writing boilerplate code, which could be used for more challenging tasks. The package can improve and simplify existing processes in the clinical research environment. It collects all needed functionalities into a straightforward interface that can be used to query a FHIR server, download imaging studies and filter clinical documents. The full capacity of the search mechanism of the FHIR REST API is available to the user, leading to a uniform querying process for all resources, thus simplifying the customization of each use case. Additionally, valuable features like parallelization and filtering are included to make it more performant. As an exemplary practical application, the package can be used to analyze the prognostic significance of routine CT imaging and clinical data in breast cancer with tumor metastases in the lungs. In this example, the initial patient cohort is first collected using ICD-10 codes. For these patients, the survival information is also gathered. Some additional clinical data is retrieved, and CT scans of the thorax are downloaded. Finally, the survival analysis can be computed using a deep learning model with the CT scans, the TNM staging and positivity of relevant markers as input. This process may vary depending on the FHIR server and available clinical data, and can be customized to cover even more use cases. FHIR-PYrate opens up the possibility to quickly and easily retrieve FHIR data, download image data, and search medical documents for keywords within a Python package. With the demonstrated functionality, FHIR-PYrate opens an easy way to assemble research collectives automatically.

Publication
BMC Health Services Research
René Hosch
René Hosch
Team Lead

My research interests include distributed Computer Vision, Generative Adversarial Networks and Image-to-Image translation.

Giulia Baldini
Giulia Baldini
Data Science

My research interests include deep learning, algorithms, and software development.

Katarzyna Borys
Katarzyna Borys
Data Science

My research interests include Deep Learning, Computer Vision, Radiomics, and Explainable AI.

Merlin Engelke
Merlin Engelke
Data Science

My research interests include machine learning, digitalization, and automation.

Felix Nensa
Felix Nensa
Lead

My research interests include medical digitalization, computer vision and radiology.