AI Chat Paper
Note: Please note that the following content is generated by AMiner AI. SciOpen does not take any responsibility related to this content.
{{lang === 'zh_CN' ? '文章概述' : 'Summary'}}
{{lang === 'en_US' ? '中' : 'Eng'}}
Chat more with AI
PDF (1.1 MB)
Collect
Submit Manuscript AI Chat Paper
Show Outline
Outline
Show full outline
Hide outline
Outline
Show full outline
Hide outline
Open Access

Quick-MIMIC: A Multimodal Data Extraction Pipeline for MIMIC with Parallelization

College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China, and also with Centre for Distributed and High Performance Computing, School of Computer Science, The University of Sydney, Darlington, NSW 2008, Australia
College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
Centre for Distributed and High Performance Computing, School of Computer Science, The University of Sydney, Darlington, NSW 2008, Australia
Faculty of Applied Sciences, Macao Polytechnic University, Macao 999078, China
Show Author Information

Abstract

Medical big data with artificial intelligence are vital in advancing digital medicine. However, the opaque and non-standardised nature embedded in most medical data extraction is prone to batch effects and has become a significant obstacle to reproducing previous works. This paper aims to develop an easy-to-use time-series multimodal data extraction pipeline, Quick-MIMIC, for standardised data extraction from MIMIC datasets. Our method can fully integrate different data structures into a time-series table, including structured, semi-structured, and unstructured data. We also introduce two additional modules to Quick-MIMIC, a pipeline parallelization method and data analysis methods, for reducing the data extraction time and presenting the characteristics of the extracted data intuitively. The extensive experimental results show that our pipeline can efficiently extract the needed data from the MIMIC dataset and convert it into the correct format for further analytic tasks.

References

【1】
【1】
 
 
Big Data Mining and Analytics
Pages 1333-1346

{{item.num}}

Comments on this article

Go to comment

< Back to all reports

Review Status: {{reviewData.commendedNum}} Commended , {{reviewData.revisionRequiredNum}} Revision Required , {{reviewData.notCommendedNum}} Not Commended Under Peer Review

Review Comment

Close
Close
Cite this article:
Dou Y, Li W, Zheng Y, et al. Quick-MIMIC: A Multimodal Data Extraction Pipeline for MIMIC with Parallelization. Big Data Mining and Analytics, 2024, 7(4): 1333-1346. https://doi.org/10.26599/BDMA.2024.9020024

1915

Views

192

Downloads

3

Crossref

1

Web of Science

2

Scopus

0

CSCD

Received: 13 October 2023
Revised: 05 March 2024
Accepted: 01 April 2024
Published: 04 December 2024
© The author(s) 2024.

The articles published in this open access journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/).