Update: 2025/01/22: We have created a repository for the paper titled Unveiling Discrete Clues: Superior Healthcare Predictions for Rare Diseases (https://arxiv.org/abs/2501.16373), which has been accepted by the WWW 2025. In this repository, we offer the original sample datasets, preprocessing scripts, and algorithm files to showcase the reproducibility of our work.
- torch==1.13.1+cu117
- dgl==1.1.2
- pyhealth==1.1.4
- seaborn==0.13.0
Owing to the copyright stipulations associated with the dataset, we are unable to provide direct upload access. However, it can be readily obtained by downloading directly from the official website: MIMIC-III, MIMIC-IV,eICU.
The structure of the data set should be like,
data
|_ DIAG
| |_ MIII
| |_ _processed
| |_ _ _datasets_pre_stand.pkl
| |_ _ rare.pkl
| |_ MIV
| |_ _ _datasets_pre_stand.pkl
|_ REC
|_ _MIII
| |_ _processed
| |_ _ _datasets_pre_stand.pkl
| |_ _ rare_patient.pkl
# run the main file
# change config.py
config = vars(UDCDIAGConfig)
config = {k:v for k,v in config.items() if not k.startswith('__')}
# please set pretrain=True (first, sencod stage)
# please set tuning=True (third stage)
python main.py
Please feel free to contact czhaobo@connect.ust.hk for assistance or collaboration.