CEAS - Classification of Early Ankylosing Spondylitis based on CNNs and clinical feature

Pytorch implement of classification for early ankylosing spondylitis. Paper can be found here.

TL;DR

Training CNNs

python train_script.py

Test CNNs

python loss_log_decode.py 
python test.py
python ci.py

Visualization

python GradCAM_demo.py

Machine Learning

python train_ML.py

Requirements

To train and test, you need:

Custom data set that contains at least one kind of sequence of SJI-MRIs with ROIs. If you have all three sequences (T1WI, T2WI and FS), our model should work without any modifications.
At least one nVidia GPU.
Python 3.6.5 (not sure whether other versions will work).
Install the necessary packages listed in requirements.txt.
A file that contain patients' clinical information.(unnecessary)

Install the requirements

pip install -r requirements.txt

Data set structure

Image data

Make sure your data set to be organized like following structure to avoid FileNotFoundError during training and testing.

├── npy_exp1
│   ├── train
│   │   ├── AS
│   │   │   ├── patient0001
│   │   │   │  ├── FS.nii.gz
│   │   │   │  ├── FS_mask.nii.gz
│   │   │   │  ├── T1.nii.gz
│   │   │   │  └── T1_mask.nii.gz
│   │   │   │  ├── T2.nii.gz
│   │   │   │  └── T2_mask.nii.gz
│   │   │   ├── patient0002
│   │   │   ├── patient0003
│   │   │   ...
│   │   └── nonAS
│   └── test
│       ├── AS
│       └── nonAS

Clinical information

Clinical information could be save as .xlsx(recommand) file under the root directory of this project with the following indices.

Patient_ID	Age(year old)	Gender(M:1,F:0)	Disease Duration(months)	ESR	CRP	HLA-B27(pos:1,neg:0)	Label(AS:1,non-AS:0)
patient0001	18	1	12	65	39	1	1
patient0002	24	0	6	0	11	0	0
patient0003	...	...	...	...	...	...	...

If the indices you use are not exactly same as the above, some modifications are need in some .py that use the clinical information.

Preprocessing

Supposing that all the image data in your custom data set has been converted to .nii format and has N4 bias field corrected.

Before training, there are several preprocesses to be done:

Resampling and save the largest ROI in the mask by

python resize_save_largest_ROI.py.

The target spacing is determined by the most frequent spacing in your custom data set.
To speed up the loading procedure of initialization of AS_Dataset, converting .nii to .npy.

python nii2npy.py
To speed up the dataloader and reduce the size of dataset, we crop the original volume to generate the smallest patch that contain the whole ROI:

python dataset_reduction.py
Clinical data imputation:

python data_imputation
Perform the LASSO regression to select the relevant clinical feature:

python lasso.py

Training

You should specify 3 parameters like the following example:

python train.py  --k 1 --gpus 0 --seq FS

Where --k indicates the k-fold, --gpus is for specifing the GPU, and --seq is for specifing which sequnce to be traind. Or you can directly run the training_script.py to perform the whole five-fold cross-validation on all sequences in one command:

python training_script.py

Test

You need to run loss_log_decode.py before test.py to get the optimal models from all the training iteration by calculating the largest AUROC.

python loss_log_decode.py
python test_DL.py

Commands listed above don't give a stats of the model assessment, but rather the predictions in patient level

To get the final results like the paper shows, you'll also need to calculate the confidential intervals via bootstrap:

python ci.py

Note that, the variable models in ci.py should be modified depending on combination of models you want to compare.

Visualization

python GradCAM_demo.py

Again, you should modify the variable patient_paths in GradCAM_demo.py to specify the patients exactly to be visualized.

Example result:

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
fig		fig
Nii_utils.py		Nii_utils.py
README.md		README.md
ci.py		ci.py
data_imputation.py		data_imputation.py
dataset.py		dataset.py
dataset_reduction.py		dataset_reduction.py
gradcam.py		gradcam.py
gradcam_demo.py		gradcam_demo.py
lasso.py		lasso.py
loss_log_decode.py		loss_log_decode.py
model.py		model.py
nii2npy.py		nii2npy.py
requirements.txt		requirements.txt
resize_save_largest_ROI.py		resize_save_largest_ROI.py
test_DL.py		test_DL.py
train_DL.py		train_DL.py
train_ML.py		train_ML.py
train_script.py		train_script.py
train_val_split.py		train_val_split.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CEAS - Classification of Early Ankylosing Spondylitis based on CNNs and clinical feature

TL;DR

Requirements

Install the requirements

Data set structure

Image data

Clinical information

Preprocessing

Training

Test

Visualization

About

Releases

Packages

Contributors 2

Languages

SMU-MedicalVision/CEAS

Folders and files

Latest commit

History

Repository files navigation

CEAS - Classification of Early Ankylosing Spondylitis based on CNNs and clinical feature

TL;DR

Requirements

Install the requirements

Data set structure

Image data

Clinical information

Preprocessing

Training

Test

Visualization

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages