Page 71 - AIH-2-2

P. 71

Artificial Intelligence in Health Improved liver tumor segmentation with dense networks

was proposed for multiclass semantic segmentation, employed to evaluate our method for 3DIRCADb dataset.
16
aiming for direct optimization of the mean intersection- In the training phase, three adjacent slices of size 224 × 224
over-union (mIoU) score. It shows substantially better were randomly cropped from raw CT volumes to train our
segmentation performance than the cross-entropy loss on I -DenseFCN. No other pre-processing operations were
2
several semantic segmentation datasets. performed in the period.
The cross-entropy loss is unable to take the regional Similar to the 2017 LiTS challenge (XX), the main
36
information into account during the optimization process, evaluation metric to assess the performance of liver tumor
but it is stable. Despite the improved segmentation segmentation was the Dice per case score, which refers to
performance achieved by the Lovász-Softmax loss, it the average Dice score per volume.
is unstable in the training process. Hence, the Lovász-
Softmax loss is exploited in a fine-tuning stage to refine 3.2. Implementation details
27
the segmentation results in the original study. The I -DenseFCN model was implemented under the
2
37
In this work, a hybrid loss is designed by combining PyTorch framework and was trained with Adam
38
the cross-entropy loss and the Lovász-Softmax loss with optimizer. The initial learning rate was set to 0.001 and
dynamic weights. The weights of two types of losses are decayed every 40 epochs at a rate of 0.1. The training of each
linearly determined by the iteration of the training process. model required approximately 30 h on a single NVIDIA
The hybrid loss function is defined in Equation (V): GTX 1080Ti GPU for 120 epochs, with a minibatch size
of 16. The inference time cost on the LiTS test dataset was
n n about 90 s/case.
L 1 L wce L ls (V)

N N In the training stage, a liver segmentation model based
on the 2D DenseUNet was first trained with CT slices in
29
Where L and L represent the weighted cross-
wce
ls
entropy loss and the Lovász-Softmax loss, respectively. their original size. Then the proposed model was trained
N is the total number of training iterations and n is the on the cropped dataset. To alleviate overfitting, standard
index of the training iteration. α is a hyperparameter data augmentation techniques, such as random flipping,
chosen to balance L and L magnitude and is rotating, and scaling, were applied. In the test stage, for a
wce
ls
determined empirically. The hybrid loss puts more weight given 3D CT volume, a coarse liver segmentation result
on the cross-entropy loss in the early stage of the training was first generated, and then the liver bounding box was
process and shifts gradually to the Lovász-Softmax loss used by the proposed model to predict the final liver tumor
in the late stage. This leads to a coarse-to-fine liver tumor segmentation.
segmentation process. 3.3. Analysis of the proposed method
3. Experiments In this section, we conducted several experiments on
the LiTS dataset to evaluate the contributions of each
3.1. Datasets and evaluation metrics
component in our proposed I -DenseFCN, including
2
Experiments were conducted to evaluate the performance the CT image pre-processing module, the I -DenseFCN
2
2
of the proposed I -DenseFCN using two public datasets, segmentation network, and the hybrid loss.
namely 2017 LiTS challenge and 3DIRCADb. The LiTS
36
dataset contains 201 contrast-enhanced abdominal CT 3.3.1. Effectiveness of CT image pre-processing
scans collected from multiple clinical sites, incorporating module
large variations in the in-plane resolution and the slice To validate the effectiveness of the CT image pre-
spacing due to the difference in scanners and protocols; processing module, we compared the tumor segmentation
the dataset includes 131 cases for training and 70 cases for performance of different CT image pre-processing
testing. The axial slices of all scans have an identical size approaches using the 2D DenseUNet model. The
of 512 × 512, and the number of slices in each case ranges approaches included HU value clipping with a preset
from 42 to 1026. The 3DIRCADb dataset is composed CT window (Window Clip), the full range of HU values
of 20 3D CT scans, of which 15 volumes have hepatic without any window (No Window), and the proposed
tumors in the liver. The number of slices in each case varies integrated window optimization pre-processing module
between 74 and 225. (Window Optimization). The preset CT window was set
In our experiments, 131 training cases were further split to [−200, 250]. The initialization parameters W width, W level,
into a training set and validation set at a ratio of 111:20 U, and ε for window optimization pre-processing module
for LiTS dataset, whereas 2-fold cross-validation was were set to 450, 50, 255, and 1, respectively. Experimental

Volume 2 Issue 2 (2025) 65 doi: 10.36922/aih.5001

66 67 68 69 70 71 72 73 74 75 76