Page 116 - AIH-2-2

P. 116

Artificial Intelligence in Health AI in early breast cancer diagnosis: A review

mammography images; and DDSM includes 2500 studies. maps between the encoder and decoder, achieving an F1
However, improvements are needed in recall rates and score of 0.81. However, the complexity of the network
capturing AD tracks from digital breast tomosynthesis architecture and its testing solely on private breast tumor
slices. Jianwei Liao et al. developed a DL model to detect ultrasound images may limit its performance on other
19
high-risk sites in ultrasound images that appear benign, tumor types. In addition, Wang et al. fused biological and
24
facilitating early cancer confirmation with urgent biopsy. The computational methods to detect early-stage breast cancer
reported accuracy was 94.4% on private datasets obtained using Raman spectroscopy. Raman spectra were collected
from three different hospitals in China, outperforming from 804 serum samples, of which 241 were from healthy
radiologists. The training dataset had Doppler ultrasound volunteers, 463 from breast cancer patients, and 100 from
images of 7955 lesions from 6795 patients. The model’s ductal carcinoma in situ patients – provided by Peking
performance was validated in two external cohorts that University People’s Hospital. The work combined Raman
included 448 lesions from 391 patients in the Tangshan spectroscopy of the serum with CNNs, demonstrating
People’s Hospital, Chongqing, China, and 245 lesions from high capability for rapid, accurate, and minimally invasive
235 patients in the Dazu People’s Hospital, Chongqing, breast cancer screening.
China. The system was designed to assist radiologists Fogliatto et al. applied ML techniques to the Wisconsin
21
and the experimental results indicated that this ensemble Breast Cancer Database (WBCD) and a protein microarray
DL model could enhance the diagnostic efficiency of database to design a decision support system for early
radiologists by identifying subtle and revealing elements breast cancer detection. The WBCD consists of 699
on ultrasound images of breast tissues. Another ensemble
DL model studied by Subramanian and Venugopal instances with nine attributes, and the protein microarray
26
achieved an accuracy of 96.71% through hyperparameter database includes 642 features regarding the levels of
optimization. The model was trained on the Breast Cancer protein expression analyzed by microarray chips. The study
Wisconsin Diagnostic dataset containing 569 instances, ranked features using a new feature importance index based
the Breast Cancer Wisconsin Original dataset with 699 on principal component analysis and Bayesian decision
instances, and the Breast Cancer Coimbra dataset with 116 parameters, yielding an average accuracy of 98.3% with
instances. While this model performed better than existing k-nearest neighbors (kNN), linear discriminant analysis
ones in the literature, it was validated on the same dataset; (LDA), and probabilistic neural network (PNN). Nomani
23
it was trained on rather than an external study, raising et al. attempted to improve early breast cancer diagnosis
23
concerns about its applicability in clinical settings. by image processing of mammograms from the Mini-
MIAS dataset, which contains 322 digitized mammograms.
Ayana et al. designed a novel patchless multi-stage The study developed a novel particle swarm-optimized
22
transfer learning technique to improve breast mass wavelet neural network that achieved a precision of 98.6%
classification. The datasets used were DDSM with 2620 on screening MIAS data, outperforming SVM, kNN, and
22
mammography images, INbreast database with 115 cases CNN. Malek et al. presented a new methodology to
28
(410 images), and Mammographic Image Analysis Society classify microcalcifications in mammogram images using
(MIAS) comprising 322 digitized film mammograms. persistent homology (PH) and ML models. The study
28
The model outperformed patch-and whole image- used MIAS and DDSM datasets containing 322 digitized
based methods, achieving an improved accuracy by 8% film mammograms and 2620 mammography images,
(91.41% vs. 99.34%) when tested on the INbreast dataset. respectively. The study demonstrated the use of PH in
Despite the high accuracy, further fine-tuning could be noise filtration and feature extraction, the results of which
beneficial if patient information and medical history are were used to train ML models. This approach achieved an
also considered. Yasir et al. implemented YOLOv5 in accuracy of 96.2% and 99.3% on MIAS and DDSM datasets,
27
combination with Mask R-convolutional neural networks leading to the discovery that PH-based ML methods can
(CNNs) on the INbreast dataset, achieving an accuracy increase classification performance in terms of accuracy. PH
of 98%. This novel technique proved to be better than a can capture topological features that conventional image-
single detection unit; however, it was conducted on a single analysis methods cannot extract, paving the way for a more
dataset and not validated against others. accurate diagnosis. Alfian et al. developed a web-based
29
Another study by Yu et al. focused on tumor application predicting breast cancer based on underlying
25
segmentation of breast ultrasound images, by enhancing risk factors with an accuracy of 80.23% using the Coimbra
the original U-Net model. The study used a private dataset dataset comprising 116 breast cancer instances. Although
from Xinhua Hospital, which contained 538 tumor this model was designed to assist healthcare professionals
images. The proposed approach integrated a Res Path using decision trees and SVM classifiers, its accuracy may
into the U-Net model to reduce discrepancies in feature not be sufficient for clinical reliance.

Volume 2 Issue 2 (2025) 110 doi: 10.36922/aih.4197

111 112 113 114 115 116 117 118 119 120 121