Page 74 - AIH-2-1
P. 74

Artificial Intelligence in Health





                                        ORIGINAL RESEARCH ARTICLE
                                        Vision transformers for glioma classification

                                        using T1 magnetic resonance imaging



                                        W. M. S. P. B. Wickramasinghe ,and Maheshi B. Dissanayake*

                                        Department of Electrical and Electronics Engineering, Faculty of Engineering, University of
                                        Peradeniya, Peradeniya, Sri Lanka
                                        (This article belongs to the Special Issue: Artificial intelligence for diagnosing brain diseases)



                                        Abstract

                                        Automated image analysis and classification have increasingly advanced in recent
                                        decades owing to machine learning and computer vision. In particular, deep learning
                                        (DL) architectures have become popular in resource-limited and labor-restricted
                                        environments such as the health-care sector. Transformer architecture, a DL method
                                        with self-attention mechanism, excels in natural language processing; however, its
                                        application in image-based diagnosis in health-care sector remains limited. Herein,
                                        the feasibility, bottlenecks, and performance of transformers in magnetic resonance
                                        imaging (MRI)-based brain tumor classification were investigated.  To this end, a
                                        vision transformer (ViT) model was trained and tested using the popular Brain Tumor
                                        Segmentation (BraTS) 2015 dataset for glioma classification. Owing to limited data
            *Corresponding author:      availability, domain adaptation techniques were used to pretrain the ViT model and
            Maheshi B. Dissanayake
            (maheshid@eng.pdn.ac.lk)    the BraTS 2015 dataset was used for its fine-tuning. With the model only trained for
                                        100 epochs, the confusion matrix for the two-class problem of tumor and nontumor
            Citation: Wickramasinghe,
            WMSPB and Dissanayake MB.   classification showed an overall classification accuracy of 81.8%. In conclusion,
            Vision transformers for glioma   although convolutional neural networks are traditionally used for DL-based
            classification using T1 magnetic   medical image classification owing to their attention mechanism and long-range
            resonance imaging. Artif Intell
            Health. 2025;2(1):68-80.    dependency-capturing capability,  ViTs can outperform them in MRI-based brain
            doi: 10.36922/aih.4155      tumor classification.
            Received: July 5, 2024
            1st revised: August 28, 2024  Keywords: Vision transformers; Medical image analysis; Deep neural networks; Magnetic
                                        resonance imaging; Convolutional neural network; Glioma detection
            2nd revised: September 9, 2024
            Accepted: September 19, 2024
            Published Online: November 6,
            2024                        1. Introduction
            Copyright: © 2024 Author(s).   Medical imaging is crucial in the health-care sector for noninvasive diagnostic procedures
            This is an Open-Access article   because it can provide functional and visual representations of internal organs. X-ray
            distributed under the terms of the
            Creative Commons Attribution   imaging, nuclear imaging, magnetic resonance imaging (MRI), mammography,
            License, permitting distribution,   computed tomography (CT), and ultrasound imaging are some popular imaging
            and reproduction in any medium,
                                                 1
            provided the original work is   techniques.  The four primary phases of medical image analysis include image formation,
            properly cited.             reconstruction, processing, and analysis. These phases help to create two-dimensional
            Publisher’s Note: AccScience   and three-dimensional (3D) images and enhance them; quantitative data are used for
            Publishing remains neutral with   segmentation, classification, and object identification.  Modern advancements in
                                                                                     2,3
            regard to jurisdictional claims in
            published maps and institutional   artificial intelligence (AI), computer vision (CV), machine learning, and deep learning
            affiliations.               (DL) techniques can qualitatively and quantitatively improve medical image analysis.

            Volume 2 Issue 1 (2025)                         68                               doi: 10.36922/aih.4155
   69   70   71   72   73   74   75   76   77   78   79