Page 13 - IJAMD-2-3
P. 13

International Journal of AI for
            Materials and Design                                                   AI applications in composite materials



              Similarly, as with DNNs, many studies have also utilized   or spatial-order relationships. Unlike feedforward neural
            transfer learning techniques to enhance the training   networks or CNNs, RNNs receive data one step at a time in
            efficiency of CNNs. Kojima et al.  used transfer learning   sequential order, as shown in Figure 5A, and have feedback
                                       47
            with CNNs to predict internal defects in carbon fiber-  connections in the hidden layers to recognize the sequential
            reinforced polymer (CFRP) laminates by combining data   structure of the data. This architecture is advantageous in
            generated through FEA with infrared stress distributions   problems where the order of inputs is important, and it
            obtained from experiments.  Figure  4D presents the   also offers the flexibility to handle variable-length data.
            ground  truth  of  defects  for  each  layer  of  the  composite   However, RNNs have a drawback known as the long-term
            laminate, together with the predicted results obtained   dependency problem, where the influence of distant inputs
            through transfer learning. By fine-tuning these models   diminishes over time despite their relevance. In addition, as
            with a limited set of experimental data, transfer learning   weights are multiplied across time steps, the model suffers
            improves their generalization capabilities across various   from vanishing or exploding gradient issues. To address
            composite structures. The results show that as the amount   these problems, advanced RNN-based models, such as
            of experimental stress measurement data increases, transfer   long short-term memory (LSTM) and gated recurrent unit
            learning-based defect prediction outperforms traditional   (GRU),  are  commonly  used  (Figure  5A).  LSTM  utilizes
            FEA-based approaches in terms of efficiency. Moreover, Liu   three gates to selectively store, transmit, and discard
            et al.  aimed to diagnose damage in composite materials   information, and uses a memory cell to retain long-term
                59
            using transfer learning with CNNs by leveraging Lamb wave   data. GRU, a simplified version of LSTM, applies an update
            signals generated from numerical simulations and applying   gate to determine how much of the previous information
            them to experimental data. By integrating transfer learning   to keep, and a reset gate to decide how much of the past
            with domain adversarial learning, the model achieved   information to forget, thereby overcoming the limitations
            high generalization performance across complex domain   of conventional RNNs.
            discrepancies, leading to highly accurate delamination   Since composites exhibit highly nonlinear and time-
            prediction. In addition to cases of domain adaptation,   dependent behavior, RNNs have been actively studied
            where only the domains differ while the tasks remain the   and applied in various research areas. 65,66,70-76  Chen et al.
                                                                                                            70
            same as in the aforementioned studies, transfer learning can   predicted  the  nonlinear  behavior  of  composites,  such  as
            also be applied when there are differences in the source and   the time-dependent nonlinear elastic-plastic response that
            target tasks. Xu et al.  extracted vector representations of   varies with the strain/loading path, using an LSTM network
                             60
            the geometric morphology and distribution characteristics   capable of processing the entire time sequence. As a result,
            of two-phase composites from images to establish similarity   the trained LSTM model accurately predicted stress with
            between  the  source  and  target  domains.  Their  method   a maximum error of <4%, and the model successfully
            enabled the use of small datasets through transfer learning,   reproduced the stress-strain hysteresis curves even under
            yielding high prediction accuracy for mechanical properties,   nonlinear cyclic loading conditions. Yousefi  et al.  also
                                                                                                        65
            such as stiffness, strength, and toughness. Techniques   employed an LSTM DL model, as shown  in  Figure  5B,
            such as data augmentation and transfer learning not only   to predict the long-term water absorption and thickness
            enhance the efficiency of predictive models but also enable   swelling behavior of a composite composed of hydrophilic
            broad applicability across diverse tasks and DL models.  cellulose fibers and hydrophobic polypropylene. By
              In addition, developing hybrid models by integrating   training the model with the initial 200 h of data obtained
            CNNs with other ML/DL methods holds promise for    from immersion experiments, they demonstrated that
            enhancing their ability to capture and analyze complex   the LSTM can effectively predict the long-term physical
            composite material structures. 61-64  Among various   behavior of the composite for the subsequent 1300  h.
            approaches, attention mechanism-based CNNs have    Furthermore, RNN-based models have also been utilized
            demonstrated strong capabilities in capturing complex   to predict the behavior of composites under dynamic
            failure behaviors. Liu  et al.  incorporated an attention-  loading conditions. Borkowski et al.  employed a GRU to
                                  64
                                                                                            71
            based loss function into a CNN framework, enabling the   predict the response of ceramic matrix composites (CMCs)
            model to focus more on high-damage regions in thick   under non-monotonic loading and microstructural
            composites with wrinkles, thereby achieving effective   variations, and demonstrated that stable predictions
            prediction of failure modes.                       could be achieved without overfitting across 60 test sets.
                                                                       66
                                                               Gao et al.  applied an LSTM to predict the axial stress of
            2.3. Recurrent neural networks (RNNs)              concrete-granite composites under impact conditions, as
            RNNs are a type of DL model designed to process sequential   shown in Figure 5C. The trained model was able to predict
            data, and they excel in handling data that contain temporal   stress responses to new impact velocities, joint roughness


            Volume 2 Issue 3 (2025)                         7                         doi: 10.36922/IJAMD025210016
   8   9   10   11   12   13   14   15   16   17   18