Page 53 - MSAM-2-1
P. 53

Materials Science in Additive Manufacturing                           Data imputation strategies of PBF Ti64





































            Figure 10. Distribution plots for k-nearest neighbor-imputed (left), multivariate imputation by chained equations-imputed (middle), and graph imputation
            neural network-imputed (right) datasets for selected incomplete variables.

            Table 3. Comparison of the mean square error of the   to the KNN (0.29) and GINN (0.16). GINN performs the
            distribution for each variable                     best in terms of the RMSE.
            Features                kNN     MICE    GINN         To further validate the results, a composite parameter
            Energy density (J/mm )  3.06E-07  1.71E-07 a  3.11E-07  (energy density × hatch spacing) was used to evaluate the
                          3
            Exposure duration (µs)  1.64E-05 a  4.46E-05  3.23E-05  models. There are 19 datarows in the dataset that contains
            Hatch spacing (µm)     1.32E-07 a  7.72E-08  6.77E-08  missing data for the energy density and hatch spacing,
            Laser focus (mm)       2.90E-02 a  3.03E-02  4.47E-02  while other parameters of the energy density equation such
                                                               as laser power, laser scan speed and layer thickness are
            Laser spot (µm)        1.19E-06  8.50E-07  2.97E-07 a  available. Figure 12 shows the performance of the models.
            Layer thickness (µm)   1.42E-09 a  2.88E-09  4.26E-09  It is found that the MICE model tends to overpredict the
            Point distance (µm)    1.23E-04 a  8.56E-04  1.03E-03  composite parameter, while the GINN model tends to
            Scan speed (mm/s)      1.11E-09  3.75E-10 a  7.04E-10  underestimate the composite parameter. The GINN model
            Density (%)            1.77E-03  2.27E-03  1.82E-03 a  has  a lowest  RMSE  (0.33)  for the composite  parameter,
            Elongation (%)         3.87E-03  2.29E-03  1.81E-03 a  whereas MICE model has a higher RMSE (3.87) for the
            Microhardness (HV)     1.50E-05  1.51E-05  4.81E-06 a  composite parameter.
            Macrohardness (HV)     1.69E-04  2.99E-05  2.36E-05 a  To get the best out of the three models, further
            Ultimate tensile strength (MPa)  2.08E-07  4.52E-07  4.86E-08 a  processing steps were taken. For each missing datapoint,
            Yield strength (MPa)   9.12E-07  7.23E-07  2.30E-07 a  the median of the imputed values obtained from the three
            Young’s modulus (GPa)  2.24E-04 a  2.26E-04  2.97E-04  models was taken as the final imputed value. This enhances
                                                               the statistical confidence of the imputed value and reduces
            Porosity (%)           6.01E-05  2.99E-05 a  4.71E-05  the chances of getting the outliers especially when dealing
            a Lowest MSE among the three models. kNN: k-nearest neighbor,   with small dataset. It resulted in a remarkable RMSE of
            GINN: Graph imputation neural network, MICE: Multivariate
            imputation by chained equations                    0.026,  which  is  significantly  lower  than  that  of  all  three
                                                               individual models (Figure 13). Nonetheless, it is believed
            general, but the presence of outliers causes the relative   that the models can be further improved with increased
            mean square error (RMSE) (1.36) to be higher compared   number of datapoints.


            Volume 2 Issue 1 (2023)                         12                       https://doi.org/10.36922/msam.50
   48   49   50   51   52   53   54   55   56   57   58