Page 52 - MSAM-2-1
P. 52

Materials Science in Additive Manufacturing                           Data imputation strategies of PBF Ti64







































            Figure 9. Cumulative distribution plots for k-nearest neighbor-imputed (left), multivariate imputation by chained equations-imputed (middle), and graph
            imputation neural network-imputed (right) datasets for selected incomplete variables.

            the best for six variables, including exposure duration,   correct relationship. Most of the sums of imputed density
            hatch spacing, laser focus, layer thickness, point distance,   and porosity values add up to 100% when rounded to 2
            and Young’s modulus. GINN gave the best imputation for   decimal places.
            seven variables, including laser spot, density, elongation,   The performances of the models were validated by
            microhardness, macrohardness, ultimate tensile strength,   comparing the values obtained from the models with
            and  yield  strength.  MICE  only  outperformed  kNN  and   the actual values calculated from a known relationship.
            GINN in two variables, that is, scan speed and porosity.   The laser scan speed is known to be related with other
            It is interesting to note that kNN performed better in   parameters by the equation below.
            imputing variables related to process parameters, whereas
            GINN performed better in variables related to material           V =  P                       (V)
            properties.                                                          Ehl
              For the MICE-imputed dataset, the imputed values were   where V is the laser scan speed, P is the laser power, E is
            much more varied, and the imputations showed a stronger   the volumetric energy density, h is the hatch spacing, and l
            relationship between  the process parameters and  material   is the layer thickness. There are 48 datarows in the dataset
            properties than the kNN-imputed dataset. While MICE   that contains missing data for the laser scan speed while
            performed better than kNN in that aspect, there were more   other parameters such as laser power, energy density, hatch
            instances where microhardness values were lower than   spacing and layer thickness are available. These datarows
            macrohardness values (137 for MICE versus 73 for kNN).   can be used to validate the performance of the imputation
            The sum of the imputed density and porosity values also do   models by comparing the imputed laser scan speed with the
            not add up to 100%, instead ranging from 99.92% to 100.12%.  calculated laser scan speed using the known relationship
              The GINN algorithm can be said to have performed   as shown in Equation V. Figure 11 shows the performance
            the best with the closest distribution to the original dataset   of the models. It is found that the KNN model tends to
            and the most varied imputed values, although with 109   overpredict the laser scan speed, while the GINN model
            instances of lower values of microhardness compared to   tends to underestimate the laser scan speed. The MICE
            macrohardness, translating to 73% of the data showing the   model is able to predict the laser scan speed correctly in


            Volume 2 Issue 1 (2023)                         11                       https://doi.org/10.36922/msam.50
   47   48   49   50   51   52   53   54   55   56   57