Page 203 - IJOCTA-15-4
P. 203

FastLoader: Leveraging large language models to accelerate cargo loading optimization with numerous
                Comparison of algorithm runtime. By           4.4. Module ablation analysis
            observing the algorithm runtime of evaluation un-  Settings. In this comparison, we set up ablation
            der four aircraft types, we find the following two  experiments for the three modules of FastLoader
            important observations: (1) compared with the     to evaluate the impact of each module on this
            heuristic algorithm, although FastLodaer loses
                                                              method.
            some accuracy, it has a higher improvement in
                                                                  Iteration analysis. As shown in Figure 5a-c,
            algorithm runtime. This is because LLM effec-
                                                              we set different numbers of iterations in the eval-
            tively reduces the search space, improves search
                                                              uation, and we have two key observations as fol-
            efficiency, and reduces the time cost of the algo-
                                                              lows: (1) when FastLoader has a small number of
            rithm through its own understanding of complex
                                                              iterations (e.g., 2 and 10 iterations), the balancing
            constraints; (2) although the combinatorial opti-
                                                              results %MAC and %index still achieve high accu-
            mization methods have the lowest algorithm run-
                                                              racy; (2) we particularly observed that when the
            time, they cannot handle complex constraints, re-
                                                              number of iterations is low, the solution accuracy
            sulting in the lowest solution accuracy.
                                                              of each large model is basically close. When the
                Results. FastLoader maintains stable solu-
                                                              number of iterations exceeds 10, in the evaluation
            tion accuracy in multiple LLMs and multiple air-
                                                              performance of %MAC, the results of Deepseek,
            craft types, with an average deviation of no more
                                                              phi, and llama are better than the Qwen, which
            than 1%. Compared with combinatorial optimiza-
                                                              shows these three LLMs have better capabilities
            tion methods, FastLoader has an average improve-
                                                              in simplifying the load balancing search space un-
            ment of 10% in accuracy.     Compared with the
                                                              der complex constraints.
            heuristic search methods, FastLoader has a loss
                                                                  Token length analysis. As shown in Fig-
            of no more than 1.5% in accuracy, but the algo-   ure 5 d-f, we set different length of token in the
            rithm runtime is reduced by an average of 90%,    evaluation, and we have two key observations: (1)
            which greatly reduces the time cost of calculating  the accuracy of the solution drops significantly
            the solution.
                                                              when the token is between 64 and 128 and tends
                                                              to converge in other token settings (256, 512); (2)
                                                              FastLoader has high accuracy and small differ-
            4.3. Evaluation of runtime optimization           ences under different token length settings. As a
                                                              result, FastLoader is robust to token length set-
            Settings. In this comparison, we break down       tings. The reason is that all methods almost over-
            each step of the heuristic search and evaluate the  lap when the token length is greater than 128 (the
            algorithm runtime separately. We aim to ana-      difference is less than 0.05% MAC). We conclude
            lyze in depth which step of the heuristic search  that whether it is Deepseek/phi based on instruc-
            is optimized by Fastloader. Following the hyper-  tion fine-tuning or the larger Qwen/Llama, as
            parameter settings in the previous evaluation, we  long as the length of the token exceeds 128, they
            choose four heuristic search methods as baselines  stably output the optimal solution in the same
            to conduct the experiment.                        order of magnitude.
                Comparison of runtime. Figures 3 and 4            Population size analysis. As shown in Fig-
            show the algorithm runtime across four aircraft   ure 5 g-i, we set different population sizes in the
            types. We have two key observations as follows:   evaluation, and we have the following three key
            (1) the evolution phase accounts for 99% of the   observations: (1) FastLoader achieves high accu-
            heuristic search runtime. The main reason why     racy in the results of %MAC and %index under
            the heuristic algorithm consumes too much time    different initial population size settings; (2) when
            is that the search speed in the evolution phase is  the population size exceeds 50, the solution ac-
            slow, and it takes a long time to get an accurate  curacy of all LLMs further improves, but when
            solution; (2) by simplifying the search space with  the population size exceeds 100, the accuracy of
            LLM, FastLoader significantly reduces the run-    Qwen3 decreases. This is because when Qwen3
            time of the algorithm in the evolution phase. This  generates the initial population, the similarity of
            is because the proportion of feasible solutions in  the solutions in the population is too high, caus-
            the search space increases, while the overall size  ing the result to fall into the local optimal solu-
            of the search space decreases, so the runtime of  tion. Under the same settings, other LLMs do not
            solving the problem is effectively reduced.       fall into the local optimal solution; (3) in terms
                Results.    By breaking down the heuristic    of algorithm runtime, when the population size
            search steps, we observe that FastLoader makes    is set to 200, except for the time reduction of
            effective optimizations to the runtime during the  Qwen3, the algorithm runtime of the other LLMs
            evolution phase, reducing the runtime by 90%.     increases. However, the overall time to complete
                                                           745
   198   199   200   201   202   203   204   205   206   207   208