Page 21 - JCAU-6-1
P. 21

Journal of Chinese
            Architecture and Urbanism                                Spatial exploration through image semantic segmentation






































                                     Figure 3. Partial image stitching of Street 1. Source: Images by the author
            was captured and saved separately every 10 s. Subsequently,   2.4.3. Semantic segmentation and subsequent
            the captured frames underwent brightness adjustment and   processing of street images of Lu Xun’s hometown
            noise reduction to address instances of excessive darkness   The pre-processed street images from Lu Xun’s hometown
            and noise, thereby facilitating the subsequent processing.  were incorporated into the abovementioned training
            (b) Normalization                                  dataset for semantic segmentation. This process resulted
                                                               in the generation of segmented color block images. The
            The resolution dimensions of the 279 images were obtained   results of semantic segmentation are exclusively affected by
            through the abovementioned process and normalized by Python.  the proportion of different color block regions. To reduce
            2.4. Street scene analysis based on semantic       computational workload, further normalization processing
            segmentation                                       was applied to the color block images. During this process, the
                                                               image resolution was compressed to facilitate pixel statistics.
            2.4.1. Selection of semantic segmentation model
                                                               2.4.4. Pixel-wise image analysis using Python
            The semantic segmentation process utilized a classic
            CNN model, ResNet-22, implemented through CUDA-    In this step, Python was utilized to scan images pixel by
            accelerated computation, with the training framework   pixel, tallying and merging pixels belonging to various
            facilitated by Baidu’s PaddlePaddle. Cityscape trained   color groups. Statistical charts were subsequently generated
            the datasets using a series of high-resolution images   based on the counting results. The results obtained from
            encompassing  various  urban  scenes,  including  roads,   various dimensions were aggregated to calculate quantities,
            buildings, pedestrians, and others. The extensive dataset,   averages, variances, and other parameters, facilitating
            consisting of thousands of high-precision annotated   subsequent analysis (Figures 5 and 6).
            training images, was methodically divided into training,   2.5. Comparison and analysis of different street
            validation, and test sets. The accuracy of well-trained   spatial features in Lu Xun’s hometown
            Cityscape images could reach up to 70% or even 80%.
                                                               2.5.1. Spatial features of the historic street (Street 1)
            2.4.2. Model network architecture and training process  in Lu Xun’s hometown

            Eight pre-defined labels were adopted in Cityscape, namely   The distribution of various elements in historic streets
            road, vegetation, building, sky, person, vehicle, wall, and sign.  is worthy of examination. As illustrated in Table 1, road


            Volume 6 Issue 1 (2024)                         4                        https://doi.org/10.36922/jcau.1736
   16   17   18   19   20   21   22   23   24   25   26