Page 84 - JCAU-7-1
P. 84

Journal of Chinese
            Architecture and Urbanism                                     Machine-simulated scoring of child-friendly streets



            to interval distance and time-of-day variations. However,   to extract the pixels of streetscape features (Figure 5). This
            a consensus has emerged that the methodology produces   approach enhanced scene comprehension at both global and
            reliable results when the study area is sufficiently large (Kim   local scales by up-sampling and concatenating features across
            et al., 2021). For example, some previous studies have used   various scales, allowing efficient semantic segmentation by
            50 m intervals for street greenery research (Lu et al., 2019; Ye   accurately categorizing each pixel through the classification
            et al., 2019), while others applied sampling points every 50   layer. PSPNet was trained on the ADE20K dataset, which
            and 100 m (Law et al., 2020; Yang et al., 2019). In this study,   can segment 150 object classes. For this study, we focused on
            we selected a 30 m interval to capture SVIs, using Python   10 object classes of streetscape features relevant to children
            in conjunction with the Google Maps platform and Google   commuting to school, to evaluate streetscape features
            API to capture parameters such as geolocation coordinates,   contributing to the concept of “child-friendly cities.”
            orientation, perspective, and field of view. Each sample point   To evaluate PSPNet’s performance on the output
            included four images facing south, north, east, and west.  dataset, we used the pixel accuracy metric, the simplest
              We used detailed street mapping data obtained from   classification  accuracy  measure  that  calculates  the
            the Hong Kong GeoData website, which required street   percentage of correctly classified pixels in the image.
            consolidation, simplification, and topological treatment   For each row (each SVI), we calculated the sum of the
            of road networks to streamline the SVI data collection   correctly classified pixel proportions across all categories:
            process. For this study, we selected 29 living streets  in
            the central area of Sham Shui Po (Figure 4) and excluded   Total Accuracy _{ } i  =   ∑ _{j  =1}^{ }  _{ }O p ij  (I)
            expressways and elevated roads to focus on public streets   where  p_{ij}  represents the proportion of correctly
            that foster connectivity and socializing for local residents.
                                                               classified pixels for category j in row i, and O is the total
              To ensure quality in SVI extraction, precise parameters   number of object classes.
            for perspective and dimensions were set. Python’s urllib   Next, we calculated the average of the total accuracies
            module was used to download SVIs with strategic overlaps,   across all rows (SVIs):
            enabling four images from different angles to be merged into
            a single panoramic view. Tools like OpenCV and Numpy   Pixel Accuracy
            facilitated image integration, enhancing the efficiency of   =    (1 /   ) N  ∑ _{i =1}^{ } Total Accuracy _{ }N  i  (II)
            the image segmentation processes. Our project adopted the
            semantic segmentation method PSPNet (Zhao et al., 2017)   where N is the total number of images.
































                  Figure 4. Site location, street segmentation, and example of the street view image collection process. Source: Drawing by the authors



            Volume 7 Issue 1 (2025)                         7                        https://doi.org/10.36922/jcau.3578
   79   80   81   82   83   84   85   86   87   88   89