| 英文摘要 |
This study developed a stacking model to predict passenger flow at a bus station in Taichung’s 7th Redevelopment Zone during the anniversary sale of a department store in the area. The study collected data on search engine index trends, ticketing information, and other relevant temporal and external factors corresponding to the sale period. A total of 19 features were considered, and an importance-performance analysis was conducted. The analysis results revealed that the top five most important features for passenger flow prediction were“previous_30_minutes_volume,”“same_time_2_weeks_ago_average_volume,”“same_time_previous_day_volume,”“same_time_3_weeks_ago_average_volume,”and“same_time_4_weeks_ago_average_volume.”The final prediction results indicated that when all considered features were selected, the developed model exhibited optimal performance, with its mean absolute error (MAE) and R2 value being 14.64 and 0.86, respectively, and a processing time of 319 seconds. When only the five most important features were used in the model, the MAE increased to 17.51. Therefore, other features were added in sequence to the top five features. The addition of“search engine index volume”caused the greatest improvement in the model performance, with the MAE reducing to 15.17 and the processing time significantly reduced to 143 seconds. The results of this study indicate that while using all features provides the best prediction accuracy, it also requires the longest processing time. If computational resources are limited, adding the“search engine index volume”to the top five most important features provides a good balance between prediction accuracy and computational efficiency during special events, making it a viable alternative. |