英文摘要 |
While loop or image detectors have been frequently adopted to collect traffic flow data as a basis for predicting and estimating travel time, missing values is an inevitable issue in real operations. Mean and moving average values based on historical data are common choices to replace missing values in past studies, which do not consider the features of vehicle flow continuation and lagging. To resolve this issue, this study proposes a novel approach which is based on data mining technique by combining the traffic information of traffic detector itself and its adjacent detectors. First, a regression model representing all road sections was developed based on the original historical traffic data of each loop detector. A decision tree was then established using Classification And Regression Tree (CART) to connect each detection point to the adjacent detectors and the Electronic Toll Collection (ETC) travel time on the associated road section. Finally, missing data were imputed based on the developed CART model. The empirical study showed that the CART imputation method based on traffic state works effectively to impute data with missing values, especially under the circumstance of long-period data missing. Moreover, it was found that under circumstances with different number of missing time-windows, hybrid imputation strategies fit better in meeting varying real-time needs. |