| 英文摘要 |
To promote the safety of imported food at the border, a border prediction intelligent system has been implemented by leveraging the Food Cloud of TFDA and open data from domestic and international sources, combining data science and machine learning, and using artificial intelligence models for decision support. This study reviewed the encoding methods for categorical factors in the current system and explored other commonly used encoding methods to evaluate their impact on the modeling. According to the analysis results, the current encoding method for categorical factors exhibited higher sensitivity and lower precision. In contrast, the model constructed using target encoding method in this study demonstrated lower sensitivity and higher precision. However, while validating with test data set, the effectiveness of high sensitivity characteristics was not significant, so that the model constructed using target encoding method outperformed the current one. It is recommended to evaluate the models using target encoding method for categorical factors in the border prediction intelligent system, in order to optimize its predictive capabilities and ensure the safety of imported food at the border. |