| 英文摘要 |
Most previous studies have manually labeled judgment data and used statistical methods to summarize their characteristics. Therefore, this study aims to extract key information from unstructured data, perform feature engineering through regular expressions, and find frequently occurring item sets. , helping auditors select high-risk government procurement cases more efficiently and accurately. This study uses Apriori and LDA topic analysis to identify the characteristics and frequent combinations of manufacturers, procurement bids, and bidding agencies that violate government procurement laws in the data set. This method can mark procurement cases with the characteristics of bid-rigging cases and summarize frequent items involving violations of common characteristics of government procurement cases. The research results found that the manufacturer is a limited company with a registered capital between NT$1 million and NT$10 million. The probability of occurrence of those who participate in the lowest bid amount is 0.62, which is a relatively frequent project; civil engineering contracting, construction business and local government partners are close to each other. These findings can become one of the references for selecting bid documents under limited audit manpower. |