| 英文摘要 |
This study analyzes river water quality data from 2020 to 2022, categorizing it into non-metallic and metallic groups. Using decision trees, random forests, SVM, XGBoost, and association analysis, the study evaluates model accuracy and identifies key factors influencing pollution. XGBoost achieved the highest predictive accuracy across both groups. For non-metallic data, major pollutants include suspended solids, ammonia nitrogen, biochemical oxygen demand (BOD), and chemical oxygen demand (COD). For metallic data, total phosphorus is the primary pollutant. Association analysis highlights key contributors like dissolved oxygen, conductivity, E. coli, mercury, and nitrite nitrogen. The findings suggest ammonia nitrogen and total phosphorus as critical pollution indicators. Recommendations include stricter wastewater controls in industrial and livestock areas and focusing on regions with high levels of suspended solids, BOD, and COD to improve river water quality. |