| 英文摘要 |
Introduction: Predicting the outcomes of sports competitions has become an intriguing area of interest for sports enthusiasts. With technological advancements and the rise of data science, various sports have increasingly emphasized data-driven predictions. Evaluating machine learning models involves examining numerous metrics; in this study, we focus on accuracy as the primary dependent variable to assess machine learning's potential in predicting team sports outcomes. Methods: This paper reviews relevant literature on predicting team sports outcomes from 1990 to 2022, screening a total of 534 papers. Each paper was sequentially evaluated, resulting in the final selection of 15 papers. The abstracts of these selected papers were visualized using word clouds, and the results were analyzed. Results: Findings indicate that machine learning models exhibit varying prediction rates depending on the attributes and quantity of features in the competitions. Sports with lower scores are more challenging to predict accurately, while those with higher scores can achieve accurate predictions with fewer features. Overall, basketball shows higher prediction rates with fewer features, whereas soccer requires more features but does not yield significantly higher accuracy. Models such as artificial neural networks and decision trees demonstrated higher accuracy compared to other models. Conclusion: Data dimensionality is a crucial factor influencing prediction rates, with higherdimensional data often leading to improved prediction accuracy. This study also observed that feature engineering enhances overall prediction accuracy. However, achieving high prediction accuracy remains challenging in sports with numerous uncontrollable factors that are not included in the models. With the development of machine learning, more empirical research is needed to increase the acceptance of data-driven approaches in sports. Leveraging accumulated historical data from competitions as training data for machine learning models can facilitate model validation and refinement. Future research could explore specialized sports and broaden the scope of investigation. |