Exploring Football Player Salary Prediction Using Random Forest: Leveraging Player Demographics and Team Associations
Isi Artikel Utama
Abstrak
This paper explores the prediction of football player salaries using a Random Forest Regressor model, leveraging player demographics and team associations as key features. The dataset consists of 684 football players, including variables such as age, nationality, position, team, weekly salary, and annual salary. The study applies exploratory data analysis (EDA) to understand the distribution of these features and identify patterns within the dataset. Data preprocessing involves handling missing values, one-hot encoding categorical variables, and splitting the dataset into training and testing sets. The Random Forest model is trained on the preprocessed data, and its performance is evaluated using common regression metrics, including R-squared (R²), Mean Absolute Error (MAE), and Root Mean Squared Error (RMSE). The results show that the model explains approximately 48.5% of the variance in player salaries, with an MAE of £1.92 million and an RMSE of £2.82 million. Key predictors of salary include player age, position, nationality, and team. The analysis of feature importance reveals that categorical variables such as Nation and Team have a significant impact on salary predictions. However, the model's performance is constrained by the lack of more granular data, such as player performance metrics or external economic factors. This research provides valuable insights for football team management, helping teams understand which factors contribute to salary setting and enabling more informed decisions in player recruitment and contract negotiations. It also highlights the potential for sponsorships to target players based on these predictive attributes. Future work could explore the integration of more advanced machine learning techniques and additional player data to improve predictive accuracy and model robustness.
Rincian Artikel

Artikel ini berlisensiCreative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with International Journal for Applied Information Management agree to the following terms: Authors retain copyright and grant the International Journal for Applied Information Management right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material) the work for any purpose, even commercially with an acknowledgement of the work's authorship and initial publication in International Journal for Applied Information Management. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in International Journal for Applied Information Management. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).