Forecasting AI Model Computational Requirements Using Random Forest and XGBoost with Entity and Domain Characteristics
Isi Artikel Utama
Abstrak
This research aims to predict the computational power required by artificial intelligence (AI) models, specifically measured in petaFLOP (Floating Point Operations Per Second), based on their domain and entity characteristics. The study employs Random Forest and XGBoost regression models to predict the amount of computational power needed by AI models. Both models were trained on a dataset that includes features such as the training year, domain (e.g., Language, Vision), and entity characteristics. The results demonstrate that the Random Forest model outperforms XGBoost in terms of prediction accuracy, as indicated by higher R-squared values and lower error metrics. Feature importance analysis revealed that the year of training and domain were the most significant predictors of computational power, with the Language domain emerging as the most influential in both models. The findings highlight the potential for machine learning models to forecast AI computational requirements, which can aid organizations in optimizing computational resources for AI projects. However, the study faces limitations due to data sparsity, particularly in the target variable, and the relatively simple nature of the models employed. Future work should explore incorporating additional features, such as hardware specifications, and leveraging deep learning models to better capture the complexity of AI computational demands. This study lays the groundwork for further research into more precise predictions of AI model resource consumption, helping organizations plan their AI initiatives more effectively.
Rincian Artikel

Artikel ini berlisensiCreative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with International Journal for Applied Information Management agree to the following terms: Authors retain copyright and grant the International Journal for Applied Information Management right of first publication with the work simultaneously licensed under a Creative Commons Attribution License (CC BY-SA 4.0) that allows others to share (copy and redistribute the material in any medium or format) and adapt (remix, transform, and build upon the material) the work for any purpose, even commercially with an acknowledgement of the work's authorship and initial publication in International Journal for Applied Information Management. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in International Journal for Applied Information Management. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).