Abstract
Machine learning algorithm emerges as a promising approach in energy fields, but its practicality is hindered by data barriers, stemming from high collection costs and privacy concerns. This study introduces a novel federated learning (FL) framework based on XGBoost models, enabling safe collaborative modeling with accessible yet concealed data from multiple parties. Hyperparameter tuning of the models is achieved through Bayesian Optimization. To ascertain the merits of the proposed FL-XGBoost method, a comparative analysis is conducted between separate and centralized models to address a classical binary classification problem in the geoenergy sector. The results reveal that the proposed FL framework strikes an optimal balance between privacy and accuracy. FL models demonstrate superior accuracy and generalization capabilities compared to separate models, particularly for participants with limited data or low correlation features, and offer significant privacy benefits compared to centralized models. The aggregated optimization approach within the FL agreement proves effective in tuning hyperparameters. This study opens new avenues for assessing unconventional reservoirs through collaborative and privacy-preserving FL techniques.
Original language | English |
---|---|
Article number | 123293 |
Journal | Applied Energy |
Volume | 367 |
DOIs | |
Publication status | Published - 1 Aug 2024 |
Keywords
- Bayesian optimization
- Data privacy and security
- Federated learning
- Geoenergy
- Production estimation
ASJC Scopus subject areas
- Building and Construction
- Renewable Energy, Sustainability and the Environment
- Mechanical Engineering
- General Energy
- Management, Monitoring, Policy and Law