TY - JOUR
T1 - Machine learning-enhanced Data Envelopment Analysis via multi-objective variable selection for benchmarking combined electricity distribution performance
AU - Dong, Hanjiang
AU - Wang, Xiuyuan
AU - Cui, Ziyu
AU - Zhu, Jizhong
AU - Li, Shenglin
AU - Yu, Changyuan
N1 - Publisher Copyright:
© 2025 Elsevier B.V.
PY - 2025/3
Y1 - 2025/3
N2 - The Data Envelopment Analysis (DEA) model can be interpreted as a machine learning-based Sign-constrained case of Convex Nonparametric Least-Squares (SCNLS) regression. However, the selection of input variables in SCNLS-based DEA for benchmarking electricity distribution performance is only for the single-output setting. This paper proposes a Multi-task Least Absolute Shrinkage and Selection Operator (M-LASSO)-enhanced DEA framework that evaluates combined performance scores of distributors in an end-to-end manner. A benchmarking criterion associated with M-LASSO-enhanced DEA, namely performance satisfaction measure, is defined to facilitate the multi-objective variable selection in M-LASSO-enhanced DEA. For implementation, we develop the Corrected M-LASSO (CM-LASSO) method as a two-stage solution approach for the M-LASSO-enhanced DEA model. Using Monte Carlo simulation data, the comparison among SCNLS-based, LASSO-enhanced, and M-LASSO-enhanced DEA models indicates the effectiveness of multi-objective variable selection. Using a real-world dataset from 1993 to 2021, the performance scores of distributors in China demonstrate a trend towards increasingly converging composite performance. This trend, which contains economic efficiency, supply reliability, and environmental sustainability, supports further deregulation in the context of achieving emission peak targets and advancing towards carbon neutrality.
AB - The Data Envelopment Analysis (DEA) model can be interpreted as a machine learning-based Sign-constrained case of Convex Nonparametric Least-Squares (SCNLS) regression. However, the selection of input variables in SCNLS-based DEA for benchmarking electricity distribution performance is only for the single-output setting. This paper proposes a Multi-task Least Absolute Shrinkage and Selection Operator (M-LASSO)-enhanced DEA framework that evaluates combined performance scores of distributors in an end-to-end manner. A benchmarking criterion associated with M-LASSO-enhanced DEA, namely performance satisfaction measure, is defined to facilitate the multi-objective variable selection in M-LASSO-enhanced DEA. For implementation, we develop the Corrected M-LASSO (CM-LASSO) method as a two-stage solution approach for the M-LASSO-enhanced DEA model. Using Monte Carlo simulation data, the comparison among SCNLS-based, LASSO-enhanced, and M-LASSO-enhanced DEA models indicates the effectiveness of multi-objective variable selection. Using a real-world dataset from 1993 to 2021, the performance scores of distributors in China demonstrate a trend towards increasingly converging composite performance. This trend, which contains economic efficiency, supply reliability, and environmental sustainability, supports further deregulation in the context of achieving emission peak targets and advancing towards carbon neutrality.
KW - Data envelopment analysis
KW - Electricity distribution
KW - Machine learning
KW - Variable selection
UR - http://www.scopus.com/inward/record.url?scp=85216306777&partnerID=8YFLogxK
U2 - 10.1016/j.eneco.2025.108226
DO - 10.1016/j.eneco.2025.108226
M3 - Journal article
AN - SCOPUS:85216306777
SN - 0140-9883
VL - 143
JO - Energy Economics
JF - Energy Economics
M1 - 108226
ER -