Editor:
Universidad Carlos III de Madrid. Departamento de Estadística
Issued date:
2022-10-03
ISSN:
2387-0303
Sponsor:
The authors gratefully acknowledge the financial support from the Spanish government through projects
PID2020-116694GB-I00 and from the Madrid Government (Comunidad de Madrid) under the Multiannual
Agreement with UC3M in the line of “Fostering Young Doctors Research” (ZEROGASPAIN-CM-UC3M), and
in the context of the V PRICIT (Regional Programme of Research and Technological Innovation).
Serie/No.:
Working paper Statistics and Econometrics 21-09
Project:
Gobierno de España. PID2020-116694GB-I00
Keywords:
Or in energy
,
Data-Driven
,
Electricity Retailer
,
Hyperparameter Selection
,
Machine Learning
Rights:
Atribución-NoComercial-SinDerivadas 3.0 España
Abstract:
We present a data-driven framework for optimal scenario selection in stochastic optimization with applications in power markets. The proposed methodology relies in the existence of auxiliary information and the use of machine learning techniques to narrow the We present a data-driven framework for optimal scenario selection in stochastic optimization with applications in power markets. The proposed methodology relies in the existence of auxiliary information and the use of machine learning techniques to narrow the set of possible realizations (scenarios) of the variables of interest. In particular, we implement a novel validation algorithm that allows optimizing each machine learning hyperparameter to further improve the prescriptive power of the resulting set of scenarios. Supervised machine learning techniques are examined, including kNN and decision trees, and the validation process is adapted to work with time-dependent datasets. Moreover, we extend the proposed methodology to work with unsupervised techniques with promising results. We test the proposed methodology in a realistic power market application: optimal trading strategy in forward and spot markets for an electricity retailer under uncertain spot prices. Results indicate that the retailer can greatly benefit from the proposed data-driven methodology and improve its market performance. Moreover, we perform an extensive set of numerical simulations to analyze under which conditions the best machine learning hyperparameters, in terms of prescriptive performance, differ from those that provide the best predictive accuracy.[+][-]