Open Access Open Access  Restricted Access Subscription Access

Machine Learning Based Prediction of Solar Power Generation Using Weather Data

Yash Jain Jain, Dr.Manish Kumar

Abstract


Accurate forecasting of solar photovoltaic (PV) power generation is critical for the efficient integration of renewable energy into modern power grids. This study presents a comparative evaluation of four supervised machine learning regression models — Linear Regression, Random Forest Regressor, Support Vector Regression (SVR), and XGBoost Regressor — for predicting DC power output from a solar PV installation. The dataset was constructed by merging solar generation records with meteorological data using aligned timestamps. Nighttime observations (solar irradiation = 0) were excluded from the analysis. The three principal input features employed were Solar Irradiation, Ambient Temperature, and Module Temperature, while the target variable was DC Power output. Experimental results demonstrate that ensemble-based models — Random Forest and XGBoost — achieve the highest predictive accuracy, attaining an R² of 0.968 with MAE values of 314.52 and 321.32, respectively. Linear Regression yielded moderate performance (R² = 0.956), while SVR exhibited substantially degraded accuracy (R² = 0.779, RMSE = 1828.67), attributed to its sensitivity to hyperparameter selection and high-variance input distributions. Solar irradiation was identified as the dominant predictor, consistent with the established physical relationship between photon flux and PV power conversion. These findings validate the superiority of ensemble methods for solar energy forecasting tasks and provide empirical guidance for model selection in grid-scale renewable energy management systems.


Full Text:

PDF

References


Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794. https://doi.org/10.1145/2939672.2939785

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Mellit, A., & Pavan, A. M. (2010). A 24-h forecast of solar irradiance using artificial neural network: Application for performance prediction of a grid-connected PV plant at Trieste, Italy. Solar Energy, 84(5), 807–821. https://doi.org/10.1016/j.solener.2010.02.006

Sharma, N., Mangalvedhe, N., & Parekh, A. (2019). Machine learning based solar irradiance forecasting using feature selection and ensemble methods. Renewable and Sustainable Energy Reviews, 116, 109360.

Liu, J., et al. (2020). Short-term photovoltaic power generation forecasting based on random forest feature selection and CEEMD: A case study. IEEE Access, 8, 29910–29927.

Wolff, B., et al. (2016). Comparing support vector regression for PV power forecasting to a physical modeling approach using measurement, numerical weather prediction, and cloud motion data. Solar Energy, 135, 197–208.

Zeng, J., & Qiao, W. (2013). Short-term solar power prediction using a support vector machine. Renewable Energy, 52, 118–127. https://doi.org/10.1016/j.renene.2012.10.009

Nelson, J. (2003). The Physics of Solar Cells. Imperial College Press. https://doi.org/10.1142/p276


Refbacks

  • There are currently no refbacks.