Big data analytics for seasonal crop patterns: integrating machine learning techniques

Authors

DOI:

https://doi.org/10.15587/1729-4061.2024.315066

Keywords:

seasonal crop patterns, random forest, grid search, SHAP, LIME, cluster analysis, predictive model, climate variability, local agricultural, model accuracy

Abstract

This study addresses the challenge of predicting rice growing season lengths, crucial for agricultural planning in tropical regions. Climate variability and season timing create uncertainties in decision-making, and while machine learning is widely used in agriculture, a gap persists in integrating spatial-temporal data for accurate season length prediction and region-specific pattern analysis influenced by rainfall. Using a combination of Random Forest algorithms with hyperparameter optimization (grid search), and clustering techniques such as PCA, K-Means, and Hierarchical Clustering, this study analyzes key features such as the start of the season (SOS), end of the season (EOS), and their significance indicators (sig_sos and sig_eos). The findings reveal a strong correlation (0.98) between SOS and EOS, with an optimal growing season ranging from day 93 to day 207 (113.82 days). The Random Forest model, optimized with Grid Search, achieved a MSE of 28.9474 and an R2 of 0.8636, showing an outstanding predictive result. SHAP and LIME analyses identified sos and eos as the most influential predictors, while cluster analysis highlighted three distinct growing season groups characterized by variations in rainfall and seasonal stability. These results underscore the importance of understanding localized agricultural conditions and provide actionable insights for optimizing planting schedules, resource allocation, and climate adaptation strategies. By integrating advanced machine learning techniques with spatial-temporal data, this study establishes a foundation for improving agricultural resilience and sustainability in the face of climate variability

Author Biographies

Roni Yunis, Universitas Mikroskil

Assistant Professor

Department of Information Systems

Arwin Halim, Universitas Mikroskil

Assistant Professor

Department of Informatics

Irpan Adiputra Pardosi, Universitas Mikroskil

Assistant Professor

Department of Informatics

References

  1. El Bilali, H., Henri Nestor Bassole, I., Dambo, L., Berjan, S. (2020). Climate change and food security. The Journal “Agriculture and Forestry,” 66 (3). https://doi.org/10.17707/agricultforest.66.3.16
  2. Molotoks, A., Smith, P., Dawson, T. P. (2020). Impacts of land use, population, and climate change on global food security. Food and Energy Security, 10 (1). https://doi.org/10.1002/fes3.261
  3. Favas, C., Cresta, C., Whelan, E., Smith, K., Manger, M. S., Chandrasenage, D. et al. (2024). Exploring food system resilience to the global polycrisis in six Asian countries. Frontiers in Nutrition, 11. https://doi.org/10.3389/fnut.2024.1347186
  4. Adesete, A. A., Olanubi, O. E., Dauda, R. O. (2022). Climate change and food security in selected Sub-Saharan African Countries. Environment, Development and Sustainability, 25 (12), 14623–14641. https://doi.org/10.1007/s10668-022-02681-0
  5. Javadi, A., Ghahremanzadeh, M., Sassi, M., Javanbakht, O., Hayati, B. (2022). Economic evaluation of the climate changes on food security in Iran: application of CGE model. Theoretical and Applied Climatology, 151 (1-2), 567–585. https://doi.org/10.1007/s00704-022-04289-w
  6. Sirsat, M. S., Mendes-Moreira, J., Ferreira, C., Cunha, M. (2019). Machine Learning predictive model of grapevine yield based on agroclimatic patterns. Engineering in Agriculture, Environment and Food, 12 (4), 443–450. https://doi.org/10.1016/j.eaef.2019.07.003
  7. Apat, S. K., Mishra, J., Srujan Raju, K., Padhy, N. (2022). State of the Art of Ensemble Learning Approach for Crop Prediction. Next Generation of Internet of Things, 675–685. https://doi.org/10.1007/978-981-19-1412-6_58
  8. Chlingaryan, A., Sukkarieh, S., Whelan, B. (2018). Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computers and Electronics in Agriculture, 151, 61–69. https://doi.org/10.1016/j.compag.2018.05.012
  9. Sharma, A., Jain, A., Gupta, P., Chowdary, V. (2021). Machine Learning Applications for Precision Agriculture: A Comprehensive Review. IEEE Access, 9, 4843–4873. https://doi.org/10.1109/access.2020.3048415
  10. Çeliktopuz, E. (2024). A Detailed Examination of Türkiye’s Projected Precipitation and Growth Season Trends under Climate Change Condition. Black Sea Journal of Agriculture, 7 (3), 215–223. https://doi.org/10.47115/bsagriculture.1416956
  11. Nainggolan, D., Abay, A. T., Christensen, J. H., Termansen, M. (2023). The impact of climate change on crop mix shift in the Nordic region. Scientific Reports, 13 (1). https://doi.org/10.1038/s41598-023-29249-w
  12. Benos, L., Tagarakis, A. C., Dolias, G., Berruto, R., Kateris, D., Bochtis, D. (2021). Machine Learning in Agriculture: A Comprehensive Updated Review. Sensors, 21 (11), 3758. https://doi.org/10.3390/s21113758
  13. Sharma, P., Dadheech, P., Aneja, N., Aneja, S. (2023). Predicting Agriculture Yields Based on Machine Learning Using Regression and Deep Learning. IEEE Access, 11, 111255–111264. https://doi.org/10.1109/access.2023.3321861
  14. Ip, R. H. L., Ang, L.-M., Seng, K. P., Broster, J. C., Pratley, J. E. (2018). Big data and machine learning for crop protection. Computers and Electronics in Agriculture, 151, 376–383. https://doi.org/10.1016/j.compag.2018.06.008
  15. Mavume, A. F., Banze, B. E., Macie, O. A., Queface, A. J. (2021). Analysis of Climate Change Projections for Mozambique under the Representative Concentration Pathways. Atmosphere, 12 (5), 588. https://doi.org/10.3390/atmos12050588
  16. Mishra, B., Busetto, L., Boschetti, M., Laborte, A. G., Nelson, A. (2023). Data underlying the research on RICA: Rice Crop Calendar for Asia. 4TU.ResearchData. https://doi.org/10.4121/13468929
  17. CMIP6 Climate Projections (2021). Climate Data Store. https://doi.org/10.24381/cds.c866074c
  18. Guo, X., Hao, P. (2021). Using a Random Forest Model to Predict the Location of Potential Damage on Asphalt Pavement. Applied Sciences, 11 (21), 10396. https://doi.org/10.3390/app112110396
  19. Fernandez-Gonzalez, P., Bielza, C., Larranaga, P. (2019). Random Forests for Regression as a Weighted Sum of k-Potential Nearest Neighbors. IEEE Access, 7, 25660–25672. https://doi.org/10.1109/access.2019.2900755
  20. Zaib, R., Ourabah, O. (2023). Large Scale Data Using K-Means. Mesopotamian Journal of Big Data, 2023, 36–45. https://doi.org/10.58496/mjbd/2023/006
  21. Shajalal, M., Boden, A., Stevens, G. (2024). ForecastExplainer: Explainable household energy demand forecasting by approximating shapley values using DeepLIFT. Technological Forecasting and Social Change, 206, 123588. https://doi.org/10.1016/j.techfore.2024.123588
  22. Bhandary, A., Dobariya, V., Yenduri, G., Jhaveri, R. H., Gochhait, S., Benedetto, F. (2024). Enhancing Household Energy Consumption Predictions Through Explainable AI Frameworks. IEEE Access, 12, 36764–36777. https://doi.org/10.1109/access.2024.3373552
  23. Chicco, D., Warrens, M. J., Jurman, G. (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7, e623. https://doi.org/10.7717/peerj-cs.623
Big data analytics for seasonal crop patterns: integrating machine learning techniques

Downloads

Published

2024-12-27

How to Cite

Yunis, R., Halim, A., & Pardosi, I. A. (2024). Big data analytics for seasonal crop patterns: integrating machine learning techniques. Eastern-European Journal of Enterprise Technologies, 6(4 (132), 46–56. https://doi.org/10.15587/1729-4061.2024.315066

Issue

Section

Mathematics and Cybernetics - applied aspects