Big data analytics for seasonal crop patterns: integrating machine learning techniques
DOI:
https://doi.org/10.15587/1729-4061.2024.315066Keywords:
seasonal crop patterns, random forest, grid search, SHAP, LIME, cluster analysis, predictive model, climate variability, local agricultural, model accuracyAbstract
This study addresses the challenge of predicting rice growing season lengths, crucial for agricultural planning in tropical regions. Climate variability and season timing create uncertainties in decision-making, and while machine learning is widely used in agriculture, a gap persists in integrating spatial-temporal data for accurate season length prediction and region-specific pattern analysis influenced by rainfall. Using a combination of Random Forest algorithms with hyperparameter optimization (grid search), and clustering techniques such as PCA, K-Means, and Hierarchical Clustering, this study analyzes key features such as the start of the season (SOS), end of the season (EOS), and their significance indicators (sig_sos and sig_eos). The findings reveal a strong correlation (0.98) between SOS and EOS, with an optimal growing season ranging from day 93 to day 207 (113.82 days). The Random Forest model, optimized with Grid Search, achieved a MSE of 28.9474 and an R2 of 0.8636, showing an outstanding predictive result. SHAP and LIME analyses identified sos and eos as the most influential predictors, while cluster analysis highlighted three distinct growing season groups characterized by variations in rainfall and seasonal stability. These results underscore the importance of understanding localized agricultural conditions and provide actionable insights for optimizing planting schedules, resource allocation, and climate adaptation strategies. By integrating advanced machine learning techniques with spatial-temporal data, this study establishes a foundation for improving agricultural resilience and sustainability in the face of climate variability
References
- El Bilali, H., Henri Nestor Bassole, I., Dambo, L., Berjan, S. (2020). Climate change and food security. The Journal “Agriculture and Forestry,” 66 (3). https://doi.org/10.17707/agricultforest.66.3.16
- Molotoks, A., Smith, P., Dawson, T. P. (2020). Impacts of land use, population, and climate change on global food security. Food and Energy Security, 10 (1). https://doi.org/10.1002/fes3.261
- Favas, C., Cresta, C., Whelan, E., Smith, K., Manger, M. S., Chandrasenage, D. et al. (2024). Exploring food system resilience to the global polycrisis in six Asian countries. Frontiers in Nutrition, 11. https://doi.org/10.3389/fnut.2024.1347186
- Adesete, A. A., Olanubi, O. E., Dauda, R. O. (2022). Climate change and food security in selected Sub-Saharan African Countries. Environment, Development and Sustainability, 25 (12), 14623–14641. https://doi.org/10.1007/s10668-022-02681-0
- Javadi, A., Ghahremanzadeh, M., Sassi, M., Javanbakht, O., Hayati, B. (2022). Economic evaluation of the climate changes on food security in Iran: application of CGE model. Theoretical and Applied Climatology, 151 (1-2), 567–585. https://doi.org/10.1007/s00704-022-04289-w
- Sirsat, M. S., Mendes-Moreira, J., Ferreira, C., Cunha, M. (2019). Machine Learning predictive model of grapevine yield based on agroclimatic patterns. Engineering in Agriculture, Environment and Food, 12 (4), 443–450. https://doi.org/10.1016/j.eaef.2019.07.003
- Apat, S. K., Mishra, J., Srujan Raju, K., Padhy, N. (2022). State of the Art of Ensemble Learning Approach for Crop Prediction. Next Generation of Internet of Things, 675–685. https://doi.org/10.1007/978-981-19-1412-6_58
- Chlingaryan, A., Sukkarieh, S., Whelan, B. (2018). Machine learning approaches for crop yield prediction and nitrogen status estimation in precision agriculture: A review. Computers and Electronics in Agriculture, 151, 61–69. https://doi.org/10.1016/j.compag.2018.05.012
- Sharma, A., Jain, A., Gupta, P., Chowdary, V. (2021). Machine Learning Applications for Precision Agriculture: A Comprehensive Review. IEEE Access, 9, 4843–4873. https://doi.org/10.1109/access.2020.3048415
- Çeliktopuz, E. (2024). A Detailed Examination of Türkiye’s Projected Precipitation and Growth Season Trends under Climate Change Condition. Black Sea Journal of Agriculture, 7 (3), 215–223. https://doi.org/10.47115/bsagriculture.1416956
- Nainggolan, D., Abay, A. T., Christensen, J. H., Termansen, M. (2023). The impact of climate change on crop mix shift in the Nordic region. Scientific Reports, 13 (1). https://doi.org/10.1038/s41598-023-29249-w
- Benos, L., Tagarakis, A. C., Dolias, G., Berruto, R., Kateris, D., Bochtis, D. (2021). Machine Learning in Agriculture: A Comprehensive Updated Review. Sensors, 21 (11), 3758. https://doi.org/10.3390/s21113758
- Sharma, P., Dadheech, P., Aneja, N., Aneja, S. (2023). Predicting Agriculture Yields Based on Machine Learning Using Regression and Deep Learning. IEEE Access, 11, 111255–111264. https://doi.org/10.1109/access.2023.3321861
- Ip, R. H. L., Ang, L.-M., Seng, K. P., Broster, J. C., Pratley, J. E. (2018). Big data and machine learning for crop protection. Computers and Electronics in Agriculture, 151, 376–383. https://doi.org/10.1016/j.compag.2018.06.008
- Mavume, A. F., Banze, B. E., Macie, O. A., Queface, A. J. (2021). Analysis of Climate Change Projections for Mozambique under the Representative Concentration Pathways. Atmosphere, 12 (5), 588. https://doi.org/10.3390/atmos12050588
- Mishra, B., Busetto, L., Boschetti, M., Laborte, A. G., Nelson, A. (2023). Data underlying the research on RICA: Rice Crop Calendar for Asia. 4TU.ResearchData. https://doi.org/10.4121/13468929
- CMIP6 Climate Projections (2021). Climate Data Store. https://doi.org/10.24381/cds.c866074c
- Guo, X., Hao, P. (2021). Using a Random Forest Model to Predict the Location of Potential Damage on Asphalt Pavement. Applied Sciences, 11 (21), 10396. https://doi.org/10.3390/app112110396
- Fernandez-Gonzalez, P., Bielza, C., Larranaga, P. (2019). Random Forests for Regression as a Weighted Sum of k-Potential Nearest Neighbors. IEEE Access, 7, 25660–25672. https://doi.org/10.1109/access.2019.2900755
- Zaib, R., Ourabah, O. (2023). Large Scale Data Using K-Means. Mesopotamian Journal of Big Data, 2023, 36–45. https://doi.org/10.58496/mjbd/2023/006
- Shajalal, M., Boden, A., Stevens, G. (2024). ForecastExplainer: Explainable household energy demand forecasting by approximating shapley values using DeepLIFT. Technological Forecasting and Social Change, 206, 123588. https://doi.org/10.1016/j.techfore.2024.123588
- Bhandary, A., Dobariya, V., Yenduri, G., Jhaveri, R. H., Gochhait, S., Benedetto, F. (2024). Enhancing Household Energy Consumption Predictions Through Explainable AI Frameworks. IEEE Access, 12, 36764–36777. https://doi.org/10.1109/access.2024.3373552
- Chicco, D., Warrens, M. J., Jurman, G. (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Computer Science, 7, e623. https://doi.org/10.7717/peerj-cs.623
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Roni Yunis, Arwin Halim, Irpan Adiputra Pardosi
This work is licensed under a Creative Commons Attribution 4.0 International License.
The consolidation and conditions for the transfer of copyright (identification of authorship) is carried out in the License Agreement. In particular, the authors reserve the right to the authorship of their manuscript and transfer the first publication of this work to the journal under the terms of the Creative Commons CC BY license. At the same time, they have the right to conclude on their own additional agreements concerning the non-exclusive distribution of the work in the form in which it was published by this journal, but provided that the link to the first publication of the article in this journal is preserved.
A license agreement is a document in which the author warrants that he/she owns all copyright for the work (manuscript, article, etc.).
The authors, signing the License Agreement with TECHNOLOGY CENTER PC, have all rights to the further use of their work, provided that they link to our edition in which the work was published.
According to the terms of the License Agreement, the Publisher TECHNOLOGY CENTER PC does not take away your copyrights and receives permission from the authors to use and dissemination of the publication through the world's scientific resources (own electronic resources, scientometric databases, repositories, libraries, etc.).
In the absence of a signed License Agreement or in the absence of this agreement of identifiers allowing to identify the identity of the author, the editors have no right to work with the manuscript.
It is important to remember that there is another type of agreement between authors and publishers – when copyright is transferred from the authors to the publisher. In this case, the authors lose ownership of their work and may not use it in any way.