Sequential Hybrid Approach for Reliable Detection of Rainfall Pauses at the Beginning of the Rainy Season in Senegal: Towards a Predictive Tool for False Starts

Authors: Pape El Hadji Abdoulaye Gueye; Cherif Bachir Deme; Diery Ngom; Adrien Basse
DIN
IJOEAR-OCT-2025-50
Abstract

This work focuses on the detection of false onsets of the rainy season in Senegal, a critical factor that can lead farmers, particularly smallholders, to initiate agricultural activities prematurely. Such errors, caused by misleading early rainfall events, result in yield losses and increase farmers’ vulnerability to climate variability. Unlike existing methods, our approach incorporates statistical tests (such as Pettitt, Kendall, and Lombard) to enrich the input dataset with relevant change points related to rainfall, soil moisture, and vegetation. This enrichment step, combined with a formal detection of false onsets based on climatic, phenological, and statistical criteria, enhances the relevance, robustness, and contextualization of detection compared to purely statistical or physical approaches. In this context, a deep learning methodology was developed to identify false onsets at an early stage using multivariate climatic data. We designed a hybrid model combining LSTM, GRU, and multi-head attention layers to extract complementary representations of the input sequence. Model hyperparameters were optimized through Bayesian search to enhance detection performance. Results show consistent improvements across all key metrics: accuracy increased from 0.84 to 0.88, F1-score from 0.833 to 0.86, recall remained perfect at 1.0, precision rose from 0.767 to 0.81, and AUC improved from 0.900 to 0.92. These gains demonstrate the overall robustness of the optimized model, ensuring more reliable detection of false onsets.

Keywords
False onset Deep learning Statistical tests LSTM GRU Attention Bayesian optimization
Introduction

In semi-arid regions, particularly in the Sahel, agricultural drought—defined as a water deficit affecting crops and their productivity—constitutes a major risk to food security [1]. This region, and notably Senegal, is characterized by strong interannual rainfall variability [2], frequent delays in the onset of the rainy season, an extended dry season, and a continuous increase inland surface temperature (LST) [3, 4]. These climatic changes disrupt traditional agricultural cycles and increase the vulnerability of rainfed production systems.

A critical yet often overlooked phenomenon in this context is the false onset of the rainy season. It occurs as a series of weak initial rains—sometimes totaling about 20 mm over 2–3 days—followed by a dry spell. This sequence can create a false sense of security for farmers, who may sow prematurely, exposing their crops to early-season water stress that can significantly reduce yields. Despite extensive research on drought detection, the false onset—a key factor in shifting the rainfall calendar— has received relatively little attention, limiting the predictive capabilities of existing models [5]. Early detection is therefore essential to adjust sowing schedules, mitigate water stress risks, and support farmers’ decision-making. Recent advances in remote sensing and machine learning have greatly enhanced drought monitoring and forecasting. Remote sensing provides continuous spatio-temporal information despite the scarcity of in situ measurements [4]. Vegetation and soil moisture indices such as NDVI, VHI, LSWI, and SIF have been widely used to assess vegetation water stress [6–9]. Moreover, composite indices integrating precipitation, temperature, and remote sensing data (e.g., CDI [10], SMADI [11], IDSI [12]) offer improved insights into drought dynamics.

In parallel, machine learning models have shown strong potential for drought and rainfall forecasting, particularly when combined with physical constraints that enhance predictive realism [9, 13]. Explainable Artificial Intelligence (XAI) techniques further highlight the relative importance of climatic variables, showing for example that temperature can be more influential than precipitation [1].

Within the Sahelian context, the timing of the rainy season is strongly influenced by large-scale climate phenomena such as ENSO. Some approaches, such as Kohonen maps, have been used to classify the onset and cessation of rainy seasons [5], while logistic regression models based on climate predictors (SST, precipitable water, dew point temperature, winds) have been proposed for seasonal onset forecasting [14]. However, these statistical models often show limited interregional transferability and struggle to capture complex intra-seasonal dynamics—particularly those linked to false starts.

To address these challenges, recurrent neural networks (RNNs) have been increasingly applied in hydrological and climatic studies. LSTM networks have proven effective for long-term temporal dependencies [15, 16], whereas GRU architectures better handle short-term fluctuations [20]. Transformer models, on the other hand, excel at capturing non-local relationships across long sequences. Hybrid architectures that integrate these models have recently emerged as a promising direction, leveraging heterogeneous data such as meteorological time series, soil properties, and remote sensing observations [18]. However, in West African contexts, the scarcity and heterogeneity of ground and satellite data remain a key limitation. Existing studies often overlook the false onset phenomenon, focusing instead on general drought indices or seasonal rainfall patterns. Statistical models alone lack the capacity for robust interregional generalization, while purely neural approaches may suffer from overfitting or reduced interpretability.

To overcome these limitations, this study proposes a hybrid neural architecture combining LSTM, GRU, and Transformer components for the detection and early prediction of false rainy season onsets in Senegal. The approach leverages multisource climatic and phenological data, enriched by statistical change-point detection tests (Pettitt, Kendall, Lombard) to identify structural shifts in rainfall, soil moisture, and vegetation dynamics. By integrating climatic, phenological, and statistical criteria, we define a False Onset Index (FOI) that enables more robust and realistic detection of false starts, improving the reliability of seasonal forecasts and supporting farmers in optimizing their sowing calendars.

The remainder of this paper is structured as follows: Section 2 describes the dataset, the LSTM–GRU–Transformer hybrid model, and hyperparameter optimization. Section 3 presents the results, including performance evaluation, robustness analysis, SHAP-based interpretability, and ablation studies. Section 4 also discusses the implications, limitations, and perspectives of the study, and Section 5 concludes by summarizing the main contributions.

Conclusion

This work demonstrated the effectiveness of a hybrid model combining LSTM, GRU, and multi-head attention for detecting false starts of the rainy season. Bayesian hyper-parameter optimization significantly improved all key metrics, including accuracy, F1-score, and AUC, ensuring reliable and comprehensive detection of critical events. SHAP value analysis enhanced the understanding of the respective contributions of climatic variables, reinforcing the model’stransparency and robustness. Despite limitations related to the quality and spatial resolution of the data used, this hybrid approach outperforms traditional methods and shows promising potential for operational deployment as a decision-support tool for agricultural stakeholders. Finally, this work highlights the importance of a continuous improvement approach, based on integrating higher-resolution data and thorough field validation. These steps are essential to optimize the agronomic impact of the model and strengthen the resilience of agricultural systems in the face of increasing climate variability.

Agriculture Journal IJOEAR Call for Papers

Article Preview