Skip to contents Skip to Global Navigation Menu
  • KDCA
  • Contact us
  • E-Submission

PHRP : Osong Public Health and Research Perspectives

OPEN ACCESS. pISSN: 2210-9099. eISSN: 2233-6052
Original Article

Time-series decomposition and modeling of dengue cases in Malaysia, 2022–2024: a nationwide observational study

Osong Public Health and Research Perspectives 2026;17(1):50-60.
Published online: January 27, 2026

1Department of Community Medicine, School of Medical Science, Universiti Sains Malaysia, Kubang Kerian, Malaysia

2Disease Control Division, Ministry of Health Malaysia, Putrajaya, Malaysia

Corresponding author: Kamarul Imran Musa Department of Community Medicine, School of Medical Science, Health Campus Universiti Sains Malaysia, 16150, Kubang Kerian, Kelantan, Malaysia E-mail: drkamarul@usm.my
• Received: September 22, 2025   • Revised: December 15, 2025   • Accepted: December 18, 2025

© 2026 Korea Disease Control and Prevention Agency.

This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

  • 1,339 Views
  • 108 Download
prev next
  • Objectives
    This study aimed to examine the temporal dynamics of dengue cases in Malaysia from 2022 to 2024 using seasonal-trend decomposition and time-series modeling.
  • Methods
    Weekly dengue case counts from the national registry were analyzed across all states using seasonal-trend decomposition using LOESS (STL) to separate trend, seasonal, and irregular components. Autoregressive integrated moving average (ARIMA) and seasonal ARIMA (SARIMA) models were fitted to validate temporal structures, with model selection based on the Akaike information criterion (AIC), corrected AIC, and Bayesian information criterion. Diagnostic checks, including residual analysis and Ljung-Box testing, were performed to ensure model adequacy.
  • Results
    Dengue incidence showed marked heterogeneity across states. STL decomposition indicated that long-term trends contributed more strongly to case dynamics than seasonality in most states, although seasonal influences were significant in the states of Kedah and Kelantan. Seasonal peak timing varied between states, highlighting differences in epidemic cycles. ARIMA and SARIMA modeling confirmed that no single temporal structure could adequately represent all states; while some series were well fitted by simple ARIMA models, others required seasonal adjustments. Residual diagnostics demonstrated that the selected models were statistically adequate.
  • Conclusion
    Dengue dynamics in Malaysia are shaped by both trend and seasonal components, with considerable variation across states. Combining STL decomposition with ARIMA/SARIMA modeling strengthens the evidence base for state-specific forecasting and proactive vector control. Tailoring surveillance systems and interventions to local temporal patterns may improve early warning capacity and optimize resource allocation for dengue prevention.
Dengue fever is one of the most significant mosquito-borne viral infections worldwide, with an estimated 390 million infections occurring annually, of which nearly 96 million manifest clinically [1]. Over the past 2 decades, the global incidence of dengue has increased markedly, driven by rapid urbanization, globalization, climate variability, and the expanding geographic range of Aedes aegypti and Aedes albopictus mosquitoes [2,3]. The World Health Organization now regards dengue as one of the fastest-spreading arboviral diseases, with more than 100 countries reporting endemic transmission [4].
In Malaysia, dengue continues to pose a substantial public health burden, with recurrent epidemics occurring despite longstanding control efforts. Between 2015 and 2023, the country recorded hundreds of thousands of cases annually, and dengue remains a leading cause of morbidity and hospitalization, particularly in highly urbanized states such as Selangor, Johor, and the federal territories [5]. Multiple factors contribute to sustained dengue transmission, including unplanned urban growth, human mobility, rainfall-driven vector proliferation, and cyclical variations in population immunity [6,7]. The pronounced spatial and temporal heterogeneity of dengue across Malaysia underscores the importance of surveillance systems capable of capturing state-level dynamics, rather than relying solely on national aggregates.
Time-series methods have been widely applied in dengue research to characterize epidemic cycles, quantify seasonality, and forecast future outbreaks. Traditional approaches, including autoregressive integrated moving average (ARIMA) and seasonal ARIMA (SARIMA) models, remain among the most commonly used due to their interpretability and robustness for short-term forecasting [8]. For example, ARIMA/SARIMA models have been successfully applied in endemic settings such as Singapore, Thailand, and Brazil to predict dengue incidence with reasonable accuracy [9,10]. In recent years, hybrid approaches and machine learning models have increasingly been explored to enhance predictive performance [11,12]. However, despite their promise, many such studies focus on national-level summaries or single high-burden districts. This emphasis on aggregated data risks obscuring important state-level differences, where local climatic conditions, demographic structures, and vector ecology may shape dengue transmission in distinct ways [13].
To better understand these localized variations, decomposition approaches provide a valuable complement to classical time-series models. Seasonal-trend decomposition using LOESS (STL) is particularly useful because it separates observed data into trend, seasonal, and irregular components, thereby clarifying the relative contributions of long-term changes and recurrent cycles [14]. When combined with ARIMA/SARIMA modeling, STL enables a more comprehensive assessment of dengue dynamics by supporting both descriptive interpretation of temporal structure and predictive modeling of case counts.
This study therefore applied STL decomposition alongside ARIMA and SARIMA models to weekly dengue case data from Malaysia spanning 2022 to 2024. By examining both national- and state-level time series, the analysis seeks to capture heterogeneity in temporal patterns and to provide evidence supporting localized forecasting and vector control strategies. Such insights are essential for strengthening early warning systems and optimizing resource allocation in Malaysia’s ongoing efforts to combat dengue.
Study Design and Data Sources
This cross-sectional study utilized weekly dengue case counts for each state in Malaysia from January 1, 2022 to December 31, 2024, obtained from the Dengue Case Registry maintained by the Ministry of Health Malaysia. This registry systematically compiles all laboratory- and clinically confirmed dengue cases, providing a reliable data source for monitoring disease burden at the state level. Prior to analysis, the data were cleaned to remove duplicate records, correct inconsistencies, and assess the presence of missing values. Weekly case counts were then aggregated to generate a continuous time series for each state.
Study Area
This study was conducted in Malaysia, a country located in Southeast Asia with an approximate land area of 329,847 km². Malaysia comprises 13 states and 3 federal territories, which are further subdivided into a total of 160 administrative districts. Geographically, the nation is divided into Peninsular Malaysia, which borders Thailand, and East Malaysia, located on the island of Borneo. The country typically experiences a tropical rainforest climate, with environmental conditions influenced by 2 seasonal monsoons [15].
Data Analysis
The temporal structure of dengue cases was first examined using STL, which decomposes a time series into 3 additive components:
Yt = Tt + St+ Rt,
where Yt represents the observed number of dengue cases at week t, Tt reflects long-term changes, St denotes the seasonal component representing recurrent cycles in transmission (e.g., monsoon-driven peaks), and Rt is the irregular component capturing short-term fluctuations not explained by trend or seasonality [14]. STL decomposition was performed using the STL() function from the feasts package in R software [16]. The seasonal component was extracted using a periodic seasonal window (season(window="periodic")), which enforces stable annual seasonality across the series. All other STL parameters followed the default feasts implementation, including automatic LOESS span selection for trend extraction, standard low-pass filtering of the seasonal component, and iterative robustness updates to reduce the influence of outliers [14,17]. This approach ensured reproducibility and consistency across all state-level analyses.
ARIMA and SARIMA models were applied as linear Gaussian time-series approximations to weekly dengue case counts, an approach commonly used in epidemiological surveillance analyses when counts are sufficiently large to justify approximate normality assumptions [18]. Model identification followed a systematic diagnostic process. Prior to model estimation, stationarity and linearity properties of each time series were evaluated using the augmented Dickey-Fuller (ADF), Phillips-Perron (PP), and Kwiatkowski-Phillips-Schmidt-Shin (KPSS) tests for mean stationarity; Goldfeld-Quandt and Breusch-Pagan tests for variance stationarity; and Ramsey Regression Equation Specification Error Test (RESET), Terasvirta, and McLeod-Li tests to assess linearity and Autoregressive Conditional Heteroscedasticity (ARCH) effects [14]. Based on these diagnostics, appropriate differencing orders (d and D) or variance-stabilizing transformations were applied as needed. Parameter exploration was conducted using the auto.arima() algorithm from the fable package, which performs stepwise model selection across the ARIMA/SARIMA parameter space and evaluates competing models using the Akaike information criterion (AIC), AIC with finite-sample correction (AICc), and Bayesian information criterion (BIC). This procedure ensured a systematic and reproducible model search while maintaining transparency in the identification process [14]. The best-fitting model for each state was selected based on the lowest information criterion values. Given that the study period from 2022 to 2024 yielded a moderate number of weekly observations per state, the AICc was prioritized as the primary determinant of the optimal model. AICc adjusts for finite sample sizes and reduces the risk of overfitting associated with conventional AIC. BIC was used as a secondary reference to ensure that model parsimony was not compromised, as BIC applies a stronger penalty for model complexity. Final model selection therefore balanced goodness-of-fit, sample-size adjustment, and simplicity, with AICc serving as the principal selection metric [14].
To assess the adequacy of the fitted ARIMA/SARIMA models, post-estimation residual diagnostics were performed. Residual autocorrelation was evaluated using the Ljung-Box test, applied at 24 lags to capture short- and medium-term dependence in weekly data while avoiding the loss of statistical power associated with testing a full 52-week seasonal cycle. This choice aligns with methodological practice in weekly epidemiological time-series analyses, where approximately half-season windows provide a practical balance between sensitivity and parsimony. Residual autocorrelation was further examined through visual inspection of residual autocorrelation function (ACF) plots [14]. Homoscedasticity was assessed using the Breusch-Pagan test in conjunction with residual-versus-fitted value plots. Residual normality was evaluated using the Shapiro-Wilk test and supported by inspection of Q-Q plots to assess distributional shape. These diagnostic procedures were used as supplementary verification tools to determine whether key modeling assumptions were reasonably satisfied, rather than as formal proofs of model sufficiency [14].
All analyses were performed using R ver. 4.5.1 (R Foundation for Statistical Computing). Data wrangling and preprocessing were conducted using the tidyverse suite, and date handling employed the lubridate package. STL decomposition was carried out using the feasts and fabletools packages, while ARIMA and SARIMA models were implemented using the forecast package. Visualization of decomposition results and model outputs was performed with ggplot2 and arranged using patchwork.
Ethics Statement
The research was conducted in accordance with the principles of the Declaration of Helsinki and received ethical clearance from the Human Research Ethics Committee of Universiti Sains Malaysia under approval code USM/JEPeM/KK/25010107. Additional approval was granted by the Malaysia Medical Research and Ethics Committee under the code NMRR ID-25-00144-OVW. This research was also supported by a Universiti Sains Malaysia Medical Research Grant 2025 (Project No: R501-LR-GPP001-0000002020-0000).
Figure 1 summarizes the monthly distribution of dengue cases in Malaysia from 2022 to 2024 and demonstrates clear temporal variability across states. Overall, most states exhibited distinct epidemic peaks in 2023, with the state of Selangor consistently recording the highest number of cases, surpassing 10,000 cases at its peak. Other states, including Johor, Sabah, and Terengganu, showed pronounced surges in mid-2023, whereas smaller states such as Perlis displayed lower case counts with less marked fluctuations. Seasonal recurrence of dengue cases was evident in nearly all states, with multiple peaks observed across the study period.
STL Decomposition
The STL decomposition illustrated in Figure 2 quantified the relative contributions of trend and seasonality to dengue transmission dynamics for each state (Table 1). Seasonal strength values ranged from 0.27 in Sarawak, indicating weak seasonal influence, to 0.68 in Kelantan and Kedah, where seasonality played a more prominent role. In contrast, trend strength values were generally high across most states, exceeding 0.80 in Johor, Melaka, Selangor, and Perak, which suggests that long-term changes in transmission contributed more strongly to observed case counts than seasonal patterns. The timing of seasonal peaks and troughs varied between states, although many states exhibited recurrent peaks around mid-year. STL decomposition plots for all states are provided in Figure S1.
ARIMA/SARIMA Models
Stationarity and linearity diagnostics were conducted prior to model estimation to verify that ARIMA and SARIMA assumptions were satisfied (Table S1). Mean stationarity was assessed using the ADF, PP, and KPSS tests. The states of Johor, Melaka, Negeri Sembilan, Perak, Terengganu, and Wilayah Persekutuan demonstrated mean stationarity (p<0.05 for ADF and PP tests and p≥0.05 for KPSS) and therefore did not require additional non-seasonal differencing (d=0). Variance stationarity, evaluated using the Goldfeld-Quandt, Breusch-Pagan, Bartlett, and Levene tests, indicated heteroscedasticity in most states. Linearity diagnostics, including the Ramsey RESET, Terasvirta neural network, and McLeod-Li tests, detected significant nonlinearity or ARCH effects in the majority of states (p<0.05), particularly those experiencing rapid epidemic fluctuations, such as Kelantan, Sabah, and Selangor. These findings supported the use of differencing and seasonal modeling to stabilize residual dynamics. Given the annual epidemiological cycle of dengue in Malaysia and the consistent seasonal components identified through STL decomposition, seasonal differencing (D=1 with a 52-week seasonal period) was applied uniformly across all states. This adjustment is supported by strong STL seasonal strength values in several states, including Kelantan, Kedah, and Pulau Pinang, and ensures removal of yearly recurrence in transmission patterns. Collectively, these diagnostic outcomes informed the differencing strategy, summarized as follows:
d=1 for 9 states with non-stationary mean
d=0 for the remaining 5 states, and
d=1 for all states due to consistent annual seasonal patterns identified in STL decomposition.
Post-differencing diagnostics demonstrated that ADF and PP tests were significant for all states (p=0.01), confirming mean stationarity after differencing (Table S2). KPSS tests were non-significant (p=0.10) across all states, further supporting stationarity. No ARCH effects were detected, indicating stable residual variance and no requirement for GARCH-type model extensions. Overall, these findings confirm that the selected differencing orders were appropriate and that all state-level series satisfied the assumptions required for SARIMA modeling. The non-seasonal differencing order (d), seasonal differencing order (D), and post-differencing stationarity diagnostics are summarized in Table 2.
Model selection using ARIMA and SARIMA approaches identified the most suitable specification for each state based on AIC, AICc, and BIC values, as summarized in Table 3. Residual diagnostics revealed departures from ideal distributional assumptions in several states. Shapiro-Wilk tests indicated statistically significant deviations from residual normality, and Breusch-Pagan tests identified evidence of heteroscedasticity in some states (Table S3). Inspection of Q-Q plots (Figure S2) demonstrated mild tail departures from normality. Importantly, residual independence was preserved across all fitted models. Ljung-Box tests at lag 24 showed no significant residual autocorrelation (Table 4), and residual ACF plots confirmed white-noise behavior across all states (Figure S3), indicating that the ARIMA/SARIMA models adequately captured the temporal dependence structure of dengue incidence. Although residual non-normality and heteroscedasticity may affect the precision of prediction intervals, these features are commonly observed in infectious disease surveillance time series and do not invalidate model consistency or short-term forecasting performance. Accordingly, forecast uncertainty should be interpreted with appropriate caution in states exhibiting greater residual variability.
This study provides a state-level temporal analysis of dengue incidence in Malaysia from 2022 to 2024, revealing pronounced heterogeneity in transmission dynamics across states. Seasonal-Trend decomposition using STL demonstrated that both long-term trends and seasonal fluctuations contributed to observed dengue patterns, although the relative influence of these components varied substantially between states.
Most states exhibited high trend strength values (>0.80), indicating that sustained structural determinants such as urbanization, demographic growth, and vector ecological adaptation exerted greater influence on dengue dynamics than short-term seasonal drivers. For example, Johor, Melaka, Perak, and Selangor all demonstrated strong trend dominance, consistent with prior studies showing that long-term environmental change and urban expansion increasingly shape dengue risk across Malaysia and Southeast Asia [13,15,19].
In contrast, states such as Kelantan displayed strong seasonal signatures, with seasonal strength values reaching 0.68. This state exhibited recurrent epidemic peaks during specific months, particularly around mid-year, consistent with rainfall-driven mosquito breeding cycles associated with the southwest monsoon [2022]. In Kelantan, for instance, seasonal peaks were most prominent during the third quarter of each year, with troughs occurring earlier in the year. Such seasonal cycles echo findings from Thailand and Vietnam, where rainfall and temperature seasonality remain critical triggers for epidemic onset [23,24].
The weak seasonality observed in the Borneo region (Sabah and Sarawak) provides a contrasting pattern. In these states, dengue incidence was less tightly linked to seasonal climatic cycles and more influenced by gradual long-term increases. This pattern suggests that factors such as population mobility, land-use change, or sustained vector adaptation may play stronger roles than short-term rainfall variability. Studies from the Borneo region and eastern Indonesia have similarly documented weaker seasonal dependence, reflecting ecological differences in forested and peri-urban environments [2527]. Taken together, the STL results emphasize that while dengue dynamics in Malaysia are broadly trend-dominated, seasonal cycles continue to exert strong influence in a subset of states. This variability reinforces the need for locally tailored early warning systems that integrate both long-term and seasonal signals.
ARIMA and SARIMA modeling provided further insight into the temporal complexity of dengue transmission across Malaysian states. Model selection indicated that no single model structure was adequate for all states, confirming that dengue dynamics in Malaysia cannot be represented within a single temporal framework. The observed variation in optimal model structures likely reflects underlying environmental, demographic, and epidemiological heterogeneity across regions. States such as Kelantan (seasonal strength=0.679) and Pahang (0.523) demonstrated strong recurring annual cycles, with dengue case peaks typically occurring between June and September. These peak periods coincide with Malaysia’s inter-monsoon transition, which is characterized by intermittent rainfall, reduced water flushing, and warmer temperatures that collectively enhance Aedes breeding productivity [28]. In contrast to heavy rainfall during major monsoon seasons, which can wash away larvae and disrupt breeding habitats, inter-monsoon conditions promote the persistence of clean, stagnant water environments that favor A. aegypti proliferation and viral transmission [19,21,22]. This ecological mechanism supports the seasonal dependence detected in these states and justifies the inclusion of SARIMA terms in the optimal models—Kelantan: ARIMA(0,1,1)(0,0,1)(s=7) and Pahang: ARIMA(0,0,0)(1,0,0) (s=7) These findings align with previous studies in Malaysia demonstrating dengue amplification during inter-monsoon periods, reflecting predictable rainfall–breeding dynamics that sustain annual dengue resurgence [21,28].
Conversely, highly urbanized states such as Johor, Penang, and Selangor, despite recording high dengue case burdens, demonstrated weaker seasonal structuring. Continuous human mobility, extensive built environments, and relatively stable microclimatic conditions in dense urban settings reduce the dependence of Aedes proliferation on seasonal rainfall cycles, thereby attenuating predictable intra-annual fluctuations [28,29]. Consequently, dengue transmission in these states may be sustained year-round through persistent artificial breeding sites, including water storage containers, construction areas, and drainage systems, as well as uninterrupted mosquito–human contact [2,9]. This epidemiological profile supports the selection of non-SARIMA models for these settings.
Several states were best represented by highly parsimonious non-seasonal models, including ARIMA(0,0,0) for Melaka, Perak, Penang, and Terengganu. These simple model structures suggest that short-term dengue variability in these states is dominated by irregular fluctuations rather than persistent autocorrelation or strong seasonal cycles. Importantly, such models do not imply an absence of epidemiological drivers but instead reflect relatively weak temporal structure in the weekly surveillance data after differencing. In contrast, states with stronger seasonal or autocorrelated dynamics required more complex ARIMA or SARIMA specifications, underscoring heterogeneity in temporal dependence across Malaysia. While parsimony yielded the lowest information criterion values in the present analysis, future research incorporating cross-validation or out-of-sample predictive assessments would be valuable for evaluating the robustness and forecasting stability of these simpler models.
These findings mirror evidence from other Asian settings in which substantial subnational variability limits the applicability of a single forecasting framework and underscores the need for localized modeling approaches [30,31]. Although ARIMA and SARIMA models provide robust representations of historical temporal patterns and perform well for short-term forecasting, their reliance on past observations constrains predictive performance under conditions of abrupt structural change, such as climate anomalies or major public health interventions. In addition, prediction intervals derived from ARIMA/SARIMA models should be interpreted as approximate, particularly in states exhibiting greater residual variance, as uncertainty may not be fully captured by univariate specifications. Although extensive diagnostic procedures were conducted, these checks are intended to support model adequacy rather than to establish sufficiency, and the findings should therefore be interpreted in light of the known limitations of linear time-series models applied to surveillance data. Consequently, while these models remain valuable tools for near-term surveillance and planning, they should be complemented by integrated modeling approaches that incorporate climatic, entomological, and mobility-related covariates to improve outbreak prediction and uncertainty characterization [32].
This study advances dengue surveillance research in Malaysia by combining STL decomposition with ARIMA/SARIMA modeling to jointly characterize long-term trends and seasonal cycles, and by analyzing all 14 states individually to capture granular heterogeneity that is often obscured in national-level analyses. These state-specific insights can inform more targeted intervention strategies. For example, states with pronounced seasonal peaks, such as Kelantan and Pahang, may benefit from strengthened vector control measures implemented ahead of predictable surges, whereas states characterized by year-round transmission may require sustained, structurally oriented vector management and urban sanitation efforts.
Nevertheless, several limitations should be acknowledged. Dengue case notifications may be affected by underreporting and diagnostic misclassification [33]. In addition, the models applied were univariate and did not incorporate climatic, entomological, or sociodemographic covariates known to influence dengue transmission. Previous studies have demonstrated that rainfall, temperature, humidity, vector indices, and population dynamics strongly shape dengue patterns in Malaysia and across the region [2,6]. The ARIMA/SARIMA framework adopted in this study assumes linearity and approximately Gaussian errors and therefore represents a statistical approximation when applied to count-based dengue surveillance data. While this approach is widely used to model temporal dependence and support short-term forecasting in settings with moderate to high case counts, it may not fully capture the mean–variance relationships intrinsic to count processes [34]. Future research should therefore explore variance-adaptive and count-based modeling approaches, including Poisson or negative binomial time-series models, generalized linear autoregressive models, or Bayesian state-space frameworks, which may better accommodate overdispersion and nonlinearity in dengue transmission dynamics.
Despite these limitations, the findings have important practical implications for dengue surveillance and control. STL-derived seasonal signatures and ARIMA/SARIMA-based forecasts can be integrated into operational decision-support systems and automated early warning dashboards to trigger alerts when observed or projected case counts exceed expected thresholds. Such systems could facilitate the timely mobilization of vector control resources, hospital preparedness, and targeted community engagement ahead of anticipated transmission peaks. Embedding these analytical tools within district- and state-level surveillance infrastructure would further support more proactive, data-driven dengue control strategies across Malaysia.
This study highlights substantial heterogeneity in dengue dynamics across Malaysian states between 2022 and 2024. STL decomposition revealed that while long-term trends dominated overall patterns, seasonal influences remained significant in certain states, underscoring the value of localized temporal analyses. ARIMA and SARIMA modeling further confirmed that no single temporal structure adequately represents dengue dynamics across all states. Together, these complementary approaches strengthen the evidence base for state-specific forecasting systems and proactive vector control strategies in Malaysia. Importantly, tailoring surveillance and intervention efforts to local temporal dynamics has the potential to enhance early warning capacity, optimize resource allocation, and improve dengue prevention outcomes nationwide.
• The study identified substantial heterogeneity in dengue dynamics across Malaysian states from 2022 to 2024.
• Long-term trends dominated overall patterns, while seasonal influences remained significant in certain states. The findings also revealed differing seasonal peak timings across states, indicating localized epidemic cycles.
• Autoregressive integrated moving average (ARIMA) and seasonal ARIMA models vary in suitability, with no single structure fitting all states.
• This research highlights the importance of state-specific forecasting and tailored vector control strategies for effective dengue prevention.

Ethics Approval

The research was conducted in accordance with the Declaration of Helsinki guidelines and received ethical clearance from the Human Research Ethics Committee of Universiti Sains Malaysia, under the approval code USM/JEPeM/KK/25010107. Additionally, the Malaysia Medical Research and Ethics Committee granted approval under the code NMRR ID-25-00144-OVW. Due to the retrospective nature of the study, the requirement for informed consent was waived.

Conflicts of Interest

The authors have no conflicts of interest to declare.

Funding

The research was conducted in accordance with the Declaration of Helsinki guidelines and received ethical clearance from the Human Research Ethics Committee of Universiti Sains Malaysia, under the approval code USM/JEPeM/KK/25010107. Additionally, the Malaysia Medical Research and Ethics Committee granted approval under the code NMRR ID-25-00144-OVW. This research also was supported by a Universiti Sains Malaysia, Medical Research Grant 2025 with Project No: R501-LR-GPP001-0000002020-0000.

Availability of Data

The datasets are not publicly available but can be obtained from the corresponding author upon reasonable request.

Authors’ Contributions

Conceptualization: all authors; Data curation: all authors; Formal analysis: MAAH, KIM Methodology: all authors; Project administration: MAAH, KIM; Resources: all authors; Visualization: all authors; Writing–original draft: MAAH; Writing–review & editing: all authors. All authors read and approved the final manuscript.

Supplementary data are available at https://doi.org/10.24171/j.phrp.2025.0397.
Table S1.
Stationarity and linearity diagnostics for weekly dengue case series by state, Malaysia (2022–2024)
j-phrp-2025-0397-Supplementary-Table-S1.pdf
Table S2.
Post-differencing diagnostic test results for weekly dengue case series by state, Malaysia (2022–2024)
j-phrp-2025-0397-Supplementary-Table-S2.pdf
Table S3.
Residual diagnostic tests for ARIMA/SARIMA models by state
j-phrp-2025-0397-Supplementary-Table-S3.pdf
Figure S1.
Seasonal-trend decomposition using LOESS (STL) decomposition plot for every state. Panels show observed cases (top), long-term trend, seasonal component, and remainder. (A) Johor, (B) Negeri Sembilan, (C) Kedah, (D) Pahang, (E) Kelantan, (F) Perak, (G) Melaka, (H) Perlis, (I) Pulau Pinang, (J) Terengganu, (K) Selangor, (L) Sarawak, (M) Sabah, (N) Wilayah Persekutuan.
j-phrp-2025-0397-Supplementary-Figure-S1.pdf
Figure S2.
Q-Q plots of ARIMA/SARIMA residuals by state.
ARIMA, autoregressive integrated moving average; SARIMA, seasonal ARIMA.
j-phrp-2025-0397-Supplementary-Figure-S2.pdf
Figure S3.
Residual diagnostic for SARIMA/ARIMA models for every state (ACF and histogram).
ARIMA, autoregressive integrated moving average; SARIMA, seasonal ARIMA; ACF, autocorrelation function. (A) Johor, (B) Kedah, (C) Kelantan, (D) Melaka, (E) Negeri Sembilan, (F) Pahang, (G) Perak, (H) Perlis, (I) Pulau Pinang, (J) Sabah, (K) Sarawak, (L) Selangor, (M) Terengganu, (N) Wilayah Persekutuan.
j-phrp-2025-0397-Supplementary-Figure-S3.pdf
Figure 1.
Monthly dengue cases for every state in Malaysia between 2022 and 2024.
Figure 1. Monthly dengue cases for every state in Malaysia between 2022 and 2024.
	 
Figure 2.
Heatmap of seasonal-trend decomposition using LOESS (STL) trend and seasonal strength values across Malaysian states (2022–2024). This figure displays the relative strength of the seasonal and trend components obtained from STL decomposition for each state. Darker shading represents higher component strength. The visualization highlights spatial heterogeneity in temporal drivers of dengue transmission across Malaysia.
Figure 2. Heatmap of seasonal-trend decomposition using LOESS (STL) trend and seasonal strength values across Malaysian states (2022–2024). This figure displays the relative strength of the seasonal and trend components obtained from STL decomposition for each state. Darker shading represents higher component strength. The visualization highlights spatial heterogeneity in temporal drivers of dengue transmission across Malaysia.
	 
Time-series decomposition and modeling of dengue cases in Malaysia, 2022–2024: a nationwide observational study
Table 1.
STL decomposition features of dengue cases by state in Malaysia
Table 1.
State Seasonal strength Trend strength Seasonal peak year Seasonal trough year
Johor 0.423 0.869 7 11
Kedah 0.672 0.805 6 9
Kelantan 0.679 0.684 9 5
Melaka 0.624 0.867 7 11
Negeri Sembilan 0.505 0.787 1 4
Pahang 0.523 0.725 7 2
Perak 0.497 0.908 1 9
Perlis 0.421 0.616 6 4
Pulau Pinang 0.619 0.813 1 8
Sabah 0.411 0.746 1 4
Sarawak 0.267 0.725 4 9
Selangor 0.618 0.835 1 11
Terengganu 0.507 0.862 1 3
Wilayah Persekutuan 0.509 0.866 1 3

STL, seasonal-trend decomposition using LOESS.

Table 2.
Differencing orders and stationarity diagnostic results for each Malaysian state
Table 2.
State d D ADF (p) PP (p) KPSS (p)
Johor 0 1 0.01 0.01 0.1
Kedah 1 1 0.01 0.01 0.1
Kelantan 1 1 0.01 0.01 0.1
Melaka 0 1 0.01 0.01 0.1
Negeri Sembilan 1 1 0.01 0.01 0.1
Pahang 1 1 0.01 0.01 0.1
Perak 0 1 0.01 0.01 0.1
Perlis 1 1 0.01 0.01 0.1
Pulau Pinang 1 1 0.01 0.01 0.1
Sabah 1 1 0.01 0.01 0.1
Sarawak 1 1 0.01 0.01 0.1
Selangor 1 1 0.01 0.01 0.1
Terengganu 1 1 0.01 0.01 0.1
Wilayah Persekutuan 0 1 0.01 0.01 0.1

Values represent p-values for each test after differencing, confirming that the final transformed series meets SARIMA model assumptions.

ADF, augmented Dickey-Fuller; PP, Phillips-Perron; KPSS, Kwiatkowski-Phillips-Schmidt-Shin; ARIMA, autoregressive integrated moving average; SARIMA, seasonal ARIMA.

Table 3.
SARIMA/ARIMA model Summary to represent temporality of dengue cases by state in Malaysia
Table 3.
State SARIMA/ARIMA order AIC AICc BIC
Johor ARIMA(0,0,1) 497.18 497.7 502.92
Kedah ARIMA(0,1,1) 382.01 382.27 385.79
Kelantan ARIMA(0,1,1)(0,0,1) (s = 7) 339.08 339.62 344.76
Melaka ARIMA(0,0,0) 309.41 309.69 313.07
Negeri Sembilan ARIMA(0,0,0) 441.3 441.56 445.09
Pahang ARIMA(0,0,0)(1,0,0) (s = 7) 356.54 357.07 362.28
Perak ARIMA(0,0,0) 426.9 427.15 430.72
Perlis ARIMA(1,0,0) 248.92 249.84 253.12
Pulau Pinang ARIMA(0,0,0) 379.8 380.07 383.54
Sabah ARIMA(1,0,0) 418.52 419.03 424.31
Sarawak ARIMA(0,1,0) 281.65 282.98 291.31
Selangor ARIMA(0,1,1) 625.29 625.54 629.11
Terengganu ARIMA(0,0,0) 583.47 583.75 587.17
Wilayah Persekutuan ARIMA(0,1,1) 443.46 443.73 447.2

ARIMA, autoregressive integrated moving average; SARIMA, seasonal ARIMA; AIC, Akaike information criterion; AICc, AIC with finite-sample correction; BIC, Bayesian information criterion.

Table 4.
Model validity parameters using the Ljung-Box test
Table 4.
State SARIMA/ARIMA model Ljung-Box statistic Lower 95% CI Upper 95% CI p
Wilayah Persekutuan ARIMA(0,1,1) 38.45 12.40 39.36 0.051
Terengganu ARIMA(0,0,0) 23.40 12.40 39.36 0.496
Pulau Pinang ARIMA(0,0,0) 22.83 12.40 39.36 0.530
Sabah ARIMA(1,0,0) 20.66 12.40 39.36 0.658
Sarawak ARIMA(0,1,0) 17.28 12.40 39.36 0.836
Kedah ARIMA(0,1,1) 19.53 12.40 39.36 0.723
Melaka ARIMA(0,0,0) 17.34 12.40 39.36 0.834
Negeri Sembilan ARIMA(0,0,0) 17.22 12.40 39.36 0.839
Selangor ARIMA(0,1,1) 17.09 12.40 39.36 0.845
Johor ARIMA(0,0,1) 15.58 12.40 39.36 0.903
Pahang ARIMA(0,0,0)(1,0,0) [7] 14.89 12.40 39.36 0.924
Perlis ARIMA(1,0,0) 13.00 12.40 39.36 0.966
Kelantan ARIMA(0,1,1)(0,0,1) [7] 11.21 12.40 39.36 0.987
Perak ARIMA(0,0,0) 1.96 12.40 39.36 >0.900

ARIMA, autoregressive integrated moving average; SARIMA, seasonal ARIMA; CI, confidence interval.

p>0.05 suggests residual autocorrelation absents and no “white noise”

  • 1. Bhatt S, Gething PW, Brady OJ, et al. The global distribution and burden of dengue. Nature 2013;496:504−7.
  • 2. Kraemer MU, Reiner RC, Brady OJ, et al. Past and future spread of the arbovirus vectors Aedes aegypti and Aedes albopictus. Nat Microbiol 2019;4:854−63.
  • 3. Messina JP, Brady OJ, Golding N, et al. The current and future global distribution and population at risk of dengue. Nat Microbiol 2019;4:1508−15.
  • 4. World Health Organization (WHO) (CH). Dengue and severe dengue [Internet]. WHO; 2022 [cited 2025 Sep 21]. Available from: https://www.who.int/news-room/fact-sheets/detail/dengue-and-severe-dengue.
  • 5. Li Z, Gurgel H, Dessay N, et al. Semi-supervised text classification framework: an overview of dengue landscape factors and satellite Earth observation. Int J Environ Res Public Health 2020;17:4509.
  • 6. Cheong YL, Leitao PJ, Lakes T. Assessment of land use factors associated with dengue cases in Malaysia using boosted regression trees. Spat Spatiotemporal Epidemiol 2014;10:75−84.
  • 7. Hii YL, Zhu H, Ng N, et al. Forecast of dengue incidence using temperature and rainfall. PLoS Negl Trop Dis 2012;6:e1908.
  • 8. Hasan P, Khan TD, Alam I, et al. Dengue in tomorrow: predictive insights from ARIMA and SARIMA models in Bangladesh: a time series analysis. Health Sci Rep 2024;7:e70276.
  • 9. Racloz V, Ramsey R, Tong S, et al. Surveillance of dengue fever virus: a review of epidemiological models and early warning systems. PLoS Negl Trop Dis 2012;6:e1648.
  • 10. Ezanno P, Andraud M, Beaunee G, et al. How mechanistic modelling supports decision making for the control of enzootic infectious diseases. Epidemics 2020;32:100398.
  • 11. Ho TS, Weng TC, Wang JD, et al. Comparing machine learning with case-control models to identify confirmed dengue cases. PLoS Negl Trop Dis 2020;14:e0008843.
  • 12. Adde A, Roucou P, Mangeas M, et al. Predicting dengue fever outbreaks in French Guiana using climate indicators. PLoS Negl Trop Dis 2016;10:e0004681.
  • 13. Zhu S, Niu L, Aamir M, et al. Spatial and seasonal variations in air-soil exchange, enantiomeric signatures and associated health risks of hexachlorocyclohexanes (HCHs) in a megacity Hangzhou in the Yangtze River Delta region, China. Sci Total Environ 2017;599-600:264−72.
  • 14. Hyndman RJ, Athanasopoulos G. Forecasting: principles and practice. 3rd ed. OTexts; 2021.
  • 15. Tang KH. Climate change in Malaysia: trends, contributors, impacts, mitigation and adaptations. Sci Total Environ 2019;650:1858−71.
  • 16. Bandara K, Hyndman RJ, Bergmeir C. MSTL: A seasonal-trend decomposition algorithm for time series with multiple seasonal patterns. Int J Oper Res 2025;52:79−98.
  • 17. Cleveland RB, Cleveland WS, McRae JE, et al. STL: a seasonal-trend decomposition. J Off Stat 1990;6:3−73.
  • 18. Chen X, Moraga P. Assessing dengue forecasting methods: a comparative study of statistical models and machine learning techniques in Rio de Janeiro, Brazil. Trop Med Health 2025;53:52.
  • 19. Abdul Hadi Z, Che Dom N, Ishak AR, et al. Geospatial dynamics of dengue fever density in Kuantan, Malaysia: GIS-based approach. Geogr Environ Sustain 2025;18:54−64.
  • 20. Masrani AS, Nik Husain NR, Musa KI, et al. Prediction of dengue incidence in the Northeast Malaysia based on weather data using the generalized additive model. Biomed Res Int 2021;2021:3540964.
  • 21. Singh S, Herng LC, Sulaiman LH, et al. The effects of meteorological factors on dengue cases in Malaysia. Int J Environ Res Public Health 2022;19:6449.
  • 22. Masrani AS, Nik Husain NR, Musa KI, et al. Trends and spatial pattern analysis of dengue cases in Northeast Malaysia. J Prev Med Public Health 2022;55:80−7.
  • 23. Louis VR, Phalkey R, Horstick O, et al. Modeling tools for dengue risk mapping: a systematic review. Int J Health Geogr 2014;13:50.
  • 24. Suphanchaimat R, Thammavijaya P, Taweewigyakarn P, et al. Systemic investigation of dengue incidence and control measures in Surin, Thailand, 2018. Outbreak Surveill Investig Response OSIR J 2019;12:15−23.
  • 25. Kunasagran PD, Rahim A, Syed SS, et al. Spatio-temporal analysis of dengue cases in Sabah. Malays J Med Heal Sci 2023;19:88−94.
  • 26. Murphy A, Rajahram GS, Jilip J, et al. Incidence and epidemiological features of dengue in Sabah, Malaysia. PLoS Negl Trop Dis 2020;14:e0007504.
  • 27. Sasmono RT, Santoso MS, Pamai YW, et al. Distinct dengue disease epidemiology, clinical, and diagnosis features in Western, Central, and Eastern regions of Indonesia, 2017-2019. Front Med (Lausanne) 2020;7:582235.
  • 28. Cheong YL, Burkart K, Leitao PJ, et al. Assessing weather effects on dengue disease in Malaysia. Int J Environ Res Public Health 2013;10:6319−34.
  • 29. Abdullah NAMH, Dom NC, Salleh SA, et al. Dengue's climate conundrum: how vegetation and temperature shape mosquito populations and disease outbreaks. BMC Public Health 2025;25:4.
  • 30. Alam KE, Ahmed MJ, Chalise R, et al. Time series analysis of dengue incidence and its association with meteorological risk factors in Bangladesh. PLoS One 2025;20:e0323238.
  • 31. Zhao X, Li K, Ang CK, et al. A deep learning based hybrid architecture for weekly dengue incidences forecasting. Chaos Solitons 2023;168:113170.
  • 32. Lowe R, Gasparrini A, Van Meerbeeck CJ, et al. Nonlinear and delayed impacts of climate on dengue risk in Barbados: a modelling study. PLoS Med 2018;15:e1002613.
  • 33. Shepard DS, Undurraga EA, Halasa YA, et al. The global economic burden of dengue: a systematic analysis. Lancet Infect Dis 2016;16:935−41.
  • 34. Mills C, Falconi-Agapito F, Carrera JP, et al. Multi-model approach to understand and predict past and future dengue epidemic dynamics. R Soc Open Sci 2025;12:241870.

Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:

Include:

Time-series decomposition and modeling of dengue cases in Malaysia, 2022–2024: a nationwide observational study
Osong Public Health Res Perspect. 2026;17(1):50-60.   Published online January 27, 2026
Download Citation

Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

Format:
Include:
Time-series decomposition and modeling of dengue cases in Malaysia, 2022–2024: a nationwide observational study
Osong Public Health Res Perspect. 2026;17(1):50-60.   Published online January 27, 2026
Close

Figure