Skip Navigation
Skip to contents

PHRP : Osong Public Health and Research Perspectives



Page Path
HOME > Osong Public Health Res Perspect > Volume 4(6); 2013 > Article
Original Article
Forecasting the Number of Human Immunodeficiency Virus Infections in the Korean Population Using the Autoregressive Integrated Moving Average Model
Hye-Kyung Yua, Na-Young Kima, Sung Soon Kima, Chaeshin Chub, Mee-Kyung Keea
Osong Public Health and Research Perspectives 2013;4(6):358-362.
Published online: December 3, 2013

aDivision of AIDS, Korea National Institute of Health, Osong, Korea

bDivision of Epidemic Intelligence Service, Korea Centers for Disease Control and Prevention, Osong, Korea

∗Corresponding author.
• Received: October 1, 2013   • Revised: October 20, 2013   • Accepted: October 28, 2013

© 2013 Published by Elsevier B.V. on behalf of Korea Centers for Disease Control and Prevention.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

  • 19 Download
  • 24 Crossref
  • 28 Scopus
  • Objectives
    From the introduction of HIV into the Republic of Korea in 1985 through 2012, 9,410 HIV-infected Koreans have been identified. Since 2000, there has been a sharp increase in newly diagnosed HIV-infected Koreans. It is necessary to estimate the changes in HIV infection to plan budgets and to modify HIV/AIDS prevention policy. We constructed autoregressive integrated moving average (ARIMA) models to forecast the number of HIV infections from 2013 to 2017.
  • Methods
    HIV infection data from 1985 to 2012 were used to fit ARIMA models. Akaike Information Criterion and Schwartz Bayesian Criterion statistics were used to evaluate the constructed models. Estimation was via the maximum likelihood method. To assess the validity of the proposed models, the mean absolute percentage error (MAPE) between the number of observed and fitted HIV infections from 1985 to 2012 was calculated. Finally, the fitted ARIMA models were used to forecast the number of HIV infections from 2013 to 2017.
  • Results
    The fitted number of HIV infections was calculated by optimum ARIMA (2,2,1) model from 1985–2012. The fitted number was similar to the observed number of HIV infections, with a MAPE of 13.7%. The forecasted number of new HIV infections in 2013 was 962 (95% confidence interval (CI): 889–1,036) and in 2017 was 1,111 (95% CI: 805–1,418). The forecasted cumulative number of HIV infections in 2013 was 10,372 (95% CI: 10,308–10,437) and in 2017 was14,724 (95% CI: 13,893–15,555) by ARIMA (1,2,3).
  • Conclusion
    Based on the forecast of the number of newly diagnosed HIV infections and the current cumulative number of HIV infections, the cumulative number of HIV-infected Koreans in 2017 would reach about 15,000.
Human immunodeficiency virus (HIV) infection can show nonspecific symptoms in the early stage such as anorexia, weight loss, fever, lymph node enlargement, and fatigue, but has a long asymptomatic period after an acute infection period. Therefore, the real prevalence of HIV infection has been estimated to be more than the reported numbers. Since the first reported case of AIDS, the United Nations Programme on HIV/AIDS (UNAIDS) estimated the global people living with HIV/AIDS (PLWHA) to be 35.3 million in 2012 [1]. The cumulative number of HIV-infected people identified in the Republic of Korea from 1985 to 2012 was 10,453 (9,410 Koreans, 1,043 foreigners), relatively low compared to many other countries [2]. However, since 2000, the number of the HIV infected people in Korea has sharply increased; 800 to 1,000 persons are newly identified each year in the recent 3 years [2], and it is questionable whether this increasing trend will continue.
The United States stated the research on estimation in 1980, and many countries have estimated the HIV-infected persons using the model suggested by UNAIDS for identifying the trend of their own nation. The method to estimate incidence, prevalence, the number of infected persons, and the number of deaths has been improved since its development in late 1980. It includes the back calculation method, ratio method, Delphi survey method, mathematical and computer/simulation models, and Epimodel. No single optimal estimation method exists; each method has its own characteristics. Recent standard methods for estimation and prediction are the workbook method, estimation and projection package (EPP) method, and time series method (autoregressive moving average model).
As for the estimation of HIV-infected Koreans, the back calculation method, Epi-info model, and EPP model cannot be applied because of limitations including: the lack of exact statistics on AIDS cases and no specific information available for commercial sex workers (CWSs), intravenous drug users (IDUs), and the sexually transmitted infection risk group (STI RG). However, the autoregressive integrated moving average (ARIMA) time series method utilizes autocorrelation and can produce an estimation model with information on HIV cases identified yearly or monthly. ARIMA is used for the estimation of influenza mortality [3], malaria incidence [4], and other infectious diseases [5–7].
This study aimed to construct estimation models using the annual number of newly identified HIV-infected Koreans in Korea from 1985 to 2012, and to forecast the number of HIV cases and the trend in epidemiological characteristics.
2.1 Materials
This study has utilized 9410 identified HIV-infected persons' basic epidemiological data, including diagnosed year, sex, age, screening site, and date of death from 1985 to 2012 in Korea. The local public health centers conduct an epidemiological investigation on the route of infection once the HIV-infected persons are identified and report the results to the Korea Centers for Disease Control and Prevention (KCDC). The epidemiological data are managed by the KCDC HIV database system. The data for this study were collected from the KCDC HIV database system.
2.2 Methods
ARIMA models are the most commonly used time series prediction models. We constructed ARIMA models with data from HIV cases in Korea from 1985–2012, to forecast the annual number of newly diagnosed HIV-infected Koreans and to forecast the cumulative number of HIV cases in the next 5 years (2013–2017). To determine the degree of ARIMA, we have utilized the autocorrelation function (ACF) and partial autocorrelation function (PACF) graphs and constructed an optimal model on the basis of Akaike's information criterion (AIC) statistic and Schwartz's Bayesian criterion (SBC) statistic. Conditional least squares method (CLS) was used for parameter estimation.
This study constructed models by sex, age, and screening site, similar to that of the study by Debanne et al [8]; as a result, the sums of the forecasting numbers by variables are not equal to the forecasting number of newly diagnosed HIV cases. Forecasting numbers were calculated based on a percentage of the forecasting value of each category by each variable. To check the accuracy of each model, mean absolute percentage errors (MAPE) between the number of observed and fitted HIV infections from 1985 to 2012 were calculated. Finally, the number of HIV-infected Koreans in each year from 2013–2017 were forecasted. SAS version 9.3 (SAS Institute, Cary, NC, USA) was used for data analysis.
3.1 Status of HIV-infected persons in Korea until 2012
Figure 1 depicts the number of people with HIV in Korea from 1985 to 2012. The cumulative number of HIV-infected Koreans through 2012 is 9,410 (8,668 men, 742 women), and the number of PLWHA is 7,788 (7,165 men, 624 women). The number of HIV case sharply increases from 2000 to 2008. However, the past 5 years have shown a lower increase of newly diagnosed cases.
3.2 Construction of models to estimate the number of HIV-infected persons, and forecast of 2013–2017
Figure 2 represents observed yearly data (1985–2012), predicted values, and 95% confidence intervals (CI) for the predicted values. Most of observed data are contained within the 95% CI of the predicted value, indicating that the fitted models are reasonable.
The forecasts of each ARIMA model for the number of HIV-infected Koreans in 2013–2017 are shown in Table 1. Each model shows the number of HIV-infected Koreans increases each year, so the data were stabilized by difference and fitted. The MAPE for the cumulative number of HIV-infected Koreans is 5.5%, for the cumulative number of PLWHA is 6.3%, and for the number of newly-diagnosed HIV-infected Koreans is 13.7%. By 2017, these models estimate that Korea will have a cumulative number of HIV-infected Koreans of 14,724 (95% CI; 13,893–15,555), and a cumulative PLWHA of 12,355 (95% CI: 11,404–13,306). A total of 1,111 (95% CI: 805–1,418) Koreans are estimated to be newly diagnosed with HIV in 2017. Analysis by epidemiological characteristic shows that the difference between men and women will be greater in 2017: 1,288 men (95% CI: 882–1,693) vs. 68 women (95% CI: 48–87). The age group in which the largest number of HIV-infected Koreans is expected to be diagnosed is 20–29, with an average of 387 persons in their twenties (95% CI; 255–518) being newly diagnosed per year. HIV-infected Koreans are expected to be mostly diagnosed in hospitals, followed by public health centers and then blood banks.
Models reveals that by 2017 the cumulative number of HIV-infected Koreans will be 14,724, with 12,355 cumulative PLWHA, and 1,111 newly diagnosed persons in the year 2017. It took 29 years since the first diagnosis of HIV infection in 1985 to identify a cumulative number of 10,000 HIV-infected people in Korea. This study suggests that the cumulative trend will increase dramatically, to 14,724 in 2017 and over 15,000 in 2018, for an increase of 5,000 in 6 years. The number of newly diagnosed HIV-infected Koreans is also expected to increase, with more than 1,000 per year after 2014.
The current gender difference (11:1 men to women) is expected to widen even further; the gender ratio of newly diagnosed HIV-infected Koreans will be 19:1 in 2017. By 2012, the age group with the highest cumulative number of HIV-infected Koreans was 30–39 (29%), but there is also a recent increase of HIV-infected Koreans in their twenties. Based on the results of this study, we can expect a sharp increase of HIV-infected Koreans in their twenties for the upcoming 5 years. Increases in the number of teens and those in their fifties are also forecasted. This result suggests that a main target group for prevention should be those aged 20–29 in Korea. In previous studies on STIs prevalence (HIV and HSV-2), the prevalence were low in Koreans in their twenties [9–11], but it is necessary to monitor this age group carefully for any changes.
This study has at least one limitation: the fitted models using the number of HIV-infected Koreans during 1985–2012 assume that past HIV infection patterns are identical to future patterns, so if the patterns are different, the estimated error will be greater and the forecasts will be less accurate.
Korea took an active HIV/AIDS prevention policy with its first diagnosis of HIV infection in 1985, and then expanded HIV testing into specific target groups. Since 1998, Korea has reduced the scope and shifted the policy to voluntary HIV testing with more emphasis on prevention and support of treatment cost of HIV infected persons. This study can be used to estimate the scale of the Korean Government's budget for HIV/AIDS prevention and treatment, as well as inform changes to the HIV/AIDS prevention policy.
This study was supported by a Chronic Infections Disease Cohort Study grant (4800-4859-304) from the Korea Centers for Disease Control and Prevention. The authors thank professor Park Yong-Gyu in Catholic university of Korea for advice on statistical analysis, and staff of 17 local institutions of health for their work for HIV diagnosis in Korea.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.

  • 1. UNAIDS report on the global AIDS epidemic 2013. 2013. UNAIDS; Geneva.
  • 2. Korea Center for Disease Control and Prevention. The surveillance reports for HIV/AIDS in Korea 2011. 2013. KCDC; Osong.
  • 3. Reichert T.A., Simonsen L., Sharma A.. Influenza and the winter increase in mortality in the United States, 1959–1999. Am J Epidemiol 160(5). 2004 Sep;492−502. PMID: 15321847.ArticlePubMed
  • 4. Gaudart J., Toure O., Dessay N.. Modelling malaria incidence with environmental dependency in a locality of Sudanese savannah area, Mali. Malar J 8:2009 Apr;61PMID: 19361335.ArticlePubMed
  • 5. Luz P.M., Mendes B.V., Codeco C.T.. Time series analysis of dengue incidence in Rio de Janeiro, Brazil. Am J Trop Med Hyg 79(6). 2008 Dec;933−939. PMID: 19052308.ArticlePubMed
  • 6. Yi J., Du C.T., Wang R.H.. [Applications of multiple seasonal autoregressive integrated moving average (ARIMA) model on predictive incidence of tuberculosis]. Chinese Journal of Preventive Medicine 41(2). 2007 Mar;118−121. [In Chinese]. PMID: 17605238.PubMed
  • 7. Liu Q., Liu X., Jiang B.. Forecasting incidence of hemorrhagic fever with renal syndrome in China using ARIMA model. BMC Infect Dis 11:2011 Aug;218PMID: 21838933.ArticlePubMed
  • 8. Debanne S.M., Bielefeld R.A., Cauthen G.M.. Multivariate Markovian modeling of tuberculosis: forecast for the United States. Emerg Infect Dis 6(2). 2000 Mar–Apr;148−157. PMID: 10756148.ArticlePubMed
  • 9. Kee M.K., Lee J.H., Chu C.. Characteristics of HIV seroprevalence of visitors to public health centers under the national HIV surveillance system in Korea: cross sectional study. BMC Public Health 9:2009 May;123PMID: 19416524.ArticlePubMed
  • 10. Shin H.S., Park J.J., Chu C.. Herpes simplex virus type 2 seroprevalence in Korea: rapid increase of HSV-2 seroprevalence in the 30s in the southern part. J Korean Med Sci 22(6). 2007 Dec;957−962. PMID: 18162706.ArticlePubMed
  • 11. Kee M.K., Lee J.H., Wang J.. Ten-year trends in HIV prevalence among visitors to public health centers under the National HIV Surveillance System in Korea, 2000 to 2009. BMC Public Health 12:2012 Sep;831PMID: 23020818.ArticlePubMed
Figure 1
The trend in the number of HIV-infected Koreans, 1985–2012. The bars are the number of newly diagnosed HIV-infected Koreans, the dotted red line is the cumulative number of HIV-infected Koreans, and the yellow triangle line is the number of AIDS deaths per year.
Figure 2
Comparison of the reported and forecasted numbers of HIV cases in Korea from 1985 through 2012. Diamond line: actual number, dot line: forecasted number, band: 95% confidence interval of forecasted number. (a) The cumulative number of HIV-infected Koreans, (b) the number of newly diagnosed HIV-infected Koreans, (c) the number of newly diagnosed HIV infected male Koreans, and (d) the number of newly diagnosed HIV infected female Koreans.
Table 1
The forecasted number of HIV-infected Koreans in Korea in 2013–2017
Characteristics Model Forecast
2013 2014 2015 2016 2017
Cumulative HIV cases ARIMA (1,2,3) 10,372 (10,308–10,437) 11,425 (11,275–11,574) 12,470 (12,170–12,770) 13,595 (13,058–14,131) 14,724 (13,893–15,555)
People living with HIV ARIMA (1,2,2) 8,615 (8,533–8,698) 9,508 (9,325–9,691) 10,438 (10,075–10,802) 11,391 (10,768–12,013) 12,355 (11,404–13,306)
Newly-diagnosed HIV cases ARIMA (2,2,1) 962 (889– 1,036) 992 (882–1,102) 1,026 (856–1,196) 1,108 (867–1,349) 1,111 (805–1,418)
Man ARIMA (1,2,2) 902 (838–966) 934 (845–1,022) 969 (811–1,127) 1050 (801–1,299) 1055 (723–1,388)
Woman ARIMA (2,1,2) 60 (48–72) 58 (47–70) 57 (43–70) 58 (43–73) 56 (39–72)
<20 ARIMA (1,2,1) 44 (37–51) 42 (34–51) 51 (39–64) 54 (39–69) 56 (38–74)
20–29 ARIMA (1,1,1) 302 (260–344) 318 (254–382) 369 (276–463) 401 (282–521) 414 (273–554)
30–39 ARIMA (1,2,0) 205 (169–242) 200 (143–258) 212 (111–312) 214 (73–355) 204 (23–385)
40–49 ARIMA (2,2,0) 185 (151–218) 203 (161–245) 155 (96–213) 188 (96–279) 190 (85–296)
50–59 ARIMA (2,2,0) 148 (126–171) 153 (123–183) 163 (117–208) 173 (107–239) 172 (92–251)
≥60 ARIMA (2,2,0) 78 (58–97) 76 (44–108) 76 (31–122) 78 (13–143) 75 (0–157)
Screening site
Public institution ARIMA (2,1,2) 145 (99–192) 134 (75–193) 135 (65–205) 153 (65–241) 164 (60–269)
Hospital ARIMA (1,1,2) 714 (628–799) 743 (626–860) 780 (600–960) 829 (568–1089) 830 (499–1161)
Blood bank ARIMA (1,1,1) 103 (82–125) 115 (85–144) 111 (72–149) 126 (81–173) 117 (65–169)

Figure & Data



    Citations to this article as recorded by  
    • Intelligent Health Care and Diseases Management System: Multi-Day-Ahead Predictions of COVID-19
      Ahed Abugabah, Farah Shahid
      Mathematics.2023; 11(4): 1051.     CrossRef
    • Prevalence of HIV in Kazakhstan 2010–2020 and Its Forecasting for the Next 10 Years
      Kamilla Mussina, Shirali Kadyrov, Ardak Kashkynbayev, Sauran Yerdessov, Gulnur Zhakhina, Yesbolat Sakko, Amin Zollanvari, Abduzhappar Gaipov
      HIV/AIDS - Research and Palliative Care.2023; Volume 15: 387.     CrossRef
    • Integration models of demand forecasting and inventory control for coconut sugar using the ARIMA and EOQ modification methods
      Siti Wardah, Nunung Nurhasanah, Wiwik Sudarwati
      Jurnal Sistem dan Manajemen Industri.2023; 7(2): 127.     CrossRef
    • Deep learning-based forecasting model for COVID-19 outbreak in Saudi Arabia
      Ammar H. Elsheikh, Amal I. Saba, Mohamed Abd Elaziz, Songfeng Lu, S. Shanmugan, T. Muthuramalingam, Ravinder Kumar, Ahmed O. Mosleh, F.A. Essa, Taher A. Shehabeldeen
      Process Safety and Environmental Protection.2021; 149: 223.     CrossRef
    • Forecasting future HIV infection cases: evidence from Indonesia
      Maria Dyah Kurniasari, Andrian Dolfriandra Huruta, Hsiu Ting Tsai, Cheng-Wen Lee
      Social Work in Public Health.2021; 36(1): 12.     CrossRef
    • Forecasting Confirmed Malaria Cases in Northwestern Province of Zambia: A Time Series Analysis Using 2014–2020 Routine Data
      Dhally M. Menda, Mukumbuta Nawa, Rosemary K. Zimba, Catherine M. Mulikita, Jim Mwandia, Henry Mwaba, Karen Sichinga, Hamidreza Karimi-Sari
      Advances in Public Health.2021; 2021: 1.     CrossRef
    • An Adaptive Variational Mode Decomposition Technique with Differential Evolution Algorithm and Its Application Analysis
      Yuanxin Wang, Chaoqun Duan
      Shock and Vibration.2021; 2021: 1.     CrossRef
    • A comparative study on the prediction of the BP artificial neural network model and the ARIMA model in the incidence of AIDS
      Zeming Li, Yanning Li
      BMC Medical Informatics and Decision Making.2020;[Epub]     CrossRef
    • Hybrid Decomposition Time-Series Forecasting by DirRec Strategy: Electric Load Forecasting Using Machine-Learning
      Branislav Vuksanovic, Davoud Rahimi Ardali
      International Journal of Computer and Electrical E.2019; 11(1): 1.     CrossRef
    • Exploring an Ensemble of Methods that Combines Fuzzy Cognitive Maps and Neural Networks in Solving the Time Series Prediction Problem of Gas Consumption in Greece
      Konstantinos I. Papageorgiou, Katarzyna Poczeta, Elpiniki Papageorgiou, Vassilis C. Gerogiannis, George Stamoulis
      Algorithms.2019; 12(11): 235.     CrossRef
      Suci Retno Ningtiyas
      The Indonesian Journal of Public Health.2019; 13(2): 158.     CrossRef
    • Research into the high-precision marine integrated navigation method using INS and star sensors based on time series forecasting BPNN
      Qiu Ying Wang, Kaiyue Liu, Zhiguo Sun, Minghui Zhang
      Optik.2018; 172: 494.     CrossRef
    • Real-time predictive seasonal influenza model in Catalonia, Spain
      Luca Basile, Manuel Oviedo de la Fuente, Nuria Torner, Ana Martínez, Mireia Jané, Jeffrey Shaman
      PLOS ONE.2018; 13(3): e0193651.     CrossRef
    • Using an Autoregressive Integrated Moving Average Model to Predict the Incidence of Hemorrhagic Fever with Renal Syndrome in Zibo, China, 2004–2014
      Tao Wang, Yunping Zhou, Ling Wang, Zhenshui Huang, Feng Cui, Shenyong Zhai
      Japanese Journal of Infectious Diseases.2016; 69(4): 279.     CrossRef
    • Time series analysis of influenza incidence in Chinese provinces from 2004 to 2011
      Xin Song, Jun Xiao, Jiang Deng, Qiong Kang, Yanyu Zhang, Jinbo Xu
      Medicine.2016; 95(26): e3929.     CrossRef
    • Modelling the prevalence of hepatitis C virus amongst blood donors in Libya: An investigation of providing a preventive strategy
      Mohamed A Daw
      World Journal of Virology.2016; 5(1): 14.     CrossRef
    • Forecast analysis of any opportunistic infection among HIV positive individuals on antiretroviral therapy in Uganda
      John Rubaihayo, Nazarius M. Tumwesigye, Joseph Konde-Lule, Fredrick Makumbi
      BMC Public Health.2016;[Epub]     CrossRef
    • The Use of an Autoregressive Integrated Moving Average Model for Prediction of the Incidence of Dysentery in Jiangsu, China
      Kewei Wang, Wentao Song, Jinping Li, Wu Lu, Jiangang Yu, Xiaofeng Han
      Asia Pacific Journal of Public Health.2016; 28(4): 336.     CrossRef
    • Prevalence of hemorrhagic fever with renal syndrome in Yiyuan County, China, 2005–2014
      Tao Wang, Jie Liu, Yunping Zhou, Feng Cui, Zhenshui Huang, Ling Wang, Shenyong Zhai
      BMC Infectious Diseases.2015;[Epub]     CrossRef
    • Application of an autoregressive integrated moving average model for predicting injury mortality in Xiamen, China
      Yilan Lin, Min Chen, Guowei Chen, Xiaoqing Wu, Tianquan Lin
      BMJ Open.2015; 5(12): e008491.     CrossRef
    • Back propagation neural network with adaptive differential evolution algorithm for time series forecasting
      Lin Wang, Yi Zeng, Tao Chen
      Expert Systems with Applications.2015; 42(2): 855.     CrossRef
    • Direct Medical Costs of Hospitalizations for Cardiovascular Diseases in Shanghai, China
      Shengnan Wang, Max Petzold, Junshan Cao, Yue Zhang, Weibing Wang
      Medicine.2015; 94(20): e837.     CrossRef
    • Changing Patterns of HIV Epidemic in 30 Years in East Asia
      S. Pilar Suguimoto, Teeranee Techasrivichien, Patou Masika Musumari, Christina El-saaidi, Bhekumusa Wellington Lukhele, Masako Ono-Kihara, Masahiro Kihara
      Current HIV/AIDS Reports.2014; 11(2): 134.     CrossRef
    • What is Next for HIV/AIDS in Korea?
      Hae-Wol Cho, Chaeshin Chu
      Osong Public Health and Research Perspectives.2013; 4(6): 291.     CrossRef

    • PubReader PubReader
    • Cite
      export Copy
    • XML DownloadXML Download

    PHRP : Osong Public Health and Research Perspectives