Developing a national surveillance system for stroke and acute myocardial infarction using claims data in the Republic of Korea: a retrospective study
Article information
Abstract
Objectives
Limited information is available concerning the epidemiology of stroke and acute myocardial infarction (AMI) in the Republic of Korea. This study aimed to develop a national surveillance system to monitor the incidence of stroke and AMI using national claims data.
Methods
We developed and validated identification algorithms for stroke and AMI using claims data. This validation involved a 2-stage stratified sampling method with a review of medical records for sampled cases. The weighted positive predictive value (PPV) and negative predictive value (NPV) were calculated based on the sampling structure and the corresponding sampling rates. Incident cases and the incidence rates of stroke and AMI in the Republic of Korea were estimated by applying the algorithms and weighted PPV and NPV to the 2018 National Health Insurance Service claims data.
Results
In total, 2,200 cases (1,086 stroke cases and 1,114 AMI cases) were sampled from the 2018 claims database. The sensitivity and specificity of the algorithms were 94.3% and 88.6% for stroke and 97.9% and 90.1% for AMI, respectively. The estimated number of cases, including recurrent events, was 150,837 for stroke and 40,529 for AMI in 2018. The age- and sex-standardized incidence rate for stroke and AMI was 180.2 and 46.1 cases per 100,000 person-years, respectively, in 2018.
Conclusion
This study demonstrates the feasibility of developing a national surveillance system based on claims data and identification algorithms for stroke and AMI to monitor their incidence rates.
Introduction
An accurate computation of national health metrics, such as the incidence rates of stroke and acute myocardial infarction (AMI), is indispensable for crafting a robust national healthcare system adept at managing cardiovascular diseases [1–8]. In Korea, the National Health Insurance Service (NHIS) operates a comprehensive claims database that contains data from healthcare providers and insurers for reimbursement purposes. Given that this database covers the vast majority of the Korean population, it contains patient-level data pertaining to diagnoses, treatments, healthcare utilization, access, outcomes, and costs [9–13]. Consequently, this extensive dataset provides a promising foundation for establishing a national surveillance system for stroke and AMI.
Previous research efforts utilizing claims data to explore the incidence of stroke and AMI have encountered limitations in case identification. These challenges primarily originate from the use of International Classification of Diseases (ICD) codes, which are often not well-validated and are challenging to use for differentiating between acute and chronic events [14–22]. In 2004, there was an attempt to construct a surveillance system based on claims data, incorporating a validated diagnostic tool, to estimate the incidence of stroke and AMI. However, this system was predominantly centered around monitoring care quality at individual hospitals, relying heavily on ICD codes [23]. Since this attempt, there have been no additional endeavors to construct a similar system with national coverage. The present study seeks to fill this gap by constructing a national surveillance system for stroke and AMI, harnessing the NHIS claims data and employing validated identification algorithms for these conditions.
Materials and Methods
Overview of the Study Process and the Organization of Study Teams
This study was launched with the aim of constructing a national surveillance system for the incidence of stroke and AMI as a private consignment project of the Korean Disease Control and Prevention Agency (KDCA). It required collaboration of a variety of organizations, encompassing (1) the Central Support Group of Cardiovascular Disease Management, Public Health and Medical Services, Seoul National University Hospital; (2) the Department of Biostatistics, Korea University College of Medicine; (3) the Korean Stroke Society, (4) the Korean Society for Preventive Medicine, and (5) the Korean Society of Cardiology. The structure of the study teams is illustrated in Figure 1A.
The study was conducted in the following steps: first, we formulated definitions for epidemiological indices pertaining to stroke and AMI. This was achieved through interactive consultations involving the Advisory Board for Disease Definition, 3 participating societies, and other experts (Figure 1A). Second, we designed identification algorithms for stroke and AMI based on claims data. Third, the developed algorithms underwent validation via a review of hospital records for cases selected from the NHIS claims data. Finally, we calculated the incidence of AMI and stroke in the Republic of Korea by applying the weighted positive predictive values (PPVs) and negative predictive values (NPVs) to the 2018 NHIS claims database (Figure 1B).
Epidemiological Indices, Definitions, and Identification Algorithms
To establish the epidemiological indices for stroke and AMI, we conducted an extensive review of the pertinent literature, including reports from the Organisation for Economic Co-operation and Development (OECD), the National Institute for Health and Care Excellence, and the OECD Health Care Quality Indicators project. Based on this review, we identified the incidence rate and proportion of stroke and AMI as the epidemiological indices for our study [24–27].
We defined stroke according to the World Health Organization (WHO) criteria, which delineate stroke as the rapid development of clinical signs indicating a focal or global disturbance of cerebral function lasting more than 24 hours or resulting in death, with no apparent cause other than of vascular origin [28].
Regarding AMI, we employed the fourth universal definition, which characterizes it as the presence of AMI accompanied by clinical evidence of acute myocardial ischemia and abnormal cardiac biomarkers [29].
The subsequent step was the development of identification algorithms. We selected relevant variables from claims data, based on our understanding of AMI and stroke management within the Korean healthcare setting. We designed an identification algorithm for stroke that encompassed both ischemic and hemorrhagic stroke cases, along with nontraumatic subdural hemorrhage. The algorithm was built around key identifiers, including ICD codes (I60–I64) for stroke and claims codes related to stroke diagnosis and treatments (Figure 2A). To apply this algorithm, we considered possible stroke admission episodes as a single claim or as combining claims if the gap between the discharge date of the first claim and the admission date of the second claim was 2 days or less. Claims with an interval of more than 2 days were deemed as separate episodes, irrespective of whether the consecutive claims were from the same hospital.
Similarly, we developed an identification algorithm for AMI. The algorithm’s key identifiers included ICD diagnosis codes (I21–I23) for AMI and claims codes related to AMI diagnosis and treatments (Figure 2B). In applying this algorithm, possible AMI admission episodes were defined as a single claim or by combining claims if the gap between the discharge date of the first claim and the admission date of the second claim was 3 days or less, or if the gap in admission dates between the first and second claims did not exceed 28 days.
The epidemiological indices and identification algorithms with their key identifiers received approval from experts from the Korean Stroke Society, the Korean Society of Cardiology, and the Korean Society for Preventive Medicine (Figure 1B).
Sampling
We applied 2-stage stratified sampling to select cases for the hospital survey [30,31]. Using the 2018 NHIS claims data, we defined admission episodes with and without the corresponding ICD codes for stroke (I60–I64) and AMI (I21–I23), as described earlier. These episodes were used for hospital and case selection.
In the first stage, we selected hospitals. Initially, we chose hospitals eligible for the survey from 6 administrative divisions—namely, Seoul, Gyeonggi, Daegu, Gyeongsangnam-do, Ulsan, and Busan—considering the feasibility of the survey. Subsequently, we determined the number of hospitals to invite for participation, considering the geographic regions and case availability. We also balanced the ratio of tertiary to general hospitals at 8:10 as guided by data from the 8th Acute Stroke Quality Assessment Program (ASQAP) [32]. The chosen hospitals were required to have over 20 episodes of both stroke and AMI in the year 2018. Our initial plan was to include a total of 18 hospitals in the survey. These were distributed across the 6 strata as follows: 3 tertiary hospitals, 4 general hospitals, and 1 other hospital from the capital region, and 3 tertiary hospitals, 4 general hospitals, and 3 other hospitals from the non-capital region (Figure 3; Table S1).
In the second stage, we determined the number of sampled cases with and without the ICD codes for stroke and AMI for each hospital and stratum. We employed the optimal allocation method [33,34] and adhered to these guidelines: (1) the ratio of sampled case volume was 1:1 between capital and non-capital hospitals, and 6.5:2.5:1 between tertiary, general, and other hospitals, based on data from the 8th ASQAP (Figure 3); (2) the ratio between algorithm-positive and algorithm-negative cases was 1:1 within each hospital; and (3) among the algorithm-negative cases in each hospital, 10% had the corresponding ICD codes, while 90% did not. As a result, our goal was to sample and survey a total of 1,200 algorithm-positive cases (600 for stroke and 600 for AMI) and 1,200 algorithm-negative cases (600 for stroke, including 60 with I60–I64 and 540 without, and 600 for AMI, including 60 with I21–I23 and 540 without) from the 18 participating hospitals (Figure 3; Table S1).
Hospital Survey
We enlisted 8 qualified reviewers for the survey with the support of the KDCA. Experts from the stroke and AMI working groups established the protocols for reviewing hospital records (Figure 1A). These reviewers underwent specialized training sessions to master how to evaluate sampled cases within the hospital settings. With the consent and support of the participating hospitals, the trained reviewers carried out the survey, consulting online with stroke and AMI experts as needed. The survey findings were scrutinized and confirmed by the respective stroke and AMI experts.
Validation of the Algorithms
The stroke and AMI algorithms were subjected to rigorous validation using the results derived from the hospital survey. The sensitivity, specificity, accuracy, PPV, and NPVs (NPV1 and NPV2) with their weighted values were estimated for each of the 6 strata by calculating a total sampling rate, taking into account the sampling structure and the corresponding sampling rates. We obtained 2 sampling rates. To calculate the first sampling rate, we divided the number of cases in selected hospitals by the total number of cases (admission episodes) from the 2018 NHIS claims data. We computed the second sampling rate by dividing the number of sampled cases from selected hospitals for the hospital survey by the number of cases in selected hospitals, which was used as a numerator to calculate the first sampling rate. The stratum-specific total sampling rate was obtained by multiplying these 2 sampling rates in each stratum (Figure 4).
During the hospital survey, the number of cases confirmed as either stroke/AMI or not were determined within each stratum across 3 groups: algorithm-positive cases with the ICD codes (used for PPV calculation), algorithm-negative cases with ICD codes (used for NPV1 calculation), and algorithm-negative cases without ICD codes (used for NPV2 calculation). By multiplying the obtained weights (the inverse of the total sampling rates) by the number of confirmed cases in each group within each stratum, we obtained the necessary values to calculate the weighted sensitivity, specificity, accuracy, PPV, and 1-NPV.
Estimation of Stroke and AMI Incidences
To obtain the number of incident stroke and AMI cases in the entire population, as well as to break these cases down by sex, we applied the algorithms on the 2018 NHIS claims data and ascertained the number of cases within individual strata. The number of stroke and AMI cases within each stratum was computed by multiplying the stratum-specific weighted PPV by the number of algorithm-positive cases with ICD codes and the 1-NPV by the number of algorithm-negative cases with the ICD codes (NPV1) and without the ICD codes (NPV2). The resulting numbers were then added within each stratum. Summing up the numbers of stroke and AMI cases across all 6 strata provided an estimate of the incident case number of stroke and AMI in 2018 (Figure 5).
The incidence rate of stroke and AMI was calculated by dividing the total number of new stroke or AMI cases in 2018, including recurrent events, by the total person-time observed in 2018, as shown in the equation below [24–27].
The incidence proportion of stroke and AMI represents the proportion of individuals who developed stroke or AMI among the entire Korean population in 2018. For the incidence proportion, the numerator is the number of patients who experienced stroke or AMI in 2018, while the denominator is the total population at risk in mid-2018, as illustrated in the equation below [24–27]. Recurrent events were not counted.
Finally, the age- and sex-standardized incidence rate and proportion, as well as the age-standardized rate and proportion, were estimated using the mid-year population of the Republic of Korea in 2005 as a benchmark. For age standardization, the WHO standard population in 2000 served as the reference.
Statistical Analysis
In this study, continuous variables were expressed as means and standard deviations, while categorical variables were presented as counts and percentages. The sensitivity, specificity, PPV, and NPV values, as weighted, were provided for the algorithms identifying stroke and AMI. Additionally, 95% confidence intervals (CIs) for incidence rates and incidence proportions were calculated using either the normal approximation or gamma confidence limits when appropriate. All statistical analyses were conducted using SAS ver. 9.4 (SAS Institute Inc.).
Ethics Approval
The Institutional Review Board (IRB) of Seoul National University (IRB No: E-2104-135-1213, E-2109-031-1252, and H-2106-064-1225) approved this study. Additionally, the requirement for informed consent was waived by the IRB because of the retrospective nature of this study.
Results
Study Milestones and Timeline
This study spanned 1 year, running from April 2021 through March 2022, as detailed in Figure 6. While we were on track in the early study period, securing IRB approval, recruiting hospital record reviewers, and hosting their training sessions, challenges arose when it came to case selection for the hospital survey and securing collaboration from the targeted hospitals. These delays impacted our roster of reviewers, necessitating a fresh round of recruitment and training. Moreover, the time constraints that resulted from these delays put a squeeze on our hospital survey duration and our schedule for estimating epidemiological indices by applying the validated algorithms to the 2018 NHIS claims data. Despite these hurdles, we successfully validated our identification algorithms for stroke and AMI, and derived estimates on the epidemiological indices for 2018 in Korea.
Establishment of Epidemiological Indices, Definitions and Identification Algorithms
The stroke identification algorithm incorporated 23 key identifiers pertinent to acute stroke management, such as brain imaging, stroke unit care, reperfusion therapy, antithrombotic therapy, and rehabilitation. This algorithm was primarily based on a previously developed ischemic stroke identification algorithm (Figure 2A; Table S1) [13]. To ensure the detection of cases with hemorrhagic stroke in addition to ischemic stroke, a 2-tiered strategy was employed. Initially, the previously validated algorithm for ischemic stroke was put into action. Following this, another segment of the algorithm, which included the admission route, in-hospital mortality, length of hospitalization, and specific ICD codes, was used to detect hemorrhagic stroke cases. A preliminary survey conducted on 768 cases prior to the main hospital survey demonstrated that the PPV for identifying hemorrhagic stroke stood at an impressive 87.1%.
The AMI identification algorithm was tailored based on whether a hospital had the capacity to perform coronary angiography (CAG) (Figure 2B; Table S1). The categorization of being CAG-capable or not was based on whether the annual number of CAGs performed met or exceeded a threshold of 10. For CAG-capable hospitals, the algorithm included ICD codes, electrocardiogram (ECG) findings, cardiac enzyme levels (such as troponin I or troponin T), and CAG results. For CAG-incapable hospitals, the algorithm included ICD codes, ECG findings, and cardiac enzyme levels. A pilot study carried out before the main hospital survey, which involved a sample size of 757, found the PPV of the algorithm to be 73.3%.
Sampled Cases for the Hospital Survey
Due to the coronavirus disease 2019 (COVID-19)-related regulations and other logistical constraints, the planned hospital survey could not be conducted as initially intended. Instead of the originally planned 18 hospitals, only 14 hospitals participated in the survey. These consisted of 6 hospitals in the capital region (2 tertiary hospitals and 4 general hospitals) and 8 hospitals in the non-capital region (3 tertiary hospitals, 4 general hospitals, and 1 other hospital) (Table S2). The survey could not be carried out in the other hospital stratum of the capital region (Table S3).
There were also challenges in obtaining a sufficient number of algorithm-negative cases without ICD codes. This issue arose from communication gaps between the KDCA and the study teams, compounded by the tight timeline set for the study. As a result, we had to modify our approach to determining the sampling rates. We leaned on the NIHS-National Sample Cohort data regarding algorithm-negative cases without ICD codes. Further details regarding these adjustments can be found in Methods S1 and Figure S1.
In the hospital record review, we assessed 603 algorithm-positive cases and 483 algorithm-negative cases for stroke. Of these algorithm-negative cases, 54 cases had the ICD codes and 429 cases did not. For AMI, we evaluated 568 algorithm-positive cases and 546 algorithm-negative cases, with 60 of the latter having the ICD codes and 486 not having them (Table 1; Table S3). Through this review, we confirmed 578 cases (age, 72.3±11.9 years; men, 51.9%) as stroke and 520 cases (age, 70.7±10.9 years; men, 72.7%) as AMI (Tables 2 and 3). Of those confirmed as stroke, 88% involved ischemic stroke, while 98.5% of the AMI cases had undergone CAG. The in-hospital mortality rate was 6.2% for stroke cases and 6.3% for AMI cases.
Performance of the Algorithms
The results from the hospital survey indicated that the algorithm we developed to identify stroke had a sensitivity of 94.3% and a specificity of 88.6%. For AMI detection, the algorithm exhibited a sensitivity of 97.9% and a specificity of 90.1%. Table 4 details the PPV, NPV1, and NPV2 for both the stroke and AMI identification algorithms in each stratum.
The stroke and AMI identification algorithms had high PPVs, indicating a high proportion of true positive cases identified by the algorithms. However, the NPV1 for algorithm-negative cases with the ICD codes was found to be low for stroke, indicating a higher proportion of false negative cases in this category. Additionally, the PPVs for stroke and AMI were found to be lower in tertiary hospitals located in the capital region than in other types of hospitals in the capital region and the non-capital hospitals (Table 4).
Incidence of AMI and Stroke
By applying the algorithms to the 2018 NHIS claims database, we derived estimates of 150,837 incident stroke cases and 40,529 incident AMI cases. Men accounted for 55.7% of stroke cases and 66.8% of AMI cases. It is worth noting that due to the small sample size for NPV1 (comprising 10% of the total algorithm-negative group), we used the overall NPV1 (38.9% for stroke and 81.7% for AMI) instead of detailing results by each stratum to estimate incident cases. In 2018, the crude incidence rates of stroke and AMI were estimated to be 294.9 and 79.2 cases per 100,000 person-years, respectively. After standardization for age and sex, the incidence rates were calculated as 180.2 cases per 100,000 person-years for stroke and 46.1 cases per 100,000 person-years for AMI. Notably, the age-standardized incidence rates of both stroke and AMI were higher in men than in women (Table 5). When applying the individual NPV1 value obtained in each stratum instead of the overall NPV1, we found that the number of incident cases and the crude incidence rate was 152,241 cases and 297.6 per 100,000 person-years for stroke and 42,801 cases and 83.7 per 100,000 person-years for AMI, respectively.
Excluding recurrent cases, the estimated number of incident cases was 131,347 for stroke and 39,720 for AMI. The crude incidence proportions were 256.0 cases per 100,000 people for stroke and 77.4 cases per 100,000 people for AMI. After age and sex standardization, the incidence proportions were determined to be 154.1 cases per 100,000 people for stroke and 44.4 cases per 100,000 people for AMI. Similar to the incidence rates, the age-standardized incidence proportions of stroke and AMI were higher in men (Table 5).
Discussion
Our study showed that there were 150,837 incident stroke cases and 40,529 incident AMI cases across the Republic of Korea in 2018. When we excluded recurrent cases, the figures decreased to 131,347 for stroke and 39,270 for AMI. Compared to earlier studies in the Republic of Korea, which ranged from 73,501 to 130,025 [16,23,35], our estimated case number for stroke was much higher. Furthermore, our results exceeded those of the 2019 Global Burden of Disease study, which estimated 92,934 stroke cases in 2019 [2]. Likewise, the number of incident AMI cases in our study was also higher than those documented in previous studies in the Republic of Korea, which ranged from 15,893 to 25,531 [15,16,29], but was slightly lower than the figure reported in the first attempt to develop the national surveillance system (50,879 cases in 2004) [23]. Overall, our crude incidence rate and proportion exceeded those reported in previous Korean studies [15,16,23,35]. The marked rise in incident stroke and AMI cases compared to older studies might be attributed to the swift aging of the Korean population [36] coupled with an increased prevalence of traditional vascular risk factors stemming from the westernization of lifestyles [37]. While directly comparing our findings with these earlier studies can be complex due to variation in reference populations, disease definitions, and standardization methods, the upward trend in stroke and AMI case numbers we observed points to possible socioeconomic burdens linked to these health conditions.
The age- and sex-standardized incidence rates and proportions, based on the 2018 mid-year population of Korea, were either comparable or slightly lower than those in the earlier Korean studies between 2004 and 2016 [15,16,23,35]. Our study’s age-standardized incidence proportion of stroke (at 149.1 per 100,000 people using the 2000 WHO standard population) was similar to or slightly higher than the incidence of first-ever stroke observed in high-income countries [1,2,38–42], but lower than that in other Asian countries [43–45]. Regarding the age-standardized incidence proportion of AMI, our finding of 44.2 per 100,000 people was in line with the incidence of first-ever AMI in high-income Asian countries, but lower than the figures reported in Western countries [1,3,45–48].
We employed a 2-stage stratified sampling method to select cases for the hospital survey considering geographic location and hospital size in order to reflect the variability across different medical settings. This method is widely acknowledged for its effectiveness in estimating sensitivity, specificity, PPV, and NPV in national sample surveys while minimizing the standard error [30,31]. The hospital survey was conducted in 14 hospitals from 5 strata, involving a total of 2,200 cases (1,114 AMI and 1,086 stroke cases). Due to the restricted hospital access amid the COVID-19 pandemic and the constrained timeframe of our study, the numbers of cases and hospitals for the hospital survey were inadequate for comprehensive algorithm validation. In particular, the proportion of algorithm-negative cases with the corresponding ICD codes was only 10%, which limited the evaluation. This small sample size resulted in the variability of NPV1 across strata (Table 4) and the adoption of the overall NPV1 instead of the stratum-specific NPV1.
The NPV for cases classified as algorithm-negative with the ICD codes demonstrated lower values in our hospital survey. Specifically, the NPV1 for stroke was notably lower than that for AMI. This discrepancy could be largely attributed to the limited sampling of the algorithm-negative cases with ICD codes for the hospital survey: only 54 cases for stroke and 60 cases for AMI were surveyed (Table 1). This small sample size inherently limited the precision of our evaluation. Moreover, the difference can be attributed to the distinct disease characteristics and the nuances in ICD coding for AMI and stroke. The stroke ICD codes, in particular, clearly distinguish between acute and chronic stages of the condition. An analysis solely based on the ICD codes of the 2018 NHIS claims data revealed an overestimation in the reported numbers of both stroke and AMI compared to our estimates for incident stroke and AMI cases (Table S4). This overestimation seemed to be more apparent for stroke than AMI. Additionally, clinical practices for AMI, including diagnostic testing and treatment, are generally less complex than those for stroke. The complexity of stroke care might have contributed to the lower NPV1 for stroke in our study.
The PPVs in the capital region were lower than those in the non-capital region, and tertiary hospitals in the capital region exhibited the lowest PPVs among the 5 strata. This reduced PPV might be explained by the fact that, in these strata, a considerable number of patients are hospitalized long after the acute phase has passed [49]. The influence of hospital size and geographic location on the PPV and NPV underscores the need for a more comprehensive hospital survey with a larger sample size to procure PPVs and NPVs based on region, size, and other characteristics of our healthcare system.
This study has several limitations. First, we relied on the overall NPV1 rather than the stratum-specific NPV1 due to the limited sample size of algorithm-negative cases with the ICD codes per stratum, which ranged from 1 to 20 for stroke and 2 to 15 for AMI. Second, the use of stratum-specific weighted values, derived from a hospital survey with limited case numbers from 14 hospitals across 5 strata in 6 administrative regions, may impede the generalizability of our findings. This sample might not have adequately represented the diversity of the entire country. Third, our inability to include non-hospitalized incident cases, including pre-hospitalization fatalities, is another limitation. Fourth, the short study period and the small number of patients surveyed restrict our capacity to evaluate the algorithm’s accuracy in determining the exact incidence rate. Fifth, we could not estimate the lifetime first-ever incident cases due to inadequate historical data on previous stroke and AMI from the claims data.
However, the strengths of our study are noteworthy. To our knowledge, this is the first study to validate a developed algorithm, including PPV, NPV, sensitivity, and specificity, through an extensive hospital survey using a 2-staged sampling strategy for national representativeness. Unlike previous research focused mainly on the quality of acute care in AMI and stroke cases, our study uniquely estimated incidence rates and proportions using PPV and NPV stratification. Although few studies have specifically evaluated the validity of identification algorithms, our study demonstrated superior sensitivity, specificity, and PPV than those relying predominantly on ICD codes [17–23].
Additionally, our study highlights the feasibility of establishing a national surveillance system using claims data and identification algorithms for tracking the incidence of stroke and AMI. Such a system is invaluable for ongoing monitoring of these diseases and supporting nationwide epidemiological research. However, to implement effective 2-stage sampling for national hospital surveys and generate comprehensive national statistics, a unified platform for collaboration and streamlined data collection is imperative. Further studies with larger samples and a broader range of hospitals are essential to develop robust sampling strategies, ensuring accurate incidence estimates that reflect diverse healthcare settings.
Looking ahead, this study underscores the necessity for a national surveillance platform to minimize bias through a comprehensive hospital survey spanning the entire country based on claims data. It is important to continue adjusting and updating stroke and AMI identification algorithms based on extensive surveys with sample case numbers and study duration. Developing methods to estimate incident cases, including non-hospitalized ones and fatalities, is also crucial. Expanding the number and diversity of participating hospitals and allowing sufficient time for surveys will enhance the comprehensiveness and representativeness of our statistics. Moreover, the swift adaptation of the ICD-11 classification system is important for improving the accuracy of estimates [49]. Further research and the establishment of a well-organized platform for this surveillance system are essential steps forward.
HIGHLIGHTS
• This study demonstrates the feasibility of creating a national surveillance system using claims data and identification algorithms to estimate the incidence of stroke and acute myocardial infarction.
• The age- and sex-standardized incidence rates stood at 180.2 per 100,000 person-years for stroke and 46.1 per 100,000 person-years for acute myocardial infarction.
• This system facilitates ongoing monitoring of the burden of stroke and cardiovascular disease in the Republic of Korea and aids in expanding nationwide epidemiological research.
Supplementary Material
Supplementary data are available at https://doi.org/10.24171/j.phrp.2023.0248.
Notes
Ethics Approval
The Institutional Review Board (IRB) of Seoul National University (IRB No: E-2104-135-1213, E-2109-031-1252, and H-2106-064-1225) approved this study. Additionally, the requirement for informed consent was waived by the IRB because of the retrospective nature of this study.
Conflicts of Interest
The authors have no conflicts of interest to declare.
Funding
This work was funded by the Korean Disease Control and Prevention Agency.
Availability of Data
All data analyzed during this study are included in this published article. While the datasets are not publicly available, inquiries regarding access to the study data can be directed to the corresponding authors. All authors read and approved the final manuscript.
Authors’ Contributions
Conceptualization: HJK, JoL, HJB; Data curation: SEK, JP, JES, JiL; Formal analysis: TJK, HSL, SEK, JP, JES, JiL; Funding acquisition: HJB; Investigation: TJK, HSL, SEK, JYK, JP, JES, JiL; Methodology: TJK, HSL, SEK, JYK, JiL, JHC, HCK, DHS, HYL, BJK, WKS, JMP, SJL, KHJ; Project administration: JHH, JP, JoL, KH, HJK, HJB; Software: SEK, JP, JES, JiL; Supervision: HJK, SUK, YCH, HSK, JuL, HJB; Validation: TJK, HSL, SEK, JYK, JP, JES, JiL; Visualization: TJK, SEK, JP; Writing–original draft: TJK, HJB; Writing–review & editing: all authors.
Additional Contributions
We wish to extend our sincere gratitude to the National Health Insurance Service and the Health Insurance Review and Assessment Service for their valuable contribution in providing the data for our research.