Skip to main content

Predicting post-COVID-19 condition in children and young people up to 24 months after a positive SARS-CoV-2 PCR-test: the CLoCk study

Abstract

Background

Predicting which children and young people (CYP) are at the highest risk of developing post-COVID-19 condition (PCC) could improve care pathways. We aim to develop and validate prediction models for persistent PCC up to 24 months post-infection in CYP.

Methods

CYP who were PCR-positive between September 2020 and March 2021, with follow-up data up to 24-months post-infection, were analysed. Persistent PCC was defined in two ways, as PCC at (a) 3, 6, 12 and 24 months post-infection (N = 943) or (b) 6, 12 and 24 months post-infection (N = 2373). Prediction models were developed using logistic regression; performance was assessed using calibration and discrimination measures; internal validation was performed via bootstrapping; the final model was adjusted for overfitting.

Results

While 24.7% (233/943) of CYP met the PCC definition 3 months post-infection, only 7.2% (68/943) continued to meet the PCC definition at all three subsequent timepoints, i.e. at 6, 12 and 24 months. The final models predicting risk of persistent PCC (at 3, 6, 12 and 24 months and at 6, 12 and 24 months) contained sex (female), history of asthma, allergy problems, learning difficulties at school and family history of ongoing COVID-19 problems, with additional variables (e.g. older age at infection and region of residence) in the model predicting PCC at 6, 12 and 24 months. Internal validation showed minimal overfitting of models with good calibration and discrimination measures (optimism-adjusted calibration slope: 1.064–1.142; C-statistic: 0.724–0.755).

Conclusions

To our knowledge, these are the only prediction models estimating the risk of CYP persistently meeting the PCC definition up to 24 months post-infection. The models could be used to triage CYP after infection. CYP with factors predicting longer-term symptomology, may benefit from earlier support.

Peer Review reports

Background

Post-COVID-19 condition (PCC), also known as long COVID, is difficult to research with many additional challenges compared to classic epidemiological studies [1]. In England, by June 2022, over 80% of 5-to-18-year-old children had antibodies against SARS-CoV-2, the virus responsible for COVID-19 and the recent pandemic. This likely reflects a combination of widespread (asymptomatic and symptomatic) infection and vaccination. For example, between January and March 2021, prior to vaccination rollout to children and young people (CYP), 85,546 CYP in England had a positive SARS-CoV-2 PCR test [2]. By July 2022, 62.4% of pupils aged 12 to 15 years and 80.5% of pupils aged 16 to 17 years at the start of the 2021/2022 academic year had received at least one dose of a coronavirus (COVID-19) vaccine, 45.3% and 69.8%, respectively, had received at least two doses [3]. What has also become clear is that some CYP report persistent symptoms months after acute SARS-CoV-2 infection, even if they were asymptomatic or had a low symptom burden at the time of infection [4,5,6]. Thus, being able to accurately predict PCC is valuable in order to identify those at highest risk and direct them towards relevant care. Such triaging is particularly relevant post-pandemic when health services are under unprecedented pressure [7].

Two particular challenges associated with researching PCC include the potential waxing and waning of symptoms after infection and the lack of a comparator group of CYP who have never been positive for SARS-CoV-2, due to widespread infection, especially since the emergence of highly transmissible variants. The CLoCk study [8] is a longitudinal cohort of SARS-CoV-2 PCR-positive and matched test-negative CYP, PCR-tested between September 2020 and March 2021 when they were aged 11 to 17 years. Using this data we have previously shown that, whilst the overall prevalence of PCC remains broadly stable up to 12 months post-infection, many CYP are classified as meeting the research definition of PCC [9] for the first time at 6 or 12 months post-infection [10, 11]. Therefore, examining PCC longitudinally and identifying those who persistently meet the PCC research definition is important in terms of characterising and predicting those who are likely to be impacted over a long time-period. We have also previously developed and validated a model to predict PCC in CYP 3 months after PCR testing [12]. In that analysis, we included both PCR test-positive and test-negative CYP and examined predictors of meeting our published consensus PCC research definition once (i.e. at 3 months post-testing) [9]. However, as we can no longer be certain that the original test-negative group remains uninfected, our new analyses will be restricted to the original test-positive group of CYP who were infected when the wild type and Alpha (B.1.1.7) variants were dominant.

Thus, using data from the CLoCk study [8] original test-positive group, we address two broad aims:

  1. 1.

    To describe the characteristics of CYP infected with the wild type or Alpha variants who persistently meet (vs. do not persistently meet) the PCC research definition over a 24-month period post-infection. Specifically, we describe these characteristics in the CLoCk sample who met the PCC research definition at 3, 6, 12 and 24 months post-infection (and similarly in a supplementary analysis, those who met the PCC research definition at 6, 12 and 24 months post-infection).

  2. 2.

    To develop and validate prediction models for persistent PCC in CYP up to 24 months post-infection.

Methods

The CLoCk study, described in detail elsewhere [8], is a cohort study of SARS-CoV-2 PCR-positive CYP, PCR-tested between September 2020 and March 2021 when they were aged 11 to 17 years, matched by month of test, age, sex assigned at birth, and geographical area to SARS-CoV-2 test-negative CYP using the SARS-CoV-2 testing dataset held by the United Kingdom Health Security Agency (UKHSA). After obtaining written informed consent, CYP completed an online questionnaire about their health at the time of their SARS-CoV-2 PCR test (“baseline”; retrospectively reported) and at approximately 3, 6, 12 and 24 months after their index-PCR test (with different numbers of respondents at each time point depending on the time of recruitment into the study relative to their test date). Ethical approval was provided by the Health Research Authority Yorkshire and the Humber – South Yorkshire Research Ethics Committee (REC reference: 21/YH/0060; IRAS project ID: 293,495) and the study is registered with the ISRCTN registry (ISRCTN 34804192). Here, in our main analysis, we examine the sample of original test-positive CYP who responded at 3, 6, 12 and 24 months post-infection (N = 943) and, in a supplementary analysis, those from the original test-positive sample who responded at 6, 12 and 24 months post-infection (N = 2373); see Fig. 1 for details.

Fig. 1
figure 1

Data flow diagrams for the analytical samples under consideration*. *For a detailed overview of the CLoCk sampling strategy see Nugawela et al. Int J Epidemiol 2024 53(1); ** we excluded 2 CYP from the final analytical sample because they had missing data on one of two questions: (i) family history of hospital attendance due to COVID-19 and/or (ii) family history of ongoing problems due to COVID-19

Measures

The CLoCk questionnaire included demographics, elements of the International Severe Acute Respiratory and emerging Infection Consortium (ISARIC) Paediatric COVID-19 questionnaire [13], the Mental Health of Children and Young People in England surveys [14] and, originally, 21 symptoms (mostly assessed as present/absent). Validated health scales including the EQ-5D-Y [15] (as a measure of quality of life and function) were also included. The questionnaire was largely unchanged between study enrolment and subsequent follow-ups: redundant questions (e.g. demographics and symptoms at time of testing) were removed at follow-ups, and questions on additional symptoms (e.g. sleeping difficulties) were added.

Outcome

The Delphi research definition of PCC in CYP [9] was operationalised at the time of questionnaire completion (i.e. at all timepoints from 3 to 24 months post-infection) as experiencing ≥ 1 symptom AND problems with mobility, self-care, doing usual activities or having pain/discomfort or feeling very worried/sad, based on the EQ-5D-Y scale. CYP meeting this operationalised research definition were classified as having PCC at the time of questionnaire completion. To align our main analysis with the WHO clinical case definition of PCC in CYP [16], our main outcome of interest was defined as persistent PCC from 3 to 24 months. This meant meeting the PCC definition at 3, 6, 12 and 24 months. Our supplementary analysis examined meeting the PCC definition at 6, 12 and 24 months in a larger sample of CYP; see Fig. 1.

Potential predictors

Pre-specified potential predictors were chosen based on their distribution in the dataset and their known association with PCC (see Table 1 for details). They included: sex (assigned at birth); age at infection (i.e. age at PCR-testing); ethnicity; region of residence; deprivation (proxied by the Index of Multiple Deprivation (IMD)); history of health conditions (specifically asthma and allergies); learning difficulties at school (pre-pandemic); having an education, health and care plan (EHCP) in place pre-pandemic, family history of hospital visits due to COVID-19 and family history of ongoing problems due to COVID-19. Sex, age, region of residence and IMD were obtained from UKHSA databases. At study enrolment, the following were self-reported: ethnicity, asthma, allergies, learning difficulties and having an EHCP. Family history of hospital visits and ongoing COVID-19 problems were self-reported for up to 24 months.

Table 1 Odds ratios (95% CIs) of associations between participant characteristics and persistent PCC from (a) 3 to 24 months and (b) 6 to 24 months post-infection

Sample size and missing data

We assessed whether our analytical samples were sufficiently powered to estimate the overall observed outcome probability, and how many predictor parameters could be considered before overfitting/precision became a concern [17]. We’ve used the pmsampsize STATA package and considered (i) small overfitting (i.e. a shrinkage factor of predictor effects ≤ 10%), (ii) small absolute difference of 0.05 in the model’s apparent and adjusted Nagelkerke’s R-squared value and (iii) precise estimation within 0.05 of the average outcome risk in the population. For the 3- to 24-month main analysis sample (n = 943), we assumed an outcome prevalence of 7.21% and a C-statistic of 0.75. For the 6- to 24-month supplementary analysis sample (n = 2373), we assumed an outcome prevalence of 11.3% and a C-statistic of 0.75. For the 3- to 24-month sample, the maximum number of parameters that could be estimated during model development was 6 with 11.33 events per candidate predictor parameter. For the 6- to 24-month sample, the maximum number of parameters that could be considered was 23; the event per candidate predictor parameter value was 11.65.

There was no missing data in our main analysis. In our supplementary analysis, 2 CYP had missing data on family history of hospital attendance and/or family history of ongoing problems due to COVID-19; they were dropped from the model-building process (Fig. 1).

Statistical analysis

Characterising CYP infected with COVID-19 who persistently meet the PCC definition over a 24-month period

The prevalence of PCC at 3 months that continued at all follow-up timepoints to 24 months and at 6 months that continued to 24 months was calculated and depicted in bar charts. We also, describe the sociodemographic characteristics of the samples stratified by PCC persistence.

Developing and validating prediction models for persistent PCC up to 24 months post-infection

The following prediction modelling development and validation process for persistent PCC was carried out in both our main and supplementary analytical samples. Univariable associations between each potential predictor and persistent PCC were examined. Next, a multivariable logistic regression model was built using the least absolute shrinkage and selection operator (LASSO) technique to identify all potential predictors [18]. After this step, the model was further refined to ensure it had the required number of parameters (as described in the sample size calculation above). Model calibration, the agreement between observed and predicted probabilities of being classified as having persistent PCC, was assessed using calibration plots, calibration- in-the-large and calibration slope statistics [19]. Model discrimination, the ability of the model to differentiate between CYP who were classified as having persistent PCC and those who did not, was quantified using the C-statistic [19]. Internal validity of the final model was assessed using 100 bootstrap samples (drawn with replacement) [19, 20]. Confidence intervals (CIs) for the performance measures, including calibration-in-the-large, were estimated using bootstrapping. Specifically, 100 bootstrap samples were drawn with replacements from the original dataset. For each bootstrap sample, the performance measures were calculated, and the distribution of these measures across the 100 bootstrap samples was used to derive the 95% CIs. Model overfitting (optimism) was also estimated using the bootstrap samples. We calculated two shrinkage factors to adjust for overfitting: (i) a uniform shrinkage factor (i.e. the optimism-adjusted calibration slope derived using bootstrap samples) and (ii) the Heuristic shrinkage factor [21]. For use in the next step, we selected the shrinkage factor requiring the least adjustment (i.e. closest to one). The original β coefficients were multiplied by this shrinkage factor to obtain the optimism-adjusted coefficients; the model intercept was re-estimated based on these shrunken model coefficients to generate the final model [20, 22]. Data management and analysis were performed using STATA18. We followed guidelines by the Prognosis Research Strategy (PROGRESS) Group [23,24,25] and model development and validation phases followed the suggested methods. [20, 23, 26,27,28] The study is reported according to the Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD) statement [27] (see Additional File 1: Table S1).

Sensitivity analyses: model performance in key subgroups

We assessed, in our main analytical sample, whether the prediction was consistent across key subgroups, by determining model performance metrics (calibration slope, calibration-in-the-large and C-statistic) in subgroups stratified by age at infection (11–14, 15–17 years), sex at birth (male, female) and IMD quintiles (most-to-least deprived).

Role of the funding source

The funders had no role in the study design, collection, analysis, and interpretation of data, nor in the writing of the report and in the decision to submit for publication.

Results

Nine hundred forty-three of 23,048 CYP were included in our 3-to-24-month analytical sample, while 2373 of 55,447 CYP were included in our 6-to-24-month analytical sample (Fig. 1). In general, our analytical samples contained more females and least deprived CYP compared to those invited and envisioned to be in our population (Additional File 1: Table S2).

Characteristics of CYP meeting the PCC research definition up to 24 months post-infection

While 25% (233/943) of CYP met the research definition of PCC at 3 months post-infection, only 7% (68/943) continued to meet the PCC definition persistently to 24 months (Fig. 2, Additional File 1: Table S3). Those who met the PCC definition persistently from 3 to 24 months were more likely to be female (vs. male), older at infection (vs. younger), live in more (vs. least) deprived areas, have a history of asthma, allergy problems, learning difficulties at school, an EHCP, a family history of hospital visits or ongoing problems due to COVID-19 (Table 1, Additional File 1: Table S3). For example, the odds ratio (OR) of meeting the PCC definition at 3, 6, 12 and 24 months was 1.58 (95% CI: 1.41, 1.76) comparing CYP who were 15 to 17 years old at infection to those who were 11 to 14 years old. There were also differences with respect to ethnicity and region of residence. In the supplemental analysis, while 25% (601/2373) met the PCC definition at 6 months post-infection, 11% (268/2373) of CYP continued to meet the PCC definition at all subsequent timepoints, i.e. at 12 and 24 months (Fig. 2, Additional File 1: Table S3). Those who met the PCC definition at all these timepoints had a broadly similar profile to the sub-sample meeting the PCC definition at 3, 6, 12 and 24 months. For example, the OR of meeting the PCC definition at 6, 12 and 24 months was 1.58 (95% CI: 1.21, 2.07) comparing CYP who were 15 to 17 years old to CYP who were 11 to 14 years old at infection.

Fig. 2
figure 2

Number and prevalence (n (%)) of CYP continuing to meet the PCC definition over time. NB: the prevalence of persistent PCC up to 24 months (shown as the bar at 24 months since a PCR positive test) is the outcome that is predicted in the subsequent models

Model development

The final model, predicting persistent PCC from 3 to 24 months included sex, history of asthma, allergy, learning difficulties at school and family history of ongoing problems due to COVID-19 (Additional File 1: Table S4). For the final model predicting persistent PCC from 6 to 24 months, additional predictors were age, region of residence, IMD, an EHCP and family history of hospital visits due to COVID-19 (Additional File 1: Table S4).

Model validation

The original predictive model was well calibrated in the model development data to predict persistent PCC from 3 to 24 months, with an apparent calibration slope of 1.000 (95% CI: 0.999, 1.000) and an apparent calibration-in-the-large of 0.000 (95% CI: − 0.000, 0.000; Additional File 1: Table S5). The discrimination was also good, with a C-statistic of 0.757 (95% CI: 0.699, 0.814). The bootstrapping approach provided a shrinkage factor of 0.9398; the heuristic shrinkage factor was 0.9139. We used the bootstrapping shrinkage factor as it required a smaller adjustment and applied it to the original β coefficients to obtain the optimism-adjusted coefficients before re-estimating the intercept for the final model given in Additional File 1: Table S6. The final shrunken predictive model showed good overall model calibration: confirmed by the calibration-in-the-large (0.000), calibration slope (1.064) and a calibration plot showing narrow confidence intervals and closely aligned predicted and observed probabilities for 10 equally sized risk groups (Additional File 1: Fig. S1). It also showed moderate-to-strong discrimination with a C-statistic of 0.755 (Table 2, Fig. 3a).

Table 2 Model performance statistics of the original/final shrunken models
Fig. 3
figure 3

Area under the curve for the final shrunken models

In the model development data predicting persistent PCC from 6 to 24 months, the original model was well calibrated (calibration slope = 1.000 (95% CI: 0.999, 1.000); calibration-in-the-large = 0.000 (95% CI: − 0.000, 0.000; Additional File 1: Table S5) and showed good discrimination (C-statistic = 0.737 (95% CI: 0.698, 0.766). The bootstrapping approach provided a shrinkage factor of 0.8755; the heuristic shrinkage factor was 0.8725. Using the bootstrapping shrinkage factor we obtained the optimism-adjusted coefficients given in Additional File 1: Table S6. The final shrunken predictive model showed strong discrimination (C-statistic = 0.724; Table 2, Fig. 3b); and, the final shrunken model also showed good overall calibration (calibration slope = 1.142; calibration-in-the-large = 0.005; Table 2).

Sensitivity analyses

Model performance statistics in age, sex at birth and IMD subgroups demonstrated broadly good calibration and discrimination in all subgroups, with the exception of the least deprived IMD group (Additional File 1: Table S7). For example, the C-statistic for age at infection, sex at birth and IMD quintiles 1 to 4 subgroups, ranged from 0.705 to 0.847. In contrast, IMD quintile 5 (i.e. least deprived) had a C-statistic of 0.629.

Worked examples

We demonstrate with hypothetical examples the predicted risk of persistent PCC in Table 3. As an example, the predicted risk of persistent PCC from 3 to 24 months post-infection for a hypothetical male, with a history of asthma and a family history of ongoing problems due to COVID-19 was 8.1% (worked example 3). If a similar boy also had learning difficulties at school pre-pandemic, it would be 22.8% (worked example 4).

Table 3 Hypothetical examples of predicted risk of persistent PCC, using our prediction models

Discussion

This study shows two key findings. Firstly, the data show that many CYP who initially meet the research definition of PCC get better over time. For example, while 25% of CYP met the PCC definition 3 months post-infection, only 7% of CYP continued to meet this definition at 6, 12 and 24 months post-infection. While acknowledging that SARS-CoV-2 infection can have a long-lasting impact on some CYP [4], our finding is in line with previous reports from the CLoCk study [10], and elsewhere [29] and demonstrates that post-infection symptoms in CYP generally improve over time. Second, in our final models, we found that female sex, history of asthma, allergy problems, learning difficulties at school and family history of ongoing COVID-19 problems all predicted persistent PCC from 3 to 24 months and from 6 to 24 months. In addition, older age at the time of infection, living in London or the South West, living in more deprived areas at the time of infection, being allocated an EHCP pre-pandemic and having a family history of hospital visits due to COVID-19 also predicted persistent PCC from 6 to 24 months. To our knowledge, these are the only prediction models estimating the risk of CYP persistently meeting the PCC definition up to 24 months post-infection. The models could be used to triage CYP early after infection to identify those who may benefit from earlier targeted support.

We acknowledge the study limitations. The focus of our study was on model development and internal validation, and therefore we did not conduct external validation in this study. However, external validation is recommended prior to clinical application of these models. In particular, it is important to externally validate these models in different settings and populations [30]. For example, these models can be validated in cohorts of CYP after infection by other variants (for example, the Omicron variant [31]). In addition to the above, selection bias may exist in our study. Approximately 4% of the target population of invited test positives were part of the examined analytical samples, and these CYP may not be representative of the broader population of CYP who PCR-tested positive for COVID-19 in England, between September 2020 and March 2021. Moreover, our models have been developed based on infection in CYP by wild-type or the Alpha (B.1.1.7) variant and may not be transferable to other more recent variants. However, previous studies suggest that post-infection symptom profiles are similar for different variants [11]. Nevertheless, as variant, background infection and vaccination rates differ, so too might predictors of persistent PCC. Therefore, as part of updating and externally validating the models developed here, it is important to apply them in cohorts of CYP that have been infected during subsequent COVID-19 infection waves and have been followed up for (at least) 24 months, when such data become available. Importantly, due to changes in COVID-19 testing policies in England over time, we were unable to account sufficiently for subsequent reinfections in our analytical samples. Therefore, while we are certain of the CYP’s PCR-positive status at baseline, we acknowledge we cannot accurately distinguish whether subsequent symptoms were due to acute reinfection or persistent symptoms from the original (or subsequent) infection. We were also restricted in terms of the potential predictors examined for two reasons. First, because CYP enrolled into our study 3 months post-index-infection, we did not use information that could be subject to recall bias, for example, self-reported physical/mental health at the time of infection. Second, we were limited by questions asked in the CLoCk questionnaire (for example, we did not have information on the severity of index infection), and this may further compromise our prediction model. However, we did use available questionnaire information on variables deemed less likely to be subject to recall bias, for example, having an EHCP in place pre-pandemic. We also used variables collected post-baseline that were likely to be relevant (but were perhaps less prevalent at the time of study enrolment) for example, family history of ongoing problems due to COVID-19. PCC prevalence estimates in CYP vary greatly, with systematic reviews reporting prevalences ranging from 3.67% to 66.49% [32, 33]. Thus, there was no external, gold-standard prevalence to use in our sample size calculations and we used outcome prevalences based on our analytical samples. In addition, our sample size for predicting persistent PCC from 3 to 24 months was relatively small (n = 943) and this had implications in terms of the number of predictors we could include in the model and, therefore, on model performance. Nonetheless, both our final models had good predictive ability, calibration and discrimination. While we are able to characterise and predict PCC up to 24 months post-infection, extending our previous work [12], we are unable to say with confidence whether PCC was present (or not) between data collection sweeps. It may be that our operationalisation of the PCC definition has been too inclusive. However, in the absence of an objective biomarker of PCC, we have had to rely on the consensus Delphi-research definition of PCC [9], which broadly aligns with the WHO definition [16] with the important exception that the latter requires symptoms to have arisen within 3 months from infection. It is for this reason that we developed, and placed most emphasis on, a model predicting persistent PCC from 3 to 24 months, despite data only being available for a small sub-sample at all these timepoints. Finally, as with any prediction model, caution is required for predictions based on data extrapolation/situations where there are only a very small number of observations for different predictor combinations.

Conclusions

Understanding which CYP are at risk of experiencing persistent PCC for months/years after infection is important for decision-making and risk management by the individual, their families and care providers. Using data from a large national cohort study of CYP, we update and extend our previously developed prediction model for experiencing persistent PCC for up to 24 months post-infection [12]. Further studies to determine the clinical and pathophysiological phenotype of PCC are warranted. In the interim, while our models need external validation (in different datasets, countries, etc.), we hope that they will eventually serve as a useful tool for the early identification and management of CYP at risk of persistent PCC.

Data availability

Data is not publicly available. All requests for data will be reviewed by the Children & young people with Long COVID (CLoCk) study team, to verify whether the request is subject to any intellectual property or confidentiality obligations. Requests for access to the participant-level data from this study can be submitted via email to ich.clock@ucl.ac.uk with detailed proposals for approval. A signed data access agreement with the CLoCk team is required before accessing shared data. Code is not made available as we have not used custom code or algorithms central to our conclusions.

Abbreviations

CYP:

Children and young people

CIs:

Confidence intervals

EHCP:

Education, health and care plan

IMD:

Index of Multiple Deprivation

ISARIC:

International Severe Acute Respiratory and emerging Infection Consortium

LASSO:

Least absolute shrinkage and selection operator

PCC:

Post-COVID-19 condition

PROGRESS:

Prognosis Research Strategy

TRIPOD:

Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis

UKHSA:

United Kingdom Health Security Agency

References

  1. Pan D, Pareek M. Toward a universal definition of post–COVID-19 condition—how do we proceed? JAMA Netw Open. 2023;6(4):e235779.

    Article  PubMed  Google Scholar 

  2. Stephenson T, Pinto Pereira SM, Shafran R, de Stavola BL, Rojas N, McOwat K, et al. Physical and mental health 3 months after SARS-CoV-2 infection (long COVID) among adolescents in England (CLoCk): a national matched cohort study. Lancet Child Adolesc Health. 2022;6(4):230–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Office for National Statistics (ONS). Coronavirus (COVID-19) vaccination uptake in school pupils, England: up to 22 July 2022. ONS website. Available from: https://www.ons.gov.uk/peoplepopulationandcommunity/healthandsocialcare/healthandwellbeing/articles/coronaviruscovid19vaccinationuptakeinschoolpupilsengland/upto22july2022. Accessed 2023 Dec 11.

  4. Behnood S, Newlands F, O’Mahoney L, Haghighat Ghahfarokhi M, Muhid MZ, Dudley J, et al. Persistent symptoms are associated with long term effects of COVID-19 among children and young people: Results from a systematic review and meta-analysis of controlled studies. PLoS ONE. 2023;18(12): e0293600.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Molteni E, Sudre CH, Canas LS, Bhopal SS, Hughes RC, Antonelli M, et al. Illness duration and symptom profile in symptomatic UK school-aged children tested for SARS-CoV-2. Lancet Child Adolesc Health. 2021;5(10):708–18.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Castagnoli R, Votto M, Licari A, Brambilla I, Bruno R, Perlini S, et al. Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infection in children and adolescents. JAMA Pediatr. 2020;174(9):882.

    Article  PubMed  Google Scholar 

  7. House of Commons Health and Social Care Committee. Clearing the backlog caused by the pandemic. Available from: https://committees.parliament.uk/publications/8352/documents/85020/default/. Accessed 2023 Dec 11.

  8. Nugawela MD, Pinto Pereira SM, Rojas NK, McOwat K, Simmons R, Dalrymple E, Ford T, Garg S, Hargreaves D, Semple MG, Xu L, Shafran R, Stephenson T, CLoCk Consortium. Data Resource Profile: the Children and Young People with Long COVID (CLoCk) Study. Int J Epidemiol. 2024;53(1):dyad158. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/ije/dyad158.

  9. Stephenson T, Allin B, Nugawela MD, Rojas N, Dalrymple E, Pinto Pereira S, et al. Long COVID (post-COVID-19 condition) in children: a modified Delphi process. Arch Dis Child. 2022;107(7):674–80.

    Article  PubMed  Google Scholar 

  10. Pinto Pereira SM, Shafran R, Nugawela MD, Panagi L, Hargreaves D, Ladhani SN, et al. Natural course of health and well-being in non-hospitalised children and young people after testing for SARS-CoV-2: a prospective follow-up study over 12 months. The Lancet Regional Health - Europe. 2023;25: 100554.

    Article  PubMed  Google Scholar 

  11. Pinto Pereira SM, Mensah A, Nugawela MD, Stephenson T, Ladhani SN, Dalrymple E, et al. Long COVID in children and young after infection or reinfection with the Omicron variant: A prospective observational study. J Pediatr. 2023;259: 113463.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Nugawela MD, Stephenson T, Shafran R, De Stavola BL, Ladhani SN, Simmons R, et al. Predictive model for long COVID in children 3 months after a SARS-CoV-2 PCR test. BMC Med. 2022;20(1):465.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Sigfrid L, Buonsenso D, DunnGalvin A, Garralda E, Jones C, Nicholls DE, et al. ISARIC Global COVID-19 paediatric follow-up. Available from: https://isaric.org/research/covid-19-clinical-research-resources/paediatric-follow-up/. Accessed 2023 Aug 1.

  14. Newlove-Delgado T, Marcheselli F, Williams T, Mandalia D, Davis J, McManus S, et al. Mental Health of Children and Young People in England 2022 - wave 3 follow up to the 2017 survey NHS digital. 2022. Available from: https://digital.nhs.uk/data-and-information/publications/statistical/mental-health-of-children-and-young-people-in-england/2022-follow-up-to-the-2017-survey. Accessed 2023 Aug 1.

  15. Wille N, Badia X, Bonsel G, Burström K, Cavrini G, Devlin N, et al. Development of the EQ-5D-Y: a child-friendly version of the EQ-5D. Qual Life Res. 2010;19(6):875–86.

    Article  PubMed  PubMed Central  Google Scholar 

  16. World Health Organisation (WHO). A clinical case definition for post COVID-19 condition in children and adolescents by expert consensus. 2023. Available from: WHO/2019-nCoV/Post_COVID-19_condition/CA/Clinical_case_definition/2023.1 .

  17. Riley RD, Ensor J, Snell KIE, Harrell FE, Martin GP, Reitsma JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. 2020;368:m441. https://doiorg.publicaciones.saludcastillayleon.es/10.1136/bmj.m441.

  18. Heinze G, Wallisch C, Dunkler D. Variable selection – A review and recommendations for the practicing statistician. Biom J. 2018;60(3):431–49.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Steyerberg EW, Vickers AJ, Cook NR, Gerds T, Gonen M, Obuchowski N, et al. Assessing the performance of prediction models. Epidemiology. 2010;21(1):128–38.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Steyerberg EW. Clinical prediction models. New York, NY: Springer, New York; 2009.

    Book  Google Scholar 

  21. Van Houwelingen JC, Le Cessie S. Predictive value of statistical models. Stat Med. 1990;9(11):1303–25.

    Article  PubMed  Google Scholar 

  22. Collins GS, Reitsma JB, Altman DG, Moons K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med. 2015;13(1):1.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Steyerberg EW, Moons KGM, van der Windt DA, Hayden JA, Perel P, Schroter S, et al. Prognosis Research Strategy (PROGRESS) 3: Prognostic model research. PLoS Med. 2013;10(2): e1001381.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Hingorani AD, Windt DA, Riley RD, Abrams K, Moons KGM, Steyerberg EW, et al. Prognosis research strategy (PROGRESS) 4: Stratified medicine research. BMJ. 2013;346(346(feb05 1)):e5793–e5793.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Altman DG, Vergouwe Y, Royston P, Moons KGM. Prognosis and prognostic research: validating a prognostic model. BMJ. 2009;338:b605–b605.

    Article  PubMed  Google Scholar 

  26. Royston P, Moons KGM, Altman DG, Vergouwe Y. Prognosis and prognostic research: Developing a prognostic model. BMJ. 2009;338:b604–b604.

    Article  PubMed  Google Scholar 

  27. Collins GS, Reitsma JB, Altman DG, Moons K. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD Statement. BMC Med. 2015;13(1):1.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Steyerberg EW, Vergouwe Y. Towards better clinical prediction models: seven steps for development and an ABCD for validation. Eur Heart J. 2014;35(29):1925–31.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Atchison CJ, Whitaker M, Donnelly CA, Chadeau-Hyam M, Riley S, Darzi A, et al. Characteristics and predictors of persistent symptoms post-COVID-19 in children and young people: a large community cross-sectional study in England. Arch Dis Child. 2023;108(7):e12–e12.

    Article  PubMed  Google Scholar 

  30. Moons KGM, Altman DG, Reitsma JB, Ioannidis JPA, Macaskill P, Steyerberg EW, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis Or Diagnosis (TRIPOD): Explanation and elaboration. Ann Intern Med. 2015;162(1):W1-73.

    Article  PubMed  Google Scholar 

  31. Pinto Pereira SM, Nugawela MD, Stephenson T, Foret-Bruno P, Dalrymple E, Xu L, et al. Post-Covid-19 condition (Long Covid) in children and young people 12 months after infection or reinfection with the Omicron variant: a prospective observational study. Sci Rep. 2024;14(1):9957.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. Zheng YB, Zeng N, Yuan K, Tian SS, Yang YB, Gao N, et al. Prevalence and risk factor for long COVID in children and adolescents: A meta-analysis and systematic review. J Infect Public Health. 2023;16(5):660–72.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Pellegrino R, Chiappini E, Licari A, Galli L, Marseglia GL. Prevalence and clinical presentation of long COVID in children: a systematic review. Eur J Pediatr. 2022;181(12):3995–4009.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Michael Lattimore, UKHSA, as Project Officer for the CLoCk study. Olivia Swann and Elizabeth Whittaker designed the elements of the ISARIC Paediatric COVID-19 follow-up questionnaire which were incorporated into the online questionnaire used in this study to which all the CLoCk Consortium members contributed.

Additional Co-Applicants on the grant application and CLoCk Consortium members (alphabetical):

Bianca De Stavola, University College London, b.destavola@ucl.ac.uk (Orcid ID: 0000-0001-7853-0528)

Esther Crawley, University of Bristol, Esther.Crawley@bristol.ac.uk (Orcid ID: 0000-0002-2521-0747)

Kishan Sharma, Manchester University NHS Foundation Trust (sadly deceased)

Malcolm Semple, University of Liverpool, M.G.Semple@liverpool.ac.uk, (Orcid ID: 0000-0001-9700-0418)

Marta Buszewicz, University College London, m.buszewicz@ucl.ac.uk (Orcid ID: 0000-0003-4016-5857)

Michael Levin, Imperial College London, m.levin@imperial.ac.uk (Orcid ID: 0000-0003-2767-6919)

Shruti Garg, University of Manchester, Shruti.Garg@mft.nhs.uk (Orcid ID: 0000-0002-4472-4583)

Dougal Hargreaves, Imperial College London, d.hargreaves@imperial.ac.uk (Orcid ID: 0000-0003-0722-9847)

Vanessa Poustie, University of Liverpool, v.poustie@liverpool.ac.uk (Orcid ID: 0000-0003-2338-8768)

Funding

This work is independent research jointly funded by the National Institute for Health and Care Research (NIHR) and UK Research & Innovation (UKRI) who have awarded funding grant number COVLT0022. All research at Great Ormond Street Hospital Charity NHS Foundation Trust and UCL Great Ormond Street Institute of Child Health is made possible by the NIHR Great Ormond Street Hospital Biomedical Research Centre. The views expressed in this publication are those of the authors and not necessarily those of the NHS, the NIHR or UKRI. SMPP is supported by a UK Medical Research Council Career Development Award (ref: MR/P020372/1) and a Senior Non-clinical fellowship (ref: MR/Y009398/1).

Author information

Authors and Affiliations

Authors

Consortia

Contributions

MDN designed and conducted the statistical analyses for the manuscript, accessed and verified the data and reviewed the manuscript. TS designed the statistical analyses for the manuscript, conceived the idea for the study, submitted the successful grant application and reviewed the manuscript. RS designed the statistical analyses for the manuscript, contributed to the design of the study, submitted the ethics and R&D applications and reviewed the manuscript. TC contributed to the design of the study and reviewed the manuscript. ED contributed to the design of the study and reviewed the manuscript. TF contributed to the design of the study and reviewed the manuscript. LFS reviewed, formatted, and referenced the manuscript. AH contributed to the design of the study and reviewed the manuscript. IH contributed to the design of the study and reviewed the manuscript. SL operationalised the regulatory requirements for the study and reviewed the manuscript. KM adapted the questionnaire for the online SNAP survey platform. RS accessed and verified the data, designed the participant sampling and dataflow. OS contributed to the design of the study and reviewed the manuscript. EW contributed to the design of the study and reviewed the manuscript. SMPP designed and conducted the statistical analyses for the manuscript, accessed and verified the data and drafted the manuscript.

Corresponding author

Correspondence to Snehal M. Pinto Pereira.

Ethics declarations

Ethics approval and consent to participate

The study was approved by Yorkshire and the Humber–South Yorkshire Research Ethics Committee (REC reference: 21/YH/0060; IRAS project ID: 293495).

Consent for publication

None of the funders was involved in study design, data collection, analysis or writing.

Competing interests

TS is Chair of the Health Research Authority and therefore recused himself from the Research Ethics Application. TC is a member of the National Institute for Health and Care Excellence committee for long COVID. She has written self-help books on chronic fatigue and has done workshops on chronic fatigue and post infectious syndromes. RS co-authored a book published in August 2020, titled Oxford Guide to Brief and Low Intensity Interventions for Children and Young People. TF’s research group receives funding for research consultancy from Place2Be, a third sector organisation providing mental health training, support, and interventions to schools across the UK. All remaining authors have no conflicts of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

12916_2024_3708_MOESM1_ESM.docx

Additional File 1: Table S1: TRIPOD checklist for prognostic model development and validation studies. Table S2: Participant characteristics of (a) all invited to enrol into the study at 3-months post-positive PCR-test, (b) those included in the 3-to-24-month analytical sample, (c) all envisioned to take part in the study at 6-months post-positive PCR-test and (d) those included in the 6-to-24-month analytical sample. Table S3: Study participant characteristics, stratified by persistent PCC 3-to-24-months and 6-to-24-months. Table S4: Final multivariable analysis developed models (original coefficients). Table S5: Model performance statistics during internal validation (using 100 bootstrap samples). Table S6: Final model coefficients after adjusting for overfitting. Table S7: Model performance statistics of the final shrunken models in key subgroups (3-to-24-month sample). Figure S1: Calibration plots.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Nugawela, M.D., Stephenson, T., Shafran, R. et al. Predicting post-COVID-19 condition in children and young people up to 24 months after a positive SARS-CoV-2 PCR-test: the CLoCk study. BMC Med 22, 520 (2024). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12916-024-03708-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12916-024-03708-1

Keywords