Andrew M.
Shaw‡
*ab,
Christopher
Hyde‡
*c,
Blair
Merrick‡
*d,
Philip
James-Pemberton
ab,
Bethany K.
Squires
b,
Rouslan V.
Olkhov
ab,
Rahul
Batra
d,
Amita
Patel
d,
Karen
Bisnauthsing
d,
Gaia
Nebbia
d,
Eithne
MacMahon
d,
Sam
Douthwaite
e,
Michael
Malim
e,
Stuart
Neil
e,
Rocio
Martinez Nunez
e,
Katie
Doores
e,
Tan Kia Ik
Mark
e,
Adrian W.
Signell
e,
Gilberto
Betancor
e,
Harry D.
Wilson
e,
Rui Pedro
Galão
e,
Suzanne
Pickering
e and
Jonathan D.
Edgeworth
de
aDepartment of Bioscience, College of Life and Environmental Sciences, University of Exeter, Stocker Road, Exeter, EX4 4QD, UK. E-mail: andrew.m.shaw@exeter.ac.uk
bAttomarker Ltd, Innovation Centre, University of Exeter, Rennes Drive, Exeter, EX4 4RN, UK
cExeter Test Group, College of Medicine and Health, University of Exeter, St Luke's Campus, Heavitree Road, Exeter, EX1 2LU, UK
dCentre for Clinical Infection and Diagnostics Research, Department of Infectious Diseases, Guy's and St Thomas’ NHS Foundation Trust, London SE1 7EH, UK. E-mail: Jonathan.Edgeworth@gstt.nhs.uk
eDepartment of Infectious Diseases, School of Immunology & Microbial Sciences, King's College London, UK
First published on 7th July 2020
An evaluation of a rapid portable gold-nanotechnology measuring SARS-CoV-2 IgM, IgA and IgG antibody concentrations against spike 1 (S1), spike 2 (S) and nucleocapsid (N) was conducted using serum samples from 74 patients tested for SARS-CoV-2 RNA on admission to hospital, and 47 historical control patients from March 2019. 59 patients were RNA(+) and 15 were RNA(−). A serum (±) classification was derived for all three antigens and a quantitative serological profile was obtained. Serum(+) was identified in 30% (95% CI 11–48) of initially RNA(−) patients, in 36% (95% CI 17–54) of RNA(+) patients before 10 days, 77% (95% CI 67–87) between 10 and 20 days and 95% (95% CI 86–100) after 21 days. The patient-level diagnostic accuracy relative to RNA(±) after 10 days displayed 88% sensitivity (95% CI 75–95) and 75% specificity (95% CI 22–99), although specificity compared with historical controls was 100% (95%CI 91–100). This study provides robust support for further evaluation and validation of this novel technology in a clinical setting and highlights challenges inherent in assessment of serological tests for an emerging disease such as COVID-19.
Patients presenting to hospital with clinical and radiological features consistent with COVID-disease usually have a SARS-CoV-2 RNA PCR test performed on upper respiratory tract specimens (e.g. nose and throat swabs) to confirm the diagnosis. Throughout this paper we refer to positive results as RNA(+) and negative as RNA(−). The reliability of PCR swabs are subject to pre-analytical errors such as the quality of sample collection, the technology platform and the primers designed, and for clinical reasons such as infection being localised to the lower respiratory tract.3 Some patients also present late when the viral infection may have passed when symptoms may predominantly be due to immunological, inflammatory and thrombotic processes.4 Comparisons between clinical, radiological and PCR findings illustrate these challenges. In one study 35% of patients with positive CT scan findings were admission RNA(−). Review of serial CT images and clinical findings showed 17% and 12% of admission RNA(−) patients were finally given a COVID-19 diagnosis, and 93% became RNA(+) after further testing over 5 days.3 These observations illustrate the benefit of aggregating information from multiple sources to support the clinical diagnosis from which the many management decisions can take place.
SARS-CoV-2 infection stimulates an antigen specific antibody response. Detecting these antibodies has potential to provide diagnostic information, even though serology is not conventionally used for diagnosis of acute respiratory viral infection such as influenza. Serology may also have a role in population screening, modelling disease spread in the community and staff surveillance, and there may be different required performance criteria in these different settings. There have been a number of reports describing SARS-CoV-2 antibody detection methodologies and technologies, including ELISA assays and lateral flow devices. None is currently considered to have acceptable sensitivity or specificity for diagnosis.5
Here we present a detailed evaluation of a novel gold nanoparticle array technology that provides a quantitative multiplexed 9-dimensional measure of the IgG, IgA and IgM response to SARS-CoV-2 S1, S2 and N proteins. The study was performed using a pre-determined set of samples obtained from a real-world cohort of patients admitted to St Thomas’ Hospital with a suspected clinical diagnosis of COVID-19 on admission and in whom a SARS-CoV-2 RNA PCR was performed. The results of the multiplexed response profile were related to RNA(±) patient classification and time. This robust initial analysis supports proceeding to validation of this technology as a potential serological technology solution for addressing key needs in response to the SARS-CoV-2 pandemic.
Diluted serum (or whole blood) is injected into the device to flow over the array producing a brightness change time (Fig. 1) during a loading step, before undergoing a wash cycle. A detection anti-CRP completes a sandwich assay for the CRP assay and an antibody recognising all human IgG, IgM and IgA detects total serum antibodies binding to each SARS CoV2 antigen (the Rapid Test). Alternatively, antibodies specific to IgG, IgM and IgA in series can be applied to detect individual responses to determine specific IgG, IgM and IgA against each antigen (The Antibody Class Differential Test). The light intensity change in the detection step is integrated for two minutes to produce an area under the curve (AUC) of light intensity seconds (Fig. 1). The test cycle takes 7.5 minutes followed by 2.5-minute regeneration.
The materials used in the course of the experiments were used without further purification, as certified by the suppliers. Sigma-Aldrich supplied phosphate buffered saline in tablet form (Sigma, P4417), phosphoric acid solution (85 ± 1 wt% in water, Sigma 345245), Tween 20 (Sigma, P1379), N-(3-dimethylaminopropyl)-N′-ethylcarbodiimide hydrochloride (commercial grade, E7750), N-hydroxysuccinimide (98%, 130672) and bovine serum albumin (>98%, A3059). SAM molecules, linker HS-C11-EG6-OCH2-COOH (>95%, TH 003-M11.N6) and spacer HS-C11-EG3-OH (>95%, TH 001-M11.N3), were supplied by ProChimia Surfaces. Native human C-reactive protein (>99%, P100-0) and CRP depleted serum (SF100-2) were obtained from BBI Solutions. Glycine (analytical grade, G/0800/48) was provided by Fisher Scientific and recombinant PAG (21186) was supplied by ThermoFisher Scientific. The assay running and dilution buffer was PBS with 0.005 v/v % Tween 20 and the regeneration buffer was 0.1 M phosphoric acid and 0.1 M glycine solution in deionized water.
The recombinant antigens REC31754-100 SARS Coronavirus Envelope Protein (E. coli); REC31753-100 SARS Coronavirus Membrane Protein (Matrix) (E. coli); REC31806-100 SARS-CoV-2 Spike Glycoprotein (S1), SHFc-Tag (HEK293); REC31807-100 SARS-CoV-2 Spike Glycoprotein (S2), SHFc-Tag (HEK293); REC31812-100 SARS-CoV-2 Nucleoprotein His-Tag (E. coli); and PAB21446 Rabbit Anti-Sheep (H + L) were supplied by the Native Antigen Company (Oxford). SAB3700698 Anti-Sheep IgG (Fc specific) polyclonal Antibody produce in rabbit, sourced from Sigma-Aldrich.
Detection reagents were supplied by Bio-Rad, 5211-8004 Goat Anti-Human IgG polyclonal Antibody, STAR141 Goat Anti-Human IgA polyclonal Antibody, STAR145 Goat Anti-Human IgM polyclonal Antibody, STAR 125 Goat Anti-Human IgG polyclonal Antibody, 1707-0189G goat anti human CRP polyclonal antibody; and Randox: IA8354 Immunoturbidimetric Anti-Human IgA Antibody, IG8352 Immunoturbidimetric Anti-Human IgG Antibody, IM8353 Immunoturbidimetric Anti-Human IgM Antibody.
A serum negative standard, CRP depleted serum (BBI SF100-2) acting as CRP(−) control and a SARS-COV-2 antibody negative control. Similarly, a CRP positive control (Level 3 Randox CP2481) is a SARS-COV-2 antibody negative control. A positive patient control was identified early in testing and used as part of the QC process.
A dilution study was performed on a positive patient to establish the antibody titre and CRP concentration (Fig. S2†). It shows that neither assay is saturated indicating a significant dynamic range and the samples with the 100-fold initial dilution are well placed on the calibration curve. Antibody titre is used as a measure of concentration when reference materials are not available; an n-fold dilution of a sample is performed until the signal falls below the detection limit. Relative concentration can then be determined from dilution and absolute concentrations by determining the titres from the NIST reference antibody of WHO CRP reference materials. for all antibody titres. The N antibody titre is 1/6700–1/10000 compared with the CRP at 300 mg L−1 with titre 1/1000. Patient testing was performed under batch control with two refractive index standards to calibrate the brightness change, a serum negative standard, a CRP positive standard and a positive patient standard repeated every 20 patients (Fig. S2†). Degradation of the antigen chip was recorded for each batch under standardised conditions and varied between patient samples applied allowing between 30–50 sample runs per chip before replacement. Chip-to-chip variation was normalised using control samples.
Viral RNA positivity or negativity at any stage during admission was retained as an important explanatory variable for results, alongside others, and percentage agreement calculated with 95% confidence intervals.13 Although these can be considered as early estimates of accuracy (sensitivity and specificity), great caution is required in their interpretation because of the underlying uncertainty about the true disease state, particularly absence of disease. Specificity is thus affected more than sensitivity. In order to avoid bias from setting cut-offs for index test positivity using the same samples as those used for evaluating accuracy, we initially separated samples into Phase 1 (training set) and Phase 2 (evaluation set). Phase 3 was further added using new samples not examined in Phases 1 and 2, to specifically improve the precision of our “sensitivity” estimate. The size of this was informed by a target sample size estimation of 45 patients assuming a true value of 100% and a lower 95% CI of 90%. The measurement of specificity relative to stored samples before the start of the COVID pandemic constituted a fourth phase of the investigation. The size of this was again informed by a sample size estimation of 45 patients assuming a true value of 100% and a lower 95% CI of 90%. Finally, to specifically extend the patient antibody profiles, additional specimens for these patients were purposively sought. These were not included in the analyses of any other phases. For the patient level analysis, where there are multiple samples for the patient, a summary value was calculated as the mean of the sample values, having first confirmed that there was consistency between the values, which was true in all patients. The results were reported according to the STARD reporting framework as the most relevant guideline for the study design undertaken.14
LOS, length of stay. Data was extracted from clinical records by the clinical care team prior to sample anonymisation. | |
---|---|
Cohort size | 74 |
Age mean (SD) | 55.3 (16.7) |
Male | 50 (68%) |
No comorbidity | 10 (14%) |
Multiple comorbidities | 48 (65%) |
Mortality | 10 (14%) |
Discharged at time of analysis | 21 (28%) |
Median LOS not died or discharged (to 30/3/20) (range) | 14 (0 to 38) |
Required mechanical ventilation | 38 (51%) |
Required no respiratory support | 14 (19%) |
Viral RNA positive at any stage | 59 (80%) |
Single sample | 61 (82%) |
Multiple samples | 13 (18%) |
Mean number of samples where multiple | 3.8 |
The combined IgM, IgA and IgG antibody response against N was taken forward as the serological response marker for SARS CoV-2 infection due to it having the highest quantitative response. The N antibody level detected in all 119 serum samples plotted against time from patient-reported first day of symptom onset is shown in Fig. 2. A sample classification was set at <10 days consistent with reported onset of serological response.15 Two serum samples taken from patients with <10 days of reported symptoms had high antibodies against N. Clinical data provided to the laboratory team were reviewed. The upper red circled data point is from a patient admitted with a differential diagnosis of community-acquired pneumonia. They had a negative SARS-CoV-2 nose and throat PCR performed on day 10 and day 18 of symptom onset and a negative PCR on a bronchoalveolar lavage taken on Day 21 while on ITU. The lower red circled data point was from a patient who had dementia and so the first day of symptom onset of interval may be incorrect.
A time-course of the antibody response against N was plotted for 9 patients using serum samples taken between symptom-day 9 and 45 (Fig. 3). This included serum samples provided from patients who remained in hospital beyond the end of the study period. One patient who died on Day 15 had negative N antibody measurements in 4 serum samples taken daily between day 11 and 15.
Classification of a serum sample being either S1, S2 or N antibody positive was made based on RNA(−) phase 1 samples from which cut-off levels for each viral antigen were derived (see Methods). The antibody levels against S1, S2 and N for all samples taken from RNA(−) and RNA(+) patients with their respective cut-offs presented as horizontal black lines are shown in Fig. 4. A Boolean classifier of being S1(+) or S2(+) or N(+) was used to classify patients as seropositive, rather than using N antibody response data alone as presented in Fig. 2 and 3 (Table 2). Using this classification 7(30%) of RNA(−) samples had serum(+) antibody concentrations above the cut off for all three viral antigens (triple RNA(−) false negative). The triple classification assessment identified 22% of all serum positive samples as S1(+) S2(−) N(+), 51% S1(−) S2(+) and N(+) and 18% S1(+) S2(+) and N(+).
Fig. 4 Antigen response for S1(3×), S2 (3×) and N response for RNA(−) and RNA(+) patients in the cohort. The RNA(−) shows outliers in all three antigen distributions and the cut-off is set at the highest value in the distribution (line). The RNA(+) patient group has varying intervals from onset of symptoms with varying rates of recovery. There is a set of patients who have not yet responded or not going to respond (below the cut-off line) that correspond to the early-time patients below 10 days shown in Fig. 1. These are excluded in the recovering +10 days cohort. The biobank data set is included for the specificity analysis. |
Serum (+) | Serum (−) | Total | Serum (+) percentage | |
---|---|---|---|---|
a Time interval not available for two patients. | ||||
RNA(−) | 7 | 16 | 23 | 30% (95% CI 11–48) |
RNA(+) all intervals | 76 | 20 | 96 | 79% (95% CI 71–97) |
Total | 83 | 36 | 119 | |
Interval samples | ||||
RNA(+) < 10 days | 9 | 16 | 25 | 36% (95% CI 17–54) |
RNA(+) 10–20 days | 53 | 16 | 69 | 77% (95% CI 67–87) |
RNA(+) > 21 days | 21 | 2 | 23 | 95% (95% CI 86–100) |
Total | 83 | 34 | 117a |
Classifications of all samples tested in phase 1, 2 and 3 are shown in Table 2 and show a 79% (95% CI 71–97) serum antibody positive rate for the whole clinically heterogenous cohort at all time intervals including the slow-response and patients who died early and those that did not appear to show an antibody response over time. The positivity rate varied by interval from day of first symptom onset with maximum sensitivity of 95% (95% CI 71–97) being observed at 21 days. Finally, a further 47 randomly selected anonymised serum samples from the diagnostic laboratory taken during March 2019 were provided by the clinical team for analysis, and all were serum(−) for all three antibodies against all three viral protein targets (Fig. 4).
Viral RNA(+) | Viral RNA(−) | ||
---|---|---|---|
Phase 2 | |||
S1 or S2 or N ≥ threshold | 6 | 1 | 7 |
S1 or S2 or N < threshold | 0 | 3 | 3 |
Totals | 6 | 4 | 10 |
Phase 3 | |||
S1 or S2 or N ≥ threshold | 43 | ||
S1 or S2 or N < threshold | 6 | ||
49 |
Responses were seen to N, S1 and S2 proteins that were above derived threshold settings. The highest antibody titre was against N and so N responses were used to inform setting the time cut-off classification and to create serological response profiles for 9 patients. The response profiles showed the technology was capable of reproducibly detecting quantitative antibody levels over time on different samples. In contrast only 18% of samples showed a positive result for all three antigens (S1, S2 and N) with 50% having N and S2 responses but only 22% N and S1 responses. The RNA(−) false negative rate was estimated to be 30% (95% CI 10–46) which is consistent with findings elsewhere16 although sensitivity of PCR assays may improve with refinement of primers and technologies.17 Serum positivity was defined as having an antibody response above cut-off against any one of the three antigens, such that 30% (95% CI 11–48) of initially RNA(−) patients were identified as serum positive. For samples taken from RNA(+) patients, the positivity rates were 36% (95% CI 17–54) before 10 days, 77% (95% CI 67–87) between 10 and 20 days and 95% (95% CI 86–100) after 21 days. The patient-level diagnostic accuracy relative to RNA(±) after 10 days gave a sensitivity of 88% (95% CI 75–95) and specificity 75% (95% CI 22–99), although specificity compared with historical controls was 100% (95%CI 91–100).
Our findings are consistent with other published studies15,16,18–25 up to 9/4/20. The main evaluation approach in most studies has been to identify patients with confirmed RNA(+) COVID-19 disease (“cases”) and then compare results with “controls”, which are either healthy normal patients or less commonly patients with suspected COVID-19, but in which disease had been excluded.21,22 Results are then measured in each group to give estimates of sensitivity from results classification in the “cases” and specificity from results classification in the “controls”. Overall sensitivity of 89% and specificity of 91% found by Li et al.21 was typical of the favourable evaluations. The estimate in the unfavourable evaluation was sensitivity 14% and specificity 92%.16 However, time interval since onset of symptoms is an obvious variable15 and most studies thus far have not taken this into account in their accuracy estimates, including that from the unfavourable evaluation, acknowledging that this can be hard to precisely define from patient memory and perception of disease symptomatology.
Other challenges which were often not met include recognising the implications of different reference standards, separating sample and patient levels of analysis and giving measures of uncertainty. We identified only one study16 which attempted to look at classification of real-world populations, where patients had not been specifically pre-separated into “cases” and “controls”. Cohort heterogeneity is a clear and important challenge in any clinical setting. We tried to overcome these challenges in our study, particularly by setting out on our assessment of this technology by first measuring antibody responses in a clinically relevant diverse patient cohort. However, we had to rely on sensitivity and specificity measured in cases and controls in subsequent phases to make the most of the available samples, which is a limitation. Difficulty blinding investigators to the nature of the sample is a further limitation for this component of our study, but was not true where the initial mixed clinical cohort was investigated. Defining target condition by a composite reference standard including clinical profile as well as viral-RNA would have improved our study too. We would have liked to look in more detail at the relationship between false positives and false negatives and clinical characteristics, but these were small in number and our ethics did not permit such detailed clinical evaluations. Concerning the size of the study generally, although the number of patients and samples were modest, the numbers we analysed were guided by sample size calculations. We acknowledge that future studies need greater numbers to further improve precision.
For future research, two-gate designs (the preferred name for diagnostic case-control accuracy studies) should be avoided. Two-gate designs calculate sensitivity in those who definitely have the condition and specificity in those who definitely do not have the condition, at its extreme completely healthy people. Although they are a pragmatic approach, often employed early in the development of a test, they have been shown to exaggerate the performance of tests26,27 by excluding borderline cases which will actually make up a high proportion of patients when the test is actually used in practice. The problem arising from use of a two-gate design may be regarded as an extreme form of spectrum bias, a major concern in the test evaluation.28–30 For these reasons, evaluation is preferred in a realistic spectrum of disease in population where one expects the test to be used with a naturally occurring prevalence and spectrum of disease.
Since we conducted the literature review to inform our study, other influential evaluations of antibody tests have emerged. That by the National COVID Testing Scientific Advisory Panel in the UK has attracted particular attention. They concluded that the performance of current lateral flow immunoassay devices was inadequate for most individual patient applications. However, we observe that our device is different to the lateral flow immunoassay devices5 and performed similarly to the novel ELISA tested: sensitivity against an RT-PCR-confirmed diagnosis of SARS-CoV-2 infection 85% (95%CI 70–94); specificity vs. pre-pandemic controls 100% (95%CI 93–100). They used similar methods and sample sizes to this study. We are currently testing samples from a cohort of known RNA positive and RNA negative patients to assess against the sensitivity and specificity requirements set by regulators.
Footnotes |
† Electronic supplementary information (ESI) available. See DOI: 10.1039/d0an01066a |
‡ Joint first authors. |
This journal is © The Royal Society of Chemistry 2020 |