We have known for a very long time that bilirubin can cause an encephalopathy leading to acute and chronic clinical impacts, the most severe chronic impacts being kernicterus, a disabling movement disorder and nerve deafness. What seems to be the case is that bilirubin bound to albumin doesn’t cross the blood-brain barrier, and that the BBB, which appears to develop early, and be functional by 20 weeks, is easily disrupted in the newborn. Bilirubin staining of the basal ganglia has been shown on autopsy specimens at much lower concentrations than are associated with kernicterus in term babies. This finding led to the performance of the large multicentre NICHD network phototherapy trial, which is very limited by the lack of a funky acronym. Can I suggest that we start calling it the “LITEPOP” trial: Longterm ImpacTs of Early Phototherapy for Outcomes of Preterms?
There have been multiple studies looking at “free” or “unbound” bilirubin and its impacts, which are hindered to some extent in difficulties measuring it. The subject of this post is a new re-analysis of data from that NICHD early phototherapy LITEPOP trial, which finished enrolling in 2005, and finished follow up 2 years later, with the primary publication (Morris BH, et al. Aggressive vs. conservative phototherapy for infants with extremely low birth weight. N Engl J Med. 2008;359(18):1885-96) appearing in 2008. If you remember, or if you have never heard about this, the primary outcome was “death or NDI” (groan…). There were nearly 2000 babies of 501 to 1000 g bwt randomized at 12 to 36 hours of age if they had never had phototherapy. The intervention was to have phototherapy started immediately and continued until it fell below 85 micromol/L (5 in the US units). The control group had phototherapy to keep the bilirubin either below 137 (500-750 g bwt) or below 170 (751-1000 g).
The results are a very apposite demonstration of the problems with such a composite outcome. Mortality was just about equal in the two groups, 209 vs 201 died before discharge, and a further 17 vs 21 died after discharge, or 24 vs 23% total mortality.
The NDI data were presented as among the entire groups (survivors tested plus deaths), i.e. 235/902 vs 275/902, which is statistically significant, 26% vs 30%, (RR 0.86 (95% CI 0.74–0.99)) but because of the 1% higher mortality in the early phototherapy group, the combined outcome of “NDI or death” was not considered significant.
If we recalculate the outcome of “NDI” just among the survivors who were tested, 235/672 vs 275/684, 35% vs 40%, and the proportion with low PDI scores 127/672 vs 152/684, 19% vs 22%, or with severe hearing loss, 9 infants vs 28, or with athetosis, 2 infants vs 10, then the differences all suggest benefit of early phototherapy.
Babies in this study had a blood sample stored at 5 days of age, which was later measured at one of 2 laboratories. Although they used the same method, (the peroxidase method) and the same analyser, there were enough differences between the 2 labs that the results were converted to z-scores for each lab, and are reported in the newest publication as percentiles of the z-score.
Adjusted probability plots and adjusted odds ratio (95% CI) for unbound bilirubin versus proportion of death or NDI, death or CP, death or hearing loss, and death before follow-up in clinically stable and unstable infant
But a correction published 3 years later reveals that they actually show the UB divided by 0.3 “we have standardized the unbound bilirubin by dividing the actual values by the standard deviations which is 0.3. The standardized values (in units) were used in the presentation of data in this figures which appear to be 0–3.0 units. The actual unbound bilirubin values are 0–0.9 mcg/dL”. I don’t understand how this “standardized” the data, it was clearly just a screw up that none of the 89 authors noticed. (I exaggerate).
You may also remember a secondary analysis published 4 years after the original manuscript Tyson JE, et al. Does aggressive phototherapy increase mortality while decreasing profound impairment among the smallest and sickest newborns? J Perinatol. 2012;32(9):677-84. In that publication they performed a subgroup analysis dividing the smaller babies into those ventilated and those who were not. Whether this was a preplanned subgroup or not is unclear; the publication states rather vaguely that they had plans “to relate the risks and benefits of AgPT to baseline risk factors including measures of severity of illness”. In that secondary analysis of a subgroup, using Bayesian analysis, they claimed to show that it was highly likely that phototherapy increased mortality among ventilated infants of 500-750 g. That analysis also shows that there was a probable decrease in mortality among non-ventilated 500-750 g babies (13% vs 25%), a probable decrease among larger babies who were ventilated (16% vs 19%), and a possible increase among larger non-ventilated babies (8% vs 6%). My interpretation of which is that the differences are very likely just random variations between groups. However, the authors of that secondary analysis were much more emphatic that phototherapy was decreasing the anti-oxidant benefits of bilirubin, and was generating oxidant injury and was increasing mortality in the smallest babies, if they were ventilated.
So why have they come back to these data now? (Arnold CC, et al. Unbound bilirubin and risk of severe neurodevelopmental impairment in extremely low birthweight newborns. Pediatr Res. 2025) The authors state that there are new concerns about lipid emulsions leading to increases in Free Fatty Acids and displacing bilirubin from albumin, leading to an increase in UB. But such concerns are not new at all, they date back to the 90’s or further. Perhaps more relevant is that a new method for measuring UB has become available, which might make it more clinically useful, and there are new statistical methods available, including the TMLE (Targeted Maximum Likelihood Estimation), which apparently uses machine learning to give conservative explanations of correlations between variables.
For this analysis they concentrated on “severe NDI” or sNDI, which was “based on the NDI outcomes with the strongest associations with the phototherapy regimen in the NICHD aggressive vs conservative phototherapy” LITEPOP RCT which was : a score on the Bayley II Mental or Psychomotor Developmental Index of 50 or less or a level of 5 for gross motor function (GMF), or needing bilateral hearing aids.
They divided UB into 20 slices, based on percentile of UB, as you can see from the figure below, there seems to be a threshold effect, with sNDI increased substantially for the top 15% of UB concentrations on day 5 of life.
There was very poor correlation between the total bilirubin and UB concentrations, as you can see below. But almost all of the UB concentrations associated with worse outcomes were above the threshold for phototherapy in the early treatment group, only 1 or 2 of the dots are in the upper left quadrant. You can also see, what I was not aware of previously, that far more than half of the dots are above 5 mg/dl, which implies that the phototherapy was not very effective at keeping the total bilirubin below the treatment threshold.
Most of the babies with “sNDI” did not have classic clinical signs of kernicterus, choreo-athetoid cerebral palsy, oculomotor problems, and high-tone nerve deafness. This could be because bilirubin encephalopathy is not so specific in the affected regions in very preterm infants, especially in the presence of acidosis, sepsis etc.
What should we do about this? If UB or “free” bilirubin measurement becomes more widely available, then the lack of correlation with total bilirubin strongly suggests that UB should become the variable of choice for monitoring hyperbilirubinemia in full term infants with jaundice. We could perhaps decrease the proportion of babies who get intervention while improving the prevention of kernicterus, which continues to affect somewhere around 1 in 100,000 babies, and while rare is devastating. The potential benefits among preterm infants are numerically greater, if a new trial (the UNBEATEN trial, UNbound Bilirubin EArly TreatmENt trial, I claim copyright) could show that screening for, and treating, to maintain UB < the 85%ile, improved neurological and developmental outcomes.
At the same time the trial could investigate whether the possible impact of phototherapy on mortality among the subgroup of ventilated extremely preterm infants in the NICHD trial was a real phenomenon or not.
A recurrent problem in the NICU is “bandwagon-jumping”. An intervention with some preliminary positive data, and a possible physiologic rationale is widely adopted and made part of QI bundles with little or no reliable evidence that it is either effective or safe.
One such is the hype surrounding fresh mother’s milk. There are a small number of studies showing the following impacts. (references at the end)
A reduction in laboratory indices of antioxidant activity with freezing and with refrigeration, has been shown, with the longer duration of refrigeration having a greater impact, compared to never refrigerated milk.
An impact of pasteurization, but not freezing and thawing, on Leptin concentrations, but not Adiponectin or total protein
Freezing and thawing reduce glutathione content
No impact on total fat or carbohydrate of freeze/thawing, possible increase in Free Fatty Acids.
Freezing inactivates the majority of CMV, decreasing viral load, but does not eliminate entirely
Vitamin C concentrations are reduced by refrigeration and by freezing
The effects on cellular components of refrigeration and/or freeze thawing are unclear, which is unfortunate as I suppose this is the reason that people want to give “fresh” breast milk. My literature search for this post was unable to find much data on the impacts of freezing and thawing on the endogenous cellular components of breast milk. It has been reported that the viable proportion of white cells in milk after freeze/thawing is 78% or 19%. But exactly how to determine which type of white cell is still active, is unclear to me, the most recent report I found is a brief letter with little methodological detail, which determined viability with “propidium iodide” and measured some cell markers.
And here another word of caution, what is meant by “fresh” isn’t always the same, sometimes it means never refrigerated, sometimes never frozen, sometimes not pasteurized. A recent statement from France, for example differentiates between “Raw” milk, meaning never refrigerated, and “Refrigerated” milk, with “Fresh” meaning either raw or refrigerated but never frozen. That working group developed guidelines to improve provision of raw milk, despite not presenting any clear reliable evidence of a clinical benefit.
Overall then, it seems that there is little adverse impact of freezing and thawing on macronutrients or cellular components of breast milk. There are some individual impacts, especially on vitamin C.
I guess the current enthusiasm for trying to give fresh milk must, therefore, be based on good clinical data of a benefit? (That is an ironic question, I already know the answer!) There is one cohort study (Huang J, et al. Short-term effects of fresh mother’s own milk in very preterm infants. Matern Child Nutr. 2023;19(1):e13430) comparing outcomes between babies who received “Fresh” mothers own milk, which was what the French group would call Raw milk (i.e. never refrigerated) and used within 3 hours, with the comparisons receiving pasteurized MoM. There were 170 babies <32 weeks GA in total, averaging 1150 g and 29 weeks. NEC and LOS were more frequent in the non-randomized control group, 16% and 19%, vs 10% and 7% in the controls, and there were some very small differences in feeding tolerance. The mother had to provide milk every 3 hours during the entire NICU stay.
Another Chinese cohort study (Sun H, et al. Testing the feasibility and safety of feeding preterm infants fresh mother’s own milk in the NICU: A pilot study. Sci Rep. 2019;9(1):941) compared a group of babies <30 weeks GA who received one feed a day of “Fresh” milk (unclear whether it was allowed to be refrigerated during the 4 hours that was permitted between expression and administration) to a group who received only frozen thawed milk. Infants received either MoM or donor milk (DHM), and it seems that some of the control babies could have received only DHM. There were close to 100 babies per group, averaging about 28.5 weeks gestation and some of the fresh milk babies dropped out and were not analyzed. The birth weights in the 2 groups are not mentioned, but they both, unusually, had a birthweight z score averaging about +0.3 (preterm infants usually average below 0). There was a very high rate of late onset sepsis (38%) and stage 2-3 NEC (14%) in the controls that was lower in the fresh milk group (22% and 6%).
One other study that I have seen quoted is a French database analysis which showed less BPD in NICUs that gave Fresh milk, but in that study the comparison NICUs gave pasteurized mother’s milk, as it was the recommended standard at that time in France. It also is not clear if the Fresh milk could have been frozen prior to administration or not. There was less BPD in the fresh NICUs, but no difference in NEC or in LOS, which was extremely frequent in both groups.
Although it seems attractive to some clinicians, there really is no reliable evidence that giving fresh breast milk, however it is defined, has different clinical outcomes, compared to frozen and thawed breast milk. Differences in composition are relatively minor, and impacts on cellular viability or numbers are really unclear.
The logistic implications, on the other hand, are substantial. Enabling the provision of an early fresh milk feed, daily fresh milk feeds, or exclusively fresh milk feeds, will require some resources, and organisation of those resources, to ensure that it could be done reliably and without errors in administration. A randomized controlled trial would be worthwhile to determine the true clinical impact, if any, of avoiding freeze/thawing of MoM, to see if dedicating those resources is worth it. One is planned and hopefully underway (Sun H, et al. A randomized controlled trial protocol comparing the feeds of fresh versus frozen mother’s own milk for preterm infants in the NICU. Trials. 2020;21(1):170), until the results are available, the potential benefits of Fresh, compared to Frozen and thawed, maternal breast milk must be considered Not Proven.
References (in chronologic order, most recent first)
I ran out of energy in trying to add all the hyperlinks to the references, but I guess if you care you can easily find them, all available through PubMed.
I have written frequently about my concerns with “NDI” as an important measure of neonatal outcomes, indeed, it seems to be often thought of as if it were the only important measure. It has very often been included as part of a composite outcome measure “death or NDI”.
So why am I disturbed about the use of NDI as a primary outcome measure? NDI is itself already a composite measurement, including some indicator of delayed development (most commonly one of the various iterations of the Bayley Scales of Infant Development), some severity of motor disorder expected to be permanent, i.e. Cerebral Palsy, some severity of hearing loss, and some severity of visual impairment. It was a composite invented by neonatologists and follow up specialists as a way of trying to quantify the impacts of adverse cerebral impacts of prematurity. There are many problems with this, both in the actual importance of each component of NDI, and also in the permanence of the finding. For example, most infants with low scores on developmental screening tests at 2 years do not have intellectual impairment at follow up. In the follow up of the CAP trial, for example, only 18% of babies who had a low Bayley score at 18 months (version 2 MDI <70) actually had a low IQ at 5 years (WPSII <70). This is unlike CP, for which a diagnosis at 2 years is very accurate (not 100%, but appears to be about 95% PPV) as a predictor of long term motor dysfunction, but the severity of the problem can vary, especially after a diagnosis at 2 to 3 years, where about 1/3 of infants will change their classification on the GMFCS, either to a higher or a lower score. Visual and auditory impairments seem to be more permanent and invariable, but are a much smaller part of the NDI.
And, of course, combining NDI with death as part of a composite outcome implies that they are equally important, and means that an intervention which decreases death may not be found to be significant is there is an increase in low BSID scores in the survivors (for example).
Do parents of babies who are labelled as having NDI think that their infants are impaired? That is the question asked in a new publication from the follow up centres across Canada (Canadian Neonatal Follow-Up Network, CNFUN). Richter LL, et al. Parental and Medical Classification of Neurodevelopment in Children Born Preterm. Pediatrics. 2025. Over 1000 very preterm infants are involved in the study, and their parents were asked if they thought that their child had a developmental impairment when they attended a follow-up clinic appointment, but before they completed the standardised evaluation. They then had their evaluation and were classified as having no NDI or :
“to have a mild-moderate NDI if they had any 1 or more of the following: CP with GMFCS 1 or 2; Bayley-III motor, cognitive, or language composite scores 70 to 84; hearing loss without requirement for hearing devices or unilateral visual impairment. A child was considered to have a severe NDI if they had any 1 or more of the following: CP with GMFCS 3, 4, or 5; Bayley-III motor, cognitive or language composite scores <70; hearing aid or cochlear implant; or bilateral visual impairment.”
As this table shows, there was poor agreement between what the parents thought, and what the standardised evaluation stated. Most of the disagreements were parents considering their infants to not be impaired, or to be less impaired than the standard classification. There were 185 infants with “severe NDI” according to the definition above, only 23 parents thought their child was severely impaired, in contrast, among the 596 with no NDI, there were 11 parents who found their child to have severe impairment, and 104 thought they had mild-moderate impairment.
Some of the details of the analyses are quite interesting, for example, the small number of infants with serious CP, GMFCS 4 or 5, were mostly considered to have moderate or severe impairment by parents. The cognitive scores of infants who agreed that their infant, with CNFUN defined severe NDI, had at least moderate impairment were lower (median 70) than those who disagreed (median 80).
Many problems faced by families with ex-preterm infants are not captured by “NDI”. This is reflected, I think, by those parents who thought their child was impaired despite not satisfying CNFUN definitions, such infants were much more likely to be using technology at home, and more likely to have been referred for occupational therapy, or to see a psychologist or other therapist. Needing re-hospitalisation also made parent more likely to agree that their infant had an impairment.
Because we haven’t measured some of the things that impact families, such as behavioural disturbances, feeding problems, and sleep disruption, we really don’t know if they are affected by any of our NICU interventions. It wouldn’t surprise me if some interventions, ranging from postnatal steroids to skin-to-skin care or light cycling, might have major impacts on those outcomes. We just don’t know.
What should we do about findings such as these newly published data, and others from the Parents’ Voices project? Defining a single ‘yes or no’ outcome variable is the old-fashioned way of designing research and determining the benefit of an intervention. There are much better ways of comparing outcomes between groups, ways which can take into account the variety of outcomes, and the preferences of parents. It takes some extra work to define the kind of ordinal outcomes which reflect the values of parents and the relative importance of each component, but that is hugely preferable to using composite outcomes which implicitly value each component as being equivalent. Being dead, having a Bayley Cognitive composite of 69, having severe visual loss all qualify as “dead or severe NDI”, but the implications are enormously different.
In the future outcomes we measure should focus on how infants function, and should recognize that the answer to the question “how is your child doing?” is not a dichotomous choice.
Just back from a week in Panama, organised by “Wings“, mostly in Darién province at the extreme east of the country. A really amazing experience, with over 300 bird species seen, including just over 200 “lifers” for me. On my other blog, I have posted several posts, a trip report with many photos.
As a taster, this is a Collared Aracari, and below King Vultures.
Sepsis caused by organisms transmitted during, or shortly before, labour is relatively uncommon in higher income countries with GBS screening programs; the incidence is now around 0.3 cases per 1000 live births, but varies, it may be as high as 10 times that, or as low as 0.1/1000. GBS remains the commonest organism, with E Coli close behind, followed by Strep viridans, and then a host of other bugs. Early identification and treatment is the key to good outcomes.
We can think of this in 2 ways; a risk of 1 case for every 2000 births is a very low risk. On the other hand it is an identifiable 72 hours of life with the highest incidence of sepsis that one will ever have! In general terms, about half of the cases have no identifiable risk factors, and about half the cases occur in babies with a risk factor or a combination of risk factors: chorioamnionitis (or “triple-I”); maternal fever; prolonged rupture of membranes; obstetric procedures; a previous baby with GBS.
Various groups have recommended evaluating risk factors at birth in order to determine the clinical approach to be taken. Usually that means that if the risk is considered high enough, then cultures are taken and antibiotics are given, even if the infant is currently well, or perhaps strict frequent physical examination. There may also be an intermediate group, who qualify for more intensive clinical monitoring, or perhaps for lab screening tests, such as a CBC and/or acute phase reactants, CRP, procalcitonin and others.
We should think of the risk-factor-based approach just as we would any screening test for determining interventions in healthy patients. What is the sensitivity, and specificity, and, depending on the incidence in your population, what are the negative and positive predictive values, of the risk factor combination?
Risk factors are often grouped in differing ways, which makes determining sensitivity etc complex, but given that about half of septic babies lack identifiable risk factors, usually sensitivity is around about 50%.
One study from the Netherlands compared the sensitivity of the Kaiser calculator with the 2021 NICE guidelines and the most recent Dutch guidelines. (Snoek L, et al. Neonatal early-onset infections: Comparing the sensitivity of the neonatal early-onset sepsis calculator to the Dutch and the updated NICE guidelines in an observational cohort of culture-positive cases. EClinicalMedicine. 2022;44:101270), in that study, at 4 hours after birth, the Kaiser calculator had a sensitivity of 36% compared to 50% for the Dutch, and 55% for the NICE framework, among the 88 patients eventually determined to have culture positive EOS. Other publications have shown sensitivity between 37% and 76% of the Kaiser calculator, with the CDC approach being reported to have a sensitivity between 50% and 100%. See reference list below. As the babies were followed, in the Snoek study, progressively more of them developed clinical signs satisfying treatment thresholds, so that 72% were eventually “screen-positive” by Dutch and NICE guidelines, compared to 61% by the calculator. What the publication never explains is how the remaining 24 babies were picked up. Did they become sick after 48 hours? Were they screened outside of the Dutch guidelines, and if so on what basis? Did they remain asymptomatic despite positive cultures?
Overall, the calculator and other risk-based systems, in very large cohorts with current EOS incidence, will, therefore, have around about 50% sensitivity within the first 4 hours after birth. In general, more sensitive risk-factor-based approaches will, of course, screen and treat many more infants. But specificity is more variable and difficult to calculate unless you are sure about the population frequency, but if we work with an incidence of 0.5 per 1000 live births, at term or near, and that risk factor screening tags 4% of babies as being at risk, then the specificity of screening calculates to 96%, with a PPV of 0.6 and a NPV of 99.9.
As always in epidemiology, if the risk factors are broader then the sensitivity will increase, and specificity will decrease. this is nicely illustrated by this figure from one of the studies of Escobar and Puopolo (and others), using just maternal risk factors. In this figure, higher risk at birth identified fewer cases (lower sensitivity) but had much higher specificity and fewer false positives. On the other hand it also demonstrates that the majority of EOS cases are NOT identified by their risk factors, sensitivity is <50% in the columns with intermediate and high risk of sepsis.
If we could define risk factors (or another screening test) that were more accurate, with a higher sensitivity AND specificity, then we could reduce both unnecessary intervention and false negatives. But as of now, about half of EOS cases will be among the babies that have a positive screening test result, and about half will be among the remaining infants, whether that screening test is the calculator or another one of the recent risk-factor-based approaches. The older standards were less specific, especially among babies born after chorioamnionitis, without being much more sensitive, which is why the calculator has led to a reduction in unnecessary treatment in most cohorts where it has been introduced.
For a screening test for an uncommon condition, a sensitivity of about 50% is pretty poor, and for any other condition we would probably abandon that approach! Imagine if our screens for hypothyroidism, or for PKU only had a sensitivity of 50%.
Can we make risk-factor-based screening more sensitive AND more specific?
The Kaiser EOS calculator, if strictly applied, usually ends up with about 4% of all babies receiving antibiotics. I have been trawling through recent literature (and there is a lot of it!) and, of unselected populations, the range of babies recommended to start or strongly consider antibiotics, is between 1 and 5%. That obviously also depends on the prevalence of the risk factors, one very important one being chorioamnionitis. Many guidelines that I have found steer clear of trying to define that phenomenon. They either leave it to individuals to decide whether there is a reliable history of chorio, or avoid the term and use maternal temperature instead. That widely-used Kaiser calculator doesn’t mention chorio, just highest maternal temperature. If you enter data into that calculator, the presence of a maternal fever of 39 degrees increases the risk of EOS by about 40 fold compared to a temperature of 37, depending on the other risk factors.
In contrast the NICE framework in the UK refers to a “clinical diagnosis of chorioamnionitis” but without diagnostic criteria; in full, the NICE list of antenatal risk factors is
Red flag risk factor:
Suspected or confirmed infection in another baby in the case of a multiple pregnancy.
Other risk factors:
Invasive group B streptococcal infection in a previous baby or maternal group B streptococcal colonisation, bacteriuria or infection in the current pregnancy.
Pre-term birth following spontaneous labour before 37 weeks’ gestation.
Confirmed rupture of membranes for more than 18 hours before a pre-term birth.
Confirmed prelabour rupture of membranes at term for more than 24 hours before the onset of labour.
Intrapartum fever higher than 38°C if there is suspected or confirmed bacterial infection.
Clinical diagnosis of chorioamnionitis
The NICE guidance recommends cultures and antibiotics if you have one “red flag” or any 2 other risk factors. Which includes those antenatal factors, as well as the following list of clinical indicators
Red flag clinical indicators:
Apnoea (temporary stopping of breathing)
Seizures
Need for cardiopulmonary resuscitation
Need for mechanical ventilation
Signs of shock
Other clinical indicators:
Altered behaviour or responsiveness
Altered muscle tone (for example, floppiness)
Feeding difficulties (for example, feed refusal)
Feed intolerance, including vomiting, excessive gastric aspirates and abdominal distension
Abnormal heart rate (bradycardia or tachycardia)
Signs of respiratory distress (including grunting, recession, tachypnoea)
Hypoxia (for example, central cyanosis or reduced oxygen saturation level)
Persistent pulmonary hypertension of newborns
Jaundice within 24 hours of birth
Signs of neonatal encephalopathy
Temperature abnormality (lower than 36°C or higher than 38°C) unexplained by environmental factors
Unexplained excessive bleeding, thrombocytopenia, or abnormal coagulation
Altered glucose homeostasis (hypoglycaemia or hyperglycaemia)
Metabolic acidosis (base deficit of 10 mmol/litre or greater)
As you can see the Swiss don’t refer to chorioamnionitis at all, but use the ACOG/AAP term of triple I, which stands for “Intrauterine Infection or Inflammation…. or both” and give some criteria for that diagnosis. My only quibble with this list is that Elective Caesarean without labour or ruptured membranes has a zero, rather than low, probability of EOS, and should be listed as a contraindication to doing a sepsis workup!
Once an at-risk patient is identified, the Swiss guideline recommends Serial Physical Examination, SPE. And here it starts to get questionable, for me. For the Swiss guideline that apparently simply means vital signs q4h, which they specify as being heart rate, temperature, peripheral perfusion and skin colour. Exactly how this is meant to be done, by whom, and with what criteria, are not specified; what exactly is meant by “peripheral perfusion”, for example? If this is going to be done for thousands of babies, it is vital that it is clear and unambiguous. As I understand it, babies with a low probability of sepsis should have routine care, which, I suppose therefore, must exclude q4h vital signs.
However, the references that they give to support the SPE approach have all used very different approaches. For example, here is a figure from an Italian group (Berardi A, et al. Serial physical examinations, a simple and reliable tool for managing neonates at risk for early-onset sepsis. World J Clin Pediatr. 2016;5(4):358-64) that have published about their approach a few times. As you can see, they immediately culture everyone with “chorioamnionitis” or intrapartum fever, and, for the remaining at-risk infants, do the SPE on multiple occasions, “in turn by bedside nursing staff, midwives and physicians” using a purpose built form, shown here. The methodology and timing of the SPE is vitally important.
Among the 8000 babies analyzed using this approach, there was 1 true, culture-positive, sepsis (and supposedly 55 “culture-negative sepsis”), and about 1% of infants treated with antibiotics; with no deaths due to sepsis.
The group in Stanford Joshi NS, et al. Clinical Monitoring of Well-Appearing Infants Born to Mothers With Chorioamnionitis. Pediatrics. 2018;141(4):e20172056 have a different approach again, they directly observed the baby with risk factors for 30 minutes, then, in an initial phase, admitted them to the level 2 nursery on a cardiac monitor with q4h vital signs for 24 hours minimum, then transferred to maternal/infant care thereafter. In the second stage of their project, the infants went directly to maternal/infant care, and had “vital signs and clinical nursing assessments with documentation in the electronic health record at 0, 30, 60, 90, and 120 minutes and then every 4 hours for the first 24 hours after birth. After 24 hours of age, vital signs and clinical nurse assessments were performed every 8 hours until discharge. Nurse staffing ratios for all neonates in couplet care in the postpartum unit were 1:3”. The Stanford experience in that 2018 publication only included 277 infants, all of whom were exposed to “chorioamnionitis”. They showed that their approach was safe, but there were 0 cases of culture-positive sepsis, so again, I could suggest that ANY approach would have been safe!
In a much more extensive report from Stanford from 2020, there were 20,000 infants, with 6 cases of GBS sepsis, no other culture positive sepsis (and one probable CSF contaminant with 2 coagulase negative staph species). It seems that all their newborn babies are monitored in the same way, regardless of any risk factors, with the vital signs recorded at those same intervals and a 1:3 nursing ratio, for the 1st 24 hours of life. Despite the reliance on SPE, they still treat about 4% of all the term and near-term infants with antibiotics; or about 150 babies treated for every true case.
In Lausanne (Duvoisin G, et al. Reduction in the use of diagnostic tests in infants with risk factors for early-onset neonatal sepsis does not delay antibiotic treatment. Swiss Med Wkly. 2014;144:w13981) they have done things differently again, in 2 different time periods they first did the following “Vital signs were checked by midwives every 4 hours during the first 24 hours and every 8 hours during the next 24 hours in all infants with risk factors for EOS” during that period they also did CBC and CRP in all the babies with risk factors. During period 2 “in addition to the monitoring of vital signs by midwives, infants with risk factors for EOS were examined by paediatric residents every 8 hours during the first 24 hours” but they stopped doing the routine CBC and CRP.
In Lausanne, there were 11,000 babies in their report, with only 3 proven infections, which were all in the 6000 infants in the first period. None among the 5000 in the second period. Which is an example of why these decisions are so difficult to make. In the second phase of their study they could have done nothing at all, just sent all the babies home without ever examining them, and treating none of them with antibiotics, to make a silly suggestion, but as there were no culture-positive cases of sepsis in 5000 babies during that period, the results would, I propose, have been just as good! About 2% of all babies born received antibiotics.
When neonatal sepsis is so uncommon, at less than 0.5 cases per 1000 live births, we need enormous studies to prove the safety of one approach over another.
In order to really determine the risk/benefit balance of any particular approach, we need to know the real risks of each approach, with the major benefit, of course, being that we prevent death or serious illness from EOS. But, even that benefit is difficult to determine.
Imagine a well-appearing baby with enough risk factors to receive immediate antibiotics by the EOS calculator, but the baby is treated in a centre using an SPE approach without immediate antibiotics. The infant then develops a serious illness at 6 hours of age, and has a culture and antibiotics; how should that be counted? Is that a success of the SPE approach, or an unnecessary clinical deterioration? We cannot just look at sepsis related mortality, as that is extremely rare, and most of the cohorts have between 0 and 1 death.
If we use a risk-factor based approach, which identifies around 6% of babies as being at risk of EOS, and includes about half of the cases, in a population with an EOS incidence of 0.4 per 1000 the incidence among the 6% at risk will be about 0.3%; among the remaining 94% of infants EOS incidence will be 0.2 per 1000. The Kaiser calculator refines this further, dividing into a category at highest risk, including about a quarter the EOS cases. Which means that there are 1.5% of babies at highest risk, and among them the incidence of EOS approaches 1%.
The question then becomes almost philosophical, should we culture and treat those at highest risk, with the hope of preventing deterioration, but with the knowledge that 99% of treated babies will not be infected, and will have the pain, the adverse impacts on parental infant interaction, and the dramatic, long-lasting effects on the intestinal microbiome.
We also cannot eliminate the risk of EOS and a later deterioration among screen-negative babies, so must ensure adequate parental education, and some sort of surveillance of even low-risk babies.
Perhaps the best approach would be to ensure intensive clinical surveillance of the 1 to 2% of infants who are highest risk, in a way which has the least adverse impact on the infant and the family; an intermediate form of surveillance for the 4 or 5% who at intermediate risk, while ensuring that the system is fail-safe, and babies are not missed. But we must not forget the remaining 95% of infants among whom half of the EOS cases occur, and ensure some form of surveillance for that group with an incidence of maybe 0.1 per 1000. Whether that should include occasional nursing evaluation, parental education, increased sensitivity to the problem of the population at large, or some combination, I do not know. I don’t think it is feasible in most health care systems to follow the Stanford approach of “enhanced clinical monitoring” in all newborns, with frequent vital signs, nursing examinations and 1:3 nurse to baby ratio. Creating the right safety net for low risk babies will detect as many cases as screening or SPE among the high risk.
Additional References (without URL links, I ran out of steam!)
Gyllensvard J, et al. Antibiotic Use in Late Preterm and Full-Term Newborns. JAMA Netw Open. 2024;7(3):e243362. Kuzniewicz MW, et al. Update to the Neonatal Early-Onset Sepsis Calculator Utilizing a Contemporary Cohort. Pediatrics. 2024;154(4). Dimopoulou V, et al. Antibiotic exposure for culture-negative early-onset sepsis in late-preterm and term newborns: an international study. Pediatr Res. 2024. Harrison ML, et al. Beyond Early- and Late-onset Neonatal Sepsis Definitions: What are the Current Causes of Neonatal Sepsis Globally? A Systematic Review and Meta-analysis of the Evidence. Pediatr Infect Dis J. 2024. Achten NB, et al. Sepsis calculator implementation reduces empiric antibiotics for suspected early-onset sepsis. Eur J Pediatr. 2018;177(5):741-6. Coleman C, et al. A comparison of Triple I classification with neonatal early-onset sepsis calculator recommendations in neonates born to mothers with clinical chorioamnionitis. J Perinatol. 2020. Morris R, et al. Comparison of the management recommendations of the Kaiser Permanente neonatal early-onset sepsis risk calculator (SRC) with NICE guideline CG149 in infants >/=34 weeks’ gestation who developed early-onset sepsis. Arch Dis Child Fetal Neonatal Ed. 2020:fetalneonatal-2019-317165. Goel N, et al. Screening for early onset neonatal sepsis: NICE guidance-based practice versus projected application of the Kaiser Permanente sepsis risk calculator in the UK population. Arch Dis Child Fetal Neonatal Ed. 2019. Sloane AJ, et al. Use of a Modified Early-Onset Sepsis Risk Calculator for Neonates Exposed to Chorioamnionitis. J Pediatr. 2019;213:52-7. Gong CL, et al. Early onset sepsis calculator-based management of newborns exposed to maternal intrapartum fever: a cost benefit analysis. J Perinatol. 2019. Carola D, et al. Utility of Early-Onset Sepsis Risk Calculator for Neonates Born to Mothers with Chorioamnionitis. J Pediatr. 2017.
In order to put this in context, I have just been trying to search the literature for all the prior trials, knowing that there were 2 trials from Yeh’s group in Taiwan, the 2nd being a very positive trial performed in 3 centres, 2 in Tawian and 1 in Chicago. There have been several systematic reviews published, the most recent of which was published in a journal called “Respiratory Medicine and Research” and dates from June this year. In that recent SR I saw many more trials than I was aware of, and I must say, this foray was extremely disturbing. There is an enormous amount of very questionable research being published and/or referenced. Below is the Forest plot which includes both studies of early budesonide inhalation, as well as studies of budesonide instillation with surfactant. Bassler 2015 is of course, the NEUROSIS trial, which showed slightly higher mortality with budesonide inhalation, but less BPD among survivors. This Forest plot shows BPD among survivors from all the early inhaled/instilled steroid trials.
I tried to find some of the other trials. Heo2020 is actually a retrospective chart review, with no consent, and no mention of randomization. Wen2016 is from “World Latest Medicne Information” (sic) and impossible to find. Among other studies included in the SR were Cao2018, published in 2018 in the “Journal of Pediatric Pharmacy” which, as far as I can tell, stopped production in 2017, Zhou2019 was also apparently published in the same ex-journal in 2019. Kou2019 is also untraceable, with the link in the reference list of the SR leading to a completely different article.
Some of the real articles in the SR include Pan2017 (Pan J, et al. [Clinical efficacy of pulmonary surfactant combined with budesonide for preventing bronchopulmonary dysplasia in very low birth weight infants]. Zhongguo Dang Dai Er Ke Za Zhi. 2017;19(2):137-41, the abstract is in English, for the rest I used Google translate) 15 babies were randomized to budesonide with their surfactant, and 15 to surfactant alone. The incidence of BPD (defined as needing oxygen at 28 days) was 1 vs 6, which the authors claim was “significant and statistically significant”; of course, it is far from being statistically significant, by Fisher Exact, p= 0.08. Ping2019 also seems to exist, it appears to be a report of a prospective RCT, and seems to show a reduction in moderate to severe BPD, which is undefined in the abstract, which is all I can get hold of. Ke2016 is a 4 group RCT, with 46 subjects per group, one of the groups being budesonide plus surfactant, and one being surfactant alone, the others being budesonide by inhalation using 2 different methods. Yao2021 was published in “American Journal of Translation Research” and purports to be a trial using nebulized budesonide.
It is impossible to know how much of this research was actually ever performed, how much is fictional, how much was invented by AI, how much is re-quoting of numbers from other reviews without adequate checking of the original.
This is very disturbing to anyone wanting to provide high-quality medical care, based on reliable evidence, to sick babies. One way to deal with all this is that, when we write or review a Systematic Review for publication in a reliable journal, we must insist on only prospectively registered trials, with publicly available data. We will have to perform more independent IPD meta-analyses, where the review authors share, and can examine each others anonymised individual data, if we are to be able to rely on the best evidence for our patients.
The background, therefore, to PLUSS, was of a single high-quality trial of moderately large size (n=265 total), Yeh 2016, which doesn’t even appear in the Forest plot above (probably because they did not publish the incidence of BPD among survivors) but which showed a substantial decrease in oxygen needs at 36 weeks, from 50 to 29 %, which was due to a reduction in both moderate BPD (31 to 20%) and severe BPD, (19 to 9%). The RR for “BPD or death” was therefore 0.63, 95% CI 0.5 to 0.8.
If we recalculate the primary outcome of PLUSS as a relative risk of the primary outcome, it was 0.96, with 95% CI of 0.9 to 1.03. In other words, not only was there likely no benefit, with the residual possibility of a small benefit or a minor harm, but no overlap with the CI of the previous trial either. Also of note, there was no impact of budesonide in any of the prespecified subgroups either. Only a post hoc subgroup analysis of the infants with an FiO2 >50% at treatment showed some hint of a benefit.
So why is PLUSS so different to Yeh et al? PLUSS used a different surfactant (poractant rather than beractant in the previous study) had more immature infants, (average GA 25.6 weeks compared to 26.5 weeks), who needed less oxygen on average at intervention (average of 30%, compared to a mean FiO2 in Yeh et al of 60%) and had a maximum of 2 doses of budesonide, compared to a maximum of 6 doses in Yeh’s study.
Yeh had lower mortality, and less BPD among survivors than PLUSS, in the control groups. Mortality in the controls was 16% in Yeh’s trial, and 20% in PLUSS, while BPD in survivors was 55% vs 72%.
In PLUSS, infants who had already received one dose of surfactant were eligible to be randomized at the time of the second dose, if they were still < 48 hours of age. In fact about half of the infants (in both groups) had already received a dose of surfactant without budesonide prior to being randomized. But one of the pre-planned subgroup analyses was restricted to infants getting their first dose of surfactant, and that also showed no benefit of budesonide, as you can see from the figure above, “Surfactant therapy” means previous surfactant without budesonide prior to randomization. So the ‘No’ results are for babies getting their first dose, and there is an almost identical primary outcome (Death or BPD) among these babies. Because prior surfactant treatment was permitted in PLUSS, the average age of administration of the study drug was later, at 5 hours, compared to 2 hours in Yeh et al.
The new study was much larger than Yeh, and allowed surfactant via LISA/MIST, compared to all the babies being intubated in the previous trial. The differences in outcome, with this new high-quality trial showing no benefit with budesonide, could be because of the differences I have discussed, or some other currently uncertain effect, or simply because of random variation. As I already mentioned, the confidence intervals of the primary outcome of PLUSS do include the possibility of a small benefit of budesonide instillation, but do not overlap with the CI of the trial of Yeh.
Post hoc subgroup analyses, such as the one performed here of above and below 50% O2, should always be treated with great suspicion. Introducing new, unplanned analyses after the data have accumulated is very risky, and inflates the possibility of a type 1 error, suggesting a difference which is not real. In addition, the interaction term was not significant, which means that, although the subgroups with FiO2 above and below 50% had different results, there is not a statistically significant difference between those subgroups. For all these reasons, budesonide instillation with early surfactant cannot be recommended based on this trial, even for babies with very high oxygen requirements. It may be that another trial enrolling only infants with very high oxygen requirements could possibly show a benefit, but that is a possibility that can only be answered by doing such a trial.
The BIB trial, currently underway in the USA does not, however, restrict enrolment to infants with a high FiO2, and is in many ways similar to PLUSS, with the main difference being a restriction to the first dose of surfactant, and the inclusion of infants up to 28 weeks and 6 days gestation. BIB has completed enrolment, and, you never know, perhaps we will see some results at PAS next spring. Once we have those results, we should be able to make an evidence-based decision about routine budesonide instillation with surfactant, and we will be able to ignore all the very small, potentially unreliable or perhaps fictitious trials in the current Systematic Reviews. Perhaps, if someone has the time and the funding, we could do an IPD meta-analysis, and only those investigators who are able to supply the individual patient data, from pre-registered trials, could collaborate to give the best possible information for future babies, and for future trials.
As usual for a TED talk this is 18 minutes long, and, in that brief time, Annie recounts her trajectory: learning from families of kids with trisomy 13 and trisomy 18; our own experience with counselling for threatened extreme preterm birth, from both sides of the conversation; and the Parents Voices Project. The only thing I don’t like about this video is the still that was chosen by their algorithm to appear with the title, which has a weird expression on Annie’s face.
The description below the video of Annie’s TEDx talk, which she presented in Fort Lauderdale in September this year, at the Delphi conference of neonatal innovation, notes:
In this heartfelt TEDx talk, Dr. Annie Janvier, Professor of Pediatrics and Clinical Ethics at the University of Montreal, shares profound insights from her work as a neonatologist and ethicist. Drawing on her personal experiences as a mother to children with healthcare needs, including one born at just 24 weeks, Dr. Janvier explores the lessons that premature and critically ill babies can teach us about resilience, life, and the complexities of healthcare. Her unique perspective sheds light on the challenges families face and the ethical questions that arise in critical care, encouraging us to view life and health through a compassionate lens. Dr Annie Janvier is a Professor of pediatrics and clinical ethics at University of Montreal. She is neonatologist and clinical ethicist at the Sainte-Justine University health center. She holds a PhD in Bioethics and co-directs the Masters and PhD programs in clinical ethics at University of Montreal. She works first hand with patients and families confronted with difficult decisions such as end-of-life issues, and decision-making in the face of uncertainty. She is also the parent of three children who had or have healthcare needs, one of which was born at 24 weeks
The most recent publications from the Parents’ Voices Project include
There are only a few randomized trials in neonatal surgery, but those that exist have had a major impact. This new publication may do the same, even though the sample size is relatively modest, and there are some issues with the article. You can see some of the difficulties in doing such trials from the fact that it took 12 hospitals 9 years to recruit 80 patients, 58 of which came from only 3 of those hospitals, Sick Kids, GOSH, and University Children’s Hospital in Belgrade, the others having between 1 and 5 patients each. (Eaton S, et al. STAT trial: stoma or intestinal anastomosis for necrotizing enterocolitis: a multicentre randomized controlled trial. Pediatr Surg Int. 2024;40(1):279).
The investigators enrolled infants with NEC prior to surgery, and randomized them during the procedure if they felt that either enterostomy or primary anastomosis were both reasonable surgical options. After elimination of one subject who had been randomized twice, they had 79 patients enrolled, with 42 having a stoma and 37 a primary anastomosis. This appears to be a very reasonable trial approach, where the individual surgeon has equipoise at the moment of surgery.
The first problem with the trial is that there were 6 patients lost “to all follow up”. I don’t understand how it is possible that the authors don’t even know if they survived or not. If there were big problems with record-keeping in some centres maybe the primary outcome variable, ‘days on parenteral nutrition’ might be difficult to find years later. But this was a prospectively registered trial, so it is not clear to me how simple data, easily available during the initial hospitalisation, can be missing on 7% of the subjects.
There are other significant problems with the manuscript. The primary outcome was, as mentioned, days on TPN, but how the investigators dealt with mortality in this outcome is not at all clear, the results present the primary outcome among survivors initially, which was a median of 51 days (range 3 days to 4 babies still on TPN at the end of follow up 310 days) among the stoma babies, and 30 days (range 4-105). They then present an analysis by an adjusted Cox regression, censored for mortality. As noted, they state in the text that 4 stoma babies were still on TPN at the end of follow up, but the figure shows everyone as being off TPN before 300 days.
The survival curves are also strange. There were only 4 deaths in the primary anastomosis group, but the curve seems to show about 13 steps down in the survival curve, and both curves continue to show deaths up to 200 days, this isn’t because some patients haven’t yet reached the final follow-up as that was completed in January 2020, and there were no further losses to follow-up in the anastomosis group at 3 months or at 1 year. The curves seem to suggest that the final proportion of the stoma patients surviving was 0.8, but there were 4 deaths of 35 with known outcome, which leaves a proportion of 0.88 surviving.
While keeping all of these concerns in mind, the results do seem to show major advantages of primary anastomosis, with the duration of TPN among survivors being shorter, and there being more survivors, 8 deaths in the stoma group vs 4 after anastomosis.
It would be interesting to see whether the experience with the stoma patients in the trial is similar to other patients having a stoma outside of the trial. It seems to me that the fact that 10% of the stoma patients were still on TPN at one year of age, when they were already a selected group, who were relatively stable and with relatively restricted disease, is a surprisingly poor outcome. There were substantially more complications in the survivors in the stoma group, with more wound infections, and, of course, more stoma complications, than among the survivors in the anastomosis group.
The study report may have a lot of big problems, but it does at least suggest that immediate primary anastomosis is a very viable approach in infants who are stable enough to have it considered, and in whom the distribution of disease makes it feasible. There may well be fewer post-op complications with anastomosis, and the number of later strictures were similar between groups.
As some of you may have noticed, I left Twitter (X), apart from sending a link to each new post on this blog. I never any longer visit the website and I have deleted the App on my phone.
A great alternative is the nicuverse, nicuverse.org a Mastodon community with many similar functionalities that we could use to chat with each other, discuss research and clinical approaches. You will see also that there are links there to another on-line community 99NICU. They recently organised a conference in Portugal, much of which is available on-line, and they have a YouTube channel which has numerous videos of presentations from that conference. Several of which I have watched, and they are all of high quality.
For example, in keeping with some of my recent posts, there is a very good presentation about the use of antibiotics and the evidence-base for our current practices.
Another, about what on earth should we do about the PDA? Has 3 stellar presenters rather than just one!
One of the reasons for mentioning this now, is the upcoming series of Webinars about, and by, adults born preterm. You can register for them, or, I am sure, watch them later on-line. One of them is embedded below.
Rethinking the definition of clinical research, and how we approach the essential protections of research subject is clearly needed.
In my NICU, if I have a 3 week old preterm baby who develops signs of septic shock with poor perfusion and a low blood pressure, in general my approach would be to give a bolus of normal saline (10 mL/kg) over a fairly short time, then to start a norepinephrine infusion, and usually start low dose steroids very early. I would start norepinephrine at about 0.1 mcg/kg/min, often the response is slow, and I may increase to as much as 0.4 mcg/kg/min before, often, being able to back off fairly quickly as the steroids and antibiotics have their effect.
Were those publications “research”? We described our practice and the outcomes, in the hope that others will evaluate their own practice and compare with ours, and we had local ethics approval to go through the databases and publish our results. In another NICU, they might routinely give multiple fluid boluses, followed by dopamine, and only use steroids as a last resort. Also without a good evidence base. As each baby is so different to the last, observational studies have limited value. It is, indeed, very hard to imagine how comparing these approaches and determining best practice is possible, even with all the fancy statistical adjustment in the world, including the latest version of propensity score matching, unless large randomized trials are performed.
I can treat babies in this non-evidence-based way, and then compare with other approaches, or change my practice and then re-evaluate, with almost no oversight, and certainly no prior IRB approval. What if I wanted to do something much more scientifically valid, such as trying to standardize therapy as much as possible, then prospectively comparing the outcomes from my centre to other centres that have also tried to standardize their treatment approaches? Then, even if every baby gets exactly the same therapy as they would have done anyway, does this suddenly become a research study? The CNN, Canadian neonatal network, are actually doing such a study, in which centres who usually use norepinephrine for septic shock, and those who use dopamine try to standardize their initial treatment, and we will compare outcomes.
A much more valid study, and one which would cost much more, and be substantially more difficult to organise, gain approval, and perform, would individually randomise eligible babies to one approach or the other. An intermediate type of study, where NICU’s are randomized, has some advantages in terms of, for example, ensuring the team is familiar with the approach over time, and not requiring masked vials of study drug; but has major impacts on sample size. Either a cluster-randomized or individually randomized trial will require prior IRB approval.
One big question for an IRB is how to determine the risk of such a study. In the past, because there is a substantial risk of death, and of major complications, this should be labelled as a high risk study. The risks of a participant in the study are much greater than the risks of daily life, even of daily life of a preterm baby in the NICU. Surely, though, the risks of the research project should be a comparison of the risks of adverse outcome if you are in the trial compared to being not in the trial. What matters is whether there is additional risk over not being a study participant. Clearly, in the scenario I have described, the risks are identical to not being in the study. It should be considered as “no additional risk”.
The new Declaration of Helsinki does not go into any detail about which risks to include in evaluating a research project. It states that a project must be “preceded by careful assessment of predictable risks and burdens to the individuals and groups involved in the research in comparison with foreseeable benefits to them and to other individuals or groups affected by the condition under investigation.”
The discussion by Kass et al puts it this way: “Our specific recommendation here, with relevance for comparative effectiveness and other types of research on approved or widely used clinical approaches, is for oversight bodies to make their risk-level determinations, including whether a study qualifies as minimal risk, on the marginal or additional risks and burdens of research participation, relative to those of ordinary usual care.”
An accompanying editorial in JAMA points out how these proposals differ from current rules. According to OHRP current rules, “in general the reasonably foreseeable risks of research in a study include the already identified risks of the standards of care being evaluated as a purpose of the research when the risks being evaluated are different from the risks some of the subjects would be exposed to outside of the study”. In other words all possible risks of being a preterm baby in the NICU with septic shock undergoing intensive care are considered study risks. Which means that all possible adverse outcomes of the extremely high-risk situation are included, death, IVH, renal failure, and so on. That interpretation of the OHRP rules makes intensive care research just about impossible, as it requires the IRB to ascribe all the risks of being critically ill to the trial, even if there are no additional risks compared to not being in the study. I think that many IRBs already interpret the risks of a trial as being those additional risks, and I can only hope that this interpretation will become widespread.
Another question is about consent. Normally, I do not share with parents all the many the uncertainties regarding the best approach to septic shock, I do not ask them to make an informed choice between norepinephrine and dopamine, I don’t counsel them about the uncertain role and timing of steroids, and get them to sign a consent form. If I am planning to prospectively collect the data, with the explicit intention of comparing outcomes with dopamine centres, does that now require specific informed consent? Even though I am doing exactly what I would have done in any case? Indeed, so are my comparison centres, who use a different approach.
I think that this does not require specific informed consent, I am not even sure that we should obtain consent to contribute the baby’s data to a truly anonymous database for future comparative analysis.
What if I decide, for a while, to switch my approach to using dopamine, and collect the data, because I understand the uncertainty involved, and I think that dopamine is an equally valid choice? And my comparison centres will also switch to using norepinephrine. According to this new discussion article:
(Informed consent is required if)… the research removes from patients the ability to make a decision that may be meaningful to them—a decision that, in ordinary clinical care, they may have been invited to make.
They go on to note :some commentators support the view that consent can, ethically, be waived when risks are low and when consent is typically not sought for either approach being studied, others oppose complete waivers of consent requirements in some or all circumstances. Importantly, most of these critics of complete waivers support streamlined approaches to consent for these lower-risk, randomized studies.
In this situation, parents would not ordinarily be asked whether they prefer norepinephrine or dopamine, so the first criterion does not apply. However, my initial intervention choices are being determined by a written protocol, I think this should require that I tell the parents that their infant is getting a therapy determined by a protocol, a therapy which is, nevertheless, within the usual range of good clinical practice.
I don’t think it is appropriate to perform prospective comparative research, even with zero additional risk, without informing the parents. It seems to me to be hiding important information from the parents, and that is likely to create distrust.
I propose for such a study that consent should be altered to focus on informing parents of the important details of the study, and giving them an opportunity to opt-out of data collection. A 2017 article refers to this as “targeted consent”, whereby patients who are individually randomized to one of two reasonable, clinically appropriate treatments have a verbal information sharing session, and a brief targeted consent form outlining the patient’s rights. I don’t think that a cluster-randomized trial should be considered to be morally different, if all patients admitted in January have one approach, then in February we switch to another approach, from the patient’s point of view this is identical. Or if NICU A is randomized to dopamine, and NICU B is randomized to norepinephrine, and then switched over after 6 months. In each situation we are performing pre-planned research, and parents should know about it, and have the chance to opt-out of information collection, and even from non-essential aspects of the care protocol if there are any.
Of course, as always, if I have a valid reason to believe that this particular patient would benefit from norepinephrine, then I am morally obligated to do so, and treat them as a protocol violation, or a non-enrolled subject.
If I was to introduce levosimendan, however, a drug with almost zero use in the NICU, and no experience in the preterm infant with septic shock, then there would be a possibility of a major impact on important clinical outcomes, and the study should have completely different, and more extensive requirements and consent process.
After the dopamine and norepinephrine study finishes, it may be found that mortality is higher in one group or the other, or the incidence of a major complication differs between groups. I don’t think we can then, in retrospect, suggest that the risk of the trial changes.
This is what happened with the SUPPORT trial, where the 2 oxygen saturation target ranges, both considered to be acceptable within usual clinical practice were prospectively randomly compared. Outside of the trial and infant could have been treated with either of the 2 approaches, so, according to the standards that I am promoting here, this would be considered a trial with no additional risk. In fact, as we all know, we found a higher mortality with the lower target range, which created a storm, and was largely responsible for the OHRP guidance which I quoted above.
But, if it turns out that norepinephrine babies have a lower mortality in our study, even after correcting for every baseline imbalance possible, that does not change the a priori risk assessment of the study. To think otherwise requires some sort of magical thinking, that things which are unknowable because such a trial has never been done, should have been known before the facts existed.
Building cluster randomized comparative effectiveness trials on top of existing databases, such as the CNN, will allow us to improve therapy for future babies, with no additional risk to our current patients, and the chance of major benefits for the next cohort.