Neonatal Research Shorts : April 2025

I used to do a series of shorter posts called “weekly updates” but I ran out of steam and have concentrated on longer posts in recent years. The last couple of weeks, with clinical service, I have had less time for a longer post, but a few things piqued my attention:

Alexe PI, et al. Prematurity and Congenial Diaphragmatic Hernia: Revisiting Outcomes in a Contemporary Cohort. J Pediatr. 2025:114545. I thought I would write about this article before the title got corrected, (unless, that it, they really are discussing “congenial” diaphragmatic hernia). Right now if you do a search on “congenial” in pubmed, you get this hit and a couple of auto-mis-corrected titles!

There was also an article about congenial cataracts, and several others about congenial heart disease. The pre-print of the article has the error right on its front page, it has probably been seen by hundreds of pairs of eyes, and everyone’s brain auto-re-corrected it back to congenital! I am not writing about this just to poke fun, however, it really is a very interesting piece of work from an international consortium, reporting analysis of over 13000 cases. It demonstrates very clearly the major impact of prematurity on management and outcomes of CDH. Survival of all cases progressively increases up to 40 weeks gestation.

For infants of <34 weeks gestation, large numbers never have a surgical repair, nearly half, and those that do, get their surgery much later, at 11 days mean, compared to about 1 week for the early term and term groups. ECMO was used in about a third of the cases after 34 weeks, but only 7% of the <34 week infants. ECMO survival was 30% for the <34 weeks, 40% for the late preterm, and just over 50% for the term infants. All of the measures were more marked for the very preterm, and worse again for the extremely preterm.

Within the database there are many infants who delivered after fetal intervention (FETO), which was associated with lower survival. This was clearly because they were a much higher risk group, which is why they were selected for FETO, overall survival was only 54% after FETO, compared to 71% among the remaining infants, and almost all of the FETO babies needed a patch repair. FETO was also associated with increased prematurity, and among those who did deliver preterm, survival was poor, about 40%, but was identical with or without a prior FETO.

Take Home Message : Prematurity is very bad news for infants with CDH.

Franz AR, et al. Effects of liberal versus restrictive transfusion strategies on intermittent hypoxaemia in extremely low birthweight infants: secondary analyses of the ETTNO randomised controlled trial. Arch Dis Child Fetal Neonatal Ed. 2025. This is a secondary analysis of the ETTNO trial, the primary publication of which which showed no overall benefit of a higher versus a lower transfusion threshold in the very preterm infant. This analysis focused on any potential impact of transfusion threshold on apnoea spells, which are the usual pathophysiology behind intermittent hypoxia. Multichannel recording and analysis of apnoea patterns, which I have done, and published, many hundreds of times and published is much more difficult and time consuming than just recording oximeter saturation data. In addition, unless you record airflow as well as just respiratory movement and heart rate, you gain very little, and measuring airflow is tricky. Most recent studies therefore have measured, and discuss, intermittent hypoxia.

I actually started an RCT of transfusion therapy for apnoea when I was in San Diego, we enrolled only about 20 patients prior to me leaving UCSD, and the fellow also graduated, so the trial ground to a halt unfortunately. I did have a poster at an SPR about our preliminary findings, but never felt that it was worth a publication. Basically we performed multi-channel recordings, including an expired CO2 detector next to the nose of the infants, among convalescent preterm babies with apnoea and a hemoglobin <100. We randomized them to get an immediate transfusion if the attending physician thought it was reasonable to transfuse, or to not be transfused for at least 72 hours, and did physiological recordings for 72 hours. The trial was of course therefore very underpowered, but there was no impact of transfusion on the respiratory patterns of the babies.

We did that trial because, at the time, recurrent apnoeic spells were often as an indication for transfusion in anaemic preterm infants. There were also a few observational cohorts, which suggested that apnoea becane less frequent after transfusion, but those cohorts were all uncontrolled. Control groups are essential for many things, but especially with a condition such as apnoea of prematurity, which always gets better with time anyway! Without a control group, any intervention that you can imagine will be followed by decreased apnoea, especially when you factor in that babies are usually enrolled in a trial when their apnoea frequency is at its highest. I was rather sceptical about apnoea as a transfusion indication, which was the justification for the RCT.

This new publication has compared intermittent hypoxia episodes, occurring between days 8 and 49, between infants in the higher transfusion threshold group to the lower group. All the babies were less than 1kg, and there were over 250 per group.

There was no difference between the groups in numbers of IH episodes, duration of episodes, more severe episodes, or the median duration of episodes. Or indeed, as they have previously published, on survival or neurological or developmental outcomes. There was also no interaction between IH, transfusion strategy, and outcomes.

Take Home Message : Intermittent hypoxia, and/or apnoea of prematurity is not an indication for blood transfusion.

Cheung PY, et al. Dose-related systemic and cerebral hemodynamic effects of norepinephrine in newborn piglets with hypoxia-reoxygenation. Pediatr Res. 2025. Po-Yin Cheung and the group in Edmonton refer to the lab where they do their animal work, as my old lab; I set up many of the processes and bought some of the equipment that they are still using 30 years later, including the Transonic flow probes! Po-Yin, and Georg Schmolzer, and their colleagues, have gone far beyond what little I accomplished there, however. This new publication is in acutely instrumented anesthetised newborn piglets, and investigates the haemodynamic and cerebral circulatory effects of norepinephrine infusions. The model they use is one developed by Po-Yin, of hypoxia and re-oxygenation, and is designed to be a model of infants after perinatal asphyxia.

Our group in Montreal have been using norepinephrine for circulatory support for a few years now, mostly for septic shock or for infants with PPHN who need support. In both circumstances there is a small amount of animal data, and a very small amount of human neonatal data suggesting advantages over other agents. One thing that has been missing is a good idea of what the agent does to the cerebral circulation, being very difficult to accurately measure cerebral perfusion in the newborn infant.

The model used in this study leads to a severe metabolic acidosis, lactate of 20 and Base Deficit of 20, and 2 hours later, prior to the study medications begin administered, the acidosis was a little improved. Animals then got either saline, or epinephrine infusion at 0.1 mcg/kg/min or one of 3 doses of NorEpinephrine (0.05, 0.1 or 0.2 mcg/kg/min). Without going to all the details of the findings, NE at 0.1 mcg/kg/min had the biggest positive impact on cardiac function and cardiac output, With cardiac output increasing by 100%, due to a small increase in heart rate, of about 20%, and a major increase in stroke volume. At the same time systemic blood pressure increased by about 20%.

In addition carotid blood flow increased by nearly 20% leading to an increase in cerebral oxygen delivery and brain oxygenation.

Clearly, one cannot directly extrapolate these results to the human newborn, and particularly not specific dose responses, but they do give some reassurance that norepinephrine does not vasoconstrict the neonatal cerebral circulation.

Take Home Message: (with lots of caveats) norepinephrine infusions may lead to improved cerebral oxygenation.

Fox L, et al. A Pilot Randomized Control Trial of Holding During Hypothermia and Effects on Maternal and Infant Salivary Cortisol Levels. Adv Neonatal Care. 2025;25(2):173-80. Many centres, including ours, have instituted “cool cuddles” (câlins-frisquets in Quebecois), whereby infants with HIE being cooled are allowed to be cuddled by their mothers (or fathers!). We have demonstrated, for our own purposes, that this doesn’t adversely impact temperature control. We still have a few restrictions, for infants with multiple central lines it gets quite difficult to ensure safety. Although it seems like the right thing to do, from first principles, this new randomized trial sought signs of infant stability and stress, and maternal stress from their salivary cortisol levels. Mothers in the cuddling group had lowered cortisol, as did their infants, who also showed lower, stable heart rate and respiratory rate, without desaturation, or temperature problems.

Take Home Message : Cuddling during therapeutic hypothermia reduces maternal stress.

Posted in Neonatal Research | Tagged , , , , , | Leave a comment

Early hydrocortisone in shock?

A few years ago, we published our experience with the use of hydrocortisone in newborn infants in septic shock (Altit G, et al. Corticosteroid Therapy in Neonatal Septic Shock-Do We Prevent Death? Am J Perinatol. 2018;35(2):146-51), like many such studies the numbers were small, but the 39 babies who received hydrocortisone in addition to their inotropic support had rapid haemodynamic improvement, within 6 hours of starting hydrocortisone, mean BP had increased by over 30%, urine output was better, and inotropic support was being weaned. They were, however, somewhat less likely to survive the episode (59% survival vs 78% without hydrocortisone) and to survive until discharge, neither of which comparisons were “statistically significant”, and the babies who got hydrocortisone were sicker, less mature and more likely to have NEC, than the septic babies who did not get hydrocortisone.

Those data suggested that there was a major rapid haemodynamic response to hydrocortisone in newborn infants with septic shock, but they clearly could not answer the important question about whether they improved survival.

A new, single-centre prospectively-registered RCT has examined this issue. (Dudeja S, et al. Early hydrocortisone verses placebo in neonatal shock- a double blind Randomized controlled trial. J Perinatol. 2025). Newborn infants with shock, of any aetiology, were enrolled in 2017 and 2018 (not clear why the delay in publishing) and randomised to receive early hydrocortisone, or placebo. It is also not clear why they bothered with the placebo, as the bedside nurse was not masked to study group assignment. The main problem with the trial is the definition of shock, which was either A or B criteria “(A) either systolic or diastolic blood pressure (BP) less than 5th centile; (B) Any two of the following five criteria- capillary refill time >4sec, core-periphery temperature difference >3 °C, urine output <0.5mL/kg/hr in previous 6 h, base excess worse than −5.0mmol/L and lactate >5mmol/L”.

The use of BP alone to define shock is problematic, at any moment 5% of healthy babies are below the 5th percentile, and therefore over 5% of all babies are in shock at any time according to this definition. In addition, the definition allows cuff, non-invasive, BP to be used, which are very unreliable, and they used standards for BP which are limited (Zubrow AB, et al. Determinants of blood pressure in infants admitted to neonatal intensive care units: a prospective multicenter study. Philadelphia Neonatal Blood Pressure Study Group. J Perinatol. 1995;15(6):470-9). Many infants had a large PDA, which will lead to low diastolic pressures satisfying this definition, but without necessarily having other features of shock. Nevertheless, these were babies that the clinicians felt warranted vasopressor/inotrope intervention. To be eligible for the trial the infants had to be “fluid-resistant”, that is not responding to 1 to 2 10 mL/kg fluid boluses, they also excluded infants with NEC, I am unsure why.

Infants were started on vaso-active drugs at the same time as the study drug, and had functional cardiac echography. The primary outcome was survival to 14 days of age, rescue hydrocortisone was allowed if the infant became unresponsive to their vaso-active drugs.

Survival was very poor in both groups, 17% with placebo, and 28% with early hydrocortisone. and 80% of the controls eventually received open label hydrocortisone (compared to 70% of the early HC group).

There are a couple of reasons for this post, the first being that I think this is the first report of an RCT of steroid use in the treatment of circulatory collapse in the newborn, which points out how poor the evidence is to support anything that we do. And also because the survival is reported just as being “not significant”. I mentioned Bayesian analysis in a previous post, and I think that a small trial such as this, which was very unlikely to have a “statistically significant” result, because of low power, is nevertheless informative, and is a good candidate for Bayesian analysis. Using an on-line Bayesian calculator, it seems that likelihood that the early hydrocortisone group is better is about 88% (I am certainly no expert in these calculations, and I may be in error!)

Another way of presenting these results is with the relative risk and its confidence intervals, which is 0.87, 95% CI 0.69 – 1.10. Which means that the results are consistent, with that level of confidence, with no difference in mortality, a small increase in mortality, and a major decrease of 31% in mortality.

The implication of this is that there is certainly enough suggestion of a substantial benefit that further trials are warranted.

I am not against publishing such small trials, I think they can give valuable pointers for the future, but we should be very careful how the results are presented. The abstract of this article just states “we did not observe a significant reduction in 14-day mortality”, which is actually misleading. It would be more accurate to state “we observed a major reduction in mortality with early HC use, but it was not statistically significant, and may have been due to chance effects in a small underpowered trial”. Maybe we should ban the use of the word “significant” from the medical literature. It is often used, as in this abstract, to mean “statistically significant”, but is often interpreted as “real”, “meaningful”, or “important”.

What are we currently doing for this problem? A questionnaire study among Canadian neonatologists was published last year, (Kharrat A, et al. Corticosteroid use in neonatal hypotension: A survey of Canadian neonatologists. Pediatr Neonatol. 2024;65(5):451-6 they were asked if they used HC for hypotension, and if so when. The results showed the variability that you would expect in a situation where there is almost no reliable data.

I can’t remember is I participated in this questionnaire, but, if I had, I would be one of the “other” group! I don’t give fluid boluses for hypotension, unless I have good reason to suppose that the infant is hypovolemic. Early hypotension in the preterm is now much less common, because of delayed cord clamping, and is almost never due to hypovolemia. I will sometimes giving a fluid bolus in shock, if I think that the baby needs more preload, and I tend to start HC very early when I feel that vasopressor/inotrope therapy is needed. In suspected septic shock, for example, I will usually start HC at the same time as the norepinephrine. I have no idea if that is the right thing to do, but 100% of the respondents to the survey reported using HC for hypotensive babies, at least some of the time.

There is clearly a need for further trials; the question of timing of HC use in babies with shock is clearly very important, and could have an important impact on mortality.

Posted in Neonatal Research | Tagged , , , , , , | Leave a comment

Is it a duck? Is it a mole? Platipus and Trial Design Innovations.

When the platypus was first captured, killed, skinned and the pelt sent to a zoologist in London, George Shaw in 1799, he thought he was being scammed. He tried to find the stitches that had been used to cobble together the specimen. Being unable to do so, he then concluded that it was probably real, and was unique. It was what we would now consider an evolutionary innovation, it had features of previous organisms, such as laying eggs, and several new features, being venomous and producing milk to feed their young, once they have hatched. Here is a photo I took in March 2023 at the Woonoowooran national park.

Brett Manley and the group from Melbourne are doing something analogous with their PLATIPUS (Manley BJ, et al. Adapt to survive and thrive: the time is now for adaptive platform trials for preterm birth. Lancet Child Adolesc Health. 2025;9(2):131-7) adaptive platform. Taking features of standard RCTs, and evolving a new way to do trials in perinatal medicine.

Adaptive designs first went mainstream during the COVID epidemic, with the dramatically effective RECOVERY platform. The trials were able to be launched, and to rapidly provide clinically vital results within a very short period of time, while the pandemic was still at its height. RECOVERY was able to rapidly prove the value of standard dose steroid treatment for improving outcomes, and then showed that higher dose dexamethasone was a bad idea, and should not be used. They have also shown that some antivirals are ineffective, that routine azithromycin made no difference, and have to this point enrolled nearly 50,000 patients, having now expanded to find treatments for complicated influenza and non-viral community acquired pneumonia.

Another platform design has, just this last week, produced some important results in ALS, also known as Lou Gehrig’s disease in the USA, and Motor Neurone Disease in the UK. There are 4 simultaneous comparisons published, with 3 different interventions compared to a common control group. None of the interventions showed a clear benefit, but there were some suggestions of some improvements, which allows, within the platform design, to proceed rapidly to adjusted protocols, perhaps in subgroups. An editorial article accompanying the publications discusses the limitations, and the major advantages of such trials. In ALS, the Healey platform was funded by, and named after, a very wealthy man who has the condition himself.

The first comparisons being investigated by PLATIPUS are, for the maternal domain, different antibiotic regimens for preterm PROM, with 3 different regimens being investigated. I must say I thought this was relatively settled, among the antibiotics in the trial, and that it was already fairly clear that azithromycin plus amoxycillin was the way to go, but it seems there is still some doubt about their relative efficacy, and the platform will allow the future investigation of other regimes with minimal adjustment. For the neonatal domain of the platform, they are investigating 3 different dosage regimes of caffeine, the standard dose that we used in the CAP trial, (20 mg/kg of caffeine citrate load, then 10 mg/kg/d, which they call the low-dose group) to a medium dose group which is 50% higher, and a high dose group which is double the standard dose.

The figure below is a schematic of how it might proceed.

The other part of the innovative nature of this trial is the outcome measure that is being used, which is an ordinal composite adjudicated at 42 weeks PMA. The outcome is scored from 1 to 15, with 15 being the worst outcome, death. Levels 1 to 15 are described as follows:

1= Well, liveborn infant; 2= Neonatal unit admission for <48 hours; 3= Neonatal unit admission for >/= 48 hours; 4= Non-invasive respiratory support or oxygen therapy for ≥ 4 hours & < 5 days; 5= Non-invasive respiratory support or oxygen therapy >/= 5 days; 6= Mechanical ventilation via endotracheal tube for ≥ 4 hours & <7 days; 7= Mechanical ventilation via endotracheal tube for >/=7 days; 8= Moderate respiratory morbidity; 9=Necrotising enterocolitis AND/OR Sepsis; 10= Severe Respiratory Morbidity; 11= Major Surgery; 12= Brain Injury; 13= TWO of severe respiratory morbidity OR major surgery OR brain injury; 14= Severe respiratory morbidity & major surgery & brain injury; 15 = Death

There is obviously a lot to pick apart and discuss here, but my first response is Bravo!! At last a composite outcome in a neonatal trial that recognizes that all competing outcomes do not have the same value! The definitions of each of these outcomes are on the clinicaltrials.gov website. For example “severe respiratory morbidity” is respiratory support at 40 weeks PMA, or discharge home on oxygen or respiratory support. Which seems to me to be a definition which is both consistent with parental concerns and reasonably predictive of longer term respiratory outcomes.

As for Brain Injury, I have a few more concerns, the definition given is : Major intraventricular haemorrhage, unilateral or bilateral, defined as i. Papile Grade 3 or 4 AND/OR ii. Moderate-severe periventricular haemorrhagic infarction b. Cystic periventricular leukomalacia, unilateral or bilateral c. Moderate or severe white matter injury on MRI at near-term or term equivalent age d. Any cerebellar haemorrhage e. Other major ischaemic injury such as arterial stroke or hypoxic ischaemic injury f. Post haemorrhagic hydrocephalus requiring drainage.

The problems with this part of the outcome are 1. using the Papile classification 2. including “any” cerebellar haemorrhage. 3. White matter injury on the MRI.

For this particular part of the composite outcome, I am not sure how much this matters, but if the high-dose caffeine group, for example, had more cerebellar haemorrhages, but less severe respiratory morbidity, then that group would be considered to be worse off, even if the haemorrhages were all tiny and had no long term impact. However, I am not sure if you could really tease all that out as part of an ordinal composite like this one, if there is a difference in brain injury, as defined in the protocol, between groups, that would probably be a reason for choosing between interventions, even if some of the haemorrhages had little long term impact.

For an adaptive platform trial trial like this the outcomes have to be determined rapidly after the intervention; 2 year neurological/developmental outcomes, or other long term outcomes of interest to parents cannot be the primary. Hopefully, all the infants randomized in the platform will have longer term outcomes evaluated and reported in addition to the primary short term outcome. I’m sure you are all aware that caffeine therapy in the CAP trial did have some minor short term benefits (infants came out of oxygen a little faster, so were less likely to be diagnosed with BPD, and less likely to have treatment for a PDA), but substantial, very long lasting benefits on follow-up.

Analysis of the trial results is also a major issue, the RECOVERY trials have used a fairly standard frequentist approach, with the first results showing advantages of standard dose dexamethasone being presented as the proportion of deaths in each group, the relative risk (0.83) and confidence intervals (0.75 to 0.93) and the p-value P<0.001. Platipus is planned with a Bayesian approach, which is less familiar to many clinicians, but has potential advantages. The Bayesian approach clarifies that no RCT really proves that one approach is better than another, which is suggested by our dichotomous “significant or not significant” current typical presentation of results, but gives an estimate of how likely it is that one intervention is better than another.

The Platipus platform will allow on-going adjustments, other comparisons, addition of other groups, and other interventions. It could become the default model for future trials in perinatal medicine, and should enable us to get results much faster, but just as reliably.

Platipus, just like the Platypus, will give us major scientific insights for the future. It should allow us to advance the care of newborns more rapidly than in the past.

Posted in Neonatal Research | Tagged , , | 3 Comments

A lower limit for restrictive transfusion strategies? Not directly neonatology…

Several trials of liberal versus more restrictive transfusion practices have been published, and overall, it seems that being very restrictive in transfusions has no negative impacts on clinical outcomes, and depending on the trial, some positive benefits of avoiding transfusion.

One of the important studies of the Canadian Critical Care Trials Group was the TRICC trial, which enrolled critically ill adults with a hemoglobin less than 9 g/dl, who were randomized to a target hemoglobin of 7-9 vs 10-12 g/dl. In that trial the adults with the lower target did better, with a lower in-hospital mortality of 22%, vs 28% with more liberal transfusion. In the results of that trial there was a suggestion of a different outcome in the subgroup with myocardial ischemia, so the group did another analysis in adults with cardiovascular disease, which showed no big differences in outcome, but still suggested a possible benefit of higher hemoglobin in those with unstable angina or an acute MI. Other studies have still, as far as I can see, not clearly answered the question in this subgroup of adults, and the most recent systematic review that I found basically stated “we don’t know”. As a result further trials are planned.

It could be that in patients with coronary artery disease, the ability to vasodilate and maintain myocardial oxygen delivery in the face of anaemia is limited, hence it is physiologically feasible that this subgroup would have different outcomes to those without serious coronary artery disease.

The same might be true for another subgroup, under-represented in the original TRICC and other trials, those with acute neurological injury. In this new trial in JAMA, adults with either traumatic brain injury, or a sub-arachnoid haemorrhage, or an intra-cerebral haemorrhage, who had a haemoglobin <9 g/dl, were randomized to a target of maintaining haemoglobin >9 compared to >7.

The rationale being that these groups of adults have a limited ability to increase brain perfusion in response to anaemia, and may benefit from higher haemoglobin, and that the previous studies have included very few such patients, and have also not measured longer term functional outcomes. The primary outcome of this trial was the extended Glasgow Outcome Scale after 6 months. We used a paediatric version of this scale in our recent publication (Boutillier B, et al. Survival and Long-Term Outcomes of Children Who Survived after End-of-Life Decisions in a Neonatal Intensive Care Unit. J Pediatr. 2023;259:113422. as it is a scale of functional capacities, in our study it was used in a descriptive fashion. In the new trial it was dichotomised into bad outcomes (1 to 5, 1 being death) and acceptable outcomes (6 to 8). The following table describes those scores

I will restrain myself from one of my usual rants about it being inappropriate to dichotomize continuous outcomes just for the simplicity of research design, or of equating death with being unable to use public transport! Designing and analyzing the trial using the GOS-E as an ordinal outcome would have been entirely possible.

800 patients, mostly in Europe and South America, were randomized, and outcomes were better in the higher hemoglobin group. The median GOS-E in both groups was 4, in other words large numbers of the patients had quite poor outcomes. As you can see from the graphical abstract above, the proportion with scores <6 was lower, 63%, with the higher HgB threshold compared to the higher threshold, 73%. When analyzed using an outcome of GOS-E<5, the higher HgB group still had an advantage, 50% vs 60%.

Why am I discussing this adult trial? I guess it is the first evidence, that I know of, that a higher hemoglobin threshold in critically ill patients improves outcomes. And that, in this group of patients we may have found the lower limit of acceptable HgB.

One thing we do not know in preterm babies is the extent to which they are able to vasodilate the cerebral circulation in the face of anaemia to maintain cerebral oxygen delivery. I guess this is why the people who designed the protocols that have been tested in neonatal transfusion trials have all used different thresholds during the first days after birth compared to later life. Or maybe it was just completely arbitrary!

The neonatal trials which have been published, which I have discussed previously used the following transfusion thresholds (converted to Hemoglobin in g/100mL).

The definition of “sick” varied between the trials. In none of the trials was there any sign of harm from the lower transfusion threshold.

I think we are therefore probably well above the threshold of harm, but the new trial does show that under certain circumstances a higher threshold may be better, and that, at least in adults with acute brain injury, a higher threshold may improve outcomes.

Might this be true in preterm infants? I’m not sure if there has been any subgroup analysis of infants in the RCTs who had a brain injury, obviously the pathophysiology of such injuries is dramatically different in preterm infants than the adults in the new JAMA RCT, and the physiology of the control of cerebral perfusion is different also. But those results do suggest that we should consider the possibility that there is a lower limit to transfusion thresholds, which may differ among different groups of babies.

Posted in Neonatal Research | Tagged , | Leave a comment

Bilirubin is Bad for the Brain! Who knew?

We have known for a very long time that bilirubin can cause an encephalopathy leading to acute and chronic clinical impacts, the most severe chronic impacts being kernicterus, a disabling movement disorder and nerve deafness. What seems to be the case is that bilirubin bound to albumin doesn’t cross the blood-brain barrier, and that the BBB, which appears to develop early, and be functional by 20 weeks, is easily disrupted in the newborn. Bilirubin staining of the basal ganglia has been shown on autopsy specimens at much lower concentrations than are associated with kernicterus in term babies. This finding led to the performance of the large multicentre NICHD network phototherapy trial, which is very limited by the lack of a funky acronym. Can I suggest that we start calling it the “LITEPOP” trial: Longterm ImpacTs of Early Phototherapy for Outcomes of Preterms?

There have been multiple studies looking at “free” or “unbound” bilirubin and its impacts, which are hindered to some extent in difficulties measuring it. The subject of this post is a new re-analysis of data from that NICHD early phototherapy LITEPOP trial, which finished enrolling in 2005, and finished follow up 2 years later, with the primary publication (Morris BH, et al. Aggressive vs. conservative phototherapy for infants with extremely low birth weight. N Engl J Med. 2008;359(18):1885-96) appearing in 2008. If you remember, or if you have never heard about this, the primary outcome was “death or NDI” (groan…). There were nearly 2000 babies of 501 to 1000 g bwt randomized at 12 to 36 hours of age if they had never had phototherapy. The intervention was to have phototherapy started immediately and continued until it fell below 85 micromol/L (5 in the US units). The control group had phototherapy to keep the bilirubin either below 137 (500-750 g bwt) or below 170 (751-1000 g).

The results are a very apposite demonstration of the problems with such a composite outcome. Mortality was just about equal in the two groups, 209 vs 201 died before discharge, and a further 17 vs 21 died after discharge, or 24 vs 23% total mortality.

The NDI data were presented as among the entire groups (survivors tested plus deaths), i.e. 235/902 vs 275/902, which is statistically significant, 26% vs 30%, (RR 0.86 (95% CI 0.74–0.99)) but because of the 1% higher mortality in the early phototherapy group, the combined outcome of “NDI or death” was not considered significant.

If we recalculate the outcome of “NDI” just among the survivors who were tested, 235/672 vs 275/684, 35% vs 40%, and the proportion with low PDI scores 127/672 vs 152/684, 19% vs 22%, or with severe hearing loss, 9 infants vs 28, or with athetosis, 2 infants vs 10, then the differences all suggest benefit of early phototherapy.

Babies in this study had a blood sample stored at 5 days of age, which was later measured at one of 2 laboratories. Although they used the same method, (the peroxidase method) and the same analyser, there were enough differences between the 2 labs that the results were converted to z-scores for each lab, and are reported in the newest publication as percentiles of the z-score.

The first publication reporting these data, however, was in 2010, Oh W, et al. Influence of clinical status on the association between plasma total and unbound bilirubin and death or adverse neurodevelopmental outcomes in extremely low birth weight infants. Acta Paediatr. 2010;99(5):673-8. The reporting of the free or Unbound Bilirubin (UB) in that first publication was weird. The graphs appear to show UB concentrations in the microg/dl range,

Adjusted probability plots and adjusted odds ratio (95% CI) for unbound bilirubin versus proportion of death or NDI, death or CP, death or hearing loss, and death before follow-up in clinically stable and unstable infant

But a correction published 3 years later reveals that they actually show the UB divided by 0.3 “we have standardized the unbound bilirubin by dividing the actual values by the standard deviations which is 0.3. The standardized values (in units) were used in the presentation of data in this figures which appear to be 0–3.0 units. The actual unbound bilirubin values are 0–0.9 mcg/dL”. I don’t understand how this “standardized” the data, it was clearly just a screw up that none of the 89 authors noticed. (I exaggerate).

You may also remember a secondary analysis published 4 years after the original manuscript Tyson JE, et al. Does aggressive phototherapy increase mortality while decreasing profound impairment among the smallest and sickest newborns? J Perinatol. 2012;32(9):677-84. In that publication they performed a subgroup analysis dividing the smaller babies into those ventilated and those who were not. Whether this was a preplanned subgroup or not is unclear; the publication states rather vaguely that they had plans “to relate the risks and benefits of AgPT to baseline risk factors including measures of severity of illness”. In that secondary analysis of a subgroup, using Bayesian analysis, they claimed to show that it was highly likely that phototherapy increased mortality among ventilated infants of 500-750 g. That analysis also shows that there was a probable decrease in mortality among non-ventilated 500-750 g babies (13% vs 25%), a probable decrease among larger babies who were ventilated (16% vs 19%), and a possible increase among larger non-ventilated babies (8% vs 6%). My interpretation of which is that the differences are very likely just random variations between groups. However, the authors of that secondary analysis were much more emphatic that phototherapy was decreasing the anti-oxidant benefits of bilirubin, and was generating oxidant injury and was increasing mortality in the smallest babies, if they were ventilated.

So why have they come back to these data now? (Arnold CC, et al. Unbound bilirubin and risk of severe neurodevelopmental impairment in extremely low birthweight newborns. Pediatr Res. 2025) The authors state that there are new concerns about lipid emulsions leading to increases in Free Fatty Acids and displacing bilirubin from albumin, leading to an increase in UB. But such concerns are not new at all, they date back to the 90’s or further. Perhaps more relevant is that a new method for measuring UB has become available, which might make it more clinically useful, and there are new statistical methods available, including the TMLE (Targeted Maximum Likelihood Estimation), which apparently uses machine learning to give conservative explanations of correlations between variables.

For this analysis they concentrated on “severe NDI” or sNDI, which was “based on the NDI outcomes with the strongest associations with the phototherapy regimen in the NICHD aggressive vs conservative phototherapy” LITEPOP RCT which was : a score on the Bayley II Mental or Psychomotor Developmental Index of 50 or less or a level of 5 for gross motor function (GMF), or needing bilateral hearing aids.

They divided UB into 20 slices, based on percentile of UB, as you can see from the figure below, there seems to be a threshold effect, with sNDI increased substantially for the top 15% of UB concentrations on day 5 of life.

There was very poor correlation between the total bilirubin and UB concentrations, as you can see below. But almost all of the UB concentrations associated with worse outcomes were above the threshold for phototherapy in the early treatment group, only 1 or 2 of the dots are in the upper left quadrant. You can also see, what I was not aware of previously, that far more than half of the dots are above 5 mg/dl, which implies that the phototherapy was not very effective at keeping the total bilirubin below the treatment threshold.

Most of the babies with “sNDI” did not have classic clinical signs of kernicterus, choreo-athetoid cerebral palsy, oculomotor problems, and high-tone nerve deafness. This could be because bilirubin encephalopathy is not so specific in the affected regions in very preterm infants, especially in the presence of acidosis, sepsis etc.

What should we do about this? If UB or “free” bilirubin measurement becomes more widely available, then the lack of correlation with total bilirubin strongly suggests that UB should become the variable of choice for monitoring hyperbilirubinemia in full term infants with jaundice. We could perhaps decrease the proportion of babies who get intervention while improving the prevention of kernicterus, which continues to affect somewhere around 1 in 100,000 babies, and while rare is devastating. The potential benefits among preterm infants are numerically greater, if a new trial (the UNBEATEN trial, UNbound Bilirubin EArly TreatmENt trial, I claim copyright) could show that screening for, and treating, to maintain UB < the 85%ile, improved neurological and developmental outcomes.

At the same time the trial could investigate whether the possible impact of phototherapy on mortality among the subgroup of ventilated extremely preterm infants in the NICHD trial was a real phenomenon or not.

Posted in Neonatal Research | Tagged , , , | 3 Comments

Breast is Best; is Fresher Better?

A recurrent problem in the NICU is “bandwagon-jumping”. An intervention with some preliminary positive data, and a possible physiologic rationale is widely adopted and made part of QI bundles with little or no reliable evidence that it is either effective or safe.

One such is the hype surrounding fresh mother’s milk. There are a small number of studies showing the following impacts. (references at the end)

  1. A reduction in laboratory indices of antioxidant activity with freezing and with refrigeration, has been shown, with the longer duration of refrigeration having a greater impact, compared to never refrigerated milk.
  2. An impact of pasteurization, but not freezing and thawing, on Leptin concentrations, but not Adiponectin or total protein
  3. Freezing and thawing reduce glutathione content
  4. No impact on total fat or carbohydrate of freeze/thawing, possible increase in Free Fatty Acids.
  5. Freezing inactivates the majority of CMV, decreasing viral load, but does not eliminate entirely
  6. Vitamin C concentrations are reduced by refrigeration and by freezing

The effects on cellular components of refrigeration and/or freeze thawing are unclear, which is unfortunate as I suppose this is the reason that people want to give “fresh” breast milk. My literature search for this post was unable to find much data on the impacts of freezing and thawing on the endogenous cellular components of breast milk. It has been reported that the viable proportion of white cells in milk after freeze/thawing is 78% or 19%. But exactly how to determine which type of white cell is still active, is unclear to me, the most recent report I found is a brief letter with little methodological detail, which determined viability with “propidium iodide” and measured some cell markers.

It looks however, like there is probably little effect on the bacterial components of breast milk. This study, Marin ML, et al. Cold storage of human milk: effect on its bacterial composition. J Pediatr Gastroenterol Nutr. 2009;49(3):343-8, for example, showed the same species and the same number of organisms in fresh and freeze/thawed milk.

And here another word of caution, what is meant by “fresh” isn’t always the same, sometimes it means never refrigerated, sometimes never frozen, sometimes not pasteurized. A recent statement from France, for example differentiates between “Raw” milk, meaning never refrigerated, and “Refrigerated” milk, with “Fresh” meaning either raw or refrigerated but never frozen. That working group developed guidelines to improve provision of raw milk, despite not presenting any clear reliable evidence of a clinical benefit.

Overall then, it seems that there is little adverse impact of freezing and thawing on macronutrients or cellular components of breast milk. There are some individual impacts, especially on vitamin C.

I guess the current enthusiasm for trying to give fresh milk must, therefore, be based on good clinical data of a benefit? (That is an ironic question, I already know the answer!) There is one cohort study (Huang J, et al. Short-term effects of fresh mother’s own milk in very preterm infants. Matern Child Nutr. 2023;19(1):e13430) comparing outcomes between babies who received “Fresh” mothers own milk, which was what the French group would call Raw milk (i.e. never refrigerated) and used within 3 hours, with the comparisons receiving pasteurized MoM. There were 170 babies <32 weeks GA in total, averaging 1150 g and 29 weeks. NEC and LOS were more frequent in the non-randomized control group, 16% and 19%, vs 10% and 7% in the controls, and there were some very small differences in feeding tolerance. The mother had to provide milk every 3 hours during the entire NICU stay.

Another Chinese cohort study (Sun H, et al. Testing the feasibility and safety of feeding preterm infants fresh mother’s own milk in the NICU: A pilot study. Sci Rep. 2019;9(1):941) compared a group of babies <30 weeks GA who received one feed a day of “Fresh” milk (unclear whether it was allowed to be refrigerated during the 4 hours that was permitted between expression and administration) to a group who received only frozen thawed milk. Infants received either MoM or donor milk (DHM), and it seems that some of the control babies could have received only DHM. There were close to 100 babies per group, averaging about 28.5 weeks gestation and some of the fresh milk babies dropped out and were not analyzed. The birth weights in the 2 groups are not mentioned, but they both, unusually, had a birthweight z score averaging about +0.3 (preterm infants usually average below 0). There was a very high rate of late onset sepsis (38%) and stage 2-3 NEC (14%) in the controls that was lower in the fresh milk group (22% and 6%).

One other study that I have seen quoted is a French database analysis which showed less BPD in NICUs that gave Fresh milk, but in that study the comparison NICUs gave pasteurized mother’s milk, as it was the recommended standard at that time in France. It also is not clear if the Fresh milk could have been frozen prior to administration or not. There was less BPD in the fresh NICUs, but no difference in NEC or in LOS, which was extremely frequent in both groups.

Although it seems attractive to some clinicians, there really is no reliable evidence that giving fresh breast milk, however it is defined, has different clinical outcomes, compared to frozen and thawed breast milk. Differences in composition are relatively minor, and impacts on cellular viability or numbers are really unclear.

The logistic implications, on the other hand, are substantial. Enabling the provision of an early fresh milk feed, daily fresh milk feeds, or exclusively fresh milk feeds, will require some resources, and organisation of those resources, to ensure that it could be done reliably and without errors in administration. A randomized controlled trial would be worthwhile to determine the true clinical impact, if any, of avoiding freeze/thawing of MoM, to see if dedicating those resources is worth it. One is planned and hopefully underway (Sun H, et al. A randomized controlled trial protocol comparing the feeds of fresh versus frozen mother’s own milk for preterm infants in the NICU. Trials. 2020;21(1):170), until the results are available, the potential benefits of Fresh, compared to Frozen and thawed, maternal breast milk must be considered Not Proven.

References (in chronologic order, most recent first)

Surmeli Onay O, et al. Evaluation of the Effect of Breast Milk Storage Conditions on the Viability of Cells in Breast Milk: A Pilot Study. Indian J Pediatr. 2024;91(10):1084.
Stinson LF, et al. Effects of Different Thawing and Warming Processes on Human Milk Composition. J Nutr. 2024;154(2):314-24.
Putri DK, et al. The Effects of Different Storage Conditions on Leukocytes in Human Breast Milk. Sultan Qaboos Univ Med J. 2024;24(1):91-8.
Binder C, et al. Human Milk Processing and Its Effect on Protein and Leptin Concentrations. Nutrients. 2023;15(2).
Edwards CA, et al. A systematic review of breast milk microbiota composition and the evidence for transfer to and colonisation of the infant gut. Beneficial microbes. 2022;13(5):365-82.
Volder C, et al. Transmission of cytomegalovirus in fresh and freeze-thawed mother’s own milk to very preterm infants: a cohort study. J Perinatol. 2021;41(8):1873-8.
Schwab C, et al. Characterization of the Cultivable Microbiota in Fresh and Stored Mature Human Breast Milk. Front Microbiol. 2019;10:2666.
Paduraru L, et al. Total antioxidant status in fresh and stored human milk from mothers of term and preterm neonates. Pediatr Neonatol. 2018;59(6):600-5.
Ahrabi AF, et al. Effects of Extended Freezer Storage on the Integrity of Human Milk. J Pediatr. 2016;177:140-3.
Handa D, et al. Do thawing and warming affect the integrity of human milk? J Perinatol. 2014;34(11):863-6.
Slutzah M, et al. Refrigerator storage of expressed human milk in the neonatal intensive care unit. J Pediatr. 2010;156(1):26-8.
Silvestre D, et al. Effect of pasteurization on the bactericidal capacity of human milk. J Hum Lact. 2008;24(4):371-6.
Hanna N, et al. Effect of storage on breast milk antioxidant activity. Arch Dis Child Fetal Neonatal Ed. 2004;89(6):F518-20.
Lawrence RA. Storage of human milk and the influence of procedures on immunological components of human milk. Acta Paediatr. 1999;88:14-8.
Jocson MAL, et al. The Effects of Nutrient Fortification and Varying Storage Conditions on Host Defense Properties of Human Milk. Pediatrics. 1997;100(2):240-3.
Williamson MT, Murti PK. Effects of Storage, Time, Temperature, and Composition of Containers on Biologic Components of Human Milk. J Hum Lact. 1996;12(1):31-5.
Friend BA, et al. The Effect of Processing and Storage on Key Enzymes, B Vitamins, and Lipids of Mature Human Milk I. Evaluation of Fresh Samples and Effects of Freezing and Frozen Storage. Pediatr Res. 1983;17(1):61-4.
Reynolds GJ, et al. A simplified system of human milk banking. Early Hum Dev. 1982;7(3):281-92.
Pittard WB, Bill K. Human Milk Banking: Effect of Refrigeration on Cellular Components. Clin Pediatr (Phila). 1981;20(1):31-3.

I ran out of energy in trying to add all the hyperlinks to the references, but I guess if you care you can easily find them, all available through PubMed.

Posted in Neonatal Research | Tagged , , , , , | 1 Comment

Neurodevelopmental Impairment, who decides what it is?

I have written frequently about my concerns with “NDI” as an important measure of neonatal outcomes, indeed, it seems to be often thought of as if it were the only important measure. It has very often been included as part of a composite outcome measure “death or NDI”.

So why am I disturbed about the use of NDI as a primary outcome measure? NDI is itself already a composite measurement, including some indicator of delayed development (most commonly one of the various iterations of the Bayley Scales of Infant Development), some severity of motor disorder expected to be permanent, i.e. Cerebral Palsy, some severity of hearing loss, and some severity of visual impairment. It was a composite invented by neonatologists and follow up specialists as a way of trying to quantify the impacts of adverse cerebral impacts of prematurity. There are many problems with this, both in the actual importance of each component of NDI, and also in the permanence of the finding. For example, most infants with low scores on developmental screening tests at 2 years do not have intellectual impairment at follow up. In the follow up of the CAP trial, for example, only 18% of babies who had a low Bayley score at 18 months (version 2 MDI <70) actually had a low IQ at 5 years (WPSII <70). This is unlike CP, for which a diagnosis at 2 years is very accurate (not 100%, but appears to be about 95% PPV) as a predictor of long term motor dysfunction, but the severity of the problem can vary, especially after a diagnosis at 2 to 3 years, where about 1/3 of infants will change their classification on the GMFCS, either to a higher or a lower score. Visual and auditory impairments seem to be more permanent and invariable, but are a much smaller part of the NDI.

And, of course, combining NDI with death as part of a composite outcome implies that they are equally important, and means that an intervention which decreases death may not be found to be significant is there is an increase in low BSID scores in the survivors (for example).

Do parents of babies who are labelled as having NDI think that their infants are impaired? That is the question asked in a new publication from the follow up centres across Canada (Canadian Neonatal Follow-Up Network, CNFUN). Richter LL, et al. Parental and Medical Classification of Neurodevelopment in Children Born Preterm. Pediatrics. 2025. Over 1000 very preterm infants are involved in the study, and their parents were asked if they thought that their child had a developmental impairment when they attended a follow-up clinic appointment, but before they completed the standardised evaluation. They then had their evaluation and were classified as having no NDI or :

“to have a mild-moderate NDI if they had any 1 or more of the following: CP with GMFCS 1 or 2; Bayley-III motor, cognitive, or language composite scores 70 to 84; hearing loss without requirement for hearing devices or unilateral visual impairment. A child was considered to have a severe NDI if they had any 1 or more of the following: CP with GMFCS 3, 4, or 5; Bayley-III motor, cognitive or language composite scores <70; hearing aid or cochlear implant; or bilateral visual impairment.”

As this table shows, there was poor agreement between what the parents thought, and what the standardised evaluation stated. Most of the disagreements were parents considering their infants to not be impaired, or to be less impaired than the standard classification. There were 185 infants with “severe NDI” according to the definition above, only 23 parents thought their child was severely impaired, in contrast, among the 596 with no NDI, there were 11 parents who found their child to have severe impairment, and 104 thought they had mild-moderate impairment.

Some of the details of the analyses are quite interesting, for example, the small number of infants with serious CP, GMFCS 4 or 5, were mostly considered to have moderate or severe impairment by parents. The cognitive scores of infants who agreed that their infant, with CNFUN defined severe NDI, had at least moderate impairment were lower (median 70) than those who disagreed (median 80).

Many problems faced by families with ex-preterm infants are not captured by “NDI”. This is reflected, I think, by those parents who thought their child was impaired despite not satisfying CNFUN definitions, such infants were much more likely to be using technology at home, and more likely to have been referred for occupational therapy, or to see a psychologist or other therapist. Needing re-hospitalisation also made parent more likely to agree that their infant had an impairment.

Because we haven’t measured some of the things that impact families, such as behavioural disturbances, feeding problems, and sleep disruption, we really don’t know if they are affected by any of our NICU interventions. It wouldn’t surprise me if some interventions, ranging from postnatal steroids to skin-to-skin care or light cycling, might have major impacts on those outcomes. We just don’t know.

What should we do about findings such as these newly published data, and others from the Parents’ Voices project? Defining a single ‘yes or no’ outcome variable is the old-fashioned way of designing research and determining the benefit of an intervention. There are much better ways of comparing outcomes between groups, ways which can take into account the variety of outcomes, and the preferences of parents. It takes some extra work to define the kind of ordinal outcomes which reflect the values of parents and the relative importance of each component, but that is hugely preferable to using composite outcomes which implicitly value each component as being equivalent. Being dead, having a Bayley Cognitive composite of 69, having severe visual loss all qualify as “dead or severe NDI”, but the implications are enormously different.

In the future outcomes we measure should focus on how infants function, and should recognize that the answer to the question “how is your child doing?” is not a dichotomous choice.

Posted in Neonatal Research | 1 Comment

Not neonatology: trip to Panama

Just back from a week in Panama, organised by “Wings“, mostly in Darién province at the extreme east of the country. A really amazing experience, with over 300 bird species seen, including just over 200 “lifers” for me. On my other blog, I have posted several posts, a trip report with many photos.

As a taster, this is a Collared Aracari, and below King Vultures.

Collared Aracari
King Vulture, immature and adult
Posted in Neonatal Research | Leave a comment

Term and Near-Term babies at Risk of Early Onset Sepsis: Best Practice?

Sepsis caused by organisms transmitted during, or shortly before, labour is relatively uncommon in higher income countries with GBS screening programs; the incidence is now around 0.3 cases per 1000 live births, but varies, it may be as high as 10 times that, or as low as 0.1/1000. GBS remains the commonest organism, with E Coli close behind, followed by Strep viridans, and then a host of other bugs. Early identification and treatment is the key to good outcomes.

We can think of this in 2 ways; a risk of 1 case for every 2000 births is a very low risk. On the other hand it is an identifiable 72 hours of life with the highest incidence of sepsis that one will ever have! In general terms, about half of the cases have no identifiable risk factors, and about half the cases occur in babies with a risk factor or a combination of risk factors: chorioamnionitis (or “triple-I”); maternal fever; prolonged rupture of membranes; obstetric procedures; a previous baby with GBS.

Various groups have recommended evaluating risk factors at birth in order to determine the clinical approach to be taken. Usually that means that if the risk is considered high enough, then cultures are taken and antibiotics are given, even if the infant is currently well, or perhaps strict frequent physical examination. There may also be an intermediate group, who qualify for more intensive clinical monitoring, or perhaps for lab screening tests, such as a CBC and/or acute phase reactants, CRP, procalcitonin and others.

We should think of the risk-factor-based approach just as we would any screening test for determining interventions in healthy patients. What is the sensitivity, and specificity, and, depending on the incidence in your population, what are the negative and positive predictive values, of the risk factor combination?

Risk factors are often grouped in differing ways, which makes determining sensitivity etc complex, but given that about half of septic babies lack identifiable risk factors, usually sensitivity is around about 50%.

One study from the Netherlands compared the sensitivity of the Kaiser calculator with the 2021 NICE guidelines and the most recent Dutch guidelines. (Snoek L, et al. Neonatal early-onset infections: Comparing the sensitivity of the neonatal early-onset sepsis calculator to the Dutch and the updated NICE guidelines in an observational cohort of culture-positive cases. EClinicalMedicine. 2022;44:101270), in that study, at 4 hours after birth, the Kaiser calculator had a sensitivity of 36% compared to 50% for the Dutch, and 55% for the NICE framework, among the 88 patients eventually determined to have culture positive EOS. Other publications have shown sensitivity between 37% and 76% of the Kaiser calculator, with the CDC approach being reported to have a sensitivity between 50% and 100%. See reference list below. As the babies were followed, in the Snoek study, progressively more of them developed clinical signs satisfying treatment thresholds, so that 72% were eventually “screen-positive” by Dutch and NICE guidelines, compared to 61% by the calculator. What the publication never explains is how the remaining 24 babies were picked up. Did they become sick after 48 hours? Were they screened outside of the Dutch guidelines, and if so on what basis? Did they remain asymptomatic despite positive cultures?

Overall, the calculator and other risk-based systems, in very large cohorts with current EOS incidence, will, therefore, have around about 50% sensitivity within the first 4 hours after birth. In general, more sensitive risk-factor-based approaches will, of course, screen and treat many more infants. But specificity is more variable and difficult to calculate unless you are sure about the population frequency, but if we work with an incidence of 0.5 per 1000 live births, at term or near, and that risk factor screening tags 4% of babies as being at risk, then the specificity of screening calculates to 96%, with a PPV of 0.6 and a NPV of 99.9.

As always in epidemiology, if the risk factors are broader then the sensitivity will increase, and specificity will decrease. this is nicely illustrated by this figure from one of the studies of Escobar and Puopolo (and others), using just maternal risk factors. In this figure, higher risk at birth identified fewer cases (lower sensitivity) but had much higher specificity and fewer false positives. On the other hand it also demonstrates that the majority of EOS cases are NOT identified by their risk factors, sensitivity is <50% in the columns with intermediate and high risk of sepsis.

If we could define risk factors (or another screening test) that were more accurate, with a higher sensitivity AND specificity, then we could reduce both unnecessary intervention and false negatives. But as of now, about half of EOS cases will be among the babies that have a positive screening test result, and about half will be among the remaining infants, whether that screening test is the calculator or another one of the recent risk-factor-based approaches. The older standards were less specific, especially among babies born after chorioamnionitis, without being much more sensitive, which is why the calculator has led to a reduction in unnecessary treatment in most cohorts where it has been introduced.

For a screening test for an uncommon condition, a sensitivity of about 50% is pretty poor, and for any other condition we would probably abandon that approach! Imagine if our screens for hypothyroidism, or for PKU only had a sensitivity of 50%.

Can we make risk-factor-based screening more sensitive AND more specific?

The Kaiser EOS calculator, if strictly applied, usually ends up with about 4% of all babies receiving antibiotics. I have been trawling through recent literature (and there is a lot of it!) and, of unselected populations, the range of babies recommended to start or strongly consider antibiotics, is between 1 and 5%. That obviously also depends on the prevalence of the risk factors, one very important one being chorioamnionitis. Many guidelines that I have found steer clear of trying to define that phenomenon. They either leave it to individuals to decide whether there is a reliable history of chorio, or avoid the term and use maternal temperature instead. That widely-used Kaiser calculator doesn’t mention chorio, just highest maternal temperature. If you enter data into that calculator, the presence of a maternal fever of 39 degrees increases the risk of EOS by about 40 fold compared to a temperature of 37, depending on the other risk factors.

In contrast the NICE framework in the UK refers to a “clinical diagnosis of chorioamnionitis” but without diagnostic criteria; in full, the NICE list of antenatal risk factors is

  • Red flag risk factor:
  • Suspected or confirmed infection in another baby in the case of a multiple pregnancy.
  • Other risk factors:
  • Invasive group B streptococcal infection in a previous baby or maternal group B streptococcal colonisation, bacteriuria or infection in the current pregnancy.
  • Pre-term birth following spontaneous labour before 37 weeks’ gestation.
  • Confirmed rupture of membranes for more than 18 hours before a pre-term birth.
  • Confirmed prelabour rupture of membranes at term for more than 24 hours before the onset of labour.
  • Intrapartum fever higher than 38°C if there is suspected or confirmed bacterial infection.
  • Clinical diagnosis of chorioamnionitis

The NICE guidance recommends cultures and antibiotics if you have one “red flag” or any 2 other risk factors. Which includes those antenatal factors, as well as the following list of clinical indicators

Red flag clinical indicators:

  • Apnoea (temporary stopping of breathing)
  • Seizures
  • Need for cardiopulmonary resuscitation
  • Need for mechanical ventilation
  • Signs of shock

Other clinical indicators:

  • Altered behaviour or responsiveness
  • Altered muscle tone (for example, floppiness)
  • Feeding difficulties (for example, feed refusal)
  • Feed intolerance, including vomiting, excessive gastric aspirates and abdominal distension
  • Abnormal heart rate (bradycardia or tachycardia)
  • Signs of respiratory distress (including grunting, recession, tachypnoea)
  • Hypoxia (for example, central cyanosis or reduced oxygen saturation level)
  • Persistent pulmonary hypertension of newborns
  • Jaundice within 24 hours of birth
  • Signs of neonatal encephalopathy
  • Temperature abnormality (lower than 36°C or higher than 38°C) unexplained by environmental factors
  • Unexplained excessive bleeding, thrombocytopenia, or abnormal coagulation
  • Altered glucose homeostasis (hypoglycaemia or hyperglycaemia)
  • Metabolic acidosis (base deficit of 10 mmol/litre or greater)


Why have I come back to this issue after 2 other recent blog posts? It’s because of the publication of new Swiss guidelines (Stocker M, et al. Management of neonates at risk of early onset sepsis: a probability-based approach and recent literature appraisal : Update of the Swiss national guideline of the Swiss Society of Neonatology and the Pediatric Infectious Disease Group Switzerland. Eur J Pediatr. 2024. That guideline includes the following table which includes antenatal risk factors.

As you can see the Swiss don’t refer to chorioamnionitis at all, but use the ACOG/AAP term of triple I, which stands for “Intrauterine Infection or Inflammation…. or both” and give some criteria for that diagnosis. My only quibble with this list is that Elective Caesarean without labour or ruptured membranes has a zero, rather than low, probability of EOS, and should be listed as a contraindication to doing a sepsis workup!

Once an at-risk patient is identified, the Swiss guideline recommends Serial Physical Examination, SPE. And here it starts to get questionable, for me. For the Swiss guideline that apparently simply means vital signs q4h, which they specify as being heart rate, temperature, peripheral perfusion and skin colour. Exactly how this is meant to be done, by whom, and with what criteria, are not specified; what exactly is meant by “peripheral perfusion”, for example? If this is going to be done for thousands of babies, it is vital that it is clear and unambiguous. As I understand it, babies with a low probability of sepsis should have routine care, which, I suppose therefore, must exclude q4h vital signs.

However, the references that they give to support the SPE approach have all used very different approaches. For example, here is a figure from an Italian group (Berardi A, et al. Serial physical examinations, a simple and reliable tool for managing neonates at risk for early-onset sepsis. World J Clin Pediatr. 2016;5(4):358-64) that have published about their approach a few times. As you can see, they immediately culture everyone with “chorioamnionitis” or intrapartum fever, and, for the remaining at-risk infants, do the SPE on multiple occasions, “in turn by bedside nursing staff, midwives and physicians” using a purpose built form, shown here. The methodology and timing of the SPE is vitally important.

The group from Stavanger, Norway, (Vatne A, et al. Reduced Antibiotic Exposure by Serial Physical Examinations in Term Neonates at Risk of Early-onset Sepsis. Pediatr Infect Dis J. 2020;39(5):438-43) have also published widely on this issue, their protocol is for hourly structured examination, of infants deemed at risk, by the nursing staff for 24 to 48 hours, recorded on a special form, and taking note of the signs shown in the figure below. As far as I can tell, they did not routinely treat infants with antenatal chorioamnionitis or intrapartum fever, but used the same approach for them also.

Among the 8000 babies analyzed using this approach, there was 1 true, culture-positive, sepsis (and supposedly 55 “culture-negative sepsis”), and about 1% of infants treated with antibiotics; with no deaths due to sepsis.

The group in Stanford Joshi NS, et al. Clinical Monitoring of Well-Appearing Infants Born to Mothers With Chorioamnionitis. Pediatrics. 2018;141(4):e20172056 have a different approach again, they directly observed the baby with risk factors for 30 minutes, then, in an initial phase, admitted them to the level 2 nursery on a cardiac monitor with q4h vital signs for 24 hours minimum, then transferred to maternal/infant care thereafter. In the second stage of their project, the infants went directly to maternal/infant care, and had “vital signs and clinical nursing assessments with documentation in the electronic health record at 0, 30, 60, 90, and 120 minutes and then every 4 hours for the first 24 hours after birth. After 24 hours of age, vital signs and clinical nurse assessments were performed every 8 hours until discharge. Nurse staffing ratios for all neonates in couplet care in the postpartum unit were 1:3”. The Stanford experience in that 2018 publication only included 277 infants, all of whom were exposed to “chorioamnionitis”. They showed that their approach was safe, but there were 0 cases of culture-positive sepsis, so again, I could suggest that ANY approach would have been safe!

In a much more extensive report from Stanford from 2020, there were 20,000 infants, with 6 cases of GBS sepsis, no other culture positive sepsis (and one probable CSF contaminant with 2 coagulase negative staph species). It seems that all their newborn babies are monitored in the same way, regardless of any risk factors, with the vital signs recorded at those same intervals and a 1:3 nursing ratio, for the 1st 24 hours of life. Despite the reliance on SPE, they still treat about 4% of all the term and near-term infants with antibiotics; or about 150 babies treated for every true case.

In Lausanne (Duvoisin G, et al. Reduction in the use of diagnostic tests in infants with risk factors for early-onset neonatal sepsis does not delay antibiotic treatment. Swiss Med Wkly. 2014;144:w13981) they have done things differently again, in 2 different time periods they first did the following “Vital signs were checked by midwives every 4 hours during the first 24 hours and every 8 hours during the next 24 hours in all infants with risk factors for EOS” during that period they also did CBC and CRP in all the babies with risk factors. During period 2 “in addition to the monitoring of vital signs by midwives, infants with risk factors for EOS were examined by paediatric residents every 8 hours during the first 24 hours” but they stopped doing the routine CBC and CRP.

In Lausanne, there were 11,000 babies in their report, with only 3 proven infections, which were all in the 6000 infants in the first period. None among the 5000 in the second period. Which is an example of why these decisions are so difficult to make. In the second phase of their study they could have done nothing at all, just sent all the babies home without ever examining them, and treating none of them with antibiotics, to make a silly suggestion, but as there were no culture-positive cases of sepsis in 5000 babies during that period, the results would, I propose, have been just as good! About 2% of all babies born received antibiotics.

When neonatal sepsis is so uncommon, at less than 0.5 cases per 1000 live births, we need enormous studies to prove the safety of one approach over another.

In order to really determine the risk/benefit balance of any particular approach, we need to know the real risks of each approach, with the major benefit, of course, being that we prevent death or serious illness from EOS. But, even that benefit is difficult to determine.

Imagine a well-appearing baby with enough risk factors to receive immediate antibiotics by the EOS calculator, but the baby is treated in a centre using an SPE approach without immediate antibiotics. The infant then develops a serious illness at 6 hours of age, and has a culture and antibiotics; how should that be counted? Is that a success of the SPE approach, or an unnecessary clinical deterioration? We cannot just look at sepsis related mortality, as that is extremely rare, and most of the cohorts have between 0 and 1 death.

If we use a risk-factor based approach, which identifies around 6% of babies as being at risk of EOS, and includes about half of the cases, in a population with an EOS incidence of 0.4 per 1000 the incidence among the 6% at risk will be about 0.3%; among the remaining 94% of infants EOS incidence will be 0.2 per 1000. The Kaiser calculator refines this further, dividing into a category at highest risk, including about a quarter the EOS cases. Which means that there are 1.5% of babies at highest risk, and among them the incidence of EOS approaches 1%.

The question then becomes almost philosophical, should we culture and treat those at highest risk, with the hope of preventing deterioration, but with the knowledge that 99% of treated babies will not be infected, and will have the pain, the adverse impacts on parental infant interaction, and the dramatic, long-lasting effects on the intestinal microbiome.

We also cannot eliminate the risk of EOS and a later deterioration among screen-negative babies, so must ensure adequate parental education, and some sort of surveillance of even low-risk babies.

Perhaps the best approach would be to ensure intensive clinical surveillance of the 1 to 2% of infants who are highest risk, in a way which has the least adverse impact on the infant and the family; an intermediate form of surveillance for the 4 or 5% who at intermediate risk, while ensuring that the system is fail-safe, and babies are not missed. But we must not forget the remaining 95% of infants among whom half of the EOS cases occur, and ensure some form of surveillance for that group with an incidence of maybe 0.1 per 1000. Whether that should include occasional nursing evaluation, parental education, increased sensitivity to the problem of the population at large, or some combination, I do not know. I don’t think it is feasible in most health care systems to follow the Stanford approach of “enhanced clinical monitoring” in all newborns, with frequent vital signs, nursing examinations and 1:3 nurse to baby ratio. Creating the right safety net for low risk babies will detect as many cases as screening or SPE among the high risk.

Additional References (without URL links, I ran out of steam!)

Gyllensvard J, et al. Antibiotic Use in Late Preterm and Full-Term Newborns. JAMA Netw Open. 2024;7(3):e243362.
Kuzniewicz MW, et al. Update to the Neonatal Early-Onset Sepsis Calculator Utilizing a Contemporary Cohort. Pediatrics. 2024;154(4).
Dimopoulou V, et al. Antibiotic exposure for culture-negative early-onset sepsis in late-preterm and term newborns: an international study. Pediatr Res. 2024.
Harrison ML, et al. Beyond Early- and Late-onset Neonatal Sepsis Definitions: What are the Current Causes of Neonatal Sepsis Globally? A Systematic Review and Meta-analysis of the Evidence. Pediatr Infect Dis J. 2024.
Achten NB, et al. Sepsis calculator implementation reduces empiric antibiotics for suspected early-onset sepsis. Eur J Pediatr. 2018;177(5):741-6.
Coleman C, et al. A comparison of Triple I classification with neonatal early-onset sepsis calculator recommendations in neonates born to mothers with clinical chorioamnionitis. J Perinatol. 2020.
Morris R, et al. Comparison of the management recommendations of the Kaiser Permanente neonatal early-onset sepsis risk calculator (SRC) with NICE guideline CG149 in infants >/=34 weeks’ gestation who developed early-onset sepsis. Arch Dis Child Fetal Neonatal Ed. 2020:fetalneonatal-2019-317165.
Goel N, et al. Screening for early onset neonatal sepsis: NICE guidance-based practice versus projected application of the Kaiser Permanente sepsis risk calculator in the UK population. Arch Dis Child Fetal Neonatal Ed. 2019.
Sloane AJ, et al. Use of a Modified Early-Onset Sepsis Risk Calculator for Neonates Exposed to Chorioamnionitis. J Pediatr. 2019;213:52-7.
Gong CL, et al. Early onset sepsis calculator-based management of newborns exposed to maternal intrapartum fever: a cost benefit analysis. J Perinatol. 2019.
Carola D, et al. Utility of Early-Onset Sepsis Risk Calculator for Neonates Born to Mothers with Chorioamnionitis. J Pediatr. 2017.

Posted in Neonatal Research | 1 Comment

Surfactant plus Budesonide? Not a PLUSS for outcomes

Many of us were already aware of the general direction of the results of this trial, a multi-centre RCT of surfactant with or without budesonide (Manley BJ, et al. Intratracheal Budesonide Mixed With Surfactant for Extremely Preterm Infants. JAMA. 2024), disappointingly negative results, with no impact on survival or on any index of lung injury.

In order to put this in context, I have just been trying to search the literature for all the prior trials, knowing that there were 2 trials from Yeh’s group in Taiwan, the 2nd being a very positive trial performed in 3 centres, 2 in Tawian and 1 in Chicago. There have been several systematic reviews published, the most recent of which was published in a journal called “Respiratory Medicine and Research” and dates from June this year. In that recent SR I saw many more trials than I was aware of, and I must say, this foray was extremely disturbing. There is an enormous amount of very questionable research being published and/or referenced. Below is the Forest plot which includes both studies of early budesonide inhalation, as well as studies of budesonide instillation with surfactant. Bassler 2015 is of course, the NEUROSIS trial, which showed slightly higher mortality with budesonide inhalation, but less BPD among survivors. This Forest plot shows BPD among survivors from all the early inhaled/instilled steroid trials.

I tried to find some of the other trials. Heo2020 is actually a retrospective chart review, with no consent, and no mention of randomization. Wen2016 is from “World Latest Medicne Information” (sic) and impossible to find. Among other studies included in the SR were Cao2018, published in 2018 in the “Journal of Pediatric Pharmacy” which, as far as I can tell, stopped production in 2017, Zhou2019 was also apparently published in the same ex-journal in 2019. Kou2019 is also untraceable, with the link in the reference list of the SR leading to a completely different article.

Some of the real articles in the SR include Pan2017 (Pan J, et al. [Clinical efficacy of pulmonary surfactant combined with budesonide for preventing bronchopulmonary dysplasia in very low birth weight infants]. Zhongguo Dang Dai Er Ke Za Zhi. 2017;19(2):137-41, the abstract is in English, for the rest I used Google translate) 15 babies were randomized to budesonide with their surfactant, and 15 to surfactant alone. The incidence of BPD (defined as needing oxygen at 28 days) was 1 vs 6, which the authors claim was “significant and statistically significant”; of course, it is far from being statistically significant, by Fisher Exact, p= 0.08. Ping2019 also seems to exist, it appears to be a report of a prospective RCT, and seems to show a reduction in moderate to severe BPD, which is undefined in the abstract, which is all I can get hold of. Ke2016 is a 4 group RCT, with 46 subjects per group, one of the groups being budesonide plus surfactant, and one being surfactant alone, the others being budesonide by inhalation using 2 different methods. Yao2021 was published in “American Journal of Translation Research” and purports to be a trial using nebulized budesonide.

It is impossible to know how much of this research was actually ever performed, how much is fictional, how much was invented by AI, how much is re-quoting of numbers from other reviews without adequate checking of the original.

This is very disturbing to anyone wanting to provide high-quality medical care, based on reliable evidence, to sick babies. One way to deal with all this is that, when we write or review a Systematic Review for publication in a reliable journal, we must insist on only prospectively registered trials, with publicly available data. We will have to perform more independent IPD meta-analyses, where the review authors share, and can examine each others anonymised individual data, if we are to be able to rely on the best evidence for our patients.

The background, therefore, to PLUSS, was of a single high-quality trial of moderately large size (n=265 total), Yeh 2016, which doesn’t even appear in the Forest plot above (probably because they did not publish the incidence of BPD among survivors) but which showed a substantial decrease in oxygen needs at 36 weeks, from 50 to 29 %, which was due to a reduction in both moderate BPD (31 to 20%) and severe BPD, (19 to 9%). The RR for “BPD or death” was therefore 0.63, 95% CI 0.5 to 0.8.

If we recalculate the primary outcome of PLUSS as a relative risk of the primary outcome, it was 0.96, with 95% CI of 0.9 to 1.03. In other words, not only was there likely no benefit, with the residual possibility of a small benefit or a minor harm, but no overlap with the CI of the previous trial either. Also of note, there was no impact of budesonide in any of the prespecified subgroups either. Only a post hoc subgroup analysis of the infants with an FiO2 >50% at treatment showed some hint of a benefit.

So why is PLUSS so different to Yeh et al? PLUSS used a different surfactant (poractant rather than beractant in the previous study) had more immature infants, (average GA 25.6 weeks compared to 26.5 weeks), who needed less oxygen on average at intervention (average of 30%, compared to a mean FiO2 in Yeh et al of 60%) and had a maximum of 2 doses of budesonide, compared to a maximum of 6 doses in Yeh’s study.

Yeh had lower mortality, and less BPD among survivors than PLUSS, in the control groups. Mortality in the controls was 16% in Yeh’s trial, and 20% in PLUSS, while BPD in survivors was 55% vs 72%.

In PLUSS, infants who had already received one dose of surfactant were eligible to be randomized at the time of the second dose, if they were still < 48 hours of age. In fact about half of the infants (in both groups) had already received a dose of surfactant without budesonide prior to being randomized. But one of the pre-planned subgroup analyses was restricted to infants getting their first dose of surfactant, and that also showed no benefit of budesonide, as you can see from the figure above, “Surfactant therapy” means previous surfactant without budesonide prior to randomization. So the ‘No’ results are for babies getting their first dose, and there is an almost identical primary outcome (Death or BPD) among these babies. Because prior surfactant treatment was permitted in PLUSS, the average age of administration of the study drug was later, at 5 hours, compared to 2 hours in Yeh et al.

The new study was much larger than Yeh, and allowed surfactant via LISA/MIST, compared to all the babies being intubated in the previous trial. The differences in outcome, with this new high-quality trial showing no benefit with budesonide, could be because of the differences I have discussed, or some other currently uncertain effect, or simply because of random variation. As I already mentioned, the confidence intervals of the primary outcome of PLUSS do include the possibility of a small benefit of budesonide instillation, but do not overlap with the CI of the trial of Yeh.

Post hoc subgroup analyses, such as the one performed here of above and below 50% O2, should always be treated with great suspicion. Introducing new, unplanned analyses after the data have accumulated is very risky, and inflates the possibility of a type 1 error, suggesting a difference which is not real. In addition, the interaction term was not significant, which means that, although the subgroups with FiO2 above and below 50% had different results, there is not a statistically significant difference between those subgroups. For all these reasons, budesonide instillation with early surfactant cannot be recommended based on this trial, even for babies with very high oxygen requirements. It may be that another trial enrolling only infants with very high oxygen requirements could possibly show a benefit, but that is a possibility that can only be answered by doing such a trial.

The BIB trial, currently underway in the USA does not, however, restrict enrolment to infants with a high FiO2, and is in many ways similar to PLUSS, with the main difference being a restriction to the first dose of surfactant, and the inclusion of infants up to 28 weeks and 6 days gestation. BIB has completed enrolment, and, you never know, perhaps we will see some results at PAS next spring. Once we have those results, we should be able to make an evidence-based decision about routine budesonide instillation with surfactant, and we will be able to ignore all the very small, potentially unreliable or perhaps fictitious trials in the current Systematic Reviews. Perhaps, if someone has the time and the funding, we could do an IPD meta-analysis, and only those investigators who are able to supply the individual patient data, from pre-registered trials, could collaborate to give the best possible information for future babies, and for future trials.

Posted in Neonatal Research | Tagged , , , , | 2 Comments