Clinical examination and treatment of septic shock. Not neonatology.

Andromeda-Shock-2 Investigators for the Andromeda Research Network SSoAR, et al. Personalized Hemodynamic Resuscitation Targeting Capillary Refill Time in Early Septic Shock: The ANDROMEDA-SHOCK-2 Randomized Clinical Trial. JAMA. 2025.

This is a very interesting trial evaluating the usefulness of clinical assessment of the circulation in adults with septic shock in a large international multicentre trial. Patients with suspected sepsis, who required norepinephrine after 1 litre fluid bolus, and had an elevated serum lactate, were randomized. A standardized method of measuring capillary refill time was agreed upon,

CRT was assessed by applying firm pressure to the ventral surface of the distal phalanx of a finger, using a glass microscope slide. The pressure was increased until the skin was blank, maintained for 10 seconds, and then released. The time required to return to the normal skin color was measured with a chronometer and a refill time longer than 3 seconds was defined as abnormal

and the algorithms were activated if the cap filling time was abnormal in the CRT-PHR (cap refill time- personalized haemodynamic resuscitation) group.

As you can see, if the CRT was >3 seconds, you first check the pulse pressure, and if it is >40 mmHg, then you check the diastolic BP, which may lead to increasing norepinephrine dose; the next stage may be to give more fluid to see if there is a response, and then progress to bedside echocardiography, which may lead to specific treatments, or more fluid, or eventually to low dose dobutamine.

The control group had “standard care”, CRT was recorded but the algorithm was not followed.

The primary outcome was a hierarchical composite: (1) all-cause mortality within 28 days, (2) duration of vital support (vasoactives, mechanical ventilation, and kidney replacement therapy) truncated at day 28, and (3) length of hospital stay truncated at day 28.

The trial was analyzed by the Win Ratio. 1400 patients were randomized, as it was not a paired study (one way of using the Win Ratio), but individually randomized, they stratified the patients by APACHE score, then, within strata, every patient in group 1 was compared with every patient in group 2, to determine if they won or lost. There were therefore 244 000 paired comparisons. The CRT-PHR group won 49% of the comparisons, compared to 42% of the control, usual care group. The remaining 9% were exact ties.

This exceeded the limits for statistical significance; mortality was identical at 26.5%, but there were more ICU free days, and shorter hospital stays in the CRT group. The table of interventions shows that more of the CRT group received vasopressin, more received dobutamine, and they received less fluid; at 6 hours of treatment, their CRT was shorter, and serum lactate was lower.

The analysis is illustrated below in the 2 strata of the Apache Score (a higher score predicting higher mortality); this showed a greater difference in the sicker patients.

I found this fascinating. In terms of the intervention being investigated, trial design, and analysis methodology.

Many of my readers will know of my concerns about the way we analyse composite outcomes in neonatology. Comparing “death or BPD”, “death or NDI”, “death or hiccups”, between randomized groups, as if they were of equal importance, and as if we were always sure that they would change in the same direction with an intervention. This trial is one of a growing trend to using hierarchical composites, with death being given the greatest weight in the analysis, followed by other clinical outcomes in descending order of importance. Clearly an example to be followed in neonatology.

As for the intervention, I remember from my years as a staff in the PICU that one of the best predictors of a good outcome after major cardiac surgery is that the nurse tells you the infant’s toes are warm! It is a better predictor of survival than measuring cardiac output, for example. (Here is a recent example Cruz G, et al. Capillary Refill Time and Serum Lactate as Predictors of Mortality and Postoperative Extracorporeal Membrane Oxygenation Requirement in Congenital Heart Surgery. Children (Basel). 2023;10(5)).

I don’t think that capillary filling is quite as useful in preterm infants, kept in incubators, with their immature control of vascular tone, but there is still some correlation between cap filling and blood flow measured by doppler in the SVC. (Osborn DA, et al. Clinical detection of low upper body blood flow in very premature infants using blood pressure, capillary refill time, and central-peripheral temperature difference. Arch Dis Child Fetal Neonatal Ed. 2004;89(2):F168–73). As you can see below there is a lot of scatter, but a higher SVC flow is correlated with shorter CRT.

Combining such clinical signs with the direction of change in serum lactate (the absolute value doesn’t help much in the first couple of days as it is often high after birth), urine output (also not much use immediately after birth), level of activity etc, seems to me to be likely to be important in determining treatment in septic babies also. But we have very few good randomized trials of treatment approaches in septic newborns.

This trial gives us some pointers of how we could reasonably design such a trial, with a structured algorithm of interventions, including clinical pointers and targeted functional echocardiography in some patients, and how to design and analyse the primary outcome. We could develop a consensus algorithm (it couldn’t really be evidence-based) and test against usual care, with a hierarchical composite outcome including death and brain injury and duration of intensive care support, for example.

Posted in Neonatal Research | Tagged , , , , | Leave a comment

Lactoferrin supplementation does not prevent late-onset sepsis in the preterm… or is it more complicated that that?

Many of my readers will remember the impressive results of the high-quality study by Paolo Manzoni, Manzoni P, et al. Bovine lactoferrin supplementation for prevention of late-onset sepsis in very low-birth-weight neonates: a randomized trial. JAMA. 2009;302(13):1421–8) which showed that routine supplementation of preterm infants with bovine lactoferrin (bLF) dramatically reduced late-onset sepsis.

Many of us were quite excited with this finding, and launched our own studies, I performed a pilot in my NICU, hoping to use the data to get funding for a confirmatory trial (Barrington KJ, et al. The Lacuna Trial: a double-blind randomized controlled pilot trial of lactoferrin supplementation in the very preterm infant. J Perinatol. 2016;36(8):666–9), and at about the same time ELFIN was started in the UK. LIFT then took place in Australia/NZ, and, more recently, a Canadian version of LIFT was performed to increase study numbers and power.

Unfortunately all of the large confirmatory studies have been completely null, without a hint of a benefit. Including LIFT-Canada, which is in submission so I won’t go into any details, but I can say that we did not show a benefit of bLF.

There continue to be some trials which do seem to show an effect of bLF, including this very new trial (Plaza-Astasio V, et al. Preventing Sepsis in Preterm Infants with Bovine Lactoferrin: A Randomized Trial Exploring Immune and Antioxidant Effects. Nutrients. 2025;17(19)). Just over 100 VLBW infants were randomized to bLF supplementation or control, prior to 72 hours of age, and followed for LOS, as well as lab tests of antioxidant and immunologic effects. LOS was defined as “Laboratory confirmed sepsis” after 72 hours. The authors followed the NeoKisses definitions, which, as far as I can tell, include so-called “clinical sepsis” without a positive blood culture, but in the supplementary materials of this new study there are the same number of organisms listed as the episodes of sepsis, that is 11 in the bLF group and 21 in the placebo group. In other words they showed a reduction in culture-positive sepsis.

The authors note that their breast feeding rates were lower than some of the other large trials, at around 75% compared to over 90% in the large trials, and suggest this as a possible explanation for the difference of their results compared to the larger RCTs. That seems to me doubtful, if bLF was only effective in formula fed babies, then they could not have shown such a large decrease. Ochoa and her collaborators have published an IPD meta-analysis of the VLBW infants enrolled in their 2 trials (see below) which suggested that the impact of bLF was much greater among babies with low human milk intake (11% bLF, 21% controls). Although they do indeed show that, what is strange is that their analysis shows that LOS was much more frequent in babies with a high human milk intake, either with bLF (35%) or in their controls (39%), which is hard to understand. Another secondary analysis, of the data from ELFIN and the original Manzoni trial, showed similar reductions in LOS by bLF among breast-milk fed and formula fed, or mixed feeds babies. The reductions in LOS by bLF were very small and consistent with random variation in ELFIN. The interaction term was not significant, suggesting that the reduction in LOS was similar regardless of feed type.

The authors of the new study also note that their control frequency of sepsis was high, which is again true, a 40% incidence of LOS in a group of infants with a mean GA of 30 wks is extremely high. Having a higher baseline frequency of an abnormality will generally tend to make the impact of an intervention seem greater (see my recent posts on regression to the mean), but that doesn’t mean that such an impact would disappear completely when the incidence is lower.

One other difference that they do not mention is the source of bLF; the newly published trial used DicoPharm, just as did Manzoni. Akin’s study used the same product and also showed a reduction in culture-positive sepsis. Theresa Ochoa in her 2 studies used a product from Tatua ™ in the first study, derived from pasteurized milk, which had no effect on culture-positive sepsis, and a product from Friesland Campina in the other trial, which seems to be extracted by freeze-drying and not heat treated. The second trial showed a decrease in culture positive sepsis (from 11 to 8%, NS) not shared by the first study. Other studies either don’t mention the source of the bLF (Kaur et al) or I cannot obtain them as they aren’t in PubMed, or any other database that I can access (Liu, Tang, Dai). Another new study, from Egypt, randomized only formula fed infants (Ellakkany N, et al. Influence of bovine lactoferrin on feeding intolerance and intestinal permeability in preterm infants: a randomized controlled trial. Eur J Pediatr. 2024;184(1):30). They had an enormously high rate of LOS in the controls (60%) and a lower, but still extremely high, incidence in the bLF treated infants, 43.3%. The preparation they used was produced in Egypt, and I can’t find any details of how it was prepared. Finally I found one other trial, performed in Pakistan in infants with an average GA of about 34 weeks, (Ariff S, et al. Evaluation of Bovine Lactoferrin for Prevention of Late-Onset Sepsis in Low-Birth-Weight Infants: A Double-Blind Randomized Controlled Trial. Nutrients. 2025;17(11)). with a product from Hilmar in the USA which appears to have been prepared from freeze-dried milk (and perhaps not heat treated), they had an 8% incidence of culture-positive LOS in controls, and a combined 6% in the 2 treatment groups (with 2 different doses of bLF); total n of about 300.

There are lab studies showing that pasteurization decreases the biologic activity of bLF. bLF is degraded by heat treatment, it aggregates, and bind iron less well (Remadevi R, Mead D. A Study on the Bioavailability of Lactoferrin under Pasteurisation at Different Conductivities and Solid Contents. Journal of Food Research. 2025;14(2)). It could well be that heat-treatment of milk, prior to extraction of bLF, causes sufficient structural changes in the molecule for it to no longer have the multiple beneficial effects on bacterial proliferation that have been documented. This might be one reason why donor human milk (which is always pasteurized, usually by Holder pasteurization, the only method approved by HMBANA) is less effective at decreasing NEC than Mothers own Milk.

There are, however, known to be major differences in the biologic activity of different sources of bLF. One study examined 10 different bLF sources, and compared several different aspects of structure and activity between them, as well as their own bLF and human LF (Lonnerdal B, et al. Biological activities of commercial bovine lactoferrin sources. Biochem Cell Biol. 2021;99(1):35–46). There were major differences between bLF sources. As one example, they examined uptake of the LF by Caco-2 cells, and whether the LF transported iron into those cells

The details of what that means are not that important here (partly because my own understanding is limited, but also because it isn’t certain what this particular aspect has to do with their biologic effect of decreasing infections), but what this does show is that different sources of bLF are extremely different. They also found very variable degrees of contamination of the bLF product with other proteins, the Hilmar product, as one example “contains a relatively low concentration of Lf and relatively high concentrations of a-S1-casein, a-S2-casein, and J domain- containing protein”, whereas the Dicopharm product had lots of LF and relatively less of the other proteins.

I think, before we give up completely on bLF supplementation as a potential way to decrease LOS in the preterm, there is room for another study, investigating specifically the Dicopharm product, which has been consistently associated with decreases in culture-positive LOS. It may be that the story of bLF to prevent LOS still has a twist in the tale.

Posted in Neonatal Research | Tagged , , | 1 Comment

Return to the Regression to the Mean

After my post on regression to the mean, and its importance in studies of apnoea therapy, I was thinking of other examples. Some which have been most evident to me are those which impact on areas of medicine that I have researched myself. One example, from many years ago now, looked at the haemodynamic effects of dopamine in sick preterm infants. Seri I, et al. Regional hemodynamic effects of dopamine in the sick preterm neonate. J Pediatr. 1998;133(6):728–34.

This study was performed during the 1st 2 days of life, a period when blood pressure normally gradually increases, and when renal vascular resistance falls dramatically. These known baseline changes are an additional confounder in the results of non-controlled studies. The subjects were preterm infants with what they termed “compensated shock”, that is they had a BP between the 10th and 90th percentiles, but were oliguric (<0.6 ml/kg/h of urine) and/or had slow capillary filling. They were all given dopamine, with echographic indices performed before and after.

What you can see is that overall mean BP increased, after doses of dopamine between 2.5 and 7.5 microg/kg/min

And an index of renal vascular resistance, the pulsatility index in the renal artery, decreased

These are actually changes that you would expect over time in the first hours of life. The time difference between the 2 measurement was relatively short, at about 30 minutes, one could argue, perhaps, that the changes are too quick to just be postnatal adjustment. Maybe they were caused by the dopamine?

Interestingly, the authors also presented results after, post hoc, dividing the infants into responders (who had a >10% increase in mean BP) and non-responders.

This shows that, the “responders”, panel A, had a lower mean BP before dopamine treatment, of about 35, and it increased to about 43 afterwards. The “non-responders”, panel D, had a mean BP, before and after dopamine, of just over 40 mmHg.

This is exactly what you would see if the results are entirely due to regression to the mean. Those with lower BP than average will tend to have an increase after any treatment, including placebo. It would be surprising, in an observational study such as this, for them to have given dopamine to babies with a higher BP than average.

Having said that, dopamine will in some circumstances, I think it is clear, increase BP, probably not by much at a dose of 2.5, but there is enormous variability in dopamine kinetics (and pharmacodynamics); some infants might have an increase in BP at low doses, and some have no effect at very high doses. Dopamine is, however, an effective vasoconstrictor, and any increase in BP is entirely due to vasoconstrition in the newborn. In this study, however, both “responders” and “non-responders” had a decrease in renal vascular resistance, why would this be? As I mentioned above, renal vascular resistance is known to decrease dramatically after birth; this study, for example shows an 88% decrease in RVR over the first 2 weeks of life, most of which is in the first 2 to 3 days.

Additionally, there is no newborn animal model in which dopamine causes renal vasodilatation; indeed the “dopaminergic effect” on renal blood flow has only really been shown in healthy adult dogs! In newborn mammals, in contrast, only renal vasoconstriction has been shown with dopamine. Here is one of my studies, from many years ago (Pearson RJ, et al. Dopaminergic receptor-mediated effects in the mesenteric vasculature and renal vasculature of the chronically instrumented newborn piglet. Crit Care Med. 1996;24(10):1706–12). The piglets I used were relatively insensitive to dopamine, requiring more than 16 microg/kg/min to have a BP increase, but there was no decrease in RVR at any dose.

I also subjected the animals to an infusion of Fenoldopam, as selective agonist of vascular dopamine receptors, which showed absolutely no renal vasodilatation.

These examples demonstrate, yet again, that one has to be very sceptical about the results of observational studies of the responses to an intervention. Whenever we treat a baby who has a problem which varies in intensity, be it apnoea, low blood pressure, oxygen requirements, oliguria, or anything else that you can think of, unless you randomize and treat only half of the infants, one can never know if any changes which are seen are due to the intervention, or just regression to the mean. Babies with BP lower than average will always tend to have higher BP the next time you measure it. Babies with low urine output will always tend to have higher urine output after an interval.

Controls, controls, controls. Preferably randomized controls. They are essential for determining the impacts, efficacy and safety of our interventions.

Posted in Neonatal Research | Tagged , , , | 1 Comment

Beware of meta-analysis factories

I mentioned in my previous post, an issue with meta-analyses; there have been several I have read recently which are very problematic. They seem to be produced by groups that have little concern for the quality of their product.

This recent meta-analysis, for example, of transfusion thresholds in the preterm, stated that they found 12 RCTs (Fu et al Ann Hematol 2023). But it includes a trial (Widness JA, et al. Reduction in Red Blood Cell Transfusions Among Preterm Infants: Results of a Randomized Trial With an In-Line Blood Gas and Chemistry Monitor. Pediatrics. 2005;115(5):1299–306.) which was not a trial of differing transfusion thresholds, it was a trial of an in-line device to reduce blood sampling. It also includes data twice from the PINT trial, from the original publication and then includes them again from the long-term follow up publication. Some of the data appear to be made up, PINT never published the “hemoglobin after transfusion” data which are in the first Forest plot, and for which they give different results for the primary PINT publication and the long term outcome publication! According to their fictitious numbers, the Hgb after transfusion was almost identical in the 2 groups (124.9 vs 125.7), whereas, in reality, PINT achieved a clear difference in Hgb concentrations between randomized groups. There were 5 very small Chinese trials included in that SR/MA, only 1 of which I can access (Chen H-L, et al. Effect of Blood Transfusions on the Outcome of Very Low Body Weight Preterm Infants under Two Different Transfusion Criteria. Pediatr Neonatol. 2009;50(3):110–6). The others have no entry in PubMed, and I can’t find them when searching the Chinese national database.

This article is free access, which means that someone paid about $4600 US to put this misleading nonsense on line. Springer journals commonly publish poor quality articles under their pay-to-publish model, and really, if any peer reviewer worth his salt had read this, it is immediately evident that there are huge issues. Just to take one other minor example, they state that the ETTNO trial did not describe the means of randomization, I guess the SR authors just didn’t read the methods which actually describes them in more than the usual detail : “The random sequence was computer generated with variable block size (2-10) using the software RandList version 2.1 (DatInf)”.

They also have weighted some outcomes in a way that the small, unobtainable Chinese trials (n of between 70 and 180) have much more weight in the analysis of duration of oxygen (for example) than the large ETTNO trial (n>1000). This is presumably because of the minuscule SD of the data from those trials, for example Wang 2013 apparently showed a duration of oxygen in the liberal transfusion group of 14 days (SD2) compared to 18 days (SD3). This study of 86 babies has a 40% weight in the analysis as a result, compared to ETTNO, given an 11% weight, probably because the SD of the duration of oxygen therapy is realistic, 50 days (SD33). My guess is that the supposed SD of duration of O2 therapy in Wang, and the other trials with extremely narrow distributions, is actually an SEM, but as the articles are inaccessible there is on way to check that.

To explain further, continuous outcomes in meta-analyses are usually weighted by the inverse of the variance. This is done so that articles with more precision in their estimates (usually the larger trials) have more impact on the calculated overall mean effect. When the variance (however it is reported) is very small, then the article might have an outsized impact on the MA, which is why it is so important to be sure that the data are reliable, and that the reported variability in the data is really a SD, and not a SEM.

If the analyses were redone, giving appropriate weight to the larger trials, then there would be no impact of transfusion threshold on respiratory outcomes.

This matters. Individual carers could give transfusions to preterm infants with the expectation that they will shorten the duration of oxygen therapy, or positive pressure respiratory support, based on this erroneous SR/meta-analysis.

Recently, when I do a lit search, I often find more Systematic Reviews and Meta-Analyses that there are original trials. I think there are academics who think its so much easier to just recycle the results of someone else’s research than to perform a trial themselves.

Reputable journals should be very careful about publishing SR/MA. They should ensure that the SR was registered, and follows PRISMA guidelines and ensure that they are not just re-performing reviews that have already been well done. They should require that the authors provide pdf copies of the original trial publications with the submission, so that peer reviewers can verify the accuracy of what is being presented. Peer reviewers should ensure that the articles included really exist, that they are trials of the intervention being evaluated, and that the results are accurately analysed.

A related issue is the question of whether the original data are reliable or not. I have read, and reviewed, articles which seem to have been written by AI, and which are probably entirely fictitious. Others have probably skewed their results to be more positive, or have reported different outcomes to those planned when they found something interesting post hoc. A new tool has been developed to try and counter these issues, called INSPECT-SR, which is available as a preprint. (Wilkinson J, et al. INSPECT-SR: a tool for assessing trustworthiness of randomised controlled trials. medRxiv. 2025:2025.09.03.25334905). The tool gives multiple checks to perform when writing an SR, as an attempt to eliminate data which are not reliable. It is a very sad that the integrity of published trials has to be questioned, but it is a reality of our current state of affairs.

Determining the integrity of a Systematic review is even more difficult, as one often does not have access to the original trials, to see if they have been accurately interpreted. The 2 SRs that I have recently criticized, one about Caffeine in the newborn, and this one about transfusions, are both addressing issues for which I was an author of one of the major included trials. My involvement made it immediately obvious to me that there were serious errors in interpretation, and that the SR/MA was very flawed. Systematic reviews of other issues, that I have had less direct involvement with, may have been just as flawed, but it could have escaped my notice.

There is much pressure in some academic circles to “publish or perish”, and to get something, anything, in to print. In some countries medical students are expected to publish an article prior to being awarded their MD degree. In others, junior academics cannot advance unless the combined weight of their output,when printed, exceeds a certain number of kg (or at least it seems that way). Journals now have a major interest in publishing anything that is submitted along with a cheque. Springer seem to be particularly egregious among the older established publishers, but some newer groups, like the Frontiers journals and MDPI have an extremely uneven profile, some of their titles being clearly predatory pay-to-publish journals, and others having higher standards.

It is incumbent on us, in the present day, to be sceptical of everything that we read, primary research and SRs. Pre-registration of trials and SRs, and data sharing are essential to ensure the integrity of the research on which we base our clinical decisions for critically sick babies.

Posted in Neonatal Research | Tagged , | Leave a comment

Do blood transfusions treat apnoea of prematurity?

There has for a long time been a thought that anemic babies with many apnoeas could benefit from a blood transfusion which would decrease their apnoeic spells. This idea has never been directly tested by an RCT. That is, a trial in which infants with apnoea were randomized to receive a transfusion or control, and the response accurately determined. I actually started such a trial when I was in San Diego, but only enrolled a tiny number of babies before leaving to return to Canada; the fellow who was involved finished at about the same time as me, and the project was sadly terminated. I admit that it is not ethical to randomize infants to a trial, with all the stress imposed, and the goodwill of parents involved, and then not have a mechanism to complete the trial, and I apologize to the parents and families involved.

The evidence we do have, therefore, comes from observational studies of various kinds and from secondary analysis of RCTs of blood transfusions at differing thresholds, in which the impacts on apnoea or on intermittent hypoxia (IH) have been recorded. Just to remind my readers, most IH is caused by apnoea, prolonged recordings of saturation are much easier than the prolonged multichannel recordings required to objectively quantify apnoea. I will use the terms interchangeably here, partly because the harms of multiple recurrent apnoea spells, are probably because of frequent desaturation leading to hypoxic injury, and resaturation, with consequent oxidative injury.

Interestingly, the 2 types of evidence give contradictory results. Observational studies tend to show a reduction in apnoea, or IH, after a transfusion, whereas RCTs don’t show a difference in apnoea, or IH, by randomized group; those with a haemoglobin maintained at a higher level do not have less apnoea/IH than those allowed to have lower Hgb.

This is an object lesson in the hazards of observational studies, especially for a condition which is very variable in severity (between patients and between days) and which eventually improves.

I actually use this example when I teach statistics and research design to fellows!

The figure below shows two columns of randomly distributed numbers which I generated, each connected by row number, with a mean of 0 and a SD of 1. If we take “0” to mean the overall average frequency of IH per hour in the sample, (it could be 4, for example), the numbers on the vertical axis are the number of Standard Deviations above and below the mean, this figure could be the number of IH per hour on day 7 and on day 14 of life of 200 preterm babies.

If one decides to give a treatment only to those babies who have more IH than the mean, which means you are selecting the babies in the top half of the distribution, then measure IH frequency again after the treatment. Then, even if the treatment had no effect whatsoever on IH frequency, the result you would get is shown below.

This is regression to the mean. There are very many examples of this, as a potential explanation of positive results in observational studies. Let me give you one example, of an uncontrolled study of of an apnoea treatment. Marlier L, et al. Olfactory Stimulation Prevents Apnea in Premature Newborns. Pediatrics. 2005;115(1):83–8. In this study, babies having recurrent apnoea were exposed to a nice smell, lavender wafted through their incubator, and they evaluated apnoea frequency afterwards. There were fewer apnoeas when the babies had a pleasant odour in the incubator! Without a randomized control group such data are worthless. Any variable condition will tend to get better if you start treating it when it is worse than average, whether you use something effective or give a placebo.

If you tend to give transfusions to anaemic babies who are having more apnoeas, then you are immediately creating exactly this situation. If transfusions have no impact on apnoea, then an observational study will show a significant reduction in apnoea frequency following transfusion.

Also striking is what happens if you ask the question, “do babies who have the most apnoeas have the greatest benefit from transfusion?”, then plot the initial apnoea frequency against the improvement after the treatment, using the randomly generated numbers in the figure above, this gives a correlation coefficient of 0.56 and a p-value of <0.0001.

Remember, these are from entirely random numbers, after an intervention with no real impact whatsoever on apnoea frequency! All you have to do is select the most severely affected babies to treat, they will seem to improve the most.

We can see this, as a potential explanation of results which claim to show a benefit of transfusion on IH, in several studies, such as this one (Kovatis KZ, et al. Effect of Blood Transfusions on Intermittent Hypoxic Episodes in a Prospective Study of Very Low Birth Weight Infants. J Pediatr. 2020;222:65–70). In that study, they examined IH before and after blood transfusion, as well as before and after transfusion of other blood products.

IH decreased from 5.3/h to 3.6/h after blood transfusion, and was unchanged at 4.6/h before and after the small number of transfusions of other blood products.

This is exactly the result you would expect if caregivers were more likely to transfuse anaemic preterm infants when they were having a greater than average number of IH, but give plasma, platelet, or other transfusions for reasons that have nothing to do with apnoea; even if there were absolutely no effect of transfusions on apnoea incidence or IH.

I am not picking this study out as a particularly egregious example, in fact, it is a better study than most, as it at least had the non-RBC controls. You would also see something similar if there was a real impact of RBC transfusions on IH.

If we look at the RCTs of blood transfusions in the preterm, which have compared different transfusion thresholds, there is no apparent impact on IH or on apnoea. This includes the most recent publication, which is a secondary analysis of an RCT. (Franz AR, et al. Effects of liberal versus restrictive transfusion strategies on intermittent hypoxaemia in extremely low birthweight infants: secondary analyses of the ETTNO randomised controlled trial. Arch Dis Child Fetal Neonatal Ed. 2025). The ETTNO trial was a randomized comparison of differing transfusion thresholds in infants <1000g birthweight that I have already discussed, which showed no impact on the primary outcome of “survival without NDI”. There was no impact in survival or on developmental progress, as measured by the Bayley version 2 MDI, results of which were identical between groups. The transfusion thresholds were a bit complicated in ETTNO, the high threshold group had 3 different thresholds according to postnatal age in stable babies, and 3 higher thresholds in “critically ill” babies. The low transfusion group had a different matrix of 6 transfusion thresholds according to postnatal age and being stable or not. Of note, one of the indications for being considered “critically ill” were 6 or more apnoeas/day requiring nursing intervention, or IH to <60% saturation >4 times per day.

This new secondary analysis compared IH frequency and severity according to randomized group. About 50% of the babies had good enough recordings of saturation for analysis, a subgroup who seemed representative of the whole sample.

There were no differences in any index of IH between groups.

In the PINT study, one of our secondary outcomes was how many babies in the low vs high transfusion threshold transfusion groups had “apnea requiring treatment”, which was 55% in the lower Hgb group and 60% in the higher Hgb group, in other words, the small difference was in the opposite direction and not in favour of an impact of Hgb on apnoea frequency.

I don’t think there are any secondary outcome data on apnoea or IH from the TOP trial, if anyone knows of any, please let me know. A much older trial (1984) of only 56 preterm infants reported apnoea frequency among infants randomized to either have their Hgb kept above 100 g/l, or to be transfused only for clinical indications, including surgery, but also including severe apnoea not responding to theophylline as a clinical indication. There were no differences in recorded apnoea frequency despite differing Hgb concentrations. The only controlled data which show a possible impact on apnoea are from the Iowa trial, of 100 preterm infants <1300 g (Bell EF, et al. Randomized trial of liberal versus restrictive guidelines for red blood cell transfusion in preterm infants. Pediatrics. 2005;115(6):1685–91) which had more apnoea spells in the lower threshold group, and more apnoea requiring nursing intervention, 0.4/day compared to 0.2/day. In that trial, infants were allowed to receive a transfusion, even if they were in the low threshold group, if they had multiple apnoeas, which is a possible confounder in analysing the meaning of that result.

I looked for systematic reviews of transfusion in the preterm to see if any had analysed the impact on apnoea, and was unable to find any other reliable data, but read my next post to see what I did find.

My take home message is that there are few reliable data to show that apnoea or IH is more frequent in infants with lower Hgb, nor any reliable evidence that RBC transfusion reduces apnoea or IH in the preterm.

If you transfuse babies who are having more apnoea, or more IH, than average they will usually have a reduction in their episodes.

But :

If you don’t transfuse babies who are having more apnoea, or more IH, than average they will usually have a reduction in their episodes.

The only way to resolve the issue would be to do a trial similar to the one that I started years ago. Enrol anaemic infants with apnoea or IH, randomize them to transfusion or control and obtain objective recordings of their responses. I have a strong feeling, based on my evaluation of the currently available data, that both groups will show a reduction in apnoea/IH, and that there would be little or no difference between the two groups.

Posted in Neonatal Research | Tagged , , , , | 1 Comment

Non-invasive high-frequency oscillation; worth the hassle?

Non-invasive HFOV can be delivered by a variety of different equipment and interfaces. The high flows and upper airway turbulence probably have an impact on gas exchange; It appears that the effective dead space of the oro-nasopharynx is washed out (De Luca D, Dell’Orto V. Non-invasive high-frequency oscillatory ventilation in neonates: review of physiology, biology and clinical data. Arch Dis Child Fetal Neonatal Ed. 2016;101(6):F565–F70), but how much transmission of the oscillatory pressures to the lung occurs is uncertain. Transmission does occur under some circumstances, however, as several groups have shown. In this cross-over study, for example, (Gaertner VD, et al. Transmission of Oscillatory Volumes into the Preterm Lung during Noninvasive High-Frequency Ventilation. Am J Respir Crit Care Med. 2021;203(8):998–1005) nHFOV was applied starting with a pressure amplitude of 20 cmH2O, then adjusted to give either a PCO2 of 40-60, or, if, the baby was already normocapnic, adjusted to the lowest pressure that gave visible chest wall oscillations, the article doesn’t state what were the eventual pressure amplitudes received. Nevertheless, using transthoracic impedance tomography, they were able to detect chest wall movements which were about 1/5 the amplitude of the babies tidal volume movements. They also showed that when the oscillations were switched on, there was a decrease in the amplitude of the infant’s own tidal respiratory movements, which I presume was a reflex reduction, secondary to an increase in CO2 clearance by the HFO, which would decrease endogenous respiratory drive.

The figure shows the amplitudes of impedance changes over a period of CPAP compared to nHFOV, i the upper panels, and the lower coloured pictures show that the oscillations were preferentially transmitted to the right lung, especially the central and “upper” regions (the babies were in ventral positioning).

It appears likely, then, that there are some pressure oscillations in the distal airways during nHFOV, which might lead to some gas exchange. It is possible, therefore that respiratory support of nHFOV may have some advantage over CPAP. Any possible advantage over nIPPV (with NAVA or fixed pressures and/or non-synchronised) is not so clear, and requires some clinical trials to confirm.

There have been a few recent trials of nHFOV, both as a mode of routine support post-extubation, and for primary respiratory support after birth in preterm infants. There are also now several systematic reviews and meta-analyses, there seem to be some SR/MA factories churning these things out, some of which appear to have been written with AI, and sometimes include non-existent references. One needs to be really careful these days, both as a reader and as a peer-reviewer. I now check much more carefully than in the past when I am peer-reviewing an article (both primary research and review articles) to be sure that the key references actually exist. Unfortunately I don’t read Chinese, and many references only seem to appear in Chinese databases. In this SR of post-extubation nHFOV, for example, Prasad R, et al. Noninvasive high-frequency oscillation ventilation as post- extubation respiratory support in neonates: Systematic review and meta-analysis. PLoS One. 2024;19(7):e0307903 there are numerous included RCTs for which I cannot find the original article, some appear to only show up when searching the Chinese medical publication database, so it is impossible for me to check the definitions of treatment failure, for example, or the characteristics of included babies.

That review shows a reduction in extubation failure when nHOV was compared to nCPAP

and a smaller advantage of nHOV compared to nIPPV

That review showed no difference in any other clinically important outcomes, such as lung injury as measured by BPD at 36 weeks, IVH, or the other usual neonatal complications.
It also is not clear if all the babies benefited from an optimal approach to extubation, with caffeine pre-treatment, evaluation of spontaneous breathing, and higher CPAP levels in those still requiring oxygen, all of which can reduce extubation failure rates, and might eliminate a possible advantage of nHFOV, or not. They are also all tiny or modestly sized trials, only one of which was individually significant, which inflates the probability of a chance finding.

In addition, as mentioned, there are some problems with availability of these publications. The reference list of this SR/MA has no entry for Zhu 2019, it is supposed to be reference 37, but reference 37 is a different publication, a non-controlled report of nHFOV use, and I can’t find any link to a trial of nHFOV authored by Zhu in 2019, including from searching the Chinese database, CNKI. Also Liang 2019, and several other references, does not have a URL link; I searched the CNKI database and found a link to the Liang article, which had an abstract in English (the things I do for my readers!), but the full publication is behind a paywall, so I have no idea how extubation failure was defined in that study.

The systematic review seems to show that nHFOV post-extubation leads to a decrease in extubation failure from an overall rate in these studies of 1 in 4 with CPAP, (which seems very high) to about 1 in 9. In the studies comparing with nIPPV, the overall rate decreases from 1 in 6 with nIPPV to about 1 in 9 with nHFOV. But, if you eliminate the trials that I cannot find, and the tiny trial with an extremely high failure rate in controls, there is no clear advantage to post-extubation nHFOV compared with nIPPV in extubation failure.

Failing extubation and being re-intubated is, in itself, an important outcome to parents (and I would guess to the babies!). If there were no other advantage in terms of lung injury, VAP, or length of stay, then that in itself would be worth the hassle. For now, I am not convinced of the advantage of nHFOV compared to nIPPV for routine support post-extubation in the very preterm infant (who has the highest chance of being re-intubated). Other interventions, including NIV-NAVA (Tome MR, et al. NIV-NAVA versus non-invasive respiratory support in preterm neonates: a meta-analysis of randomized controlled trials. J Perinatol. 2024;44(9):1276–84) also are unconvincing as evidence for routine use.

The current evidence shows that prior to extubation babies we should ensure therapeutic caffeine treatment, perform a Spontaneous Breathing Test, and if the baby passes the test, very preterm infants should receive either nIPPV, nHFOV, or NIV-NAVA, with a PEEP of 6 if in 21% oxygen, and 8 or 9 if they have residual oxygen needs.

Decreasing extubation failure, and the need for reintubation, with its resultant trauma to the airways, and trauma to the parents hopes, is an important goal for research, even if longer term pulmonary complications are not affected.


Posted in Neonatal Research | Tagged , , , , , , | 2 Comments

Avoiding IVs in moderately preterm babies

A new very large (for neonatology) RCT has just been published. Ojha S, et al. Full exclusively enteral fluids from day 1 versus gradual feeding in preterm infants (FEED1): a open-label, parallel-group, multicentre, randomised, superiority trial. Lancet Child Adolesc Health. 2025. Mothers were approached prior to preterm delivery, and babies were enrolled if they delivered between 30 and >33 weeks gestation, and were deemed clinically stable, prior to 3 hours of age. Prior to delivery the mothers either gave full written signed consent, or they verbally agreed to the study, in which case they had a full written consent later. This allowed babies to be enrolled early in life, at an average of about 1.5 hours. Twins were therefore randomized as a unit to the same group, there were 2088 babies from 1761 mothers.

About half of the babies already had an IV when they were enrolled. In the intervention group, babies then received “full-milk-feeds”, starting at 60-80 mL/kg/d (the published protocol states 60 mL/kg/d), whereas the control group had a maximum of 30 mL/kg/d enteral liquids on day 1.

This is described thus : “For infants in the full milk group, we started milk feeds within 3 h of birth at 60–80 mL/kg per day via a gastric tube and continued milk feeds without intravenous fluids or parenteral nutrition”. But this just isn’t true, 80% of the full milk group infants did have intravenous fluids on day 1, as their own figure shows. I think this is really just a question of wording, the full milk group babies were supposed to attempt full enteral nutrition, but large numbers of babies in this GA group will have expectant IV antibiotics for possible early onset sepsis, so they required IV access for good clinical care. 50% of the babies had an IV at enrolment (there is no mention anywhere in the publication or the supplementary materials of IV antibiotics). This also means that 300 babies in the full feeds group had an IV inserted, within the first few hours of birth, after they were assigned to the full enteral feeding group. This is never explained, or even mentioned or discussed.

They also never state what was done with feeding volumes for infants who had an IV running; were the IV fluids included in the fluid calculation? If a baby weighed 1.2 kg, for example, and had an IV running at 2 mL/h for their antibiotics, was the 40 mL/kg/d of IV liquid in addition to the enteral liquids? Or was that volume deducted from their feeds? This is an important detail given that 80% of the “full enteral” group had IV fluids.

After the initiation of the trial, the local care team could do whatever they wanted, in terms of increasing feed rate, or the source of additional feeds (donor milk or formula), or defining feed intolerance, or measuring gastric residuals(!), or timing of fortification. Full feeds were defined as at least 140 mL/kg/d for 3 consecutive days.

Forty percent of the “full enteral feeding” group had more than 24 hours of IV fluids, but again we have no idea how much of this was due to IV antibiotic use. The babies were all preterm, many had respiratory distress, more than a fifth had ruptured membranes for >24 hours, so I am sure that many had (and needed) IV antibiotics. On the other hand, there were 71% who were delivered by Cesarean section, and babies delivered by CS with intact membranes don’t need antibiotics.

I am sure some also needed IV dextrose for treating hypoglycaemia; we are told that the incidence of hypoglycaemia was the same between groups, but how many had an IV for low blood sugar on day 1 is not reported. In the supplemental data we learn that there were about 7% of babies in the full feeds group who “did not adhere to the protocol”, i.e. had intravenous fluids after 24 hours of age, who were in that situation because of hypoglycaemia. Also, 4% of the full feeds group had an IV after 24 hours of age for “other clinical reasons”, which I guess must include IV antibiotics, but that seems extremely low to me. 12% if them had an IV for not tolerating full feeds, and, as mentioned, 7% for hypoglycemia, In other words, nearly 90% of the babies did tolerate full feeds from birth.

The primary outcome was duration of hospitalisation, which was determined according to local practice. There was no impact of study group on the primary, just over 32 days in each group. There were also no differences in the secondary outcomes of NEC (which was rare, 4 cases vs 6 cases), or late-onset sepsis (which was uncommon, 3% vs 2%). Among gestational age subgroups, the primary and these secondary outcomes were similar.

There were differences in TPN use, number and duration of central line use, and the numbers of peripheral IVs inserted, as you would guess, these were all reduced in the early enteral feeds group.

My take away from this trial, and several other smaller trials, is that full nutritional support can be given, from birth, by the enteral route in a large proportion of preterm infants of 30 to 34 weeks, and if they have no other clinical indication for an IV access, one can completely avoid IVs. Infants who need IV antibiotics can usually have their antibiotics discontinued at 36 hours of age (because most of them have negative cultures), after which most of them can be on full enteral fluids. A number of recent trials, some of which I have discussed in the blog have shown the toxicity of TPN. These new data show that a large proportion of the 30 to 34 weeks babies (the majority of preterm babies in the NICU) can be managed without ever receiving TPN. They can also avoid the pain of IV insertion attempts, and the discomfort of IV infiltration episodes.

There does not seem to be any good reason to start at less than 60 mL/kg/d of enteral milk feeds in this group of babies on day 1. Some babies will have difficulty tolerating this, especially infants with IUGR, and in those babies you may need to slow down feeding advancement, or even sometimes to back down to smaller volumes or temporarily stop feeds. Some will also need IV glucose, but I can’t see any good reason for not at least trying to give full enteral nutrition from birth in these babies, even if they need IV access, and a small volume of crystalloid solution, for antibiotic administration.

Posted in Neonatal Research | Tagged , , , , | Leave a comment

Bob Bartlett RIP

I just learned of the very recent death of Dr Robert Hawkes Bartlett, May 8, 1939 – October 20, 2025. He was a surgeon who had been developing extracorporeal oxygenation systems for cardiothoracic surgery who realised that extracorporeal circulation could be used for prolonged support, and was willing to try it out for a baby who was dying.

He told the story in his Presidential address to the American Society of Artificial Internal Organs in 1985 Bartlett RH. Esperanza. Presidential address. Trans Am Soc Artif Intern Organs. 1985;31:723–6, of the first baby who received ECMO treatment.

“That child, treated in 1975 was.. a little girl. Her mother was just a girl herself. A Mexican peasant girl living in Baja who could neither read nor write and who realized, when she became pregnant in 1974, that her baby, if it lived at all, would fare no better. We all have hopes and dreams, and when we become parents our most fervent hope is that our children will live well, grow up bright and beautiful, and exceed the station of their parents, whatever that is. Poor Mexican mothers know that they can give the gift of opportunity to their new offspring in the form of United States citizenship by having the child born in this country. So it was that this young mother, consumed with the wish for a better life for her unborn child, crossed the border and set out for Los Angeles when her labor pains began. But as fate.. would have it, her water broke on the freeway and she took the next off-ramp to Orange County Medical Center. The baby was born – a perfect little girl- but something was wrong. The delivery had been difficult. The neonatologist tried to explain, “Mal respire. Mal grande. Intubation. Ventilator, Oxygen. Pressure. Hypoxia, Seizures.”

“The neonatologist knew that we were working with ECMO (rather unsuccessfully) with adult patients. Would we give it a try? The babe was dying. The arterial PO2 was 12. In the middle of the night, with the aid of a flashlight so as not to disturb the other patients, we tried to explain to the mother through an interpreter the ultimate in high tech procedures which had never been used successfully for an infant. She signed the consent form with an X, scared to death for her little girl and more scared that the official-looking form would bring recognition, deportation, perhaps imprisonment. She went in to see her baby girl, cyanotic, on a ventilator, with tense nurses and residents standing about. And the next day she disappeared, leaving her baby 2 gifts : a US citizenship and a name – Esperanza- Hope.

…we ligated the patent ductus arterosus and placed a catheter to monitor pressure in the pulmonary artery. This established the diagnosis of persistent pulmonary hypertension of the newborn. When the spasm finally relaxed and the blood flowed through the lung, our patient could be weaned off bypass, and off the ventilator. Soon she had a foster family.

The baby survived, and Ann Arbor started a program of offering ECMO for full term infants who were expected to die because of cardiorespiratory failure, usually hypoxic secondary to PPHN. They developed predictive criteria which were reasonably good at predicting which hypoxic babies under full intensive care would die, with over an 80% accuracy. But with ECMO they had over 80% survival.

Bob was criticized for not doing a randomized controlled trial, when introducing this new life-saving technology. Which could be likened to doing an RCT of parachute use when falling out of a plane (Yeh RW, et al. Parachute use to prevent death and major trauma when jumping from aircraft: randomized controlled trial. BMJ. 2018;363:k5094); but nevertheless there were many sceptics in many parts of the world who thought they could have saved these babies without ECMO. He listened to them, and designed a study which minimized the number of potential deaths (Bartlett RH, et al. Extracorporeal circulation in neonatal respiratory failure: a prospective randomized study. Pediatrics. 1985;76(4):479–87). The “randomized play the winner” trial was a unique approach to a trial design, where potential adverse outcomes (death) were extremely likely. In essence, the first baby was randomized, and depending on whether they survived or died, the successive randomizations were weighted to increase the chance that a baby would be in the group with survivors, or decrease the chance of being in a group where the previous infant had died.

This design was likened to randomizing by pulling a ball from a sack, within the sack one starts with a black ball (ECMO) and a white ball (standard care). If a baby was randomized, to ECMO for example, and then survived, then an extra black ball was added to the sack prior to the next randomization. Likewise if the baby was randomized to ECMO and died, then an extra white ball would be added, or if they were randomized to standard care and survived. That way the previous “winner” group would have more chance of being the group assignment for the next baby. As it happened, the first baby was randomized to ECMO and survived (so a 2nd black ball was added) the second baby was randomized to routine care and died (so a 3rd black ball was added). This progressively increased the chances of a subsequent baby being in the ECMO group, and another 10 babies were randomized to ECMO who all survived. This reached the pre-specified success criterion, and the trial was terminated.

If this had been a standard RCT then 0/1 compared to 11/11 would not be “statistically significant”; by Fisher exact test the p value is 0.08. But it wasn’t designed as such a trial, and the results did exceed the pre-specified criterion for advantage of ECMO, without consigning large numbers of babies to the inferior treatment, or, to put it less politely, to die.

The observational data reported prior to this trial were already convincing enough for Neil Finer in our centre, and he went off for a few months to train in ECMO, then returned to Edmonton to start the first Canadian ECMO program, a process I was delighted to have a small part in.

A couple of years later we held an ECMO conference in Lake Louise, at which I got to meet Bob Bartlett, a delightful, thoughtful, humble man, you can detect those characteristics in the kindness of his description above of the dilemma of the mother of Esperanza.

The conference we held was in the winter, and the schedule was designed so that we could go skiing in the afternoons. Bob was a much better skier than I was, and I remember him skiing down the slopes, of the most beautiful scenery on earth, with his Sony walkman playing his favourite tunes as he skied.

Dr Bartlett was a thoracic surgeon whose dedication to improving patient care saved tens of thousands of newborn babies. A page on the ELSO website is dedicated to his memory, and includes a link to a fairly recent video about the development of ECMO.

Posted in Neonatal Research | Tagged | Leave a comment

Neonatal Research Shorts : October 2025

Afifi J, et al. Atropine Versus Placebo for Neonatal Nonemergent Intubation: A Randomized Clinical Trial. J Pediatr. 2025;286:114719

I had thought this was a settled issue, Neil Finer showed many years ago that atropine alone decreased bradycardias during intubation. But as the authors of this new study point out, there is very little (or no) data about atropine as part of an intubation cocktail in the newborn. I have a bit of a beef with the introduction which suggests that the Kelly and Finer trial mentioned above was limited, as it did not “follow recommended premedication protocols”. But, when Neil Finer and Mark Kelly performed that study, there were no premedication protocols, and everyone in the world was intubating babies awake, and un-premedicated. Apart from this minor wording issue, the rationale for the study is reasonable. All the babies received fentanyl (1-2 microg/kg) and succinlycholine (2 mg/kg) premedication, and they were randomized to additional atropine (20 microg/kg) or placebo.

The primary outcome was a dichotomous, occurrence of severe bradycardia (<80 bpm for >10 seconds), there were 73 intubations with quite a large imbalance between the size of the groups, 49 placebo and 24 atropine. The randomization schedule was blocked, so I am not sure why there is such a big difference, which has an impact on the power of the study, compared to having 35 or so in each group. There was much more severe bradycardia among the controls, much more bradycardia <100, and much longer median duration of bradycardia, than among the atropine babies.

The premedication cocktail used is probably optimal, except for the dose of fentanyl; different doses of which have never been adequately compared. I am not sure that 1 microg/kg is adequate analgesia, my feeling is that larger doses, 2 to 5 microg/kg are needed for a procedure which is quite painful; but that feeling should be better investigated, and analysing pain in babies who have received a muscle relaxant is rather tricky! Alternatives to atropine, specifically glycopyrrolate, an analogue which has a greater safety profile as it doesn’t cross the blood-brain barrier, could well be preferable, but has not been adequately studied in the newborn.

Take Home Message: Premedication for neonatal endotracheal intubation should include atropine.

Ambalavanan N, et al. Early Intratracheal Budesonide to Reduce Bronchopulmonary Dysplasia in Extremely Preterm Infants: The Budesonide in Babies (BiB) Randomized Clinical Trial. JAMA. 2025.

This is the type of study that would normally warrant an entire post on its own, but it has the misfortune to appear after the similar PLUSS trial.

Differences to the PLUSS trial include that all the babies were intubated, there was no enrolment of babies receiving surfactant by LISA/MIST. In addition, all the babies were enrolled prior to their first dose of surfactant. They were all <29 weeks GA and <50 hours of age. They could receive a maximum of 2 doses, if they were retreated with surfactant <50 h of age.

Sample size was similar to PLUSS at 641 total, the initial plan was for 1160 infants, but the trial was stopped after around 50% for futility. In general, I think it is a mistake to stop for futility, but in this case, with the null results of PLUSS, which the investigators would have been aware of, the chances of finding important effects of budesonide became very unlikely.

There was a tiny difference in mortality, 15% budesonide vs 13% placebo, and no differences in BPD, 63% of survivors per group. The combined outcome was therefore close to identical in the 2 groups. Unlike the secondary analyses of PLUSS, there was no difference in outcomes in any subgroup, including the subgroup of babies with more severe lung disease (>50% FiO2).

Other secondary outcomes were also similar between groups, there was a small shift in severity of BPD; there was no difference in severe BPD, but a slightly fewer moderate and more mild BPD in the budesonide group compared to placebo, but all the 95% CIs included no difference. I think this puts the nail in the coffin of routine budesonide supplementation in very preterm infants. We can’t overcome the adverse impacts of preterm birth and ventilatory support with exogenous steroid treatment.

Take Home Message : There is no rôle for routine addition of budesonide to early surfactant replacement.

Singh G, et al. Dopamine versus epinephrine for neonatal septic shock: an open labeled, randomized controlled trial. J Perinatol. 2025. This is a single centre trial among term and late preterm infants (>34 wk) with septic shock, it was registered 1st of March 2023, first patient enrolled the 6th of March, and enrolled 80 babies with fluid refractory septic shock, that is they continued to have signs of shock (defined in the publication) despite up to 60 mL/kg of fluid in term babies and up to 30 mL/kg in the preterm. Enrolment seems to have been completed in under 2 years, with 206 cases of septic shock, of whom 108 were “fluid-refractory”, 28 had exclusions and 80 were randomized. It is hard for me to imagine an NICU with over 100 cases a year of septic shock among term and near-term infants. The authors give no bacteriology, but the infants were diagnosed with pneumonia (60%), meningitis (30%) or “parenteral diarrhea” whatever that is (7%). Infants who received epinephrine were more likely to have reversal of their shock at 60 minutes (78% vs 63%), and more likely to have reversal of shock within 40 minutes (65% vs 43%). However, almost all the babies died, with no difference between groups (85% vs 88%).

Take Home Message : There are very few data on which to base choice of inotropes in newborns with septic shock. Clinical outcomes are poor, but there are some indications of better haemodynamics with epinephrine than dopamine.

Rochow N, et al. Individualized Target Fortification of Breast Milk with Protein, Carbohydrates, and Fat for Preterm Infants: Effect on Neurodevelopment. Nutrients. 2025;17(11). A couple of years ago, the group from McMaster published a randomized trial of individualized fortification of mother’s milk, compared to standard fortification, among VLBW babies and showed improved growth outcomes. Milk was analyzed 3 times a week, and fortification adjusted according to the results of that analysis. When I compare those older results with our local practice, they had rather poorer growth in their comparison, standard fortification group (mean 2.3 kg at 36 weeks), than we do, Lapointe M, et al. Preventing postnatal growth restriction in infants with birthweight less than 1300 g. Acta Paediatr. 2016;105(2):e54–9, with an approach where we routinely target 165 mL/kg/day of milk fortified to 81 kcal/100mL. We rapidly increase volumes or fortification in case of poor growth, and the mean discharge weight, at a mean of 37.9 weeks, was 2.88 kg.

The advantage of the individualized fortification was greatest, as one might expect, in the subgroup of infants whose maternal breast milk had lower than average protein content. The babies with MoM protein content that was higher than the average had relatively modest impacts of the individualized approach.

Of course, our approach means that babies who need to have their fortification adjusted will have passed a period of poor growth, and babies may pass several days with inadequate protein or energy intakes. In contrast, the individualized approach could be considered more “prophylactic”, by adjusting fortification according to measured breast milk composition, one can ensure that recommended intakes are received throughout the hospitalisation, and hopefully avoid periods of poor growth. Or, at least, that would be the case if the intervention started early after birth, unfortunately in this study, the intervention started at an average of 24 days of life.

I have a lot of sympathy with the ideas behind the individualized fortification approach, based on the known variability of breast milk content, for women who deliver at term or preterm (for a very nice review see Gates A, et al. Review of Preterm Human-Milk Nutrient Composition. Nutr Clin Pract. 2021;36(6):1163–72). But, it is time-consuming and costly, and needs some specialized equipment, i.e. an infra-red analyser for protein and fat, and a lab system for the lactose. Do the short term impacts translate to longer term outcomes? 69 of the original sample of 103 infants were examined at 18 months corrected age with neurologic examinations and Bayley version 3.

As you can see here, all the Bayley scores were a little higher in the intervention (individualized fortification) group. The differences were actually smaller between the low protein controls, and the low protein intervention infants. You can also see that there are only 51 babies with Bayley scores in this table, I don’t know where the results went for the other 18 babies, who, according to the CONSORT flow diagram, had their BSID evaluations. In a table a bit further on in the publication, which shows the proportions of infants below certain threshold scores of their BSID scores, the lost babies re-appear.

Take Home Message : These results suggest that individualized fortification might have some long term benefits, but that is not yet proven, there were at least no adverse impacts shown.

Posted in Neonatal Research | Tagged , , , , , , , , , | 2 Comments

Predicting neurological and developmental outcomes. Why? How?

There are a huge number of publications correlating medium term outcomes (by which I mean outcomes around 1 to 2 years of age) with findings in the neonatal period. Most have concerned various approaches to brain imaging, although other studies have evaluated EEG, NIRS, early structured physical examinations, counting how many complications the baby had, the type of feeding they received…. I am sure my readers could construct a longer list.

There are several recent publications that have triggered this post, in the extremely preterm infant.

In the preterm infant, brain injury on imaging is very common, yet most preterm babies actually function very well. Why, therefore have we spent so much effort trying to refine brain imaging, to find better ways to predict outcomes?

The main point I want to make is the following:

Finding a statistically significant correlation is not the same as an individually useful prediction. Investigations seeking answers about the cause of neurological and developmental problems in the newborn might be satisfied to find a statistically significant correlation, that could be a reasonable research goal. But, just because there is a significant correlation doesn’t mean that a finding is useful to predict an individual child’s prognosis.

Let me give one older, illustrative, example, Tusor N, et al. Punctate White Matter Lesions Associated With Altered Brain Development And Adverse Motor Outcome In Preterm Infants. Sci Rep. 2017;7(1):13250. This study quantified the punctate white matter lesions (PWML) on MRI at term equivalent age, in a multicentre cohort of 500 preterm infants of <33 weeks. They examined the infants at 20 months corrected age, performed Bayley version 3 developmental screening and a neurological exam. 114 infants had PWML and a neuro exam, of whom 10 had CP with a GMFCS grade 2-5; 281 infants had no PWML and a neuro exam, of whom 2 had CP of those grades.

The incidence of CP among all those who showed up for a neurological examination, therefore, was 3.5%. If they had PWML on the MRI at term the incidence was 9%. So the difference in CP incidence was statistically significant, with an Odds Ratio of 6.6 (95% CI did not include 1, 2-22). But for an individual baby the finding was completely useless as a prediction!

If you found PWML on the MRI you could say with over 90% confidence to the parents, that the infant would not have disabling CP!

Even among those with PWML in the cerebro-spinal tracts, or those with large numbers of lesions, the individual prediction of CP was always under 30%.

Some years ago, an abnormal finding of what was called DEHSI, an abnormal white matter appearance of Diffuse Excessive High Signal Intensity, was reported as a frequent occurrence at term equivalent age in very preterm infants (Maalouf EF, et al. Magnetic resonance imaging of the brain in a cohort of extremely preterm infants. J Pediatr. 1999;135(3):351–7) it was initially thought to be a poor prognostic factor. However, further study suggested that it was not strongly associated with worse developmental outcomes. This new study, from a multicentre cohort of about 340 babies from Cincinnati of 32 weeks GA or less, (Derbie AY, et al. Diffuse white matter abnormality is independently predictive of neurodevelopmental outcomes in preterm infants. Arch Dis Child Fetal Neonatal Ed. 2025) used automated objective measurement of the signal intensity in the Centrum Semiovale. If you remember from your neuroanatomy, this is a large region of subcortical white matter superior to the lateral ventricles and the corpus callosum, which on each side of the brain is roughly the shape of half an oval! Increased signal intensity on T2 imaging, calculated by their algorithm as being more than 1.8SD higher than the mean density, they called Diffuse White Matter Abnormality DWMA. You can see from the outlined regions in the MRI below, you wouldn’t have spotted those regions by eye, it really requires their computerised algorithm.

These babies were then followed, with a good percentage returning for evaluation (nearly 90%), at 2 years for Bayley’s and neuro examination; at 3 years (also around 90%) they had cognitive testing done using a tool that is well validated, and somewhat predictive of later academic performance “the Differential Ability Scales, second edition (DAS-II) General Conceptual Ability (GCA) score”.

There was a strong correlation between several different factors, including the MRI findings, and motor scores, diagnosis of CP, and cognitive scores.

This kind of study always leaves me ambivalent. On the one hand, the finding that the extent of white matter injury correlates with motor, and, less strongly, with cognitive outcomes, is an unsurprising confirmation of the importance of abnormal brain development in very preterm infants. This is a very well done study, with infants from a wide range of gestational ages, excellent high quality follow up, and extensive statistical analysis.

On the other hand, what on earth are we supposed to do about it? The outcomes measured are largely outcomes that have little importance for parents. As the parents voices project has confirmed, they don’t care about Bayley scores, and CP with a GMFCS=3 is not considered by parents to be a serious adverse outcome.

In addition, the correlation with the outcomes is not even very close. The following figure shows the contribution of various factors to the cognitive outcomes of the infants. It shows that having antenatal corticosteroids (ACS), maternal breast milk (MMDD) and being a girl (SEX) had a substantial positive impact, whereas High-Risk Social Status (HRSS) had a major adverse impact, especially if combined with moderate or serious Brain Abnormalities (shown on the figure as msBA, defined by a Kidokoro score of over 7 on the term-equivalent MRI).

Those other factors were much more strongly correlated with the outcomes, than the DWMA. For example the relative impact of receiving antenatal steroids gave a ß value of +13.5 compared to about -2 for the DWMA, receiving maternal milk at discharge had a ß of +8 (which I think means that, after correcting for other factors, the cognitive score was on average 8 points higher among those who received maternal milk compared to those that did not, this is huge impact from something that we can do something about! Or, depending on how the statistics were done, it might be 0.8 SD higher: 1 SD was 20 points on the cognitive score, so 0.8 SD would be 16 points higher) As you can see from that graph above, the DWMA had a relatively weak association with cognitive score (all the results were similar for the motor composite score). The ß of about 2 means that, for each 1 SD increase in DWMA volume, there was an average of a 0.2 SD lower score on the Cognitive Composite, or 4 points lower.

Unlike some other markers on TEA brain imaging, the absence of DWMA does not appear to be very predictive of absence of CP, but I can’t find enough data to calculate the Negative Predictive Value in this publication.

I also find the way the results are presented to be questionable. The subtitles of the various sections of the results all use the term “prediction” such as in “prediction of motor performance”, “prediction of cognitive performance”, “prediction of CP”. These all presuppose that the prediction is useful, whereas they all were, in reality, very poorly predictive. It would have been better to title the sections “correlation with…”

There is a really good paper describing the methodology, goals, and interpretation of prognostic studies that I have been reading (Kent P, et al. A conceptual framework for prognostic research. BMC Med Res Methodol. 2020;20(1):172). It makes the important distinction between prognostic determinants and prognostic markers, and between various stages of exploratory and confirmatory research projects. This project would therefore be an exploratory study. Although the authors of this study have shown that there is a relatively weak association between the extent of DWMA and lower scores on the developmental screening tests at 2 to 3 years, we can’t tell from this whether the DWMA is a marker or is causative, and whether interventions aimed at reducing DWMA will improve outcomes.

To come back to the sentence in bold type above, the authors have shown a statistically significant correlation between having more white matter injury, using their algorithm on TEA brain MRI, and poorer motor scores at 2 years, lower cognitive scores at 3 years, and a diagnosis of CP (stage 1 or more). But, is that association useful in prognosis for an individual? They have shown that other factors are much more strongly associated with those outcomes, some of which are potentially modifiable on a group basis (working harder to ensure that mothers get steroids before delivery) or on an individual basis (maternal milk intake).

Another very recent study, from the same group, (Mahabee-Gittens EM, et al. Severity of punctate white matter lesions in preterm infants: antecedents and cerebral palsy prediction. Pediatr Res. 2025) analysed PWML, (the punctate lesions discussed above) and divided the 28 who had PWML into terciles of extent of lesions. 39 of the 339 infants in the study (12%) developed CP, of whom 6 were among those who had PWML.

To put it another way, the large majority of the babies who developed CP (33 of 39) did not have PWML. The majority of those with PWML, 22 of 28, did not develop CP.

This study also shows no significant correlation between the PWML and Bayley scores, on any of the 3 domains, including the motor composite.

This is a very unimpressive predictive capacity for the individual baby. Nevertheless, an accompanying editorial somehow uses those results to push the idea that all ex-preterm babies should have MRIs. They claim that doing so would allow early intervention, not mentioning that targeting early intervention to those with PWML on the MRI would fail to include the large majority of babies with CP. In fact, as usual, the study showed that the strongest predictor of poor developmental scores was Social Status, followed, in this analysis, by chorioamnionitis.

A much better way of targeting early intervention then, according to these data, is not to perform routine MRI, but to forget the MRI and routinely enrol infants with poor social status into early intervention programs.

Here’s another idea. Why not take the money that would be spent on routine MRI, and just give it to the poorest parents? (Bouchelle ZM, et al. Unconditional cash transfers to low-income preterm infants and their families: a pilot randomized controlled trial. J Perinatol. 2025) This pilot trial is investigating the impacts of giving unconditional money transfers to families that are at highest risk of having a baby with developmental delay. That is, according to the MRI study above, those with poor social status. Previous studies of giving money to families, with babies mostly born at term, (Gennetian LA, et al. Effects of a monthly unconditional cash transfer starting at birth on family investments among US families with low income. Nat Hum Behav. 2024;8(8):1514–29) showed that “mothers spent more time engaged in cognitively stimulating activities with their children. In addition, ~25% of the value of the cash gift was used on children’s books, toys, activities, clothing, diapers, and children’s electronic items/devices” which are all things likely to improve infant development, and which poor families struggle to obtain.

For 100 babies eligible for a TEA MRI, if they cost, say, $500 each, that $50,000 could instead be divided among the 10 families at highest social risk and the impacts on infant and family well-being might be dramatic. I think it would be ethically justifiable to have some minor conditions attached, such as participation in an early intervention program, which have been shown to improve cognitive and motor function in infancy (see the latest Cochrane Review Orton J, et al. Early developmental intervention programmes provided post hospital discharge to prevent motor and cognitive impairment in preterm infants. Cochrane Database Syst Rev. 2024;2(2):CD005495).

Instead of performing routine MRIs and searching for abnormalities with a weak association with CP or developmental delay, we should focus on ways to improve those delays and improve outcomes. The individual predictive ability of any finding on MRI at term-equivalent age is between low and extremely low. Such studies may well have value for research, but for improving outcomes of former very preterm babies they are useless.

Posted in Neonatal Research | Tagged , | Leave a comment