Statistically Significant Bacteria shifts seen in ME/CFS

Statistics is fun because there many paths. Most studies using the microbiome uses the easy, but naïve, path of computing averages and standard deviation. As my dataset has grown, I have been travelling some less traveled path, for example: Visual Exploration of Odds Ratios, and a patent pending method termed “Kaltoft-Moltrup”.

One of the frequent decisions that I see in studies is to limit examination of bacteria that have a high frequency in the samples. This allows the researchers to keep to familiar and classic statistics. Using frequency of observation in the control group and the condition group is one of these much less travelled paths. It usually require big sample sizes and many studies have a sample size of 30 (sufficient for the mean and standard deviation approach).

I just completed code to compute Chi2 using Biomesight data for users reporting ME/CFS.

  • Total Population: 3525
  • ME/CFS: 280

Chi2 can be converted to probability (p) of happening at random with the following table

Seen too Rarely(Want to increase)

We see one bacteria available as a probiotic Bifidobacterium adolescentis. The rest would need to be altered by diet.

tax_nameTAX
RANK
Chi2ObservedExpectedShift
Segatella oulorumspecies13.91638Under-Represented
Bifidobacterium cuniculispecies10.83357Under-Represented
Brenneriagenus10.5112Under-Represented
Bifidobacterium adolescentis strain10.35077Under-Represented
Prevotella veroralisspecies10.3620Under-Represented
Aggregatibactergenus10.14672Under-Represented
Hoylesella shahiispecies10.11228Under-Represented
Segatella paludivivensspecies9.64064Under-Represented
Actinobacillus pleuropneumoniaespecies8.63354Under-Represented
Slackia heliotrinireducensspecies8.2820Under-Represented
Prevotella micansspecies8.1313Under-Represented
Actinobacillusgenus8123157Under-Represented
[Actinobacillus] rossiispecies7.179105Under-Represented
Pasteurellaceae incertae sedisno rank7.179105Under-Represented
[Pasteurella] aerogenes-[Pasteurella] mairii-[Actinobacillus] rossii complexspecies group7.179105Under-Represented
Aggregatibacter aphrophilusspecies7413Under-Represented
Mitsuokella multacidaspecies6.8312Under-Represented

Seen too Often (Want to decrease)

We see 118 bacteria over a Chi2 of 6.635 ( P < 0.01 or 1 change in 100 of being a false detection). The list below are for genus and higher taxonomy orders.

Tax NameRankChi2ObservedExpectedShift
Dethiobacteraceaefamily20.97545Over-Represented
Hungateiclostridiaceaefamily16.17145Over-Represented
Tepidimicrobiaceaefamily14.6167Over-Represented
unclassified Burkholderialesfamily12.62916Over-Represented
Corynebacteriaceaefamily11.2159123Over-Represented
Halanaerobiaceaefamily10.17654Over-Represented
Alcanivoracaceaefamily9.65335Over-Represented
Halanaerobialesorder9.68158Over-Represented
Desulfonatronovibrionaceaefamily9.15840Over-Represented
Acetobacteraceaefamily8.712496Over-Represented
Hyphomonadaceaefamily82918Over-Represented
Tissierellaceaefamily7.52011Over-Represented
Rhizobiaceaefamily7.19876Over-Represented
Nannocystineaesuborder6.93624Over-Represented
Rubritaleaceaefamily6.912499Over-Represented

Bottom Line

The next step is to compute similar tables for all symptoms and incorporate these findings into a new algorithm. I say new, because I do not know if it is better than the existing ones. Conceptually, it would be added as a 5th set of suggestions to the existing consensus view on Microbiome Prescription.