Improving epigenetic clocks’ signal-to-noise ratio

This 2021 computational study investigated several methods of improving epigenetic clock reliability:

“Epigenetic clocks are widely used aging biomarkers calculated from DNA methylation data. Unfortunately, measurements for individual CpGs can be surprisingly unreliable due to technical noise, and this may limit the utility of epigenetic clocks.

Noise produces deviations up to 3 to 9 years between technical replicates for six major epigenetic clocks. Elimination of low-reliability CpGs does not ameliorate this issue.

We present a novel computational multi-step solution to address this noise, involving performing principal component analysis (PCA) on the CpG-level data followed by biological age prediction using principal components as input. This method extracts shared systematic variation in DNAm while minimizing random noise from individual CpGs.

Our novel principal-component versions of six clocks show agreement between most technical replicates within 0 to 1.5 years, equivalent or improved prediction of outcomes, and more stable trajectories in longitudinal studies and cell culture. This method entails only one additional step compared to traditional clocks, does not require prior knowledge of CpG reliabilities, and can improve the reliability of any existing or future epigenetic biomarker.

PC-based clocks showed greatly improved agreement between technical replicates, with 90+% agreeing within 1-1.5 years. The median deviation ranged from 0.3 to 0.8 years, whereas CpG clocks ranged from 0.9-2.4 years.

PCPhenoAge vs. PhenoAge

The most dramatic improvement was in PhenoAge. CpG-trained PhenoAge has a median deviation of 2.4 years, 3rd quartile of 5 years, and maximum of 8.6 years. In contrast, PCPhenoAge has a median deviation of 0.6 years, 3rd quartile of 0.9 years, and maximum of 1.6 years. PCPhenoAge was trained directly on phenotypic age based on clinical biomarkers rather than DNAm.

Correlations between different PC clocks was stronger than between CpG clocks. This may be partly due to the shared set of CpGs used to train PCs, or due to the reduction of noise that would have biased correlations towards the null. Correlations between PC clocks and CpG clocks tended to be stronger compared to correlations between CpG clocks and CpG clocks, consistent with a reduction of noise.

PC clocks preserve relevant aging signals unique to each of their CpG counterparts. They reduce technical variance but maintain relevant biological variance.

PCA is a commonly used tool and does not require specialized knowledge. High reliability of principal component-based epigenetic clocks will make them particularly useful for applications in personalized medicine and clinical trials evaluating novel aging interventions.” “A Computational Solution to Bolster Epigenetic Clock Reliability for Clinical Trials and Longitudinal Tracking”

I appreciate that a coauthor – who is the originator of PhenoAge – is open to evidence and improvements. There’s a fun do-it-yourself demo of PCA at

I found this study from it citing a 2021 review: “Aging biomarkers and the brain” (not freely available)

I found that review from it citing a 2020 study: “Human Gut Microbiome Aging Clock Based on Taxonomic Profiling and Deep Learning”

Maybe this last study could be improved from its “mean absolute error of 5.91 years” with PCA? See Part 2 for another view.


4 thoughts on “Improving epigenetic clocks’ signal-to-noise ratio

  1. I love your site and the information that you provide on broccoli sprouts, thank you!

    I have been following your microwave method for the past year. However, I will be traveling for work/vacation over the next month with limited access to a microwave or stove. Do you know if this daily method would be sufficient during that time: (1) mix a tbsp of broccoli sprout seeds with a tbsp of brown mustard seeds; (2) grind them in a handheld coffee grinder; (3) let them sit for 5-10 mins; (4) consume?

    I also considered just chewing up the mixture (e.g., instead of doing steps (2) and (3)) but I can’t chew as efficiently as grinding. Just wondering if you had any ideas.

    • Sounds like a winner, Carl! Whatever you can do to keep up an intake.
      I was impressed by the Eat broccoli sprouts every day rodent study. Those researchers intentionally destroyed myrosinase enzyme and sulforaphane in cooked broccoli.
      Subjects still benefited by keeping their gut microbiota engaged with hydrolyzing broccoli glucoraphanin.

      • Thank you for the response and great info! One last question – do you have any suggestions as to what seed supplier/vendor is best? I’ve typically just bought the brand that is highest rated on amazon. However, you would know more about what’s best/worst when it comes to SFN yield.

        Thank you!

        • Haha, I wish, Carl. Vendors are missing a marketing opportunity in that none of the ones I’ve seen have credible claims for sulforaphane yield.
          The latest broccoli seed purchase I made was a 5 lb. can at $17.50 / lb. from a vendor who has no idea of their product’s value.

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

This site uses Akismet to reduce spam. Learn how your comment data is processed.