Circulating cell-free DNA (cfDNA) methylation sequencing is becoming a key technology for liquid biopsy and early cancer detection. By analyzing methylation patterns in plasma DNA fragments, researchers can identify tissue-of-origin (TOO) signals and detect cancer-associated epigenetic changes.

Figure 1. Concept of cfDNA methylation sequencing and nucleosome-associated fragmentation.

However, one important technical factor is often overlooked: the library preparation strategy.

Two main approaches are commonly used:

  • Double-stranded DNA (dsDNA) library preparation
  • Single-stranded DNA (ssDNA) library preparation

These methods can lead to different methylation measurements and biological interpretations.

1. dsDNA vs ssDNA Library Prep: The Core Difference

The biggest difference between dsDNA and ssDNA library prep lies in how fragmented DNA ends are processed.

dsDNA library prep: Requires enzymatic end repair, which adds unmethylated nucleotides and reduces CpG methylation at fragment ends.

ssDNA library prep: Skips this step, preserving native ends and true methylation signals.

2. Experimental Workflow

Sample Preparation

Plasma cfDNA was extracted from whole blood. DNA concentration was measured using a Qubit fluorometer, and fragment size distribution was assessed using the Agilent Technologies TapeStation system.

Libraries were constructed from 1 ng cfDNA using either double-stranded (dsDNA) or single-stranded (ssDNA) methylation library preparation methods.

Library Preparation Overview

Parameter

dsDNA Library

ssDNA Library

Library strategy

Double-stranded DNA

Single-stranded DNA

Recommended kit

Yeasen-dsDNA Methyl Library Prep Kit for Illumina™ (Cat#12214ES)

Methyl-seq ssDNA Library Prep Kit

Input DNA

1 ng cfDNA

1 ng cfDNA

PCR cycles

11 cycles

11 cycles

Sequencing and Data Processing

Libraries were sequenced on the Illumina NextSeq 2000 platform using 2 × 66 bp paired-end reads.

Bioinformatics Pipeline

Step

Tool

Purpose

Adapter trimming

SeqPrep2

Remove adapter sequences

Alignment

bwa-meth

Map reads to reference genome

Duplicate removal

Picard

Mark PCR duplicates

Methylation calling

MethylDackel

Extract CpG methylation

Filtering criteria:

  • Minimum mapping quality ≥ 20
  • Minimum sequencing depth requirement

Nucleosome-Associated Methylation Analysis

Nucleosome positioning information was obtained from NucPosDB and converted to 1 bp dyad resolution.

Using bedtools, the distance between CpG sites and nucleosome centers was calculated to evaluate methylation patterns relative to nucleosome positioning. 

 Tissue-of-Origin (TOO) Inference

Tissue-of-origin prediction used a methylation reference atlas from Loyfer et al., obtained from the Gene Expression Omnibus dataset GSE186458.

The reference included 13 cfDNA-relevant tissues, such as endothelial cells, granulocytes, and hepatocytes.

TOO Analysis Strategy

Parameter

Description

Algorithm

Non-negative least squares (NNLS)

Training/Test split

70% / 30%

Iterations

100

Evaluation metrics

R² and Mean Squared Error (MSE)

 End-Trimming Correction Analysis

To evaluate correction of methylation bias, dsDNA sequencing reads were trimmed by 20 bp and 40 bp from read ends.

Methylation levels and tissue-of-origin prediction accuracy were then reanalyzed.

3. Results

Two cfDNA samples were used to generate paired ssDNA and dsDNA libraries. Key findings are summarized below.

Table 1. Basic Characteristics of the Two Library Preparation Methods

Table 1. Basic Characteristics of the Two Library Preparation Methods
Figure 1. CpG Methylation Levels Across Read Length

Figure 1. CpG Methylation Levels Across Read Length

3.1 dsDNA Libraries Show Methylation Bias

Observation

dsDNA Library

ssDNA Library

Read-level methylation pattern

CpG methylation decreases after ~40 bp

Uniform across reads

Reverse read 3′ end

Sharp methylation loss

Stable methylation

Nucleosome-associated pattern

Strand-specific hypomethylation

Symmetric methylation

The bias in dsDNA libraries is consistent with end-repair filling of 5′ overhangs with unmethylated cytosines.

3.2 dsDNA Libraries Show Lower Global Methylation

Metric

dsDNA

ssDNA

Global CpG methylation

Reduced

Higher

Estimated difference

~5–7% lower

Baseline

This reduction overlaps with nucleosome-associated methylation valleys, amplifying measurement bias.

3.3 ssDNA Libraries Improve Tissue-of-Origin Prediction

Metric

dsDNA

ssDNA

Lower

Higher

MSE

Higher

Lower

Informative CpG subsets

Reduced signal

Strong signal

The improvement is particularly evident for high-information CpG sites:

methylation < 0.2 or > 0.8

methylation < 0.1 or > 0.9

For uninformative unmethylated CpGs, both methods showed similar performance.

3.4 End Trimming Partially Corrects dsDNA Bias

Trimming length

Effect

20 bp

Partial correction

40 bp

Methylation closer to ssDNA

After trimming:

  • R² increased
  • MSE decreased

However, trimming also introduces limitations:

  • Reduced unique sequencing coverage
  • Loss of fragment end coordinates
  • Reduced ability to perform fragmentomics analysis

4. Conclusion

Key Finding

Impact

End-repair in dsDNA library prep introduces unmethylated cytosines

Causes methylation underestimation

ssDNA library prep avoids end repair

Preserves native fragment ends

ssDNA libraries retain authentic methylation signals

Improves cfDNA methylation analysis

ssDNA library preparation better preserves native cfDNA methylation signals. Meanwhile, dsDNA workflows remain widely used due to their simplicity, robustness, and scalability for routine sequencing applications.

Related Product

Product Name

Cat. No.

Application

HieffTM Superfast DNA Methylation Bisulfite Kit

12228ES

Column-based DNA methylation conversion

HieffTM Mag Superfast DNA Methylation Bisulfite Kit

12229ES

Magnetic bead-based DNA methylation conversion

Hieff NGSTM Methyl-seq ssDNA Library Prep Kit for Illumina V2

12221ES

SsDNA methylation library prep for Illumina

Hieff NGSTM Methyl-seq ssDNA Library Prep Kit for MGI V2

12226ES

SsDNA methylation library prep for MGI

Hieff NGSTM dsDNA Methyl Library Prep Kit for Illumina

12214ES

DsDNA methylation library prep for Illumina

HieffTM UltraShear

12218ES

Enzyme-based methylation fragmentation

Hieff CanaceTM Uracil+ High-Fidelity DNA Polymerase Mix

12928ES

Post-bisulfite PCR amplification

 

Reference
[1] GROTH T E, MISHIN A A, RAO V, et al. End-repair causes methylation underestimation in cell-free DNA sequencing libraries[EB/OL]. bioRxiv Preprint, 2025[2025-12-17]. https://doi.org/10.64898/2025.12.15.694439.

Расследование