Skip to Content
CLI Usagechr3d digest

chr3d digest

chr3d digest [OPTIONS] FASTA

Generate restriction fragment BED from a genome FASTA.

In silico digest a genome FASTA with a restriction enzyme and output a BED file of fragments. This BED file can be passed to bulk-hic / sn-hic via --fragment-bed to enable fragment-aware pair parsing with pairtools.

Examples

Single Enzyme

chr3d digest -e MboI -o hg38_MboI.bed /data/genomes/hg38.fa

Dual Enzyme (Arima Kit)

chr3d digest -e MboI -e GATC^ -o arima_frags.bed /data/genomes/hg38.fa

Custom Recognition Site

chr3d digest -e A^AGCTT -o fragments.bed /data/genomes/hg38.fa

With Size Filtering

chr3d digest -e HindIII --min-size 50 --max-size 500000 -o fragments.bed genome.fa

Supported Enzymes

EnzymeRecognition Site
HindIIIA^AGCTT
DpnII^GATC
MboI^GATC
BglIIA^GATCT
Sau3AI^GATC
Hinf1G^ANTC
NlaIIICATG^
AluIAG^CT
EcoRIG^AATTC
BamHIG^GATCC
PstICTGCA^G
SalIG^TCGAC
XbaIT^CTAGA

Or pass the raw recognition site with cut position, e.g., A^AGCTT.

Arguments

ArgumentDefaultDescription
FASTAGenome FASTA file (plain or gzipped)
-e, --enzyme NAME_OR_SITEEnzyme name or recognition site. Repeat flag for multiple enzymes (Arima kit)
-o, --output BEDOutput BED file path
--min-size INT20Minimum fragment size to keep (bp)
--max-size INT10000000Maximum fragment size to keep (bp)
-v, --verboseEnable DEBUG-level logging
--log-file FILEWrite log to FILE
Last updated on