nucleosome-mapping

Eddie Cano Gamez

Oct 11, 2024

de1298a · Oct 11, 2024

Name	Name	Last commit message	Last commit date
parent directory ..
1_create-sliding-windows.sh	1_create-sliding-windows.sh	Updating paths	Feb 6, 2024
2_calculate-WPS.sh	2_calculate-WPS.sh	Adding help message and correcting paths	Feb 14, 2024
3_fetch-WPS-per-chromosome.sh	3_fetch-WPS-per-chromosome.sh	Updating R module versions	Oct 11, 2024
README.md	README.md	Updating name	Oct 11, 2024
fetch-WPS-per-gene.R	fetch-WPS-per-gene.R	Updating name	Oct 11, 2024

README.md

Mapping of nucleosome positions around genes using cfDNA

Author: Kiki Cano-Gamez

Email: kiki.canogamez@well.ox.ac.uk

Overview

This directory contains codes to infer nucleosome positions around TSS regions.

To do so, 5 kb windows centred at the TSS are constructed for each transcript in gencode. Next, a sliding window approach is used to generate windows of size k (e.g. 120 bp) that can be used to scan the entire region.

Fragmentomic information (i.e. fragment sizes for each paired-end read) is first used to identify mononucleosomal cfDNA fragments (i.e. 120 - 200 bp). Next, the intersection between these fragments and each sliding window is quantified.

Nucleosome positioning is finally inferred using windowed protection scores (WPS), an approach proposed by Snyder et al. (https://doi.org/10.1016/j.cell.2015.11.050). In brief, the WPS of a window is defined as the number of cfDNA fragments completely encompassing that region minus the number of cfDNA fragments with breakpoints (i.e. beginning or end sites) within the same region. High WPS values indicate a region is protected from nuclease cutting, which indicates a nucleosome is positioned on it. Low WPS values indicate higher nuclease cutting rates, which suggest the genomic position in question is not bound by a nucleosome.

Repository structure

The codes contained within this repository are written in bash and ordered as follows:

./
 |-- 1_create-sliding-windows.sh	Creates sliding windows of size 'k', with an sliding step size 's' for the region around the TSS of each transcript reported in gencode.
 |-- 2_calculate-WPS.sh			Calculates WPS scores for each sliding window using bedtools intersect. This is done by substracting the number of intersections with 100% overlap (i.e. -f 1) minus the number of incomplete intersections (i.e. overlap < 100%)
 `-- 3_fetch-WPS-per-chromosome.sh	Takes the outputs from step 2 and summarises them by chromosome and gene. This code parallelises the analysis on a per chromosome basis, with each chromosome being analysed using the 'fetch-WPS-per-gene.R' code in this directory.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Files

nucleosome-mapping

nucleosome-mapping

README.md

Mapping of nucleosome positions around genes using cfDNA

Overview

Repository structure

Collapse file tree

Files

nucleosome-mapping

Directory actions

More options

Directory actions

More options

Latest commit

History

nucleosome-mapping

Folders and files

parent directory

README.md

Mapping of nucleosome positions around genes using cfDNA

Overview

Repository structure