ATAC Peaks Annotation
File:
[sample_name]_peak_annotation.tsv.gz
The peak annotation file is a tab-separated gzipped file that connects each peak with genes found nearby,
or classifies the peak as intergenic
. In addition to peak coordinates and gene symbols connected to the peak,
those connections are classified as promoter
or distal
and the distance to the nearest transcription start
site for that gene is listed.
promoter
region is -1000 to +100 base pairs from the nearest transcription start site (TSS).
distal
region is outside of the promotor region, but within 20,000 base pairs upstream or downstream of the TSS.
This peak annotation is an estimate of peak-to-gene connections and does not suggest that each connection is experimentally verified.
The columns are:
Column | Name | Description |
---|---|---|
1 | peak | Peak name in form of chrom:start-end |
2 | gene | List of gene connected to the peak, separated by ; |
3 | distance | List of distances of each gene's closest TSS to the peak, separated by ; |
4 | peak_type | List of classification of peak relative to the gene, separated by ; |
5 | Chromosome | Chromosome where peak is located |
6 | Start | Start coordinate of the peak |
7 | End | End coordinate of the peak |
This information is also preloaded in the H5MU
output as unstructured information in muon
object.