Parameters
input_file_cond1: 20101125_LUjt8_1_hg19_ensembl59_G_iCLIP_TDP-43_SH-SY5Y_bedGraph-cDNA-hits-in-genome.bed.gz
input_file_cond1_res_type: single hits in whole-genome, cDNA counts
input_file_cond2: 20101125_LUjt8_4_hg19_ensembl59_G_iCLIP_TDP-43_BrainC6_bedGraph-cDNA-hits-in-genome.bed.gz
input_file_cond2_res_type: single hits in whole-genome, cDNA counts
mapped_to: hg19
annot_ver: ensembl59
window_size: 350
MA_window_size: 10

Differential analysis

MA plots are used for differential analysis of condition 1 (cond1) vs. condition 2 (cond2).
Data (bedGraphs) from two conditions are quantile-normalized first, then put into a MA-plot.
A sliding window of 10% of A axis span is used to calculate z-scores (of M).

Positive z-score indicates over-representation of counts in condition 1.
Negative z-score indicates over-representation of counts in condition 2.

Because bedGraph format uses the sign to indicate strand, results are split into two bedGraphs. Magnitude equals to the absolute value of z-score and sign indicates strand:

A third bedGraph is also produced showing only windows centered at peaks of high-density smoothed absolute z-values.

MA plots

M=log(count_cond1/count_cond2), A=log(count_cond1)+log(count_cond2).


Top 30 regions with highest density of difference (z-scores) in counts

Click on region to automatically upload the two bedGraphs with z-scores of differential analysis and start exploring in UCSC browser.

smoothed_abs_z_scorez_scoreAMnormalized count in condition 1normalized count in condition 2regiongene_namesense relative
to window
seg_typeseg_biotype
98.7513.693.557.10137.51chr1-:204194386..204195086PLEKHA6sameintronprotein_coding
55.0313.003.416.81112.51chr10+:43589940..43590640RETsameintronprotein_coding
40.711.829.411.04973472.5chr1+:109642572..109643272snoU2-25samencRNAsnoRNA
37.792.945.351.607123.5chrX-:73047366..73048066XISTsamencRNAprocessed_transcript
31.463.705.071.966617chr7-:100805600..100806300VGFsame3UTRprotein_coding
31.02-10.071.26-4.520.511.5chr14+:101312607..101313307MEG3samencRNAprocessed_transcript
30.14-7.813.62-3.623.543chr10-:116193698..116194398ABLIM1same3UTRprotein_coding
27.6810.942.444.8829.51chr1-:204193550..204194250PLEKHA6sameintronprotein_coding
27.352.115.071.21510chr1-:204194796..204195496PLEKHA6sameintronprotein_coding
27.172.405.001.355120chrX-:73046829..73047529XISTsameintronprocessed_transcript
26.712.695.061.495620chr2-:72356609..72357309CYP26B1same3UTRprotein_coding
25.73-8.951.79-3.58112chr3+:173545093..173545793NLGN1sameintronprotein_coding
24.199.013.404.7955.52chr10+:43604036..43604736RETsameintronprotein_coding
22.612.284.061.3426.510.5chr9-:14276979..14277679NFIBsameintronprotein_coding
22.381.388.550.84502279.5chr7+:98479058..98479758TRRAPsameintronprotein_coding
22.05-1.737.58-0.58156.5234.5chr3-:160232566..160233266snoU90samencRNAsnoRNA
22.018.841.883.7513.51chr1+:67007730..67008430SGIP1sameintronprotein_coding
21.32-5.382.70-2.22314chr2+:149880731..149881431KIF5CsameORFprotein_coding
20.24-7.701.58-3.1719chr16+:58546746..58547446NDRG4same3UTRprotein_coding
18.63-7.461.54-3.0918.5chr3+:41281136..41281836CTNNB1same3UTRprotein_coding
18.257.101.583.1791chr1+:211474011..211474711RCOR3sameintronprotein_coding
18.075.839.152.891543208.5chr11+:65271557..65272257intersameinter
17.94-3.873.14-1.63515.5chr2-:127811348..127812048BIN1sameintronprotein_coding
17.722.603.581.5520.57chr9-:66456255..66456955intersameinter
17.382.315.531.3072.529.5chr15-:37391069..37391769MEIS2same5UTRprotein_coding
17.28-8.511.04-4.090.58.5chr9+:124094610..124095310GSNsame3UTRprotein_coding
17.25-6.661.40-2.8117chr5-:81031639..81032339SSBP2sameintronprotein_coding
16.94-7.113.20-3.22328chr2+:45886852..45887552PRKCEsameintronprotein_coding
16.73-7.520.85-3.700.56.5chr8+:91968838..91969538NECAB1same3UTRprotein_coding
16.463.275.191.75670chr1-:204193927..204194627PLEKHA6sameintronprotein_coding
Download full list in tab-delimited format.

Other measures of (dis)similarity

x-links in condition 1: 165524
x-links in condition 2: 17741
All x-link positions (union): 182831
Positions shared by both conditions (intersection): 434
Jaccard index (intersection/union): 0.0024

x-link position similarity

For each x-link in condition 1 (condition 2), look for nearest x-link in condition 2 (condition 1). Draw histogram of distances to closest neighbors.

Average distances (full span), all include distance 0: Average distances (-200..200 window), all include distance 0: [eps]
Download graphs in tab-delimited format.

Count similarity

For each x-link position with specific value stored in input bedGraph for condition 1 (condition 2) look if same position in bedGraph for condition 2 (condition 1) has a non-zero value.

[eps]
Download graphs in tab-delimited format.

k-mers similarity

For k-mer analyses performed with same parameters on both conditions, report on similarity of results.
Similarity is measured by correlation of z-scores of all k-mers, and number of top k-mers shared by both conditions.

Available k-mer analyses:
comparedanalysis parametersconditionanalysis id
Trueintervals=[(-30, -10), (10, 30)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']condition 152299
Trueintervals=[(-30, -10), (10, 30)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']condition 252326
Trueintervals=[(-50, 50)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']condition 156690
Trueintervals=[(-50, 50)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']condition 256699
Compared (marked as compared=True above) are only results from analyses with exactly same parameter values used to analyse both conditions.

There are 2 such pairs of analyses:
  1. intervals=[(-30, -10), (10, 30)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']
    k-mers with computed scores in condition 1: 1024
    k-mers with computed scores in condition 2: 1024
    Pearson correlation of all k-mers with score in both conditions: 0.954

    top 15 k-mers in condition 1: GTGTG, TGTGT, TGTGC, CGTGT, ATGTG, GTATG, GTGTA, GTGCG, TGCGT, TGTAT, GAATG, TGTGA, TATGT, TTGTG, GCGTG
    top 15 k-mers in condition 2: GTGTG, TGTGT, GTGTA, CGTGT, ATGTG, TGTAT, GTGCG, TGTGC, GTATG, GCGTG, TGCGT, TATGT, AGTGT, GTGTT, TGTGA
    intersection among top 15 k-mers: (13) GTGTG, TGTGT, CGTGT, ATGTG, GTGTA, TGTGC, GTATG, GTGCG, TGTAT, TGCGT, GCGTG, TATGT, TGTGA
    intersection among top 100 k-mers: 78

    Top 15 k-mers with best average rank:
    average rank
    condition 1 & 2
    kmerrank
    condition 1
    z-score
    condition 1
    rank
    condition 2
    z-score
    condition 2
    2GTGTG2375.0287.5
    3TGTGT3354.9386.6
    5CGTGT5129.1530.5
    6ATGTG6123.3626.3
    6GTGTA8116.9431.2
    6.5TGTGC4132.5921.5
    8.5GTATG7123.11021.3
    8.5GTGCG9114.2822.3
    9TGTAT1198.0724.4
    11TGCGT10104.01220.6
    13.5GCGTG1677.31120.6
    13.5TATGT1480.91319.5
    14.5TGTGA1385.11616.4
    16TTGTG1580.01715.6
    17.5GAATG1290.62313.4
  2. intervals=[(-50, 50)], k=5, random_perms=100, valid_region_types=['ncRNA', 'ORF', '3UTR', '5UTR', 'intron']
    k-mers with computed scores in condition 1: 1024
    k-mers with computed scores in condition 2: 1024
    Pearson correlation of all k-mers with score in both conditions: 0.928

    top 15 k-mers in condition 1: GTGTG, TGTGT, GTATG, GTGTA, ATGTG, GAATG, CGTGT, TGTAT, TGCGT, TGTGC, TGAAT, GTGCG, TATGT, TTGTG, GTGTT
    top 15 k-mers in condition 2: GTGTG, TGTGT, GTGTA, CGTGT, GTATG, TGCGT, GTGCG, GTGTT, ATGTG, TGTAT, TATGT, GCGTG, TGTTT, TTGTG, GAATG
    intersection among top 15 k-mers: (13) GTGTG, TGTGT, GTGTA, GTATG, CGTGT, ATGTG, TGCGT, TGTAT, GTGCG, GAATG, GTGTT, TATGT, TTGTG
    intersection among top 100 k-mers: 78

    Top 15 k-mers with best average rank:
    average rank
    condition 1 & 2
    kmerrank
    condition 1
    z-score
    condition 1
    rank
    condition 2
    z-score
    condition 2
    2GTGTG2344.4263.9
    3TGTGT3299.7352.5
    4.5GTGTA5195.9449.3
    5GTATG4233.1639.4
    6.5CGTGT8165.6540.4
    8ATGTG6183.61031.5
    8.5TGCGT10145.6736.7
    10TGTAT9154.81131.0
    10.5GTGCG13133.5834.9
    11.5GAATG7175.91625.6
    12.5GTGTT16121.7934.0
    13TATGT14131.01230.3
    14.5TGTGC11144.11822.5
    15TTGTG15122.51526.3
    16TGAAT12142.02021.9
Download full list(s) in tab-delimited format.

RNAmaps similarity

For RNAmap analyses performed with same parameters on both conditions, report on similarity of results.
RNAmaps are compared by normalizing them first to sum 1.0 and then taking the average squared absolute difference (units are squared abs. difference per 1000 positions).

Available RNAmap analyses:
comparedanalysis parametersconditionanalysis id
TruemaxD_flanking=1000, maxD_within=300, maxU_flanking=-1000, maxU_within=-300, smoothing=0condition 152300
TruemaxD_flanking=1000, maxD_within=300, maxU_flanking=-1000, maxU_within=-300, smoothing=0condition 252327
Compared (marked as compared=True above) are only results from analyses with exactly same parameter values used to analyse both conditions.

There are 1 such pairs of analyses:
  1. maxD_flanking=1000, maxD_within=300, maxU_flanking=-1000, maxU_within=-300, smoothing=0
    Top 10 RNAmap variables ranked by increasing difference between two conditions.
    squared abs differenceRNAmapvariablemap in condition 1map in condition 2
    0.00493-inter[x-link sites] (same)linklink
    0.00503-internormalized [x-link sites] (same)linklink
    0.00503-inter[single hits in whole-genome, cDNA counts] (same)linklink
    0.00523-internormalized [single hits in whole-genome, cDNA counts] (same)linklink
    0.0054exon-intron[x-link sites] (same)linklink
    0.0058exon-intron[single hits in whole-genome, cDNA counts] (same)linklink
    0.00653UTR-inter[x-link sites] (same)linklink
    0.00663UTR-internormalized [x-link sites] (same)linklink
    0.00663UTR-inter[single hits in whole-genome, cDNA counts] (same)linklink
    0.00673UTR-internormalized [single hits in whole-genome, cDNA counts] (same)linklink
Download full list in tab-delimited format.