Compare TF Sites across Sequences from Genomic Variants

Take in the genomic coordinates of variants to map how ref/alt sequence changes impact the function of TF binding sites. In biomedical applications, comparisons can be made between reference and alternate alleles that are associated with diseases or changes in gene expression.
Links: Tutorial Video Documentation

Parameters

Provide a name for this analysis.
Genome* Pickled genome file that corresponds to the genomic coordinates provided. This is used to extract the sequences to be compared. Start by selecting one of our genomes or uploading your own .
Tab-separated file containing the list of genomic coordinates to analyze. There should be at least 5 columns in this file, including the chromosome, position, reference allele, alternate allele, and hypothesis (values can include gof/lof/both/na).
chrom* pos* ref* alt* hypothesis*
1 if the provided position coordinates are one-indexed and 0 if they are zero-indexed.
Length of the binding sites that are being analyzed. This will be used to determine the number of nucleotides to include on each side of a variant when extracting the surrounding sequence
*Input one of the following TF datasets:

All the information for the transcription factors being analyzed, including its name, binding site definition, desired color on the plot, any PBM relative affinity data, and any PFM relative score data. You can generate an affinity dataset with the Normalize TF-DNA Affinity Data tool.
TF Name* Core Site* Affinity Data create an affinity dataset download example
or
* Enter the minimum binding change threshold for analysis.