Best Database Of Transcription Factor Binding Sites
8
9
Entering edit mode
13.7 years ago
Dataminer ★ 2.8k

(a) Which is the best database for transcription factor binding site, TRANSFAC or JASPAR? Why?

(b) Is it still valid to assume, "transcription factor binding sites are only present in 5'UTR and in promoter region"?

Note: kindly put references in your answer.

transcription binding database • 18k views
ADD COMMENT
0
Entering edit mode

I think it is pretty optimistic to think that there is a reference to document that one specific database of TF binding sites is better than another. How would you even define "best" in this case?

ADD REPLY
0
Entering edit mode

Changed title to be more descriptive

ADD REPLY
0
Entering edit mode

True, I agree, but what should be the preference while selecting one database and not the other.

ADD REPLY
15
Entering edit mode
13.7 years ago
Farhat ★ 2.9k
  1. defining 'best' is a tricky task, both JASPAR and TRANSFAC are good, though one advantage of JASPAR is that it is open vs. TRANSFAC which is not. The free version of TRANSFAC is quite behind the times.
  2. No. TFBSs can occur far away from the TSS as on enhancers http://en.wikipedia.org/wiki/Enhancer_(genetics) or even in introns (see, http://ieeexplore.ieee.org/stamp/stamp.jsp?arnumber=05479005)
ADD COMMENT
9
Entering edit mode
12.8 years ago

This very much depends on what you are trying to accomplish.

TRANSFAC and JASPAR are databases of transcription factor binding models. Using experimental techniques like SELEX and now increasingly ChIP-Seq we can identify the kinds of sequences that a particular transcription factor (TF) has affinity for. This affinity is typically represented with something like a position weight matrix (PWM) or position-specific scoring matrix (PSSM). Those models can be used to scan a genome for "putative TF binding sites" that have a high similarity to the model. Whether those sites are actually bound by a TF and play a functional role in transcription and under what conditions must still be determined experimentally be traditional molecular techniques like promoter bashing, reporter gene assays, ChIP experiments, etc.

Databases like ORegAnno and PAZAR have attempted to collect and curate individual experimentally proven TFBSs from the literature and sets from ChIP-Seq and other genome-wide experiments. Technologies like ChIP-Seq can give us a genome-wide profile of all the binding sites for a particular transcription factor under a specific condition. Many of these TF binding sites (TFBSs) are being deposited into species-specific databases (FlyBase, SGD, etc) and as tracks in the UCSC browser. In particular, see efforts by ENCODE. A good starting point would be to look through the tracks under 'Regulation' in the UCSC Genome Browser for your species of interest.

ADD COMMENT
3
Entering edit mode
13.7 years ago

I just happened to ask how JASPAR compares to Transfac to members of the JASPAR team present at a meeting in Montpellier, I think about 2 years ago. The answer I got was "Transfac (the Pro version) covers more, but our quality is better".

If you mention Transfac Pro I think you should also mention Genomatix. They stem from the same basic Transfac development in Germany. There were two commercial lines started from this project and they both developed upon the original open Transfac. They were in fact early endeavors to see whether government funded investments in databases could be made sustainable through offering commercial services.

Academic users can use most of parts of the Genomatix toolset and database with a free account. Access to (part of?) the commercial content of the Transfac Pro is possible through websites that use it behind the scenes.

ADD COMMENT
3
Entering edit mode
12.8 years ago
Qdjm 1.9k

You might also want to consider the UniPROBE database which contains PSSM models based on protein-binding microarrays (PBMs).

If you are doing your analysis in yeast, you should consider the YeTFaSCo database which is a curated collection of PSSM models for most of the yeast TFs.

Hundreds of transcription factor (TF) binding preferences have already been queried using PBMs, soon thousands will be. JASPAR and TRANSFAC are a bit slow in keeping up with these new data, it may be worth it to keep an eye on the work of Tim Hughes and Martha Bulyk who are the ones actively generating TF binding preference data using PBMs.

ADD COMMENT
1
Entering edit mode
10.3 years ago

Just to show that the free version 3.2 of transfac is really old: Version 3.2 is from 1997.

ftp://ftp.ebi.edu.au/pub/databases/transfac/tfdoc32.txt

ADD COMMENT
1
Entering edit mode
8.6 years ago

For PSSMs/PWMs of human/mouse TFs you may also try HOCOMOCO (http://hocomoco.autosome.ru).

ADD COMMENT
0
Entering edit mode
8.1 years ago
jin ▴ 80

You can access the TF binding sites in Plants at PlantRegMap, which are derived from ChIP-seq and DNase I genomic footprinting.

ADD COMMENT
0
Entering edit mode
22 months ago
nta • 0

Genexplain recently uploaded a white paper explaining the difference between TRANSFAC and JASPAR. It seems when we use TRANSFAC there are chances that we don't miss important matrices that play a role in regulation of gene of interest. The white paper link can be found here:

https://genexplain.com/wp-content/uploads/2023/01/TRANSFAC-revealed-IRF3-site-in-SNPs-associated-with-high-severity-of-Covid-19-missed-by-JASPAR.pdf

Hopefully this is still helpful to someone.

ADD COMMENT

Login before adding your answer.

Traffic: 1528 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6