Gene transcriptions/TATA binding proteins/Associated factors

From Wikiversity
Jump to navigation Jump to search
The diagram illustrates the approximate application and location of TBP during gene transcription. Credit: Robert Tjian.

When there is no TATA box nucleotide sequence in the gene core promoter region of the DNA next to a gene, say A1BG of the human genome, a TATA binding protein associated factor (TAF) will bind sequence specifically and force the TATA box binding protein to bind non-sequence specifically to the DNA in the core promoter.

Notations[edit | edit source]

Notation: let the symbol TAF stand for TATA binding protein associated factor. Notation: let the symbol TBP stand for TATA binding protein.

Genetics[edit | edit source]

This is an image of Bob, the guinea pig. Credit: selbst.
This guinea pig has gorgeous long hair and was a prize winner at the Puyallup, WA fair. Credit: Christine from Washington State, USA.

Genetics involves the expression, transmission, and variation of inherited characteristics.

Def. a "branch of biology that deals with the transmission and variation of inherited characteristics, in particular chromosomes and DNA"[1] is called genetics.

Gene transcriptions[edit | edit source]

DNA is a double helix of interlinked nucleotides surrounded by an epigenome. On the basis of biochemical signals, an enzyme, specifically a ribonucleic acid (RNA) polymerase, is chemically bonded to one of the strands (the template strand) of this double helix. The polymerase, once phosphorylated, begins to catalyze the formation of RNA using the template strand. Although the catalysis may have more than one beginning nucleotide (a start site) and more than one ending nucleotide (a stop site) along the DNA, each nucleotide sequence catalyzed that ultimately produces approximately the same RNA is part of a gene. The catalysis of each RNA representation from the template DNA is a transcription, specifically a gene transcription. The overall process is also referred to as gene transcription.

Theoretical TBP associated factors[edit | edit source]

Here's a theoretical definition:

Def. any factor associating with the TATA binding protein (TBP) is called a TBP associated factor (TAF).

TATA boxes[edit | edit source]

This image is a drawing of Haloquadratum walsbyi. Credit: Rotational.

A TATA box is a common type of core promoter sequence in eukaryotes which is a short DNA sequence.

The TATA box (also called Goldberg-Hogness box)[2] is a DNA sequence (cis-regulatory element) found in the promoter region of genes in archaea and eukaryotes;[3] approximately 24% of human genes contain a TATA box within the core promoter.[4]

The TATA box is a binding site of either general transcription factors or histones.

In the direction of transcription along the DNA strand, the TATA box has the core DNA sequence [3'-TATAAA-5'] or a variant, which is usually followed by three or more adenine [(A)] bases, specifically [3'-TATAAA(A)AAA-5' on the template strand].

"[M]ost of the diversity within metazoan core promoters appears to involve the variable occurrence of consensus or near-consensus TATA, Inr, and DPE elements."[5]

The TATA box can be an AT-rich sequence "located at a fixed distance upstream of the transcription start site".[3]

TATA binding proteins[edit | edit source]

The TATA-binding protein (TBP) is a general transcription factor that binds specifically to a DNA sequence called the TATA box. This DNA sequence is found about 30 base pairs upstream of the transcription start site in some eukaryotic gene promoters.[6] TBP, along with a variety of TBP-associated factors, make up the TFIID, a general transcription factor that in turn makes up part of the RNA polymerase II preinitiation complex.[7] As one of the few proteins in the preinitiation complex that binds DNA in a sequence-specific manner, it helps position RNA polymerase II over the transcription start site of the gene.

Initiator elements[edit | edit source]

Notation: let the symbol Inr denote an initiator element.

Notation: let the symbol +1 designate the nucleotide that is the transcription start site (TSS).

Most human genes lack a TATA box and use an Inr or downstream promoter element instead. As in other metazoans, for genes lacking a TATA box, the Inr is functionally analogous, with a base pair (bp) consensus 5'-YYA+1NWYY-3', to direct transcription initiation.[8] On the template strand (used as a template for RNA synthesis), the consensus sequence is 3'-YYA+1NWYY-5'.

The Inr is the only element in metazoan protein-encoding genes known to be a functional analog of the TATA box, in that it is sufficient for directing accurate transcription initiation in genes that lack TATA boxes.[9] An Inr for mammalian RNA polymerase II can be defined as a DNA sequence element that overlaps a TSS and is sufficient for

  1. determining the start site location in a promoter that lacks a TATA box and
  2. enhancing the strength of a promoter that contains a TATA box.[10]

"Although any isolated TAF may not exhibit sequence-specific interactions at the Inr element in the absence of a TATA-box, a combination of TAFs may bind sequence specifically to the Inr element regardless of the TATA-box and/or DPE (Chalkley and Verrijzer, 1999)."[11] Bold added.

TAF1[edit | edit source]

GeneID: 6872, TAF1 "binds to core promoter sequences encompassing the transcription start site. It also binds to activators and other transcriptional regulators, and these interactions affect the rate of transcription initiation."[12] TAF1 "is part of a complex transcriptional unit (TAF1/DYT3)".[12]

"Yeast TAF1 can be divided into four regions including a putative histone acetyltransferase domain and TBP, TAF, and promoter binding domains."[13]

"TAF1 [has] been systematically dissected into ... functional domains: an N-terminal TBP-binding domain termed TAND, a TAF-TAF interaction domain, a putative histone acetyltransferase (HAT) domain, ... a promoter recognition domain [(PB1), and] a ... domain that interacts with TAF7"[13]. The promoter recognition domain is approximately at one end of the gene for TAF1.[13]

On human chromosome X (number 23, NC_000023), specifically NC_000023.10, TAF1 is located 3'-70586113[-70685855]-5', 99,742 nt.[14] The 3'-UTR begins at 70586114.[14]

TAF1 isoform 1[edit | edit source]

Isoform 1 (variant 1) "represents the longer transcript and encodes the longer isoform".[12]

TAF1 isoform 2[edit | edit source]

Isoform 2 (variant 2) "uses an alternate in-frame splice site, compared to variant 1, resulting in a shorter isoform (2) that lacks an internal 21 aa segment, compared to isoform 1."[12]

TAF1/DYT3[edit | edit source]

TAF1/DYT3 is "a complex transcript system that is composed of at least 43 exons. Thirty-eight exons code for [TAF1] ... Five downstream exons (d1-d5) ... can either form transcripts with TAF1 exons or be transcribed independently."[15] "Transcripts including exons d1, d3, d4, d5, plus TAF1 exons make up transcript "variant 1." Major "variant 2" is composed of various TAF1 exons plus exons d3 and d4. Alternately, d exons can generate transcripts independent of TAF1 exons 1-38. Exons d2, d3, and d4 make up "variant 3" and exons d3 and d4 constitute "variant 4.""[15] The "additional five exons are located 3' to exon 38 ... ("downstream" exons 1-5)"[15].

TAF1/TAF2[edit | edit source]

"[A] [TAF1]-[TAF2] complex selects sequences that match the Initiator (Inr) consensus."[16]

TAF1A[edit | edit source]

GeneID: 9015, TAF1A, TATA box binding protein (TBP)-associated factor for RNA polymerase I. It has two isoforms.

TAF1B[edit | edit source]

GeneID: 9014, TAF1B, TATA box binding protein (TBP)-associated factor for RNA polymerase I.

TAF1C[edit | edit source]

GeneID: 9013, TAF1C, TATA box binding protein (TBP)-associated factor for RNA polymerase I. It has two isoforms.

TAF1L[edit | edit source]

GeneID: 138474, TAF1L "is expressed in male germ cells, and the product has been shown to function interchangeably with the TAF1 product."[17]

TAF2[edit | edit source]

GeneID: 6873, TAF2 "is stably associated with the TFIID complex. It contributes to interactions at and downstream of the transcription initiation site, interactions that help determine transcription complex response to activators."[18]

TAF3[edit | edit source]

GeneID: 83860, TAF3 is part of the "set of TBP-associated factors (TAFs) [which] contribute to promoter recognition and selectivity and act as antiapoptotic factors"[19]

TAF4[edit | edit source]

GeneID: 6874, TAF4 "has been shown to potentiate transcriptional activation by retinoic acid, thyroid hormone and vitamin D3 receptors. In addition, [it] interacts with the transcription factor CREB, which has a glutamine-rich activation domain, and binds to other proteins containing glutamine-rich regions."[20]

TAF4B[edit | edit source]

GeneID: 6875, TAF4B is "a cell type-specific TAF that may be responsible for mediating transcription by a subset of activators in B cells."[21]

TAF5[edit | edit source]

GeneID: 6877, TAF5 is "an integral subunit of TFIID associated with all transcriptionally competent forms of that complex. [It] interacts strongly with two TFIID subunits that show similarity to histones H3 and H4, and it may participate in forming a nucleosome-like core in the TFIID complex."[22]

TAF6[edit | edit source]

GeneID: 6878, TAF6 "binds weakly to TBP but strongly to TAF1".[23] It has four isoforms.

TAF7[edit | edit source]

GeneID: 6879, TAF7 "interacts with the largest TFIID subunit, as well as multiple transcription activators. [It] is required for transcription by promoters targeted by RNA polymerase II."[24]

TAF7L[edit | edit source]

GeneID: 54457, TAF7L "could be a spermatogenesis-specific component of the DNA-binding general transcription factor complex TFIID."[25] It has two isoforms.

TAF8[edit | edit source]

GeneID: 129685, TAF8 "contains an H4-like histone fold domain, and interacts with several subunits of TFIID including TBP and the histone-fold protein TAF10."[26]

TAF9[edit | edit source]

GeneID: 6880, TAF9 "binds to the basal transcription factor GTF2B as well as to several transcriptional activators such as p53 and VP16."[27] It has four isoforms.

TAF10[edit | edit source]

GeneID: 6881, TAF10 "is associated with a subset of TFIID complexes. Studies with human and mammalian cells have shown that this subunit is required for transcriptional activation by the estrogen receptor, for progression through the cell cycle, and may also be required for certain cellular differentiation programs."[28]

TAF11[edit | edit source]

GeneID: 6882, TAF11 "is present in all TFIID complexes and interacts with TBP. This subunit also interacts with another small subunit, TAF13, to form a heterodimer with a structure similar to the histone core structure."[29]

TAF12[edit | edit source]

GeneID: 6883, "TAF12 interacts directly with TBP as well as with TAF2I [(TAF11)]."[30]

TAF13[edit | edit source]

GeneID: 6884, TAF13 "interacts with TBP and with two other small subunits of TFIID, TAF10 and TAF11."[31]

TAF15[edit | edit source]

GeneID: 8148, TAF15 is in "a subunit of TFIID present in a subset of TFIID complexes. Translocations involving chromosome 17 and chromosome 9, where the gene for the nuclear receptor CSMF is located, result in a gene fusion product that is an RNA binding protein associated with a subset of extraskeletal myxoid chondrosarcomas."[32]

TAF15 has two isoforms.

General transcription factor II Ds[edit | edit source]

Before the start of transcription, the transcription factor II D (TFIID) complex, binds to the core promoter of the gene.

Hypotheses[edit | edit source]

  1. TAFs are not involved in the transcription of A1BG.

See also[edit | edit source]

References[edit | edit source]

  1. genetics. San Francisco, California: Wikimedia Foundation, Inc. April 16, 2014. https://en.wiktionary.org/wiki/genetics. Retrieved 2014-05-07. 
  2. Lifton RP, Goldberg ML, Karp RW, Hogness DS (1978). "The organization of the histone genes in Drosophila melanogaster: functional and evolutionary implications". Cold Spring Harb Symp Quant Biol 42: 1047–51. PMID 98262. 
  3. 3.0 3.1 Stephen T. Smale and James T. Kadonaga (July 2003). "The RNA Polymerase II Core Promoter". Annual Review of Biochemistry 72 (1): 449-79. doi:10.1146/annurev.biochem.72.121801.161520. PMID 12651739. http://www.lps.ens.fr/~monasson/Houches/Kadonaga/CorePromoterAnnuRev2003.pdf. Retrieved 2012-05-07. 
  4. C Yang, E Bolotin, T Jiang, FM Sladek, E Martinez (March 2007). "Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters". Gene 389 (1): 52–65. doi:10.1016/j.gene.2006.09.029. PMID 17123746. PMC 1955227. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1955227/?tool=pubmed. 
  5. Stephen T. Smale (October 1, 2001). "Core promoters: active contributors to combinatorial gene regulation". Genes & Development 15 (19): 2503-8. doi:10.1101/gad.937701. http://genesdev.cshlp.org/content/15/19/2503.full. Retrieved 2012-04-28. 
  6. RD Kornberg (2007). "The molecular basis of eukaryotic transcription". Proc. Natl. Acad. Sci. U.S.A. 104 (32): 12955–61. doi:10.1073/pnas.0704138104. PMID 17670940. PMC 1941834. //www.ncbi.nlm.nih.gov/pmc/articles/PMC1941834/. 
  7. TI Lee , RA Young (2000). "Transcription of eukaryotic protein-coding genes". Annu. Rev. Genet. 34: 77–137. doi:10.1146/annurev.genet.34.1.77. PMID 11092823. 
  8. DR Liston, PJ Johnson (March 1999). "Analysis of a Ubiquitous Promoter Element in a Primitive Eukaryote: Early Evolution of the Initiator Element". Molecular and Cellular Biology 19 (3): 2380-8. PMID 10022924. 
  9. ST Smale (March 1997). "Transcription initiation from TATA-less promoters within eukaryotic protein-coding genes". Biochimica & Biophysica Acta 1351 (1-2): 73-88. doi:10.1016/S0167-4781(96)00206-0. PMID 9116046. 
  10. R. Javahery, A. Khachi, K. Lo, B. Zenzie-Gregory, S. T. Smale (January 1994). "DNA Sequence Requirements for Transcriptional Initiator Activity in Mammalian Cells". Molecular and Cellular Biology 14 (1): 116-27. PMID 8264580. 
  11. Ananda L. Roy (August 2001). "Biochemistry and biology of the inducible multifunctional transcription factor TFII-I". Gene 274 (1-2): 1-13. doi:10.1016/S0378-1119(01)00625-4. http://bioinformaticaupf.crg.cat/2006/projectes06/3.14/article2.pdf. Retrieved 2012-04-06. 
  12. 12.0 12.1 12.2 12.3 HGNC:11535 (March 24, 2012). TAF1 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 250kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6872. Retrieved 2012-04-09. 
  13. 13.0 13.1 13.2 Jordan D. Irvin and B. Franklin Pugh (March 10, 2006). "Genome-wide Transcriptional Dependence on TAF1 Functional Domains". The Journal of Biological Chemistry 281 (10): 6404-12. doi:10.1074/jbc.M513776200. PMID 16407318. http://www.ncbi.nlm.nih.gov/pubmed/16407318. Retrieved 2012-04-14. 
  14. 14.0 14.1 Ncbi. Homo sapiens chromosome X, GRCh37.p5 Primary Assembly. Rockville, MD: National Center for Biotechnology Information, US National Library of Medicine. http://www.ncbi.nlm.nih.gov/projects/sviewer/?id=NC_000023. Retrieved 2012-04-25. 
  15. 15.0 15.1 15.2 Thilo Herzfeld, Dagmar Nolte and Ulrich Müller (2007). "Structural and functional analysis of the human TAF1/DYT3 multiple transcript system". Mammalian Genome 18 (11): 787-95. doi:10.1007/s00335-007-9063-z. http://www.springerlink.com/content/f75q704142277344/. Retrieved 2012-04-25. 
  16. Gillian E. Chalkley and C. Peter Verrijzer (September 1, 1999). "DNA binding site selection by RNA polymerase II TAFs: a TAFII250-TAFII150 complex recognizes the Initiator". The EMBO Journal 18 (17): 4835-45. PMID 10469661. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC1171555/pdf/004835.pdf. Retrieved 2012-04-26. 
  17. HGNC:18056 (March 10, 2012). TAF1L RNA polymerase II, TATA box binding protein (TBP)-associated factor, 210kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/138474. Retrieved 2012-04-09. 
  18. HGNC:11536 (March 10, 2012). TAF2 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 150kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6873. Retrieved 2012-04-09. 
  19. HGNC:17303 (March 24, 2012). TAF3 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 140kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/83860. Retrieved 2012-04-09. 
  20. HGNC:11537 (March 10, 2012). TAF4 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 135kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6874. Retrieved 2012-04-09. 
  21. HGNC:11538 (March 17, 2012). TAF4b RNA polymerase II, TATA box binding protein (TBP)-associated factor, 105kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6875. Retrieved 2012-04-09. 
  22. HGNC:11539 (March 10, 2012). TAF5 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 100kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6877. Retrieved 2012-04-09. 
  23. HGNC:11540 (March 24, 2012). TAF6 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 80kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6878. Retrieved 2012-04-09. 
  24. HGNC:11541 (March 10, 2012). TAF7 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 55kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6879. Retrieved 2012-04-09. 
  25. HGNC:11548 (March 10, 2012). TAF7-like RNA polymerase II, TATA box binding protein (TBP)-associated factor, 50kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/54457. Retrieved 2012-04-09. 
  26. HGNC:17300 (March 10, 2012). TAF8 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 43kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/129685. Retrieved 2012-04-09. 
  27. HGNC:11542 (March 31, 2012). TAF9 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 32kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6880. Retrieved 2012-04-09. 
  28. HGNC:11543 (March 24, 2012). TAF10 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 30kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6881. Retrieved 2012-04-09. 
  29. HGNC:11544 (March 10, 2012). TAF11 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 28kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6882. Retrieved 2012-04-09. 
  30. HGNC:11545 (March 17, 2012). TAF12 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 20kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6883. Retrieved 2012-04-09. 
  31. HGNC:11546 (March 24, 2012). TAF13 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 20kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/6884. Retrieved 2012-04-09. 
  32. HGNC:11547 (March 17, 2012). TAF15 RNA polymerase II, TATA box binding protein (TBP)-associated factor, 68kDa. Bethesda, Maryland: NCBI. http://www.ncbi.nlm.nih.gov/gene/8148. Retrieved 2012-04-09. 

Further reading[edit | edit source]

External links[edit | edit source]

{{Gene project}}