Deoxyribonucleic acid (DNA) is a polymer composed of nucleic acids linked together by a sugar-phosphate backbone.
The nucleic acids are inorganic acids with phosphoric acid as the only acid.
Attached to each sugar is a nucleobase.
- 1 Polyphosphoric acids
- 2 Nitrogenous bases
- 3 Sugars
- 4 Sugar phosphates
- 5 Nucleosides
- 6 Nucleotides
- 7 Nucleic acids
- 8 Mitochondrial DNA
- 9 Noncoding DNA
- 10 Non-coding repetitive sequences
- 11 Non-coding RNA sequences
- 12 Pseudogenes
- 13 Genes
- 14 Telomeres
- 15 Centromeres
- 16 Introns
- 17 Human DNA
- 18 Hypotheses
- 19 See also
- 20 References
- 21 External links
Def. "[a] colourless liquid, H3PO4" is called phosphoric acid, orthophosphoric acid, or monophosphoric acid.
"[A]n orthophosphoric acid molecule can dissociate up to three times, giving up an H+ each time, which typically combines with a water molecule, H2O, as shown in these [chemical] reactions:"
- H3PO4(s) + H2O(l) ↔ H3O+(aq) + H2PO4−(aq) Ka1= 7.25×10−3
- H2PO4−(aq)+ H2O(l) ↔ H3O+(aq) + HPO42−(aq) Ka2= 6.31×10−8
- HPO42−(aq)+ H2O(l) ↔ H3O+(aq) + PO43−(aq) Ka3= 4.80×10−13
"The anion after the first dissociation, H2PO4−, is the dihydrogen phosphate anion. The anion after the second dissociation, HPO42−, is the hydrogen phosphate anion. The anion after the third dissociation, PO43−, is the phosphate or orthophosphate anion. For each of the dissociation reactions shown above, there is a separate acid dissociation constant, called Ka1, Ka2, and Ka3 given at 25 °C. Associated with these three dissociation constants are corresponding pKa1=2.12, pKa2=7.21, and pKa3=12.67 values at 25 °C. Even though all three hydrogen (H) atoms are equivalent on an orthophosphoric acid molecule, the successive Ka values differ since it is energetically less favorable to lose another H+ if one (or more) has already been lost and the molecule/ion is more negatively charged."
"For a given total acid concentration [A] = [H3PO4] + [H2PO4−] + [HPO42−] + [PO43−] ([A] is the total number of moles of pure H3PO4 which have been used to prepare 1 liter of solution), the composition of an aqueous solution of phosphoric acid can be calculated using the equilibrium equations associated with the three reactions described above together with the [H+][OH−] = 10−14 relation and the electrical neutrality equation. Possible concentrations of polyphosphoric molecules and ions is neglected. The system may be reduced to a fifth degree equation for [H+] which can be solved numerically, yielding:"
|[A] (mol/L)||pH||[H3PO4]/[A] (%)||[H2PO4−]/[A] (%)||[HPO42−]/[A] (%)||[PO43−]/[A] (%)|
"For large acid concentrations, the solution is mainly composed of H3PO4. For [A] = 10−2, the pH is close to pKa1, giving an equimolar mixture of H3PO4 and H2PO4−. For [A] below 10−3, the solution is mainly composed of H2PO4− with [HPO42−] becoming non negligible for very dilute solutions. [PO43−] is always negligible. Note that the above analysis does not take into account ion activity coefficients; as such, the pH and molarity of a real phosphoric acid solution may deviate substantially from the above values."
Def. typically two to twenty, or three to seven, linked monophosphoric acids, or orthophosphoric acids, is called an oligophosphoric acid.
Def. "any of a class of inorganic polymers containing linked phosphate groups", or more than five linked phosphoric acids, is called a polyphosphate, or polyphosphoric acid.
Nitrogenous bases, found in cell nuclei, are nucleobases.
"In normal spiral DNA the bases form pairs between the two strands: [Adenine] A with [Thymine] T and [Cytosine] C with [Guanine] G. Purines pair with pyrimidines mainly for dimensional reasons - only this combination fits the constant width geometry of the DNA spiral."
Nitrogenous bases include purines: adenine, guanine, hypoxanthine, isoguanine, xanthine, and 7-methylguanine; and pyrimidines: cytosine, thymine, and isocytosine.
Def. "[a] derivative of the pentose sugar ribose in which the 2' hydroxyl (-OH) is reduced to a hydrogen (H)" is called deoxyribose.
In the diagram of DNA at the page top, the pentose sugar deoxyribose is in a cyclic, furanose (5-membered ring) form. D-deoxyribose or L-deoxyribose is not demonstrated, but is determined by the hydroxyls being primarily below (D) or above (L) the plane of the ring. In Earth-based DNA, deoxyribose is in the dextro (D) configuration.
Deoxyribose may also occur in a pyranose (six-membered ring) form.
There are other pentose sugars including aldopentoses: apiose, arabinose, xylose, and lyxose, and ketopentoses: ribulose and xylulose. These may have a deoxy-form: deoxyapiose, deoxyarabinose, deoxyxylose, deoxylyxose, deoxyribulose, and deoxyxylulose. They may occur as a levo or dextro sugar and as a furanose or pyranose.
To occur in an Earth-like DNA, each of these six deoxypentoses and perhaps other sugars need to be dextro furanoses. Each is a DNA, for example, deoxyapionucleic acid.
Any of the various sugars can have one or more phosphates attached as in glucose-6-phosphate diagrammed on the right.
Def. "an organic molecule in which a nitrogenous heterocyclic base (or nucleobase), which can be either a double-ringed purine or a single-ringed pyrimidine, is covalently attached to a five-carbon pentose sugar (deoxyribose in DNA or ribose in RNA)" is called a nucleoside.
"All five nucleotides (including the RNA base "uracil") are synthesized through complex metabolic pathways involving several multi-subunit enzymes. The pathways differ for both purines and pyrimidines (uracil falling in the pyrimidine category since it is thymine's counterpart.)"
"Synthetic genetics is a subdiscipline of synthetic biology that aims to develop artificial genetic polymers (also referred to as xeno-nucleic acids or XNAs) that can replicate in vitro and eventually in model cellular organisms."
Def. any "acidic, chainlike biological macromolecule consisting of multiply repeat units of phosphoric acid, sugar and purine and pyrimidine bases" occurring in cell nuclei is called a nucleic acid.
Def. a nucleic acid "in which the sugar component is threose" is called threose nucleic acid, or threonucleic acid (TNA).
Additional DNAs may be
- deoxyapionucleic acid,
- deoxyarabinonucleic acid,
- deoxyxylonucleic acid (dXyNA),
- deoxylyxonucleic acid,
- deoxyribulonucleic acid, and
- deoxyxylulonucleic acid.
Synthesis of deoxyapionucleic acid has been accomplished.
Deoxyxylonucleic acid and xylose nucleic acid have been produced.
"[X]ylonucleic acid (XyloNA) [contains] a potentially prebiotic xylose sugar (a 3′-epimer of ribose) in its backbone."
A "number of sugar-modified nucleic acid variants has been revealed as new genetic polymers, (2) some of them are endowed with catalytic activity (for e.g. FANA and HNA) (3). The structure of these artificial nucleic acids, however, mimics natural nucleic acid helicity (4)."
"Although helices display a distinct pitch and curvature, they feature ca. 11–12 base pairs per turn, and χ/δ covariance plots indicate that the backbones of XNA:RNA or XNA:DNA heteroduplexes adopt an architecture that is either closely related to the A-form, as in the case of [1,5-anhydrohexitol nucleic acid (HNA)] HNA:RNA (96), [locked nucleic acid (LNA)] LNA:RNA (83), [cyclohexene nucleic acid (CeNA)] CeNA:RNA (85) and PNA:RNA (59), or between the A- and B-forms, as seen in the structures of DNA:RNA (97), [arabinonucleic acid (ANA)] ANA:RNA (79), [2′-deoxy-2′-fluoro-arabinonucleic acid (FANA)]FANA:RNA (79) and [peptide nucleic acid (PNA)] PNA:DNA (98)."
Additional XNAs include bridged nucleic acid (BNA) glycol nucleic acid (GNA), FANA and peptide nucleic acid (PNA).
On the right is a diagram displaying various artificial and natural nucleic acid polymers.
"Representative structures illustrate the structural diversity and plasticity of natural and artificial nucleic acid (XNA) backbones. Structures are shown in alphabetic order. (A) Natural genetic polymers: B-form DNA (black), DNA:RNA hybrid and A-form RNA (gray). (B) Representative structures of XNA heteroduplexes with RNA or DNA. The RNA strand is shown in gray, the DNA strand in black and the orientation of the XNA strand is indicated. (C) XNA homoduplexes. Homo-XNA duplexes adopt a variety of structures. (D) Representative XNA-only heteroduplexes. FAF:FAF stands for FANA(F)-ANA(A)-FANA(F) XNA:XNA heteroduplex. Alt and chim indicate the alternated or chimeric order of FANA-segments in the duplex sequences respectively. The depicted duplexes have the following PDB ID codes in the Protein Data Bank (http://www.rcsb.org): B-DNA (3BSE); DNA:RNA (1EFS); A-RNA (3ND4); ANA(purple):RNA (2KP3); CeNA(blue):RNA (3KNC); FANA(violet):RNA (2KP4); HNA(yellow):RNA (2BJ6); LNA(cyan):RNA (1H0Q); PNA(orange):DNA (1PDT); PNA(orange):RNA (176D); CeNA:CeNA (blue, 2H0N); hDNA:hDNA (sky blue, 2H9S); FRNA:FRNA (magenta, 3P4A); GNA:GNA (red, 2XC6); HNA:HNA (yellow, 481D); LNA:LNA (cyan, 2×2Q); PNA:PNA (orange, 2K4G), TNA:TNA (green, coordinates not deposited in the PDB [...]); dXyNA:dXyNA (brown, coordinates not deposited in the PDB [...]); XyNA:XyNA (light green, 2N4J); FAF:FAF (FANA in violet, ANA in purple, 2LSC), FRNA:FANA (alt) (FRNA in magenta, FANA in violet, 2M8A); FRNA:FANA (chim) (FRNA in magenta, FANA in violet, 2M84)."
The H (heavy, outer circle) and L (light, inner circle) strands are given with their corresponding genes.
There are 22 transfer RNA (TRN) genes for the following amino acids: F, V, L1 (codon UUA/G), I, Q, M, W, A, N, C, Y, S1 (UCN), D, K, G, R, H, S2 (AGC/U), L2 (CUN), E, T and P (white boxes).
There are 2 ribosomal RNA (RRN) genes: S (small subunit, or 12S) and L (large subunit, or 16S) (blue boxes).
There are 13 protein-coding genes: 7 for NADH dehydrogenase subunits (ND, yellow boxes), 3 for cytochrome c oxidase subunits (COX, orange boxes), 2 for ATPase subunits (ATP, red boxes), and one for cytochrome b (CYTB, coral box).
Two gene overlaps are indicated (ATP8-ATP6, and ND4L-ND4, black boxes).
The control region (CR) is the longest non-coding sequence (grey box). Its three hyper-variable regions are indicated (HV, green boxes).
"Non-coding DNA sequences do not code for amino acids. Most non-coding DNA lies between genes [intergenic] on the chromosome [...]. Other non-coding DNA, called introns, is found within genes. [...] Non-coding DNA [...] represents 98 percent of our genome sequence and it does all sorts of things, like regulate those genes to figure out where they should turn on, where they should turn off, how much we should turn on certain genes, how are we going to pack up the DNA into chromosomes, and so forth."
Over 80% of human DNA "serves some purpose, biochemically speaking".
Non-coding repetitive sequences
Over 50% of human DNA consists of non-coding repetitive sequences.
Non-coding RNA sequences
"An abundant form of noncoding DNA in humans are pseudogenes, which are copies of genes that have been disabled by mutation. These sequences are usually just molecular fossils, although they can occasionally serve as raw genetic material for the creation of new genes through the process of gene duplication and divergence."
Def. "[a] unit of heredity; a segment of DNA or RNA that is transmitted from one generation to the next, and that carries genetic information such as the sequence of amino acids for a protein" is called a gene.
"The genetic information in a genome is held within genes, and the complete set of this information in an organism is called its genotype. A gene is a unit of heredity and is a region of DNA that influences a particular characteristic in an organism. Genes contain an open reading frame that can be transcribed, as well as regulatory sequences such as promoters and enhancers, which control the transcription of the open reading frame."
"Some noncoding DNA sequences [such as telomeres and centromeres] play structural roles in chromosomes."
Centromeres are chromosomal loci that ensure delivery of a copy of a chromosome to each daughter upon cell division. On the Spindle Apparatus, chromosome movement is run and maintained by the centromere during meiosis and mitosis.
"An intron is any nucleotide sequence within a gene that is removed by RNA splicing while the final mature RNA product of a gene is being generated. The term intron refers to both the DNA sequence within a gene and the corresponding sequence in RNA transcripts."
There are "several families of internal nucleic acid sequences that are not present in the final gene product, including inteins, untranslated sequences ([Untranslated region] UTR), and nucleotides removed by RNA editing, in addition to introns."
"[I]ntrons are extremely common within the nuclear genome of higher vertebrates (e.g. humans and mice), where protein-coding genes almost always contain multiple introns".
"[S]ome introns themselves encode specific proteins or can be further processed after splicing to generate noncoding RNA molecules. Alternative splicing is widely used to generate multiple proteins from a single gene. Furthermore, some introns represent mobile genetic elements and may be regarded as examples of selfish DNA."
"[T]he human genome contains an average of 8.4 introns/gene (139,418 in the genome)".
"Some introns are known to enhance the expression of the gene that they are contained in by a process known as intron-mediated enhancement (IME)."
"[H]uman DNA has millions of on-off switches and complex networks that control the genes' activities. ... [A]t least 80% of the human genome is active, which opposed the previously held idea that most of the DNA are useless."
"DNA contains genes, which hold the instructions for [life. But, these] take up only about 2 percent of the genome ... The human genome is made up of about 3 billion “letters” along strands that make up the familiar double helix structure of DNA. Particular sequences of these letters form genes, which tell cells how to make proteins. People have about 20,000 genes, but the vast majority of DNA lies outside of genes. ... [A]t least three-quarters of the genome is involved in making RNA [...] it appears to help regulate gene activity."
- As both ribose and deoxyribose nucleic acids exist, each pentose or hexose sugar should be usable to make a nucleic acid.
- "phosphoric acid, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. November 10, 2012. Retrieved 2013-04-19.
- "Phosphoric acid, In: Wikipedia". San Francisco, California: Wikimedia Foundation, Inc. April 9, 2013. Retrieved 2013-04-19.
- "polyphosphate, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. August 30, 2012. Retrieved 2013-04-19.
- "Nucleobase, In: Wikipedia". San Francisco, California: Wikimedia Foundation, Inc. February 22, 2013. Retrieved 2013-04-20.
- "deoxyribose, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. April 1, 2013. Retrieved 2013-04-19.
- "nucleoside, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. November 10, 2013. Retrieved 2014-06-04.
- Biochemistry~enwikiversity (2 March 2011). "Nucleotide Synthesis, In: Wikiversity". San Francisco, California USA: Wikimedia Foundation, Inc. Retrieved 2017-01-21.
- Irina Anosova, Ewa A. Kowal, Matthew R. Dunn, John C. Chaput, Wade D. Van Horn1, and Martin Egli (15 December 2015). "The structural diversity of artificial genetic polymers". Nucleic Acids Research. doi:10.1093/nar/gkv1472. http://nar.oxfordjournals.org/content/early/2015/12/15/nar.gkv1472.full. Retrieved 2016-01-21.
- "nucleic acid, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. January 12, 2013. Retrieved 2013-04-19.
- "threose nucleic acid, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. November 14, 2012. Retrieved 2013-04-19.
- Mayumi Kataoka, Yasuo Kouda, Kousuke Sato, Noriaki Minakawaa and Akira Matsuda (14 August 2011). "Highly efficient enzymatic synthesis of 3′-deoxyapionucleic acid (apioNA) having the four natural nucleobases". Chemical Communications 47 (30): 8700-2. doi:10.1039/C1CC12980E. http://pubs.rsc.org/en/content/articlelanding/2011/cc/c1cc12980e#!divAbstract. Retrieved 2016-01-19.
- Mohitosh Maiti, Munmun Maiti, Christine Knies, Shrinivas Dumbre, Eveline Lescrinier, Helmut Rosemeyer, Arnout Ceulemans and Piet Herdewijn (13 July 2015). "Xylonucleic acid: synthesis, structure, and orthogonal pairing properties". Nucleic Acids Research 43: 7189-200. doi:10.1093/nar/gkv719. https://nar.oxfordjournals.org/content/early/2015/07/13/nar.gkv719.full. Retrieved 2016-01-21.
- Bryan Sykes (10 September 2003). "Mitochondrial DNA and human history". The Human Genome. Wellcome Trust. Retrieved 5 February 2012.
- S. Anderson, A. T. Bankier, B. G. Barrell, M. H. L. de Bruijn, A. R. Coulson, J. Drouin, I. C. Eperon, D. P. Nierlich, B. A. Roe, F. Schreier, P. H. Sanger, A. J. H. Smith, R. Staden, I. G. Young (1981). "Sequence and organization of the human mitochondrial genome". Nature 290 (5806): 457–65. doi:10.1038/290457a0. PMID 7219534.
- Elliott Margulies (22 January 2017). "Non-Coding DNA". Bethesda, Maryland USA: National Institutes of Health, National Human Genome Research Institute. Retrieved 2017-01-22.
- Elgar G, Vavouri T (July 2008). "Tuning in to the signals: noncoding sequence conservation in vertebrate genomes". Trends Genet. 24 (7): 344–52. doi:10.1016/j.tig.2008.04.005. PMID 18514361.
- E. Pennisi (September 2012). "Genomics. ENCODE project writes eulogy for junk DNA". Science 337 (6099): 1159, 1161. doi:10.1126/science.337.6099.1159. PMID 22955811.
- Wolfsberg T, McEntyre J, Schuler G (2001). "Guide to the draft human genome". Nature 409 (6822): 824–6. doi:10.1038/35057000. PMID 11236998.
- The ENCODE Project Consortium (2007). "Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project". Nature 447 (7146): 799–816. doi:10.1038/nature05874. PMID 17571346. PMC 2212820. //www.ncbi.nlm.nih.gov/pmc/articles/PMC2212820/.
- "DNA, In: Wikipedia". San Francisco, California: Wikimedia Foundation, Inc. December 4, 2012. Retrieved 2012-12-13.
- Harrison P, Hegyi H, Balasubramanian S, Luscombe N, Bertone P, Echols N, Johnson T, Gerstein M (2002). "Molecular Fossils in the Human Genome: Identification and Analysis of the Pseudogenes in Chromosomes 21 and 22". Genome Res 12 (2): 272–80. doi:10.1101/gr.207102. PMID 11827946. PMC 155275. //www.ncbi.nlm.nih.gov/pmc/articles/PMC155275/.
- Harrison P, Gerstein M (2002). "Studying genomes through the aeons: protein families, pseudogenes and proteome evolution". J Mol Biol 318 (5): 1155–74. doi:10.1016/S0022-2836(02)00109-2. PMID 12083509.
- "gene, In: Wiktionary". San Francisco, California: Wikimedia Foundation, Inc. December 13, 2012. Retrieved 2012-12-13.
- Wright W, Tesmer V, Huffman K, Levene S, Shay J (1997). "Normal human chromosomes have long G-rich telomeric overhangs at one end". Genes Development 11 (21): 2801–9. doi:10.1101/gad.11.21.2801. PMID 9353250. PMC 316649. //www.ncbi.nlm.nih.gov/pmc/articles/PMC316649/.
- Nugent C, Lundblad V (1998). "The telomerase reverse transcriptase: components and regulation". Genes Dev 12 (8): 1073–85. doi:10.1101/gad.12.8.1073. PMID 9553037.
- Pidoux A, Allshire R (2005). "The role of heterochromatin in centromere function". Philos Trans R Soc Lond B Biol Sci 360 (1455): 569–79. doi:10.1098/rstb.2004.1611. PMID 15905142. PMC 1569473. //www.ncbi.nlm.nih.gov/pmc/articles/PMC1569473/.
- Don W Cleveland, Yinghui Mao, Kevin F Sullivan, Centromeres and Kinetochores: From Epigenetics to Mitotic Checkpoint Signaling, Cell, Volume 112, Issue 4, 21 February 2003, Pages 407-421, ISSN 0092-8674, http://dx.doi.org/10.1016/S0092-8674(03)00115-6. (http://www.sciencedirect.com/science/article/pii/S0092867403001156)
- Alberts, Bruce (2008). Molecular biology of the cell. New York: Garland Science. ISBN 0-8153-4105-9.
- Stryer, Lubert; Berg, Jeremy Mark; Tymoczko, John L. (2007). Biochemistry. San Francisco: W.H. Freeman. ISBN 0-7167-6766-X.
- Kinniburgh, Alan; mertz, j. and Ross, J. (July 1978). "The precursor of mouse β-globin messenger RNA contains two intervening RNA sequences". Cell 14 (3): 681–693. doi:10.1016/0092-8674(78)90251-9. PMID 688388. http://www.cell.com/abstract/0092-8674(78)90251-9#.
- "Intron, In: Wikipedia". San Francisco, California: Wikimedia Foundation, Inc. December 10, 2012. Retrieved 2012-12-13.
- Rearick D, Prakash A, McSweeny A, Shepard SS, Fedorova L, Fedorov A (March 2011). "Critical association of ncRNA with introns". Nucleic Acids Res. 39 (6): 2357–66. doi:10.1093/nar/gkq1080. PMID 21071396. PMC 3064772. //www.ncbi.nlm.nih.gov/pmc/articles/PMC3064772/.
- Lambowitz AM, Belfort M (1993). "Introns as mobile genetic elements". Annu. Rev. Biochem. 62: 587–622. doi:10.1146/annurev.bi.62.070193.003103. PMID 8352597.
- Bryan McBournie (September 6 2012). "Human genome study could unlock the biology of disease". Sigma Xi. Retrieved 2012-09-06.
- Malcolm Ritter (September 6 2012). "Far from being mostly junk, human DNA is ‘a jungle’ of complex activity, huge project shows". The Washington Post. Retrieved 2012-09-06.
- African Journals Online
- Bing Advanced search
- GenomeNet KEGG database
- Google Books
- Google scholar Advanced Scholar Search
- Home - Gene - NCBI
- Lycos search
- NCBI All Databases Search
- NCBI Site Search
- Office of Scientific & Technical Information
- PubChem Public Chemical Database
- Questia - The Online Library of Books and Journals
- SAGE journals online
- Scirus for scientific information only advanced search
- Taylor & Francis Online
- The Talking Glossary of Genetic Terms
- WikiDoc The Living Textbook of Medicine
- Wiley Online Library Advanced Search
- Yahoo Advanced Web Search
Learn more about Deoxyribonucleic acid