Open reading frame

Sample sequence showing three different possible reading frames. Start codons are highlighted in purple, and stop codons are highlighted in red.

In molecular biology, reading frames are defined as spans of DNA sequence between the start and stop codons. Usually, this is considered within a studied region of a prokaryotic DNA sequence, where only one of the six possible reading frames will be "open" (the "reading", however, refers to the RNA produced by transcription of the DNA and its subsequent interaction with the ribosome in translation). Such an ORF may[1] contain a start codon (usually AUG in terms of RNA) and by definition cannot extend beyond a stop codon (usually UAA, UAG or UGA in RNA).[2] That start codon (not necessarily the first) indicates where translation may start. The transcription termination site is located after the ORF, beyond the translation stop codon. If transcription were to cease before the stop codon, an incomplete protein would be made during translation.[3]

In eukaryotic genes with multiple exons, introns are removed and exons are then joined together after transcription to yield the final mRNA for protein translation. In the context of gene finding, the start-stop definition of an ORF therefore only applies to spliced mRNAs, not genomic DNA, since introns may contain stop codons and/or cause shifts between reading frames. An alternative definition says that an ORF is a sequence that has a length divisible by three and is bounded by stop codons.[1][4] This more general definition can be useful in the context of transcriptomics and metagenomics, where a start or stop codon may not be present in the obtained sequences. Such an ORF corresponds to parts of a gene rather than the complete gene.

  1. ^ a b Sieber P, Platzer M, Schuster S (March 2018). "The Definition of Open Reading Frame Revisited". Trends in Genetics. 34 (3): 167–170. doi:10.1016/j.tig.2017.12.009. PMID 29366605.
  2. ^ Brody LC (2021-08-25). "Stop Codon". National Human Genome Research Institute. National Institutes of Health. Retrieved 2021-08-25.
  3. ^ Slonczewski J, Foster JW (2009). Microbiology: An Evolving Science. New York: W.W. Norton & Co. ISBN 978-0-393-97857-5. OCLC 185042615.
  4. ^ Claverie JM (1997). "Computational methods for the identification of genes in vertebrate genomic sequences". Human Molecular Genetics. 6 (10): 1735–44. doi:10.1093/hmg/6.10.1735. PMID 9300666.

Developed by StudentB