Evolution
Intelligent Design
Evolutionary Obstacles to the Origin of Introns

A major obstacle to any explanation of the origins of eukaryotes is the emergence of the non-coding regions of genes known as introns which are largely absent from bacteria. Before we discuss how the origin of introns poses a significant challenge to evolution, it is first necessary to give a short introduction as to what introns are.
What Are Introns?
Introns (which, unlike exons, do not code for proteins) can be of considerable length in higher eukaryotes, even spanning many thousands of bases and sometimes comprising some 90 percent of the precursor mRNA.1 In contrast, lower eukaryotes such as yeast possess fewer and shorter introns, which are typically fewer than 300 bases in length. Since introns are the non-coding segments of genes, they are removed from the mRNA before it is translated into a protein. The introns are excised from the pre-mRNA transcript, and the exons are spliced together by a riboprotein complex called the spliceosome.2,3 This remarkable process is illustrated by the animation below:
What Functional Role Do Introns Play?
Introns, upon their discovery, were generally assumed to be non-functional junk DNA.4,5,6,7 Francis Crick described these sequences as “‘nonsense’ stretches of DNA interspersed within the sense DNA.”8 This view of introns has, however, not aged well. Indeed, “introns in contemporary species fulfill a broad spectrum of functions, and are involved in virtually every step of mRNA processing.”9 Introns often show high levels of sequence conservation (particularly within classes of organisms), suggesting function.10,11
We also now know that introns play an essential role in alternative splicing where the exons of pre-mRNA transcripts are spliced in different configurations producing multiple protein isoforms from the same gene.12,13 According to one paper, the majority of alternatively spliced exons in humans and mice are flanked by introns, the sequences of which are highly conserved, implying an important role in alternative splicing.14
Another study documented that a six-nucleotide intronic sequence is “frequently located adjacent to tissue-specific alternative exons in the human genome.”15 Across taxa as widely divergent as dogs, rats, chickens, mice, and humans, this sequence exhibits conservation. These features, according to the paper, “mark it as a critical component of splicing switch mechanism(s) designed to activate a limited repertoire of splicing events in cell type-specific patterns.” In addition, direct evidence has been documented showing that introns contain other codes involved in the regulation of alternative splicing.16,17,18,19,20
Genes for the majority of microRNAs (which are required for the expression of mRNAs during development) and small nucleolar RNAs (which play important roles in the processing of ribosomal RNAs) reside in introns.21 One paper reported that,
Mirtrons are alternative precursors for microRNA biogenesis that were recently described in invertebrates. These short hairpin introns use splicing to bypass Drosha cleavage, which is otherwise essential for the generation of canonical animal microRNAs. Using computational and experimental strategies, we now establish that mammals have mirtrons as well.22
Another study also reports that,
314 miRNAs (51%) were located in introns of protein-coding genes, 231 miRNAs (38%) were located in intergenic regions, while only 43 miRNAs (7%) were located in exons of noncoding RNAs or UTR of protein coding genes. Interestedly, 23 miRNAs (4%) were located in either an exon or an intron depending on alternative splicing of the host transcript.23
It has also been established that the promoter sequences for those RNA genes occur in introns.24Furthermore, as many as 52 percent of transcripts for chromatin-associated RNA, which play important roles in the structural organization of nuclear chromatin25, are found in introns.26
It is likely that the length of introns (and, consequently, the time taken to transcribe them) can contribute to timing mechanisms during development.27 Since long introns take longer to transcribe, this can cause genes to respond more slowly to transcriptional activators or repressors compared to shorter genes, resulting in sequential gene activation, where genes with longer introns are expressed later even if they receive the same activating signal at the same time.
Explaining the Origins of Introns Is a Challenge to Evolution
How could introns arise in evolution without simultaneously having a mechanism in hand to facilitate their removal from the pre-mRNA transcript? Without the presence of such a mechanism, the introns would scramble the coding regions of the gene, and themselves be translated into a sequence of amino acids by the ribosome. Moreover, if the length of an intron is not a multiple of three, it could also potentially introduce a damaging frame shift into the coding region.
Self-Splicing Introns?
The most common explanation of the origins of eukaryotic introns is that they arose from self-splicing precursors, known as group II introns found in bacteria.28,29,30,31 Self-splicing introns employ a mechanism very similar to eukaryotic pre-mRNA splicing, involving two sequential transesterification reactions. The first step is for a conserved “branchpoint” adenosine to initiate a nucleophilic reaction at the 5’ splice site, leading to cleavage of the RNA backbone and formation of a lariat structure. In the second stage, the 3’ splice site is attacked by the terminal 3’ hydroxyl group of the exon at the 5’ end. The result is the excision of the lariat intron and ligation of the flanking exons.
A few different lines of evidence are offered in support of the hypothesis that eukaryotic spliceosomal introns arose from group II introns. For example, a simpler way to achieve splicing presumably would be to bring the splice sites together in one step to directly cleave and rejoin them. The proposed scenario would explain the use of a lariat intermediate, since a lariat is generated by group II RNA intron sequences.32,33 Moreover, it is argued that this hypothesis helps to clarify why RNA molecules play such an important part in the splicing process. Examples of self-splicing RNA introns still exist today, such as in the nuclear rRNA genes of the ciliate Tetrahymena.34,35,36,37
It is unlikely that group II introns arose within protein-coding genes without pre-existing splicing mechanisms. For one thing, group II introns fold into a conserved six-domain secondary structure, which is required for their self-replicative capability.38 The proper folding depends upon specific intra- and inter-domain base-pairing. Thus, key regions are quite sequence sensitive. The most highly conserved domain is domain V, which forms the catalytic center of the ribozyme.39,40 Even single-nucleotide mutations can abolish or significantly reduce splicing activity. Domain I contains exon binding sites that have to form complementary base pairs with intron binding sites on the exons. There are also specific sequences that mediate long-range interactions, which are essential for proper folding. Given the need for sequence specificity, the right sequences would need to be generated by chance mutations — but one would expect the introduction of additional sequences within protein-coding genes to be detrimental to the organism, and thereby removed by purifying selection.
Because of this, it is generally assumed that group II introns must have first arisen in non-coding regions, which provided a safer “sandbox” for these elements to evolve the complex secondary structure and self-splicing capabilities, without immediate harmful effects. However, bacterial genomes are typically gene-dense with relatively little non-coding DNA. This raises the question of how much “safe space” would have been available for the origination of group II introns prior to their insertion into coding regions. For example, the E. coli genome has only approximately 10-12% non-coding DNA (~460,000-550,000 base pairs of DNA). An average group II intron is ~2,500 base pairs long. This only leaves room for a few dozen full length group II introns to exist in non-coding DNA without disrupting host fitness.
Even on this hypothesis, however, it seems implausible that, in a non-coding region, there would be selective pressure to preserve such highly constrained sequence features until the full intron, complete with its self-splicing capability, was fully realized. Furthermore, for splicing to be successful, group II introns must accurately recognize exon-intron boundaries and bind complementary sequences in the flanking exons. But the sandbox hypothesis would have us believe that introns evolved completely within non-coding sequences, without the necessary exonic contexts to evolve this coordinated interaction. Thus, even supposing that a fully operational group II intron were to arise in a non-coding region, its insertion into a coding gene would be highly disruptive unless precise splicing was already operational and accurately matched to the flanking exons.
There also is no plausible scenario for transitioning from group II introns to spliceosomal introns. The spliceosome machinery is far more complex and sophisticated than autocatalytic ribozymes, involving not just five RNAs but nearly one hundred core spliceosome proteins found in the simplest eukaryotes.
Eukaryotes and Prokaryotes Likely Do Not Share a Common Ancestor
As argued previously the disparity in mode of cell division between eukaryotes and prokaryotes evinces their separate origins.41 The abundance of introns in eukaryotic genomes, largely absent from prokaryotes, provides yet a further reason to suspect that eukaryotes and prokaryotes had independent origins. Current proposals speculate that introns in a proto-eukaryote may have arisen following infection by an alpha-proteobacterial endosymbiont (the hypothesized mechanism by which mitochondria and chloroplasts originated in eukaryotes).42 Group II intron retroelements originating from the endosymbiont could have profusely inserted themselves within the early host genome. This scenario is highly unlikely as spliceosomal introns of eukaryotes show negligible similarity with group II introns and would already require operative spliceosomes at this early evolutionary phase for their removal from mRNA. Thus, the origins of these diverse introns and the multitude of spliceosome components required for their removal are insufficiently described by this prominent evolutionary model. An alternative consideration to intron origins describes seven mechanisms that appear to give rise to rare, isolated introns but admission is presented that this falls woefully short of accounting for the vast majority of spliceosomal introns in eukaryotes.43 Undoubtedly, this latter viewpoint more accurately communicates what we have learned regarding potential evolutionary explanations of introns.
Conclusion
In view of the problems outlined in the foregoing, the origin of spliceosomal introns represents a significant challenge to an evolutionary account of their origins. While the hypothesis that group II introns from bacterial endosymbionts gave rise to eukaryotic introns is the prevailing explanation, it faces a series of substantial obstacles. The complexity of intron splicing mechanisms suggest that their origin is better accounted for by a different paradigm than unguided evolutionary processes — one that allows for the rapid emergence of coordinated, information-rich systems. The only type of explanation that adequately accounts for this type of phenomenon is a goal-directed, or intelligent, cause.
Notes
- Kim SW, Taggart AJ, Heintzelman C, Cygan KJ, Hull CG, Wang J, Shrestha B, Fairbrother WG. Widespread intra-dependencies in the removal of introns from human transcripts. Nucleic Acids Res. 2017 Sep 19;45(16):9503-9513. doi: 10.1093/nar/gkx661. PMID: 28934498; PMCID: PMC5766209.
- Wilkinson ME, Charenton C, Nagai K. RNA Splicing by the Spliceosome. Annu Rev Biochem. 2020 Jun 20;89:359-388. doi: 10.1146/annurev-biochem-091719-064225. Epub 2019 Dec 3. PMID: 31794245.
- Matera AG, Wang Z. A day in the life of the spliceosome. Nat Rev Mol Cell Biol. 2014 Feb;15(2):108-21. doi: 10.1038/nrm3742. Erratum in: Nat Rev Mol Cell Biol. 2014 Apr;15(4):294. PMID: 24452469; PMCID: PMC4060434.
- Doolittle WF, Sapienza C. Selfish genes, the phenotype paradigm and genome evolution. Nature. 1980 Apr 17;284(5757):601-3. doi: 10.1038/284601a0. PMID: 6245369.
- Orgel LE, Crick FH. Selfish DNA: the ultimate parasite. Nature. 1980 Apr 17;284(5757):604-7. doi: 10.1038/284604a0. PMID: 7366731.
- Orgel LE, Crick FH, Sapienza C. Selfish DNA. Nature. 1980 Dec 25;288(5792):645-6. doi: 10.1038/288645a0. PMID: 7453798.
- Cavalier-Smith T. Selfish DNA and the origin of introns. Nature. 1985 May 23-29;315(6017):283-4. doi: 10.1038/315283b0. PMID: 2987701.
- Crick F. Split genes and RNA splicing. Science. 1979 Apr 20;204(4390):264-71. doi: 10.1126/science.373120. PMID: 373120.
- Chorev M, Carmel L. The function of introns. Front Genet. 2012 Apr 13;3:55. doi: 10.3389/fgene.2012.00055. PMID: 22518112; PMCID: PMC3325483.
- Sun L, Li Y, McCullough AK, Wood TG, Lloyd RS, Adams B, Gurnon JR, Van Etten JL. Intron conservation in a UV-specific DNA repair gene encoded by chlorella viruses. J Mol Evol. 2000 Jan;50(1):82-92. doi: 10.1007/s002399910009. PMID: 10654262.
- Sorek R, Ast G. Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res. 2003 Jul;13(7):1631-7. doi: 10.1101/gr.1208803. PMID: 12840041; PMCID: PMC403736.
- Petrillo E. Do not panic: An intron-centric guide to alternative splicing. Plant Cell. 2023 May 29;35(6):1752-1761. doi: 10.1093/plcell/koad009. PMID: 36648241; PMCID: PMC10226583.
- Li H, Ding Z, Fang ZY, Long N, Ang HY, Zhang Y, Fan YJ, Xu YZ. Conserved intronic secondary structures with concealed branch sites regulate alternative splicing of poison exons. Nucleic Acids Res. 2024 Jun 10;52(10):6002-6016. doi: 10.1093/nar/gkae185. PMID: 38499485; PMCID: PMC11162794.
- Sorek R, Ast G. Intronic sequences flanking alternatively spliced exons are conserved between human and mouse. Genome Res. 2003 Jul;13(7):1631-7. doi: 10.1101/gr.1208803. PMID: 12840041; PMCID: PMC403736.
- Minovitsky S, Gee SL, Schokrpur S, Dubchak I, Conboy JG. The splicing regulatory element, UGCAUG, is phylogenetically and spatially conserved in introns that flank tissue-specific alternative exons. Nucleic Acids Res. 2005 Feb 3;33(2):714-24. doi: 10.1093/nar/gki210. PMID: 15691898; PMCID: PMC548355.
- Ladd AN, Cooper TA. Finding signals that regulate alternative splicing in the post-genomic era. Genome Biol. 2002 Oct 23;3(11):reviews0008. doi: 10.1186/gb-2002-3-11-reviews0008. Epub 2002 Oct 23. PMID: 12429065; PMCID: PMC244920.
- Hui J, Hung LH, Heiner M, Schreiner S, Neumüller N, Reither G, Haas SA, Bindereif A. Intronic CA-repeat and CA-rich elements: a new class of regulators of mammalian alternative splicing. EMBO J. 2005 Jun 1;24(11):1988-98. doi: 10.1038/sj.emboj.7600677. Epub 2005 May 12. PMID: 15889141; PMCID: PMC1142610.
- Nakaya HI, Amaral PP, Louro R, Lopes A, Fachel AA, Moreira YB, El-Jundi TA, da Silva AM, Reis EM, Verjovski-Almeida S. Genome mapping and expression analyses of human intronic noncoding RNAs reveal tissue-specific patterns and enrichment in genes related to regulation of transcription. Genome Biol. 2007;8(3):R43. doi: 10.1186/gb-2007-8-3-r43. PMID: 17386095; PMCID: PMC1868932.
- Hastings ML, Wilson CM, Munroe SH. A purine-rich intronic element enhances alternative splicing of thyroid hormone receptor mRNA. RNA. 2001 Jun;7(6):859-74. doi: 10.1017/s1355838201002084. PMID: 11421362; PMCID: PMC1370135.
- Wagner EJ, Baraniak AP, Sessions OM, Mauger D, Moskowitz E, Garcia-Blanco MA. Characterization of the intronic splicing silencers flanking FGFR2 exon IIIb. J Biol Chem. 2005 Apr 8;280(14):14017-27. doi: 10.1074/jbc.M414492200. Epub 2005 Jan 31. PMID: 15684416.
- Hoeppner MP, White S, Jeffares DC, Poole AM. Evolutionarily stable association of intronic snoRNAs and microRNAs with their host genes. Genome Biol Evol. 2009 Nov 5;1:420-8. doi: 10.1093/gbe/evp045. PMID: 20333211; PMCID: PMC2817437.
- Berezikov E, Chung WJ, Willis J, Cuppen E, Lai EC. Mammalian mirtron genes. Mol Cell. 2007 Oct 26;28(2):328-36. doi: 10.1016/j.molcel.2007.09.028. PMID: 17964270; PMCID: PMC2763384.
- Qi Y, Tu J, Cui L, Guo X, Shi Z, Li S, Shi W, Shan Y, Ge Y, Shan J, Wang H, Lu Z. High-throughput sequencing of microRNAs in adenovirus type 3 infected human laryngeal epithelial cells. J Biomed Biotechnol. 2010;2010:915980. doi: 10.1155/2010/915980. Epub 2010 Jun 9. PMID: 20634878; PMCID: PMC2896912.
- Monteys AM, Spengler RM, Wan J, Tecedor L, Lennox KA, Xing Y, Davidson BL. Structure and activity of putative intronic miRNA promoters. RNA. 2010 Mar;16(3):495-505. doi: 10.1261/rna.1731910. Epub 2010 Jan 14. PMID: 20075166; PMCID: PMC2822915.
- Rodríguez-Campos A, Azorín F. RNA is an integral component of chromatin that contributes to its structural organization. PLoS One. 2007 Nov 14;2(11):e1182. doi: 10.1371/journal.pone.0001182. PMID: 18000552; PMCID: PMC2063516.
- Mondal T, Rasmussen M, Pandey GK, Isaksson A, Kanduri C. Characterization of the RNA content of chromatin. Genome Res. 2010 Jul;20(7):899-907. doi: 10.1101/gr.103473.109. Epub 2010 Apr 19. PMID: 20404130; PMCID: PMC2892091.
- Swinburne IA, Silver PA. Intron delays and transcriptional timing during development. Dev Cell. 2008 Mar;14(3):324-30. doi: 10.1016/j.devcel.2008.02.002. PMID: 18331713; PMCID: PMC2825037.
- Valadkhan S. The spliceosome: a ribozyme at heart? Biol Chem. 2007 Jul;388(7):693-7. doi: 10.1515/BC.2007.080. PMID: 17570821.
- Toor N, Keating KS, Taylor SD, Pyle AM. Crystal structure of a self-spliced group II intron. Science. 2008 Apr 4;320(5872):77-82. doi: 10.1126/science.1153803. PMID: 18388288; PMCID: PMC4406475.
- Costa M, Walbott H, Monachello D, Westhof E, Michel F. Crystal structures of a group II intron lariat primed for reverse splicing. Science. 2016 Dec 2;354(6316):aaf9258. doi: 10.1126/science.aaf9258. PMID: 27934709.
- Galej WP, Toor N, Newman AJ, Nagai K. Molecular Mechanism and Evolution of Nuclear Pre-mRNA and Group II Intron Splicing: Insights from Cryo-Electron Microscopy Structures. Chem Rev. 2018 Apr 25;118(8):4156-4176. doi: 10.1021/acs.chemrev.7b00499. Epub 2018 Jan 29. PMID: 29377672.
- Lambowitz AM, Zimmerly S. Group II introns: mobile ribozymes that invade DNA. Cold Spring Harb Perspect Biol. 2011 Aug 1;3(8):a003616. doi: 10.1101/cshperspect.a003616. PMID: 20463000; PMCID: PMC3140690.
- Vogel J, Börner T. Lariat formation and a hydrolytic pathway in plant chloroplast group II intron splicing. EMBO J. 2002 Jul 15;21(14):3794-803. doi: 10.1093/emboj/cdf359. PMID: 12110591; PMCID: PMC126105.
- Hagen M, Cech TR. Self-splicing of the Tetrahymena intron from mRNA in mammalian cells. EMBO J. 1999 Nov 15;18(22):6491-500. doi: 10.1093/emboj/18.22.6491. PMID: 10562561; PMCID: PMC1171712.
- Price JV, Kieft GL, Kent JR, Sievers EL, Cech TR. Sequence requirements for self-splicing of the Tetrahymena thermophila pre-ribosomal RNA. Nucleic Acids Res. 1985 Mar 25;13(6):1871-89. doi: 10.1093/nar/13.6.1871. PMID: 4000946; PMCID: PMC341122.
- Price JV, Cech TR. Determinants of the 3′ splice site for self-splicing of the Tetrahymena pre-rRNA. Genes Dev. 1988 Nov;2(11):1439-47. doi: 10.1101/gad.2.11.1439. PMID: 3209068.
- Kruger K, Grabowski PJ, Zaug AJ, Sands J, Gottschling DE, Cech TR. Self-splicing RNA: autoexcision and autocyclization of the ribosomal RNA intervening sequence of Tetrahymena. Cell. 1982 Nov;31(1):147-57. doi: 10.1016/0092-8674(82)90414-7. PMID: 6297745.
- Lehmann K, Schmidt U. Group II introns: structure and catalytic versatility of large natural ribozymes. Crit Rev Biochem Mol Biol. 2003;38(3):249-303. doi: 10.1080/713609236. PMID: 12870716.
- Konforti BB, Abramovitz DL, Duarte CM, Karpeisky A, Beigelman L, Pyle AM. Ribozyme catalysis from the major groove of group II intron domain 5. Mol Cell. 1998 Feb;1(3):433-41. doi: 10.1016/s1097-2765(00)80043-x. PMID: 9660927.
- Gordon PM, Piccirilli JA. Metal ion coordination by the AGC triad in domain 5 contributes to group II intron catalysis. Nat Struct Biol. 2001 Oct;8(10):893-8. doi: 10.1038/nsb1001-893. PMID: 11573097.
- McLatchie J (2024) Phylogenetic Challenges to the Evolutionary Origin of the Eukaryotic Cell Cycle. BIO-Complexity 2024 (4):1–19 doi:10.5048/BIO-C.2024.4.
- Rogozin IB, Carmel L, Csuros M, Koonin EV Origin and evolution of spliceosomal introns. Biology Direct 2012, 7:11 doi: 10.1186/1745-6150-7-11. PMCID: 3488318
- Yenerall P, Zhou L. Identifying the mechanisms of intron gain: progress and trends, Biology Direct 2012, 7:29.doi.org/10.1186/1745-6150-7-29. PMID: 22963364.