U2 spliceosomal snRNAs are a species of small nuclear RNA molecules found in the major spliceosomal machinery of virtually all-eukaryotic organisms. In vivo, U2 snRNA along with its associated polypeptides assemble to produce the U2 small nuclear ribonucleoprotein, an essential component of the major spliceosomal complex. The major spliceosomal-splicing pathway is occasionally referred to as U2 dependent, based on a class of Sm intron—found in mRNA primary transcripts—that are recognized exclusively by the U2 snRNP during early stages of spliceosomal assembly. In addition to U2 dependent intron recognition, U2 snRNA has been theorized to serve a catalytic role in the chemistry of pre-RNA splicing as well. Similar to ribosomal RNAs, Sm snRNAs must mediate both RNA:RNA and RNA:protein contacts and hence have evolved specialized, highly conserved, primary and secondary structural elements to facilitate these types of interactions. Shortly after the discovery that mRNA primary transcripts contain long, non-coding intervening sequences by Sharp and Roberts, Joan Steitz began work to characterize the biochemical mechanism of intron excision. The curious observation that a sequence found in the 5´ region of the U1 snRNA exhibited extensive base pairing complementarity with conserved sequences across 5´ splice junctions in hnRNA transcripts prompted speculation that certain snRNAs may be involved in recognizing splice site boundaries through RNA:RNA contacts. Only recently have atomic crystal structures revealed demonstrably that the original conjecture was indeed correct, even if the complexity of these interactions were not fully realized at the time.
U2 snRNA Recognition Elements
In Saccharomyces cerevisiae the U2 snRNA is associated with 18 polypeptides, seven of which are structural proteins common to all Sm class snRNPs. These non-specific structural proteins associate with Sm snRNAs through a highly conserved recognition sequence located within the RNA called Sm-binding sites. Two other proteins, A´ and B´´, are U2-specific and require structural elements unique to U2 snRNA—specifically two 3´ stem loops—for snRNP assembly. The three-subunit SF3a and six-subunit SF3b protein complexes also associate with the U2 snRNA. U2 snRNA is implicated in intron recognition through a 7-12 nucleotide sequence between 18-40 nucleotides upstream of the 3´ splice site known as the branch point sequence. In yeast, the consensus BPS is 7 nucleotide residues in length and the complementary recognition sequence within the U2 snRNA is 6 nucleotides. Duplex formation between these two sequences results in bulging of a conserved adenosine residue at position 5 of the BPS. The bulged adenosine residue adopts a C3´-endo conformation that with the help of splicing factors Cwc25, Yju2 and Isy1 aligns a 2´ OH for an inline attack of a phosphorus atom at the 5´ splice site. Nucleophilic attack initiates the first of two successive transesterification reactions that splices out the intron—through an unusual 2´-5´-3´ linked lariat intermediate—where the second transesterification involves ligation of the two flanking exons.
Although the sequence length of U2 snRNAs can vary by up to an order of magnitude across all eukaryotic organisms, all U2 snRNAs contain many phylogenetically constant regions particularly within the first 80 nucleotides downstream of the 5´ end where 85% of the positions are conserved. Moreover, several secondary structural elements are also conserved including stem loops I, II, III, IV, and some of the single stranded regions linking these domains. Stem loop II in yeast U2 snRNA, contains an unusual sheared GA base pair leading into a characteristic U-turn loop motif that shares a geometric conformation similar to that of tRNA anti-codon loops. All U2 snRNAs possess a terminal stem loop with a 10-16 base pair helix and a conserved 11 nucleotide loop with the consensus sequence 5´-UYGCANUURYN-3´. U2 snRNAs are the most extensively modified of all the small nuclear RNAs. While the exact locations of these post-transcriptional modifications can vary from organism to organism, emerging evidence suggests there is a strong correlation between U2 snRNA modification and biological function. Modifications include the conversion of some uridine residues to pseudouridine, 2´-O-methylation, nucleobase methylation, and conversion of 5´-monomethylated guanosine cap to a 2,2,7-trimethylated guanosine cap. Many of these modifications reside in a 27-nucleotide region on the 5´ end of the molecule.
Conformational Dynamics
The spliceosome is a dynamic molecular machine that undergoes several conformational rearrangements throughout assembly and splicing. Although many of the biochemical details surrounding spliceosomal rearrangements remains unclear, recent studies have visualized the formation of a critical folding complex between U2 and U6 snRNAs immediately proceeding the first step of the splicing reaction. This folding event facilitates the formation of a four-helix junction, which is believed to provide scaffolding for the critical components of the active site including aligning the 5´ splice site with the branch point adenosine for inline attack by the 2´ OH and coordinating two Mg2+ ions to stabilize negative charge formation in the proceeding steps.
Evolutionary Origins
A notable characteristic of the U2-U6 fold is its structural similarity to that of domain V in self-splicing group II introns. The AGC triad found in U6 snRNA is conserved in group II introns and has been found to favor the same tertiary stacking interactions as well. The formation of a GU wobble pair early in the U2-U6 folding event is also observed in the formation of the catalytic core of group II introns. Finally, it is likely the spliceosome utilizes the same two-metal ion mechanism as group II introns given the structural conservation of metal binding sites found within the U2-U6 fold. The extent of both secondary and tertiary structure conservation between group II introns and the U2-U6 fold in the active site of the spliceosome strongly suggests both group II introns and the spliceosome share a common evolutionary origin.