Leucine-rich repeat neuronal protein 3, also known as neuronal leucine-rich repeat protein 3, is a protein that in humans is encoded by the LRRN3gene.
Gene
The LRRN3 is located on human chromosome 7, at 7q31.1. It contains 6 distinct gt-ag introns, and transcription produces five different mRNAs that appear to differ by truncation of the 3' end. There are only three main transcript variants that actually encode for the LRRN3 protein, with the longest transcript variant being 3744 base pairs in length. All three of these transcript variants have differing lengths and number of exons, but they all have the exon that includes the entire coding sequence for the LRRN3 protein. There are also two paralogs to the LRRN3 gene. These include the LRRN1 and LRRN2 genes, both of which have the same leucine-rich repeats that are characteristic to this family of genes.
Protein
The LRRN3 protein is 708 amino acids in length. The molecular weight of this protein is 79,424 Daltons, with an isoelectric point of 8.02. It is known to be a single-pass type I membrane protein because it spans the membrane once, with its N-terminus on the extracellular side of the membrane, and its signal sequence is removed. The LRRN3 protein contains 12 leucine-rich repeats, along with an LRRNT and an LRR_RI domain. Leucine-rich repeats are unusually rich in the hydrophobic amino acid leucine. They are common in protein-protein interaction motifs and are typically 20-29 amino acids in length. All major classes of LRRs are known to have a curved horseshoe structure with a parallel beta sheet on the concave side and mostly helical elements on the convex side. The LRRN3 protein also has an immunoglobulin domain, a fibronectintype III domain, and a transmembrane region toward the end of the protein. The composition of this protein is shown below.
Gene Conservation
The LRRN3 gene has been shown to be extremely highly conserved. There are 21 orthologs and homologs for this gene going back to zebra fish. This gene has not been found in any invertebrates or plant species. A multiple sequence alignment has shown this very high conservation of the LRRN3 gene among many different species. All 12 of the leucine-rich repeats, along with the LRRNT and the LRR_RI regions, are highly conserved in all vertebrates for which orthologs of this protein have been obtained. The Ig and FN3 domains also show high conservation in all of the orthologous sequences for mammals and birds, but are not as highly conserved for the rest of the homologous sequences. The high conservation of the leucine-rich repeats, the Ig domain, and the FN3 domain show that these regions must be of importance to the functionality of the LRRN3 gene.
Gene Expression
data has shown that the LRRN3 gene is expressed at very high levels in humans, about 2.3 times the average gene. It is most highly expressed in brain, heart, and testes tissues. It is also slightly expressed in kidney, muscle, pharynx, placental, and thymus tissue. The highest expression of the LRRN3 gene for the developmental stages is the fetal stage, but it is also expressed in the infant, juvenile, and adult stages, as can be seen in the EST profile. The expression data for the LRRN3 gene shows that it has its highest expression in the brain. Overall, the LRRN3 gene seems to have high to moderate expression throughout the majority of brain tissue. The highest expression being found in the cerebral cortex, the hippocampal formation, the cerebellar cortex, and the paraflocculus. All of these regions play a role in some key cognitive function, involving the processing of language and sensory stimuli, memory, and oculomotor behavior. The high expression of the LRRN3 gene in these regions of the brain could show that the LRRN3 gene has some importance in these cognitive function.
Structure
The characteristic shape of leucine-rich repeat proteins is an arc or horseshoe shape. This horseshoe shape of the protein is created by a parallel beta sheet on the concave side and mostly helical elements on the convex side. Eleven residue segments of the LRRs, corresponding to the beta-strand and adjacent loop regions are conserved in LRR proteins, whereas the remaining parts of the repeats may be different. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structures of some LRR protein-ligand complexes show that the concave surfact of the LRR domain is ideal for interaction with alpha-helix, thus supporting the conclusions that the elongated and curved LRR structure provides a framework for achieving diverse protein-protein interactions. Comparison of the LRRN3 protein with the chain A of the crystal structure of the LINGO1ectodomain shows that the LRRN3 protein takes on the characteristic horseshoe shape of most leucine-rich repeat proteins. The LINGO1 ectodomain also has a very long stretch of leucine-rich repeats which is the region that has the best alignment with the LRRN3 protein. This similar region shows that the LRRN3 protein has a structure that includes mostly beta-strands that are connected by loops, with a few alpha helices throughout.