SNED1

SNED1 is a human protein expressed at low levels in a wide range of tissues. The protein is soluble and found in circulating blood and the conceptually translated protein has four domains of interest. These domains include a nidgen domain, three fibronectin type III domains, several calcium binding EGF-like domains, and one complement control protein domain. The gene is found on chromosome 2, locus q37.3. The mRNA was isolated from the spleen and is 6834bp in length. The conceptually translated protein is 1178aa long. This protein is predicted to interact with somatostatin, spermidine synthase and TMEM132C.

Gene

Locus

SNED1 is located on the plus strand of chromosome 2 at locus 2q37.3. The Refseq identification number is The genomic DNA sequence of SNED1 contains 96,729bp and the longest spliced mRNA as predicted by AceView is 7048bp and contains 31 exons. There are 9 splice variants of SNED1 that exhibited protein structure matches using the Phyre 2 database which is discussed under "Tertiary and Quaternary Structure".

Common aliases

SNED1 is an acronym for "sushi, nidogen, and EGF-like domains". Aliases for SNED1 include Snep, SST3, and IRE-BP1.

Homology/evolution

Homologs and phylogeny

SNED1 is very highly conserved throughout evolutionary history and is shown to exhibit this conservation across a wide range of taxa, from mammals, to vertebrates, to invertebrates.It may be worth noting that the abundance of cysteine residues appear to be very highly conserved, suggesting that the cysteine richness is a very important feature of this protein.

Paralogs

SNED1 has a number of paralogs within the human genome, which cover small portions of the entire peptide sequence. There was no BLAST result that provided a hit that covered 100% of the query. Most hits fell in the 50-70% of query coverage and Max identity did not exceed 65%. Endogenous genes that are similar to the conserved domains in SNED 1 include; neurogenic locus notch homolog isoforms, protein-jagged precursors, protein eyes shut homolog isoforms, protein crumbs homolog isoforms, delta and notch-like epidermal growth factor receptor, sushi von Wilebrand factor A, and slit homolog 3 protein.

Protein

Primary sequence

The peptide sequence with the longest ORF was found by creating a conceptual translation using the SIXFRAME tool at the SDSC biology workbench website and this was the sequence used for most analyses. The full sequence obtained by an ncbi BLAST search can be accessed with the reference ID . One presumably important feature of this protein that is worth noting is that it is extraordinarily cysteine rich, with 106 cysteines total, giving an overall cysteine composition of 9.0%.

Domains and motifs

There are various interesting domains in this protein. The first in the annotated sequence above shown in pink, is an extracellular domain of unknown function within the Nidogen-1 domain, also known as Entactin. The second regions of interest shown by an underline are calcium-binding EGF domain. There are many of these domains in the sequence and they are often present in a large number of membrane bound and extracellular proteins. These EGF-CA domains may suggest a "sticky" nature to this protein as oftentimes extracellular matrix proteins require calcium cations to form homo and heterodimeric complexes between other ECM proteins. The complement control protein motif is annotated in green in the figure and this domain has been identified in many proteins involved in the complement system. Other aliases for this domain include short consensus repeats and the Sushi domain, from which the protein gets its name. The Fibronectin type III domain is annotated in blue and the presence of this domain may suggest one of the properties of this protein as being involved in cell adhesion. This FN3 domain contains internal repeats that are present in the plasma protein fibronectin. This particular domain contains the RGD sequence important in the binding of ECM proteins to integrins found in cell membranes, an important feature in cellular adhesion.

Post-translational modifications

There was only a few post-translational kinase dependant phosphorylation sites worth noting that resulted in a score of >0.8 by the NetPhosK program in the ExPASy Bioinformatics suite proteomics tools. These sites are annotated with yellow highlight in the conceptual translation above. All of these sites are predicted to be phosphorylated by either Protein kinase A or Protein kinase C.

Secondary structure

The amino acid sequence of the longest variant is incredibly cysteine rich, presumably resulting in a large amount of di-sulfide bond formation. There is not an organized profound string of alpha helices, but there is a cluster of alpha helices toward the C-terminus. The beta sheets are annotated as purple text in the conceptual translation and the alpha-helices are annotated as red text.

Tertiary and quaternary structure

The program Phyre2 was used to construct predictions of both the conserved domain regions NIDO, CCP, and FN3, as well as each of the splice variants. There were some interesting results consistent with the proposed function of an extracellular "sticky" protein possibly involved in cell-cell adhesion or in clotting. Protein matches found in Phyre2 comprise an array of proteins with functions of; clotting, hydrolysis, plasminogen activation, hormone/growth factor, protein binding, cell-adhesion, and ECM proteins.
Splice variants a, b, and e, in Figures 5 and 6 have >99% structural similarity to the protein neurexin 1-alpha. Neurexins are cell adhesion molecules and often contain EGF binding domains, enhancing intracellular junction forming between cells. NRXN1 is also proposed to play a role in angiogenesis. Alpha-neurexins interact with neurexophilins and possibly function in the synaptic junctions of the vertebrate nervous system. Alpha neurexins often utilize alternate promoters and splice sites, resulting in many different transcripts from one gene, may be an explanation of this gene's abundance of alternative transcripts.
Splice variant d has a 100% structural match to Low density lipoprotein receptor-related protein 4. This protein is involved in SOST-mediated bone formation inhibition and inhibition of Wnt signaling. LRP4 plays an important role in the formation of neuromuscular junctions.
Splice variants f and g have >99% similarity to fibrillin-1, an ECM protein that is a structural component of calcium binding microfibrils.
Splice variant i and conserved domain CCP are >99% structurally similar to t-plasminogen activator. PLAT is secreted by vascular endothelial cells and acts as a serine protease that converts plasminogen to plasmin. Plasmin is a fibrolytic enzyme that aids in the breakdown of blood clots and is used clinically for that exact purpose.
The conserved domain NIDO, was >99% similar to coagulation factor IX, also known as Factor IX. F9 is a secreted coagulation factor involved in the clotting cascade that required activation by multiple other coagulation factors within the cascade.
The 3 consecutive conserved FN3 domains together are >100% similar with 100% coverage to anosmin 1. Anosmin-1 is an ECM glycoprotein responsible for normal neural development of the brain, spinal chord and kidney.

Interacting proteins

The STRING-Known and Predicted Protein Interaction database was used to determine proteins that may be interacting and the following proteins were candidates for interaction: somatostatin, somatostatin receptor 2 as well as a variety of other somatostatin receptors, spermine synthase, and TMEM132C. All of the somatostatin related proteins are involved in the inhibition of hormones. There is very little known about TMEM132C and all publications related to the protein are mass genome screens. The protein expression profiles of TMEM132C and SNED1 are very similar to SNED1, with protein abundance found in blood plasma, platelets, and liver. All of the interacting proteins described are expressed in these three common areas.

Expression

SNED1 is ubiquitously expressed at intermediate levels, making it unclear from RNA expression profiles, which cells are secreting SNED1. The protein expression profiles of SNED1 predicted with MOPED-Multi-Omics Profiling Expression Database and PaxB-Protein Abundance Across Organisms database indicate that the protein is found in blood serum, blood plasma, blood T-lymphocytes, platelets, kidney Hek-293 cells, liver, and low levels in the brain.

Transcript variants

The program Aceview was used to predict transcript variants, shown in Figure 6. There are 9 spliced forms and 3 unspliced forms. Three of the transcript variants, b, c, and e, contain green regions that represent uORFs which indicate that they contain regulatory elements within the coding region of the transcript. All of the spliced transcript variants a-i were analyzed with the Phyre2 server to predict protein structure. See, "Tertiary and Quaternary Structure".

Promoter

The promoter was predicted and analyzed for transcription factor binding sites using the ElDorado software on the Genomatix software suite. There were alternative promoters downstream of the selected 845bp promoter.

Transcription factors

The following transcription factors were found with a matrix similarity of 1.00 and the entire binding domain was matched in the ElDorado predicted promoter.

Matrix Family	Detailed Family Information	Matrix	Detailed Matrix information	Strand	Matrix similarity	Sequence
BRAC	Brachyury gene, mesoderm developmental factor	TBX20.01	T-box transcription factor TBX20		1.00	gcatcgcggAGGTgtgcgggcgg
TF2B	RNA polymerase II transcription factor II B	BRE.01	Transcription factor II B recognition element		1.00	ccgCGCC
XCPE	Activator-, mediator-, and TBO-dependent core promoter element for RNA polymerase II transcription from TATA-less promoter	XCPE1.01	X gene core promoter element 1		1.00	ggGCGGgaccg
ZF02	C2H2 zinc finger transcription factors 2	ZKSCAN3.01	Zinc finger with KRAB and SCAN domains 3		1.00	catggCCCCaccacagggcgcgc
SP1F	GC-Box factors SP1/GC	SP1.03	Stimulating protein 1, ubiquitous zinc finger transcription factor		1.00	cggggGGGCggggccat
PLAG	Pleomorphic adeoma gene	PLAG1.02	Pleomorphic adeoma gene 1		1.00	aaGGGGgcagcacggaacgggtt

Clinical significance

A select cases on NCBI's GeoProfiles highlighted some clinically relevant expression data regarding SNED1 expression levels in response to certain conditions. In aldosterone producing adenoma versus control lung tissue, SNED1 expression decreased about 25 fold in the adenoma tissue. In a development study on the transition from oligodendrocyte precursors to mature oligodendrocytes, expression decreased almost 100 fold upon differentiation into mature oligodendrocytes. It may be interesting to explore the expression in clotting disorders or other blood related diseases.
Several studies have shown the role of SNED1 as a facilitator of metastasis.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...