Complement receptor 1
Complement receptor type 1 also known as C3b/C4b receptor or CD35 is a protein that in humans is encoded by the CR1 gene.
This gene is a member of the regulators of complement activation family and is located in the 'cluster RCA' region of chromosome 1. The gene encodes a monomeric single-pass type I membrane glycoprotein found on erythrocytes, leukocytes, glomerular podocytes, hyalocytes, and splenic follicular dendritic cells. The Knops blood group system is a system of antigens located on this protein. The protein mediates cellular binding to particles and immune complexes that have activated complement. Decreases in expression of this protein and/or mutations in its gene have been associated with gallbladder carcinomas, mesangiocapillary glomerulonephritis, systemic lupus erythematosus and sarcoidosis. Mutations in this gene have also been associated with a reduction in Plasmodium falciparum rosetting, conferring protection against severe malaria. Alternate allele-specific splice variants, encoding different isoforms, have been characterized. Additional allele specific isoforms, including a secreted form, have been described but have not been fully characterized.
In primates, CR1 serves as the main system for processing and clearance of complement opsonized immune complexes. It has been shown that CR1 can act as a negative regulator of the complement cascade, mediate immune adherence and phagocytosis and inhibit both the classic and alternative pathways. The number of CR1 molecules decreases with aging of erythrocytes in normal individuals and is also decreased in pathological conditions such as systemic lupus erythematosus, HIV infection, some haemolytic anaemias and other conditions featuring immune complexes. In mice, CR1 is an alternatively spliced variant of the complement receptor 2 gene.
Certain alleles of this gene have been statistically associated with an increased risk of developing late-onset Alzheimer's disease.
Gene region
In humans, the CR1 gene is located on the long arm of chromosome 1 at band 32 and lies within a complex of immunoregulatory genes. In 5’-3’ order the genes in this region are: membrane cofactor protein – CR1 – complement receptor type 2 – decay-accelerating factor – C4-binding protein.- Membrane cofactor protein is a widely distributed C3b/C4b binding regulatory glycoprotein of the complement system;
- decay-accelerating factor protects host cells from complement-mediated damage by regulating the activation of C3 convertases on host cell surfaces;
- complement receptor 2 is the C3d receptor.
Gene structure and isoforms
The canonical Cr2/CD21 gene of subprimate mammals produces two types of complement receptor via alternative mRNA splicing. The murine Cr2 gene contains 25 exons; a common first exon is spliced to exon 2 and to exon 9 in transcripts encoding CR1 and CR2, respectively. A transcript with an open reading frame of 4,224 nucleotides encodes the long isoform, CR1; this is predicted to be a protein of 1,408 amino acids that includes 21 short consensus repeats of ca. 60 amino acids each, plus transmembrane and cytoplasmic regions. Isoform CR2 is encoded by a shorter transcript that lacks exons 2–8 encoding SCR1-6. CR1 and CR2 on murine B cells form complexes with a co-accessory activation complex containing CD19, CD81, and the fragilis/Ifitm proteins.The complement receptor 2 gene of primates produces only the smaller isoform, CR2; primate CR1, which recapitulates many of the structural domains and presumed functions of Cr2-derived CR1 in subprimates, is encoded by a distinct CR1 gene.
Isoforms CR1 and CR2 derived from the Cr2 gene possess the same C-terminal sequence, such that association with and activation through CD19 should be equivalent. CR1 can bind to C4b and C3b complexes, whereas CR2 binds to C3dg-bound complexes. CR1, a surface protein produced primarily by follicular dendritic cells, appears to be critical for generation of appropriately activated B cells of the germinal centre and for mature antibody responses to bacterial infection.
The most common allelic variant of the human CR1 gene is composed of 38 exons spanning 133kb encoding a protein of 2,039 amino acids with a predicted molecular weight of 220 kDa. Large insertions and deletions have given rise to four structurally variant genes and some alleles may extend up to 160 kb and 9 additional exons. The transcription start site has been mapped to 111 bp upstream of the translation initiation codon ATG and there is another possible start site 29 bp further upstream. The promoter region lacks a distinct TATA box sequence. The gene is expressed principally on erythrocytes, monocytes, neutrophils and B cells but is also present on some T lymphocytes, mast cells and glomerular podocytes.
Structure
The encoded protein has a 47 amino acid signal peptide, an extracellular domain of 1930 residues, a 25 residue transmembrane domain and a 43 amino acid C terminal cytoplasmic region. The leader sequence and 5'-untranslated region are contained in one exon. The large extracellular domain of CR1, which has 25 potential N-glycosylation sites, can be divided into 30 short consensus repeats , each having 60 to 70 amino acids. The sequence homology between SCRs ranges between 60 and 99 percent. The transmembrane region is encoded by 2 exons and the cytoplasmic domain and the 3'-untranslated regions are coded for by two separate exons.The 30 or so SCRs are further grouped into four longer regions termed long homologous repeats each encoding approximately 45 kDa of protein and designated LHR-A, -B, -C, and -D. The first three have seven SCRs while LHR-D has 9 or more. Each LHR is composed of 8 exons and within an LHR, SCR 1, 5, and 7 are each encoded by a single exon, SCR 2 and 6 are each encoded by 2 exons, and a single exon codes for SCR 3 and 4. The LHR seem to have arisen as a result of unequal crossing over and the event that gave rise to LHR-B seems to have occurred within the fourth exon of either LHR-A or –C. To date the atomic structure have been solved for SCRs 15–16, 16 & 16–17.
Alleles
Four known human alleles encode proteins with predicted molecular weights of 190 kDa, 220 kDa, 250 kDa and 280 kDa. Multiple size variants are also found among non-human primates and a partial amino-terminal duplication that encodes the short forms expressed on non human erythrocytes. These short CR1 forms, some of which are glycosylphosphatidylinositol anchored, are expressed on erythrocytes and the 220-kDa CR1 form is expressed on monocytes. The gene including the repeats is highly conserved in primates possibly because of the ability of the repeats to bind complement. LHR-A binds preferentially to the complement component C4b: LHR-B and LHR-C bind to C3b and also, albeit with a lower affinity, to C4b. Curiously the human CR1 gene appears to have an unusual protein conformation but the significance of this finding is not clear.The mean number of complement receptor 1 molecules on erythrocytes in normal individuals lies within the range of 100–1000 molecules per cell. Two codominant alleles exist – one controlling high and the other low expression. Homozygotes differ by a factor of 10–20: heterozygotes typically have 500–600 copies per erythrocyte. These two alleles appear to have originated before the divergence of the European and African populations.
Rosetting
interacts with uninfected erythrocytes. This 'stickiness', known as rosetting, is believed to be a strategy used by the parasite to remain sequestered in the microvasculature to avoid destruction in the spleen and liver. Erythrocyte rosetting causes obstruction of the blood flow in microcapillaries. There is a direct interaction between PfEMP1 and a functional site of complement receptor type 1 on uninfected erythrocytes.Role in blood groups
The Knops antigen was the 25th blood group system recognized and consists of the single antigen York a with the following allelic pairs:- Knops a and b
- McCoy a and b
- Swain-Langley 1 and 2