Chitinase domain-containing protein 1
Chitinase domain-containing protein 1 is a highly conserved protein of unknown function located on the short arm of chromosome 11 near the telomere. The protein has 27 introns, which allows for many isoforms of this gene. It has several aliases, the most common of which is Stabilin-1 interacting chitinase-like protein. As indicated by the alias, CHID1 is known to interact with the protein STAB1. CHID1 is expressed ubiquitously at levels nearly 6 times the average gene, and is conserved very far back to organisms such as Caenorhabditis elegans and possibly some prokaryotes. This protein is known to have carbohydrate binding sites, which could be involved in carbohydrate catabolysis.
Gene
CHID1 is located on chromosome 11 at the location p15.5. It is just downstream of TSPAN4 and upstream of AP2A2. CHID1 is ubiquitously expressed at a high levels. Through microarray analysis, it has been shown that CHID1 is generally expressed at 5.7 times the average gene. CHID1 has many known variants, which is attributed to its 37 exons. There are no inherent repeats or hairpin structures to be found within the coding region of CHID1. This gene is a member of the GH18 superfamily, which dictates some of its protein structure. This gene also has several aliases, the most common of which is Stabilin-1 interacting chitinase-like protein, or SI-CLP, which indicates its known interaction with STAB1.Variations
Due to its large size and many exons, CHID1 has many transcript variants that have been identified through mRNA sequencing. CHID1 has 27 exons and 22 known splice forms. These forms indicate that there may be multiple promotor regions and transcription start sites used within the genome. The most commonly found transcripts translate to about 400 amino acids each.Homology
CHID1 shows a very high level of conservation. It has been identified in many model systems including Drosophila melanogaster, Caenorhabditis elegans, and Oryza sativa. When conservation is observed over a large region, it becomes clear which protein domains are most important to CHID1. By far, the two best conserved factors are exon junctions you can add a link to exon junction, as well as several known carbohydrate binding sites. Another important region appears to be the last 15 amino acids, which retain a high level of conservation even through insect sequences.of several CHID1 homologs with highlighted conserved sequences and important features indicated.
Species | Common name | Divergence in millions of years | Accession number | Similarity to human sequence |
Gorilla gorilla gorilla | Gorilla | 8.8 | XP_004050445.1 | 100% |
Macaca mulatta | Rhesus macaque | 29.0 | AFI38071.1 | 94% |
Equus caballus | Horse | 94.2 | XP_003362692.1 | 91% |
Felis catus | House cat | 94.2 | XP_003993852.1 | 90% |
Taeniopygia guttata | Finch | 296 | XP_002199280.2 | 78% |
Ictalurus punctatus | Catfish | 400.1 | NP_001187344.1 | 70% |
Apis florea | Honey bee | 782.7 | XP_003691984.1 | 61% |
Paralogs
There are also several proposed paralogs to CHID1. These genes are di-N-acetylchitobiase, Chitinase-3-like protein 2, and chitotriosidase 1. All of these genes have roughly 40% similarity, and match human transcripts of CHID1 over 40-50% of the gene. Based on the level of conservation, these paralogs all likely split at a time near to the split from bacteria.Name | Accession number | Length | Similarity to CHID1 in humans |
di-N-acetylchitobiase | NP_004379.1 | 385 | 38% |
Chitinase-3-like protein 2 | NP_001020368.1 | 380 | 39% |
Chitotriosidase-1 | NP_001243054.2 | 447 | 40% |