H3K4me3


H3K4me3 is an epigenetic modification to the DNA packaging protein Histone H3. It is a mark that indicates the tri-methylation at the 4th lysine residue of the histone H3 protein and often involved in the regulation of gene expression. The name denotes the addition of three methyl groups to the lysine 4 on the histone H3 protein.
H3 is used to package DNA in eukaryotic cells, and modifications to the histone alter the accessibility of genes for transcription. H3K4me3 is commonly associated with the activation of transcription of nearby genes. H3K4 trimethylation regulates gene expression through chromatin remodeling by the NURF complex. This makes the DNA in the chromatin more accessible for transcription factors, allowing the genes to be transcribed and expressed in the cell. More specifically, H3K4me3 is found to positively regulate transcription by bringing histone acetylases and nucleosome remodelling enzymes. H3K4me3 also plays an important role in the genetic regulation of stem cell potency and lineage. This is because this histone modification is more-so found in areas of the DNA that are associated with development and establishing cell identity.

Nomenclature

H3K4me1 indicates monomethylation of lysine 4 on histone H3 protein subunit:

Lysine methylation

This diagram shows the progressive methylation of a lysine residue. The tri-methylation denotes the methylation present in H3K4me3.
The H3K4me3 modification is created by a lysine-specific histone methyltransferase transferring three methyl groups to histone H3. H3K4me3 is methylated by methyltransferase complexes containing a protein WDR5, which contains the WD40 repeat protein motif. WDR5 associates specifically with dimethylated H3K4 and allows further methylation by methyltransferases, allowing for the creation and readout of the H3K4me3 modification. WDR5 activity has been shown to be required for developmental genes, like the Hox genes, that are regulated by histone methylation.

Epigenetic marker

H3K4me3 is a commonly used histone modification. H3K4me3 is one of the least abundant histone modifications; however, it is highly enriched at active promoters near transcription start sites and positively correlated with transcription. H3K4me3 is used as a histone code or histone mark in epigenetic studies to identify active gene promoters.
H3K4me3 promotes gene activation through the action of the NURF complex, a protein complex that acts through the PHD finger protein motif to remodel chromatin. This makes the DNA in the chromatin accessible for transcription factors, allowing the genes to be transcribed and expressed in the cell.

Understanding histone modifications

The genomic DNA of eukaryotic cells is wrapped around special protein molecules known as Histones. The complexes formed by the looping of the DNA are known as Chromatin. The basic structural unit of chromatin is the Nucleosome: this consists of the core octamer of histones as well as a linker histone and about 180 base pairs of DNA. These core histones are rich in lysine and arginine residues. The carboxyl terminal end of these histones contribute to histone-histone interactions, as well as histone-DNA interactions. The amino terminal charged tails are the site of the post-translational modifications, such as the one seen in H3K4me1.

Role in stem cells and embryogenesis

Regulation of gene expression through H3K4me3 plays a significant role in stem cell fate determination and early embryo development. Pluripotent cells have distinctive patterns of methylation that can be identified through ChIP-seq. This is important in the development of induced pluripotent stem cells. A way of finding indicators of successful pluripotent induction is through comparing the epigenetic pattern to that of embryonic stem cells.
In bivalent chromatin, H3K4me3 is co-localized with the repressive modification H3K27me3 to control gene regulation. H3K4me3 in embryonic cells is part of a bivalent chromatin system, in which regions of DNA are simultaneously marked with activating and repressing histone methylations. This is believed to allow for a flexible system of gene expression, in which genes are primarily repressed, but may be expressed quickly due to H3K4me3 as the cell progresses through development. These regions tend to coincide with transcription factor genes expressed at low levels. Some of these factors, such as the Hox genes, are essential for control development and cellular differentiation during embryogenesis.

DNA repair

H3K4me3 is present at sites of DNA double-strand breaks where it promotes repair by the non-homologous end joining pathway. It has been implicated that the binding of H3K4me3 is necessary for the function of genes such as inhibitor of growth protein 1, which act as a tumor suppressors and enact DNA repair mechanisms.
When DNA damage occurs, DNA damage signalling and repair begins as a result of the modification of histones within the chromatin. Mechanistically, the demethylation of H3K4me3 is used required for specific protein binding and recruitment to DNA damage

Epigenetic implications

The post-translational modification of histone tails by either histone modifying complexes or chromatin remodelling complexes are interpreted by the cell and lead to complex, combinatorial transcriptional output. It is thought that a Histone code dictates the expression of genes by a complex interaction between the histones in a particular region. The current understanding and interpretation of histones comes from two large scale projects: ENCODE and the Epigenomic roadmap. The purpose of the epigenomic study was to investigate epigenetic changes across the entire genome. This led to chromatin states which define genomic regions by grouping the interactions of different proteins and/or histone modifications together.
Chromatin states were investigated in Drosophila cells by looking at the binding location of proteins in the genome. Use of ChIP-sequencing revealed regions in the genome characterised by different banding. Different developmental stages were profiled in Drosophila as well, an emphasis was placed on histone modification relevance. A look in to the data obtained led to the definition of chromatin states based on histone modifications. Certain modifications were mapped and enrichment was seen to localize in certain genomic regions. Five core histone modifications were found with each respective one being linked to various cell functions.
The human genome was annotated with chromatin states. These annotated states can be used as new ways to annotate a genome independently of the underlying genome sequence. This independence from the DNA sequence enforces the epigenetic nature of histone modifications. Chromatin states are also useful in identifying regulatory elements that have no defined sequence, such as enhancers. This additional level of annotation allows for a deeper understanding of cell specific gene regulation.

Methods

The histone mark H3K4me3 can be detected in a variety of ways:
1. Chromatin Immunoprecipitation Sequencing measures the amount of DNA enrichment once bound to a targeted protein and immunoprecipitated. It results in good optimization and is used in vivo to reveal DNA-protein binding occurring in cells. ChIP-Seq can be used to identify and quantify various DNA fragments for different histone modifications along a genomic region.
2. Micrococcal Nuclease sequencing is used to investigate regions that are bound by well positioned nucleosomes. Use of the micrococcal nuclease enzyme is employed to identify nucleosome positioning. Well positioned nucleosomes are seen to have enrichment of sequences.
3. Assay for transposase accessible chromatin sequencing is used to look in to regions that are nucleosome free. It uses hyperactive Tn5 transposon to highlight nucleosome localisation.