Main

During the life cycle of an organism, epigenetic factors provide a heritable, reversible and dynamic manner to modulate gene expression without changing the DNA sequence. In this review, we discuss how the disruption of the principal epigenetic pathways can lead to silencing or inappropriate expression of genes, leading to a new category of pathologies, called epigenetic diseases. The control of eukaryotic gene transcription involves the dynamic positioning of histones, organizing the genome into open or condensed chromatin, governing transcriptional activity and the DNA availability for recombination, replication and repair. Histones N-terminal tails can suffer a variety of covalent, but dynamic post-translational modifications, including acetylation, phosphorylation and methylation, which determine their interactions with other proteins, regulating chromatin structure and remodeling.

Acetylation of conserved lysine residues present in histone tails has long been linked to transcriptional activity. Histone acetylation is a hallmark of transcriptionally active regions, whereas hypoacetylated histones tails are found in transcriptionally inactive euchromatin or heterochromatin regions. The equilibrium of histone acetylation/deacetylation is governed by the opposite actions of histone desacetylases (HDACs) and histone acetyl transferases. Aberrant acetylation or deacetylation leads to such diverse disorders as leukemia, epithelial cancers and Rubinstein–Taybi syndrome.1 Like histone acetylation, the addition of negatively charged phosphate groups to histone tails also neutralize their basic charge and reduce their affinity to DNA. In this context, phosphorylation of serine 10 in histone H3 has been correlated with gene activation in mammalian cells2 and phosphorylation of H2A seemed to occur after activation of DNA-damage signaling pathways, permitting chromatin exposure and, consequently, facilitating the DNA repair.3 Coffin–Lowry syndrome is a chromatin remodeling disorder caused presumably by insufficient H3 phosphorylation and acetylation owing to mutations in the RSK-2 gene (RPS6KA3), which inactivates its protein serine-threonine kinase activity and perturbs cell cycle regulation and cell proliferation.4 Histone methylation can be a mark for both active and inactive chromatin. Lysine residues can be mono-, di- or trimethylated at the ɛ amino group, thereby vastly expanding the possibility of such modification. Usually, methylation of lysine 9 on the N-terminus of histone H3 is associated to silent DNA and is globally distributed throughout heterochromatin regions such as centromeres and telomeres. It appears also at transcriptionally silenced promoters of inactive X chromosome.5, 6 Curiously, evidences indicate that different degrees of methylation could be correlated with distinct levels of gene silencing. For example, while trimethylation of lysine 9 of H3 is associated with constitutive heterochromatin, mono- and dimethylation of the same position were supposed to be associated with facultative heterochromatin.7 On the other hand, methylation of lysine 4 of histone H3 signals for transcriptional activity and is found at promoters of active genes. One explanation for this duality function of histone methylation is that the gene silencing of methylated lysine 9 of histone H3 tails is mediated by the specifically binding of the mammalian heterochromatin protein 1 (HP1) through its conserved chromo domain regions, whereas methylated lysine 4 of histone H3 is apparently not.8 A number of histone methyltransferases with distinct functions have been identified. Recently, Sun et al9proposed that a histone H3 lysine 36-specific HTMase activity could be seen for the huntingtin interacting protein B and could be implicated in the pathogenesis of Huntington disease. The relevance of chromatin remodeling is further emphasized by the action of ATP-dependent SWI/SNF chromatin remodeling complexes. These multisubunit complexes modify the nucleosomes topology in a ATP-dependent manner by disrupting DNA:histone interactions, allowing the access of transcription factors to their targets within the chromatin. Several members of the highly conserved SWI–SNF complex have been implicated in cancer10 and in ATRX syndrome.11 Owing to the multiplicity of these and others potentially histone modifications like ubiquination, poly-ADP ribosylation and sumoylation, we can consider that histones have their own epigenetic code, essential to efficient spatial and temporal gene expression control.

Besides modification of core histones, the genomes of all vertebrates are also modified by the post-synthetic addition of a methyl group in carbon 5 of the cytosine ring, predominantly in 5′-CpG-3′. Methylation of DNA plays a key role in several biological processes, including parental imprinting,12 development,13 X-chromosome inactivation,14 silencing of foreign DNA,15 proper centromeric segregation16 and T-cell function.17 Furthermore, methylation is a prominent factor in aging18 and has been suggested to be involved in the long-term memory process.19 The donation of carbon atoms for maintenance of methylation patterns is mainly realized through the folate cycle, which involves an integrated action of many gene products and other important micronutrients, like vitamin B12, vitamin B6, choline and methionine, obtained by dietary. All these elements are directly or indirectly required for the conversion of homocysteine to methionine, which is the immediate precursor of S-adenosyl methionine (SAM), the major intracellular methyl donor for DNA, protein and lipid methylation maintenance. After donating its methyl group, SAM is converted into S-adenosylhomocysteine (SAH), which then becomes homocysteine after losing its adenosine in a reversible manner. The presence of genetic polymorphisms that alter the functionality of key transport molecules and enzymes required for the folate/homocysteine cycle steps could predispose to genomic instability, altered recombination and abnormal segregation.20 It is clear, however, that the effects of these variants will be closely dependent on the individual folate nutritional status, as a high intake of folate and associated nutrients effectively could neutralize the negative biochemical consequence of the polymorphism(s). In this context, polymorphisms that diminish the bioavailability of folate/homocysteine in the mother around or during pregnancy associated with a low intake of these micronutrients have been proposed to contribute to abnormalities in the fetus, such as failure of neural tube closure,21 cleft palate,22 and non-disjunctions disorders as Down syndrome20 and trisomy 18.23 Furthermore, DNA hypomethylation and high levels of homocysteine associated with low folate intake have been described as important risk factors for cancer,24 including those of the colon, breast, pancreas, cervix, bronchus, as well as leukemia. It is also implicated in the etiology of cardiovascular25 and cerebrovascular diseases,26 diabetes27 and developmental/neurological, such as schizophrenia and depression abnormalities.28 The capacity of homocysteine induces oxidative stress and DNA damage suggests yet that increase of it may predispose the nervous system to aging and age-related neurodegenerative disorders like Parkinson and Alzheimer diseases.29 This aspect may represent one contribution factor to the association of Alzheimer disease (AD) and Down syndrome-related dementia, since by the fourth decade of life, individuals with Down syndrome display many of the same neuropathological features as do individuals with AD. Increasing levels of homocysteine also result in a greater amount of SAH, as well as free adenosine, which are potent inhibitors of transmethylation enzymes and can affect a variety of biochemical processes, including the decrease of hepatocellular detoxification.30 Some reports have yet demonstrated that low folate and vitamin B12 levels associated to demethylation patterns may facilitate the DNA integration of oncogenic viruses and may activate transcription of endogenous retroviruses.15 Corroborating these findings, the treatment of AIDS patients by using folic acid and vitamin B12 supplements has been successful.31 However, despite of the enormous proven benefit of folate nutrition, recent findings alert that the excessive supplemental levels of this vitamin during the periconceptional period could select embryos that carry the folate-associated mutant alleles and it could increase the prevalence of significant human life-threatening diseases associated with the presence of this variant allele along our evolution, like AD.32

Between the DNA methyltransferases, the first described was Dnmt1, which is indispensable for restoring full methylation from the hemi-methylated DNA sites that arises as a result of the semiconservative replication (maintenance activity). It is likely that Dnmt1 requires chromatin to be deacetylated before it can effectively methylate DNA.33 Four other DNA methyltransferases genes were described in mammals (Dnmt2, Dnmt3a, Dnmt3b and Dnmt3-like gene/Dnmt3L). Dnmt2 was recently shown to have a weak DNA methyltransferase activity and to act as a tRNA methyltransferase,34 whereas Dnmt3a and Dnmt3b are high expressed in embryonic cells and have de novo methyltransferase rather than maintenance methyltransferase activity.35 It implies that after a global pre implantation demethylation, Dnmt3a and Dnmt3b are responsible for initiating DNA methylation during early embryonic development. Li et al13 proposed that Dnmt3a and Dnmt3b may cooperate with Dnmt1 in stable maintenance of DNA methylation patterns and that inactivation of Dnmt3a and Dnmt3b in somatic cells leads to chromosomal instability. Recently, it was demonstrated that Dnmt3L physically associates with Dnmt3a and Dnmt3b and stimulates their catalytic activities both in cell culture assays and in vitro.36

Mutations in DNMTs genes, cis-acting sequences or their interacting partners could have profound limitations on specific DNA methylation patterns. All methyltransferases have been found to be overexpressed in human tumors, although to moderate levels only.37, 38 Thus, DNA methylation acts as a double-edged sword in cancer, as whereas oncogenesis is promoted by local hypermethylation of tumor suppressor genes, global genomic hypomethylation, on the other hand, affects oncogene expression and genomic instability.39 So, nowadays epigenetics silencing has been recognized as a third pathway satisfying Knudson's hypothesis. DNMT3B misexpression is also known to be the cause of the ICF syndrome, characterized by the lack of condensation of the pericentromeric heterochromatin regions of chromosomes 1, 9 and 16 owing to a marked hypomethylation of classical satellites II and III associated with centromere instability.40 It is intriguing, however, that although DNMT3B alterations lead to a widespread genomic DNA methylation defect, the ICF phenotype is restricted to facial abnormalities and immune deficiency.

The repression of methylated CpG islands located at promoter sequences showed to be mediated by methyl-CpG binding proteins (MBPs). Upon specifically binding to methylated DNA, the MBPs recruit chromatin remodeling factors, like HDACs, acting as gene expression regulators. The founder member of the MBPs is the methyl-CpG binding protein 2 (MeCP2), which binds to symmetrically methylated cytosine throughout the genome, and is an important transcriptional silencer.41 Loss-of-function mutations in the MECP2 gene are associated to Rett syndrome in females and to a wide spectrum of neurological phenotypes in males.42 Interestingly, a mild overexpression of MeCP2 in neurons is also detrimental and leads to symptoms of severe motor dysfunction and seizures,43 suggesting that a proper dosage of MeCP2 is critical for neuronal function and that levels of MeCP2 in the CNS are tightly regulated. By searching for sequence homology to the conserved methyl-binding domain of MeCP2, other five similar proteins were found. Like MeCP2, the MBD1, MBD2, MBD3 and Kaiso proteins are all involved in transcription repression, whereas MBD4 acts in DNA mismatch repair. Unexpectedly, although MeCP2, MBD2 and MBD4 have been found to be downregulated in human cancers,44 no firm cancer connections have been established for any of the other MBP.45

Other DNA methylation-related epigenetic diseases originate from genetic alterations like tri-nucleotide expansions (as in Fragile X syndrome) and from parental imprinting defects (imprinting disorders). Fragile X syndrome is the best-characterized genetic alteration that modifies the epigenetic coding of a sequence, contributing to a disease phenotype. This syndrome elapses from an expansion of 5′CGG 3′ polymorphic repeats in the 5′UTR of the FMR1 gene and is associated to hypermethylation and changes in the chromatin structure, leading to gene silencing.46 It is conceivable that the expanded number of repeats is recognized as foreign and become methylated de novo like any integrated viral DNA.47 Concerning imprinting-based epigenetic diseases, it is vastly known that maternal and paternal genomes are both required for normal embryonic and postnatal development, although there is a functional asymmetry between them. CpG methylation is also closely associated with imprinting, having an important role in silencing certain genes as well in activating others. Initial parental-origin imprinting signals are established during gametogenesis. In the pre-implantation embryo, the methylation patterns of the gametes are erased with a dramatic decrease in the total level of methylated DNA. After implantation, a wave of de novo methylation establishes a new pattern, in which the majority of CpGs islands are methylated in a tissue-specific manner.48 Therefore, it is unlikely that all epigenetic information is erased in germ cells, as some heritable characteristics in humans may in fact have an inherited epigenetic basis through generations.49 Also, Moore and Haig50 postulated that during fetal growth there is a preferential expression of paternal genes, whereas maternal allele's expression should seek a protective performance of minimizing the absorption of mother resources for future pregnancies. Such details of the epigenetic reprogramming are responsible for the highest source of failure found in cloning approaches. The usual epigenetic alteration leading to imprinting syndromes is uniparental disomy. Angelman (AS) and PraderWilli (PWS) syndromes are the best-known imprinting syndromes, being caused by either deletion or epigenetic faults of the same cluster (15q11–q13). Evidences suggest that the AS cause resides in loss of expression of the E6-associated protein ubiquitin-protein ligase gene (UBE3A), maternal imprinted only in the brain. Otherwise, PWS main cause is attributed to the paternal imprinted SNRPN gene, which encodes the expressed SNURF and SmN spliceosomal (SNRPN – small nuclear ribonucleoprotein-associated polypeptide N) proteins.51 Imprinting alterations are frequently associated with many types of cancer, including acute myoblastic leukemia (paternal chromosome 7), Wilm's tumor (maternal chromosome 11p15.5), neuroblastoma (maternal chromosome 1p36 and paternal chromosome 2), rhabdomyosarcoma (maternal chromosome 11p15.5), glomus tumors (paternal chromosomes 11q13 and 11q22.3–q23.3) and sporadic osteosarcoma (maternal chromosome 13).

Besides modification of core histones and DNA methylation, three other types of epigenetic control have been recently described. The first consists in the activity of post-transcriptional gene silencing transcripts that can be expressed in the form of non-coding RNAs (eg, XIST), non-coding antisense RNAs (eg, TSIX) and RNA interference (RNAi). Non-coding RNAs regulate key processes, such as association of HP1 to chromatin52 and the role of XIST transcript in the initiation of X-chromosome inactivation.53 After fertilization, expression of the gene XIST from the inactive X is required for initiating X inactivation, by the ‘coating’ of a substantial portion of the future inactive X, leading to its DNA methylation and chromatin condensation. Small non-coding RNAs (microRNAs) are also involved in the silencing of repetitive DNA, such as retrotransposons, being important components for genome stability and integrity.54 A survey about the distribution of microRNAs along the genome evidenced that they are frequently located at fragile sites and loci involved in cancer.55 Consistent with this, the microRNA genes miR15 and miR16 are downregulated in two-thirds of chronic lymphocytic leukemia cases.56 Another class of non-coding RNAs, the antisense RNAs, is involved in the silencing of some imprinted genes57 and also in the repression of XIST on the selected active X chromosome by the antisense transcript TSIX and accumulation in cis of Xist.53 A case of α thalassemia was reported, in which a deletion-induced aberrant antisense transcript from a neighboring gene leads to DNA methylation and silencing of the intact HBA2 α-globin gene.58 It lifts primarily suspicion that double-strand RNAs (dsRNAs), formed by the complementary of sense and antisense transcripts, would recruit by their own other epigenetic repressors like those involved in DNA methylation and opened our mind to a novel mechanism of human diseases. By this way, such kind of genomic rearrangement causing epigenetic silencing might elucidate several other unexplained but clinically well-defined disorders. RNAi is a highly conserved silencing pathway triggered by dsRNA, which regulates gene expression via targeted degradation of mRNAs and subsequently transcriptional repression of mRNAs. The hallmark of this pathway involves the cleavage of dsRNAs into 22 nucleotides double and small-interfering RNA molecules (siRNAs) by an Rnase-III-like enzyme called Dicer and the subsequent association of these siRNAs with a multiprotein RNAi effector complex called RISC to provide specificity for the degradation of complementary mRNAs. In some organisms, the RNAi machinery may also require an RNA-dependent RNA polymerase that is believed to amplify dsRNA.59 Recent data, from fission yeast, indicates that the RNAi machinery targets repressive histone modification to specific chromosomal loci destined for silencing.60 RNAi machinery is guided to heterochromatin through the formation of dsRNAs by the bidirectional transcription of repetitive DNA sequences.54, 61 In mammals, this RNAi machinery is conserved and recent evidences indicate that centromeric and pericentromeric heterochromatin repeats are transcribed and dsRNA can silence genes through de novo methylation in such organisms.62 In diseases associated with trinucleotide repeat instability and subsequent DNA methylation-mediated gene silencing, like Fragile X syndrome, one of the models for explaining how the large repeats are targeted for methylation includes the formation of hairpin RNA (similar to dsRNA) from the transcribed large repeats, which might be cleaved by Dicer and subsequently recruit the RNAi silencing machinery.63

The second additional class of epigenetic regulation involves the presence of methylation-dependent sensitive insulators, fundamental for the roles of insulin-like growth factor 2 (IGF2) and H19 genes. These are proximal and oppositely imprinted genes: expression is from the paternal copy of IGF2 and from maternal copy of H19. An insulator element has been found upstream of H19, between this gene and IGF2, and is identified as a differentially methylated domain (DMD), which interacts with enhancer elements downstream of H19 and acts as a chromatin boundary that regulates IGF2 imprinting. On the maternal chromosome, an enhancer-blocking protein binds to the DMD, preventing the enhancers from interacting with the promoter of IGF2, favoring only H19 expression. In contrast, on the paternal chromosome, DMD is methylated and, consequently, the blocking protein cannot bind and IGF2 can be expressed whereas H19 is not.64 Recently, the importance of these genes was thoroughly discussed by Kono et al65 who combined the chromosomal complements of a mature, fully grown mouse egg with a non-growing egg from a new-born mouse, in which the H19 gene and the DMD had been deleted for mimic parental imprint. Surprisingly, this manipulation was sufficient for providing normal results for the expression of 1000 between 11000 analyzed genes, including other apparently unconnected imprinted genes.

The latter form of epigenetic control refers to the action of Polycomb Group (PcG) and Trithorax Group (TrxG), which are an ancient group of chromatin modifiers proteins that constitute a cellular memory system responsible for controlling chromatin accessibility and maintaining transcription in the first stages of embryonic life, throughout development and in adulthood.66 For this purpose, these proteins usually have antagonistic effects on target genes, including developmentally and cell cycle regulated genes: PcG proteins act as stable repressors whereas TrxG proteins promote maintenance of gene activity. Evidences showed that components of the PcG complex and two polypeptides of the TrxG, TRX and ASH1, are histone methyltransferases, suggesting that the memory system uses a similar pathway to regulate the binding of both PcG and TrxG complexes to their sites.67, 68

The emerging role of PcG proteins in X-chromosome inactivation has calling attention for the involvement of PcG proteins with RNA machinery. Thus, X inactivation represents one of the great paradigms of epigenetics, as it involves among other mechanisms, the roles of antisense transcripts69 and other non-coding RNAs,70, 71 H3 K9 di-methylation,5 H3 K27 tri-methylation,6 H4 acetylation,72 DNA methylation,14 possible chromatin insulation73 and Eed and Enx1 PcG proteins.74 PcG EZH2 is overexpressed in many types of cancer, including those of prostate75 and breast.76 Moreover, the homolog of the Drosophila PcG PHO in humans, the YY1 transcriptional repressor, has been found to be part of a repressive complex bound to the D4Z4 3.3 kb repeat that regulates genes involved in the control of the facioscapulohumeral dystrophy.77 Dystrophic patients carry deletions of these repeats and show depression of neighboring and more distant transcription units. As 50% of the genome is composed by repetitive sequences, it will be interesting if and how repetitive sequences could be a major target of PcG proteins in mammalian genome.66

Elucidation of the precise way by which the human epigenome reprogramming occurs during gametogenesis, development and aging is the dream of returning differentiated cells into an undifferentiated stem cell state and it will certainly provide new therapeutic strategies for the treatment of epigenetic-based diseases. The investigation of the epigenome by high-throughput genomic technologies has been allowed the identification of differentially methylated sites/genes, which could be used as potential epigenetic markers for routine diagnosis and prognosis. Concerning cancer, many CpG islands that exhibit abnormal methylation have been identified and have been considered for identification of cancer classes, monitoring of tumor progression and evaluation of distinct drug intolerance between patients. In the future, it is expected that personalized medicine could be provided as a result of epigenotype profiling of critical genes. Although these approaches are not still available, an increasing number of agents that alter DNA methylation or histones positioning patterns are currently being tested in clinical trials.

Inhibitors of DNA methylation are able to reactivate the expression of previously silenced genes owing to an epigenetically pathological situation, like the case of tumor-suppressor genes. For this purpose, the most used hypomethylating agents are pyrimidine nucleosides analogs of cytidine (eg, 5-azacytidine and 5-aza-2′-deoxycytidine) that specifically inhibits DNA methylation by its incorporation into replicating DNA in place of cytosine, trapping DNA methyltransferases and resulting in heritable demethylated DNA. In addition to the fact that non-proliferating cells are relatively insensitive to azacytidine, the great disadvantage of these agents is their cytotoxicity, particularly when used in high doses. Therefore, the proved clinical efficiency of low-dose of 5-azacytidine in treating hemoglobinopathies,78 myeloid dysplastic syndrome and other leukemias79 prompted its regular approval by the US Food and Drug Administration for the treatment of patients with all subtypes of myelodysplastic syndrome as injectable suspension. Moreover, several in vitro/in vivo experiments of inhibitors of DNA methylation have showed positive results for autoimmune anomalies80 and Fragile X syndrome.81 Thus, epigenetic therapy might be also a good strategy to treat females bearing X-linked dominant disorders, like Rett syndrome. As X-inactivation is random, 50% of the females' cells harbor an unused but intact copy of the gene, which can be reactivated, ameliorating the clinical phenotype of the disorder. An alternative for preventing the cytotoxicity of the demethylating agents is the development of antisense oligonucleotides that specific target DNA methyltransferases.82

Pathological epigenetic silencing is almost associated with histone deacetylation. For therapy purpose, HDACs can be globally or specifically inhibited by a structurally diverse group of small molecules, including hydroxamates, cyclic peptides, aliphatic acids, benzamides and electrophilic ketones.83 The inhibitory effects occur owing to acetylated histones accumulation, leading to transcription activation of select genes whose previous expression had been epigenetically turned off. Therefore, as there are many types of HDACs, the success of the inhibition approach as a therapeutic role is directly linked to the selectiveness of each HDACs inhibitor and its cytotoxicity. Such therapy strategy has long being routinely present in the treatment of epilepsy and bipolar disorder with valproic acid.84 Experimentally, HDACs inhibition worked efficiently in cell lines of non-small lung cancer85 and several other types of solid and hematological tumors.86 Positive therapy results have been also found in animals models for cardiac hypertrophy,87 autoimmune encephalomyelitis88 and polyglutamine expansion-associated neurological disorders, such as Dentatorubral–pallidoluysian atrophy,89 Huntington disease90 and spinal and bulbar muscular atrophy.91

The closely link between DNA methylation and histone deacetylation stimulated the development of experimental combined and efficient therapies, in which demethylating agents are used simultaneously with HDACs inhibitors, like 5-aza-2′deoxycytidine and trichostatin or sodium phenylbutyrate.92 Moreover, cancer pre-clinical researchers have demonstrated that HDAC inhibitors and demethylating agents can be potentially used as sensitizing agents for subsequent therapies as chemotherapy,93 radiotherapy94 and immunotherapy.95 However, epigenetic therapy should be taken with care, as it could wake up dormant hypermethylated oncogenes, silenced alleles of imprinted genes, genes on the female inactive X chromosome or other intentionally spatial/temporal silenced genes.

The most intriguing and exciting feature concerning epigenetic diseases is that, except to cancer, neurocognitive disturbs (mainly mental disturbances or neurodegenerative disorders) are present in almost all of them. One possible explanation is that the complexity of brain depends on the precise spatial and temporal action of many genes, by the way that a minimal alteration is sufficient to disturb the balance of the integrated gene expression. Furthermore, complex cytoarchiteture and cellular signaling mechanisms of the nervous system are reflected at the molecular level with more genes being expressed in this tissue than in any other.29 Future eager research will certainly provide more mechanistic understanding into how epigenetics pathways can regulate or disturb the role of genetics code in neuronal differentiation and maturation and may therefore reveal novel approaches for preventing and treating a great number of epigenetic diseases.