Skip to main content

Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia



Monozygotic twins are valuable in assessing the genetic vs environmental contribution to diseases. In the era of complete genome sequences, they allow identification of mutational mechanisms and specific genes and pathways that offer predisposition to the development of complex diseases including schizophrenia.


We sequenced the complete genomes of two pairs of monozygotic twins discordant for schizophrenia (MZD), including one representing a family tetrad. The family specific complete sequences have allowed identification of post zygotic mutations between MZD genomes. It allows identification of affected genes including relevant network and pathways that may account for the diseased state in pair specific patient.


We found multiple twin specific sequence differences between co-twins that included small nucleotides [single nucleotide variants (SNV), small indels and block substitutions], copy number variations (CNVs) and structural variations. The genes affected by these changes belonged to a number of canonical pathways, the most prominent ones are implicated in schizophrenia and related disorders. Although these changes were found in both twins, they were more frequent in the affected twin in both pairs. Two specific pathway defects, glutamate receptor signaling and dopamine feedback in cAMP signaling pathways, were uniquely affected in the two patients representing two unrelated families.


We have identified genome-wide post zygotic mutations in two MZD pairs affected with schizophrenia. It has allowed us to use the threshold model and propose the most likely cause of this disease in the two patients studied. The results support the proposition that each schizophrenia patient may be unique and heterogeneous somatic de novo events may contribute to schizophrenia threshold and discordance of the disease in monozygotic twins.


Monozygotic (MZ) twins originate from a single fertilized zygote and have been used to study the relative contribution of nature and nurture on a variety of phenotypes and disorders for well over a century. Schizophrenia, which is among the most devastating of the major mental health disorders, is thought to have both genetic and environmental causes [1]. Results show that although the frequency of schizophrenia is only 1% in the general population, its concordance in MZ twins approximates ~ 50% and not 100% [2]. Interestingly, recent results have shown that MZ twins may differ for de novo events, that include copy number variations (CNVs) [3, 4] and DNA methylation [5,6,7,8]. The timing, rate, extent and impact of such de novo events however, is difficult to ascertain. They may occur anytime during development as a normal aspect of development of an organ [9] and cell type [10]. Also, depending on the developmental stage at which they arise, will determine their presence in all or almost all cells of an individual or represent mosaicism. The random nature of this variation is expected to differ between MZ twins and contribute to twin differences. Although logical and attractive as a potential hypothesis for the cause of disease discordance in MZ twins, the confirmation of this hypothesis faces two challenges. The first is the identification of all or almost all differences between the genomes of discordant MZ twins and the second is establishing the significance of the observed genomic difference in the disease. Although the former is gradually becoming possible through increased resolution of genomic technologies, the latter remains challenging and will demand diligent efforts [11, 12].

The challenge of analysis and interpretation of complete genome sequences is attributed to a variety of factors [13]. First, the sequence coverage is not always 100% and second, the differences between MZ twins are not always easy to confirm due in part to expected mosaicism of unknown frequency and distribution [14], even though it is possible to differentiate between the pre-zygotic or post-zygotic origin of such twin differences using parental sequences. The next challenge is any involvement of observed differences in the development of schizophrenia. This of course is a much more complicated question. Schizophrenia is highly heterogeneous and affects ~ 1% of the world’s population with a heritability estimate of ~ 80% [1, 15]. Extensive research on this disorder has generated a large number of candidate genes. This includes a report involving 150,000 individuals with over 35,000 schizophrenic patients that has implicated over 108 genetic loci [16]. This considerable body of multifactorial and rather trustworthy data provides a standard for assessing the potential role of any new observation. We note that previously identified loci that are considered particularly trustworthy include DRD2, a common anti-psychotic target, as well as a number of glutamate receptors (GRIA1, GRIN2A, GRM3), members of the voltage gated calcium channels (CACNA1C, CACNA1l and CACNB2) and genes involved in synaptic plasticity [16]. As it stands, there are no common variants that account for a substantial portion of the liability to develop this disease [17]. It argues that the disease is highly heterogeneous and that most patients are genetically distinct. Also, any difference, particularly involving previously identified genes between monozygotic twins discordant for the disease, will have a high probability of being involved in the development of the disease. The results will be patient specific and may or may not apply to all or even most patients.

Finally, the results at hand allow us to appraise the threshold model for the development of schizophrenia [18]. It adds that inherited gene mutations [19,20,21] place an individual on a liability scale for the development of this disease. Additional contributors to this scale may include de novo mutations, environment factors and/or epigenetic [22] events, most occurring during ontogenic development [23]. The model predicts that the liability threshold may be met by inherited factors alone or require additional random genetic and epigenetic events including environmental events during development. We propose that this model can be tested using MZ twins discordant for schizophrenia with the assumption that inherited components will be shared between twins. Further, additional de novo mutations (and epimutations) will be acquired during ontogeny in the affected but not in unaffected twin. Given high heterogeneity of the disease, such results across MZD twin pairs may follow similar pattern but differ in the genes affected. Further, the genes affected may belong to a number of common pathways that appear to be invariably affected in this complex disease. Finally, the genes identified in this research are known to be involved in post-synaptic complexes [24] and synaptic strength [25], processes that are known to be defective in several but not all patients with schizophrenia.



This research received ethics approval from the Committee on Research Involving Human Subjects at The University of Western Ontario, London, Ontario, Canada. The methods were carried out in accordance with all approved guidelines and regulations. Two families with monozygotic twins discordant for schizophrenia were selected based on their long term clinical features assessed in-person by Richard O’Reilly (Psychiatrist), using the Structured Clinical Interview for DSM-IV (SCID I and SCID II) and a review of medical records from multiple decades [26, 27]. All participants provided written informed consent. A second senior psychiatrist independently reviewed videotapes of the structured interviews of the twins and confirmed the diagnoses. It has assured a reliable diagnosis and long-term discordance of the two MZ twin pairs used in this study. All six subjects (Fig. 1) provided blood and cheek swab samples that were used to isolate genomic DNA using the PerfectPure DNA blood kit (Blood samples), and the QIAGEN DNA Micro Kit (Buccal samples), following the manufacturer’s protocols. The experimental workflow of the analysis is summarized in Fig. 2.

Fig. 1

Members of the two discordant monozygotic twin pairs and one set of parents included in the study

Fig. 2

Flow chart of methods employed

The twins from family 1 are Afro-American females and contributed DNA samples at age 53. The affected twin (I-2-1) was diagnosed with schizophrenia at age 22. She was admitted to a psychiatric unit on at least 12 occasions and experienced paranoid delusions, auditory hallucinations and occasionally euphoria during her acute episodes of illness. This twin had significant functional decline and lived with her parents throughout her adult life. The diagnosis of schizophrenia was confirmed by the study psychiatrist. I-2-2 did not experience any significant symptoms of mental illness until she developed a brief episode of depression at age 48, which was treated with an antidepressant. Four years later this twin developed an acute episode of mania, which required hospitalization and she was placed on the mood stabilizer divalproex sodium. She was diagnosed as having a bipolar disorder by the study psychiatrist. This twin pair has been previously described in detail [28]. At the time of sample collection, the twins were discordant for schizophrenia for 31 years. The father (I-1-1) of the twins was aged 82 at assessment and had a mild obsessive–compulsive personality disorder but no other psychiatric problems, while the mother (I-1-2) was aged 74 and had never had any psychiatric problems.

At the time of sample collection, the twins from family 2 (Fig. 1) were 43 years of age. The twins are Caucasian females. Twin II-2-1 became ill at age 27. She experienced paranoid delusions and hallucinations, which usually occurred in the context of a euphoric or depressed mood. She lived with her twin sister and had worked intermittently. She was diagnosed with a schizoaffective disorder. The twin sister (II-2-2) sustained skeletal injuries in a motor vehicle accident at age 18 years. A boyfriend died close to the time of this accident and II-2-2 became depressed for a short period around this time. She has had no further episodes of depression or other emotional or psychiatric problems. The twins were considered discordant for 16 years at the time of sample collection.

Genome sequencing

The genome sequence of the six subjects (Fig. 1) was generated at Complete Genomics Inc. (Mountain View, CA) in the form of ~ 2 billion overlapping 70-base nucleotide sequences. They allowed reconstruction of six individual genomes. Duplicates were removed prior to variant calling. The sequences met the criteria of high accuracy (99.999%) and were considered suitable for identification of rare variants including somatic mutations as described by Drmanac et al. [29]. These variants included single nucleotide variants (SNVs), indels and block substitutions as well as larger variants classified as CNVs and SVs that were called in comparison to reference sequence (NCBI Build 37). CNVs were called based on a read depth or depth-of-coverage algorithm provided by Complete Genomics. Sequence coverage was averaged and then GC bias was corrected for over a fixed window (2 kb) and normalized relative to a set of standard (CG 45 genome reference) genomes sequenced by Complete Genomics. A Hidden Markov Model was used to determine the integer copy number state (0–10). Structural variations were detected by identifying discordant mate pair mappings found during the assembly process. Mate pair mappings where each arm maps to the reference genome but with either an unexpected length between them or an anomalous orientation are subjected to local de novo assembly to refine junction breakpoints at single base pair resolution. Complete genomics then generated a high confidence junctions file that reports junctions that are most likely to be accurate. CGAtools junctions2events was then used to convert high confidence junctions to possible SV events using repeat masker 37 data, hg19 reference file and a refseq gene information file. The junctions2events command in CGAtools identifies likely deletion, inversion and translocation events from the list of high confidence junctions delivered by Complete Genomics. SVs were further refined into five categories: deletions, tandem duplications, distal duplications, inter-chromosomal events and inversions. To call small variants in our dataset, Master Variation Files (MasterVar) were generated using CGAtools and hg19 as reference. VCF files were then generated from each MasterVar file using CGAtools command mkvcf. VCF files generated this way were directly inputted into ingenuity variant analysis (Ingenuity Systems, Redwood City, CA). Given the potential for false positives identified in genome sequencing, a stringent read depth of 50 and a call quality of 100 (calculated by complete genomics and based on a phred scale) were chosen as parameters for initial variant filtering. In the analysis of family 1, variants that were present in the affected twin and not present in the unaffected twin or their mother or father were assessed as de novo. Also, in the twin analyses of family 2, without parents, variants that were present in the affected twin and not present in the unaffected twin were labeled as provisional de novo. Variants were annotated with overlapping genes, cytoband, gene region (Exonic, Intronic, Promoter, 5′UTR, 3′UTR, Splice Site), translational impact (if applicable), SIFT function prediction (if applicable), SIFT score (if applicable), dbSNP ID (if applicable) and frequency in the 1000 Genomes as well as frequency in the Complete Genomics Public Genome dataset. The biological context filter in ingenuity variant analysis was applied to downstream data with “Schizophrenia, Neurological Disease and diseases consistent with those two phenotypes” as the filtering criteria. Genes had to have a direct connection to these phenotypes (no hops allowed).

In the case of CNVs and SVs a 50% reciprocal overlap rule [30,31,32] was applied to determine if two variants were the same or different between twins/parents. All CNVs/SVs had to meet the ≥ 50% rule when compared to the other CNV/SV, otherwise, the two were considered unique. Pairwise comparisons were performed for all CNVs and SVs. To increase the efficiency of this, HD-CNV was utilized [33]. Comma delimited files were prepared using a custom python script and the 50% reciprocal overlap rule was applied to determine the CNVs and SVs that were unique to an individual. Interchromosomal and inversion comparisons were analyzed manually due to the limitations of HD-CNV. Inter-chromosomal events were considered the same if the origin and the destination chromosome numbers matched and the junction positions were less than 500 bp apart. Inversions had to share the same direction and 50% or more identity to be classified as shared.

Confirmation of variants

Quantitative PCR using TaqMan® Assays in an Applied Biosystem StepOne were performed on selected CNVs [4] and SVs [6]. CNVs and SVs that were confirmed via Real Time PCR were then assessed in the Buccal DNA from the same individuals to identify if the variant arose before or after the differentiation of the germ layers. RNAseP was used as the reference gene. Predesigned TaqMan assays were ordered from Life Technologies for regions of interest. CopyCaller 2.0 was used to generate the predicted copy numbers. Circos [34] was used for visualization of post-filtering variation data. Unique variants were separated into files to generate relevant tracks in the genome diagram. Structural variants that involved more than one chromosome were visualized using “links” in Circos [34].

Sanger sequencing was used to assess small variants of interest in our dataset [28]. Regions were sequenced at the DNA sequencing center at The London Regional Genomics Centre. Variants that appeared mosaic in nature were further assessed using PASA (PCR amplification of specific alleles).

Pathway analysis

Any unique variant that overlapped a gene was identified and the gene lists were used in Core pathway analysis (Ingenuity Systems, Redwood City, CA) to identify pathways and networks that were overrepresented in the filtered dataset. Only the genes that were identified to overlap high confidence somatic de novo (family 1) or provisional de novo (family 2) variations were included in the pathway analysis. Variants that were found to be unique to unaffected twins were also assessed in a separate pathway analysis. The top 20 canonical pathways found to be overrepresented in each individual were then compared between all members in the study; pathways that were found in affected and unaffected twins were labeled as GP. They were viewed as “shared genetic predisposition” between the two members of the twin pair. Pathways that were unique to affected twins only were labeled as GPD. They were viewed as additional “genetic predisposition that led to disease” in the affected and not the unaffected member of the twin pair.


Defining genetic variation including de novo events

The complete genome sequences of two MZD twin pairs and one set of parents (Fig. 1) were generated with a high call quality above 100 (Additional file 1: Figure S2) and an average read depth of 47–50 fold coverage (Additional file 1: Figure S3) following our workflow (Fig. 2). They represented over 99% of the reference sequence for each genome (Table 1). We compared individual genome sequences with the Genome Reference Consortium Human Genome Build 37 using Complete Genomics Analysis tools (CGAtools) for identification of individual specific variants and their genomic location at single base pair resolution. The results show that each genome harbors 3.3 to 3.9 million single nucleotide variants (SNVs), 370–430 thousand indels (small insertions and deletions), 71–80 thousand block substitutions, 1 thousand structural variations, and 150 copy number variations as well as a transition/transversion ratio of 2.1. The exception to this pattern was the father in family 1 who carried 592 copy number variations and 1110 structural variations, that may be associated with a diagnosis of chronic leukemia unrelated to this study [35]. We further grouped individual variations as small sequence changes, (SSCs—SNVs, indels, block substitutions), copy number variants (CNVs) or structural variants (SVs). An example to structural variation observed in family 1 involving inter-chromosomal translocation is depicted in the Circos plot (Additional file 1: Figure S1).

Table 1 Identity of sequenced genomes

Distribution of variants across twins

The frequency of variants across chromosomes that is related to their size is different between parents at each chromosome but rare between the monozygotic twins (Fig. 3). In family 1, some sequence differences between twins were viewed as inherited (present in at least one parent) and others were considered as post zygotic de novo given that they were not present in any of the two parents (Table 2). The de novo variants were further assessed by their presence in exons (Tables 3 and 4). In family 1, two unique CNVs were identified as inherited and five were considered de novo in the affected twin (Table 5). In contrast, in the unaffected twin, five CNVs were identified as inherited and four were labelled as de novo. The CNVs unique to the affected member of twin pair 1 and 2 are given in Tables 6 and 7 respectively. Interestingly, the structural variations followed a pattern similar to small sequence variations (Table 8). Most of these variants are shared between twins, the unshared variants are more common in the affected as compared to their unaffected counterpart and the post-zygotic de novo events are not different between MZD twin in family 1. In the absence of parental sequences for family 2, it was not possible to consider the familial vs somatic de novo nature of the twin differences. For the purpose of this discussion variations that were unique to the affected twin in twin pair 2 were considered as “provisional de novo”. We conclude that most human variations are historical, yet rare variations may be acquired through de novo events during genetic transmission as well as extensive mitosis during development and aging. As expected, the post zygotic de novo variants are often reflected as different degrees of mosaicism, that are not always easy to independently confirm using standard technologies.

Fig. 3

Chromosomal distribution of small sequence variations (SNVs, Indels, block substitutions) in all six samples

Table 2 Shared and de novo small sequence variations (SNVs, substitutions, Indels) as compared to NCBI Build 37
Table 3 Exonic de novo variants in the affected twin in family 1 (I-2-1)
Table 4 Exonic de novo variants in the affected twin in family 2 (II-2-1)
Table 5 Shared and unshared copy number variants (CNVs) as compared to NCBI Build 37
Table 6 Copy number variants unique to the affected member of family 1 (I-2-1)
Table 7 Copy number variants unique to the affected member of family 2 (II-2-1) as compared to unaffected MZ twin
Table 8 Shared and unshared structural variation (SV) as compared to NCBI Build 37

De novo mutational features in twins affected with schizophrenia: SSCs

We classified SNVs, small indels and block substitutions as small sequence changes (SSCs). The identity of de novo (family 1) and provisional de novo (family 2) high confidence variations in affected twins shows that majority of variants are inter-genic SNVs and that each individual harbors approximately equal number of block substitutions, insertions and deletions (Figs. 3 and 4). As expected, the exonic variants are relatively rare. The exonic de novo variations identified were annotated with gene information for the two patients (Tables 3 and 4). Of the exonic variants identified, 13 exonic variants in family 1 and 22 exonic variants in family 2 were related to brain function or previously implicated in schizophrenia and related disorders. The patient in family 1 carries four exonic de novo variants in ZNF595, a zinc finger protein, two of which are predicted to have a functional impact: one frameshift and one missense mutation. Further, two missense mutations in MUC3A (that encode for a transmembrane mucin) and a missense in OR4C45 (that encode for olfactory receptors) may also contribute to pathophysiology. Interestingly, these variations have not been reported in 1000 genomes as well as CG public genomes and may not be polymorphic. In comparison, the patient from family 2 appears to carry a much larger number of variants. However, in the lack of parental sequences this only represents the differences between the discordant twins. Most (51/82) of these variants in this patient have been reported in 1000 genomes and/or CG public genomes and are considered polymorphic. It leaves 31 variants that may be viewed as provisional somatic de novo. The genes affected by missense changes include some that are compatible with the pathophysiology of schizophrenia. These include NOTCH2, OR2T2, ZNF717, MUC4, MUC6, OR4C3, UBTFL1, TAS2R46 and KRT6B (Table 4). However, we note that although these results are appealing, additional validation will be needed to establish a precise role for these genes in the etiology of schizophrenia.

Fig. 4

De novo (family 1) and presumed de novo (family 2) variants identified in the affected twin of family 1 (I-2-1) and the affected twin of family 2 (II-2-1)

De novo mutational features in twins affected with schizophrenia: CNVs

The genome sequences of the six individuals were used to identify individual specific copy number variations as compared to NCBI Build 37. Table 5 shows the distribution of observed CNVs in MZ twins representing the two families. This data allowed classification of CNVs representing inherited (present in at least one parent) and de novo (not present in either parent and also not present in co-twin) for family 1 and unshared between twins in family 2. Of the rare CNVs that were not shared between twins, some represented inherited and others non-inherited and somatic de novo events that must have occurred during independent mitosis following their separation from each other (Tables 6 and 7). In family 1, five of the seven unique CNVs were also not found in either parent. Discordant CNVs between twin pairs were annotated and are presented in Tables 6 and 7, for the affected twin from family 1 and 2, respectively. Three genes overlapped unique CNVs in family 1. The PDSS1, and LOC642131 genes represented somatic de novo events, as they were not present in either parent while CES1 that was present in one parent was viewed as inherited. Although the affected twin of family 2 also carried five CNVs that were not present in her unaffected twin, these regions did not directly overlap any genes.

De novo mutational features in twins affected with schizophrenia: SVs

The complete genome sequences of the six individuals were also used to identify individual specific deletions, tandem duplications, distal duplications, inter-chromosomal variations and inversions in comparison to NCBI Build 37. They were grouped as structural variants (SVs). The SVs ranged from 50 bp to 218 Mb in length. Most of the SVs in family 1 (total 138) and 2 (total 141) represented deletions (78 and 86%), followed by tandem duplications (6 and 10%) and distal duplications (6.5 and 3.5%). Also, we found one inter-chromosomal move involving a translocation of CDC27 from chromosome 2 to 17 in the patient in family 1 (Additional file 1: Figure S1).

Further, we characterized the genomic details of individual SVs that were unique to the affected twins of the two families (Additional file 1: Table S1 and S2). For family 1 where parental sequences are available we were able to eliminate familial events and concentrate on somatic de novo events. We note that these changes have affected two genes (ANKS18 and CLCN5) via deletion, four genes (LOC285768, NTM, SNORD115-29 and GZMM) due to tandem repeats and a 5 Mb tandem duplication on chromosome 1 (825765) affecting 101 genes. Also, it was possible to identify SV differences between twins in family 2, but in the absence of parental sequences they could not be characterized as familial or somatic de novo as in family 1. Consequently, the SV differences between twins in family 2 are viewed as provisional somatic de novo. As in family 1 they affected 47 genes by deletion, seven genes at seven sites by tandem duplication along with a 3 Mb tandem duplication on X (52886720) overlapping 39 genes and 2 genes by distal duplication. Also, the genomes carried rare (two or three) inversions, both twin pairs shared them with their co-twin and all inversions were inherited in family 1. Finally, the sharing of CNVs and SVs between any two individuals is directly correlated with their genetic relatedness; high but not 100% between monozygotic twins, low across unrelated individuals and intermediate between a parent and an offspring, as expected.

Independent confirmation of NGS results

We sought to confirm a random sample of CNVs, SVs and SSCs by independent experiments involving Real Time PCR for confirmation of CNVs and SVs (Table 9) and sanger sequencing for confirmation of SSCs (Fig. 5a, b). They represented 10 randomly chosen structural rearrangements and 30 randomly chosen SSCs, specifically 4 CNVs, 6 SVs and 30 SSCs (two of which failed sanger sequencing). Table 9 outlines the CNVs and SVs that were assessed by Real Time PCR and includes the predicted identity of the variants from sequencing. Each Real Time PCR assay identified the number of copies of each segment of DNA interrogated, which are reported in Table 9. Further, the Real Time PCR for CNVs and SVs using blood DNA confirmed 5/10 variants, three of which were also confirmed on the DNA from buccal swabs from the same individuals. Many of the SSC findings represented shared peaks that may have resulted from expected mosaicism resulting from somatic de novo events during different stages of ontogeny. Given the early differentiation of the germ layers during development, we would expect tissue specific somatic variation to exist, which is compatible with only some of the variation between twins being identified in both blood and buccal tissues in the same individual.

Table 9 Results of real time quantitative PCR analysis on 10 CNV/SV regions predicted by NGS to be unshared between twins
Fig. 5

a Confirmed variant between monozygotic twins in family 1 in the 4th exon of the ZNF595 gene. This variant changes isoleucine to serine. b Confirmed variant between monozygotic twins in family 2 in the 13th exon of the PDE4DIP gene. This variant changes Tryptophan to a premature stop codon

Genetic differences between MZ twins and their discordance for schizophrenia

The results also allow us to assess any involvement of genetic differences between MZ twins in the development of schizophrenia using a threshold model [18]. It assumes that both members of the monozygotic discordant MZD twins will carry family specific genetic predisposition for the disease and rare ontogenetic additional changes in the affected twin will add to the predisposition that will be sufficient for the onset of the disease. This model is testable given near complete coverage of genetic changes in the affected as well as unaffected members of the two MZD twin pairs. It begins with development of a list of all genes that are affected by a variety of mutational mechanisms in four members of the two MZD twin pairs. This list is rather comprehensive and includes all variants, some validated and others not validated. We have used this list to assess the effect of these differences using ingenuity pathway analysis (IPA). It moves the analysis from a focus on affected genes to affected pathways.

Table 10 shows the top 20 individual specific canonical pathways in two affected individuals belonging to the two families. Interestingly, these pathways are not random. Figure 6 incorporates our strategy in assessing the nature of the threshold model in the two patients [36]. Of the top 20 pathways in patient 1 and patient 2, 10 are shared between them (Additional file 1: Figure S4). Also, 8 of the 10 pathways shared by the two patients are present in either of the two unaffected members of the two twin pairs. It leaves two pathways that may be viewed as highly specific to the two patients (Fig. 6). They represent dopamine-DARPP32 feedback in cAMP signaling (Fig. 7a, family 1, p = 2.28E−04; family 2, p = 5.84E−03) and glutamate receptor signaling (Fig. 7b, family 1, p = 1.46E−03; family 2, p = 3.88E−05). We note that 13 and 29 genes of the two pathways respectively are affected in the patient of family 1. The corresponding numbers for the patient in family 2 are 15 and 23 genes for the two pathways, respectively (Table 11). Although few of the mutated genes are patient specific, the two patients share a large number of affected molecules in glutamate receptor signaling as well as dopamine-DARPP32 feedback in cAMP signaling pathways. Further, we note (Table 10) that four of the top five pathways in patient of family 1 and three of the top five canonical pathways in patient of family 2 are relevant to neural functions and have the potential to contribute to the disease in a threshold model discussed in the next section.

Table 10 Top 20 canonical pathways identified by ingenuity pathway analysis (IPA) in the affected twin of each family
Fig. 6

Analysis of ingenuity pathway results

Fig. 7

a Dopamine feedback in cAMP signaling pathway identified using ingenuity pathway analysis (IPA). Genes that were unique to each affected twin were used to generate canonical pathways of interest. This pathway emerged independently in both affected twins. This pathway was also not found to be enriched in unaffected twins in the study. Purple represents genes harboring a unique high confidence variant in each respective patient (family 1, p = 2.28E−04; family 2, p = 5.84E−03). b Glutamate pathway identified using ingenuity pathway analysis (IPA). Genes that were unique to each affected twin were used to generate canonical pathways of interest. This pathway emerged independently in both affected twins. This pathway was also not found to be enriched in unaffected twins in the study. Purple represents genes harboring a unique high confidence variant in each respective patient (family 1, p = 1.46E−03; family 2, p = 3.88E−05)

Table 11 Genes affected in two pathways (glutamate receptor signaling and dopamine feedback in cAMP signaling) unique to the two unrelated schizophrenia patients representing family 1 and family 2


Individual human genomes contain extensive variability

In general, the nature and distribution of variations observed in this study follow the results on 1000 genomes data and expected distributions based on chromosome size (Figs. 3 and 4). It follows that each individual harbors hundreds of rare variants [37]. The individual genomes have on average, 3.6 million SNVs, 344 thousand indels and 717 large deletions [37] that includes somatic as well as germline de novo variations. However, the rate of somatic de novo mutation appears to vary and may be family specific [38]. Unfortunately, this variability particularly caused by somatic mutations is not always easy to discern. Not surprisingly, almost all de novo mutations reported in the literature are based on germline events. These estimates are much lower than our results that focus on somatic de novo events representing individuals of different ages. Interestingly, our results follow a recent report that found that the rate of de novo somatic mutations in humans and mice is almost two orders of magnitude higher than the germline mutation rate and that both mutation rates are significantly higher in mice than humans [39]. It also adds that somatic de novo mutations are highly variable, may depend on tissue and age of the individual, and could add to predisposition for disease.

Studies of twin sequencing differences in the literature have noted validation rates of variants as low as 1/15 [40]. Often it is attributed to mosaicism resulting from de novo somatic events. In addition, validation rates for data from the 1000 genomes project averaged a validation rate of 1.8%, and similar to our study, large SVs were easier to validate than other types of variation. Our analysis also focused on the use of higher quality variant calls as was done in the 1000 genomes pilot study to evaluate the likelihood of a candidate variant call being a real event [37]. The results show that individual genomes harbor extensive variability and that this variability can be measured both within and between generations.

MZ twins are genomically different

Results included in this report show that the genome sequences of pairs of MZ twins although very similar, are not identical. It follows a number of recent reports on identical twins [41,42,43,44,45]. Although likely to be a conservative estimate, Krawczak et al. has predicted that there exists at minimum an average of > 1.3 SNVs discriminating MZ twins in each tissue type [46]. The results of one forensic study found DNA sequence differences between identical twins that appear to reflect mosaicism in that the newly arisen allele was generally only found in a small fraction (approximately 20% of the cells assayed) as estimated by sanger sequencing results [41]. Many reports have now identified genetic and epigenetic differences between identical twins [6,7,8]. For example, Bruder et al. [4] reported that all of 19 MZ twins studied differed in CNVs [47] and Forsberg et al. confirmed 10 post-twinning CNV mutations in 159 MZ twin pairs [48]. In addition, many post-twinning single nucleotide mutations have also been reported [42,43,44,45]. It is likely that older twin pairs will have accumulated more somatic de novo mutations over time [45] and this will be expected to affect the degree of somatic mosaicism across tissues [49]. A recent report that identified somatic mutations at the base-pair level in monozygotic twins found two de novo somatic mutations that appear to have occurred early in embryonic development, suggesting that early development may be enriched for de novo change [12].

De novo changes may add to genetic liability in schizophrenia

Schizophrenia is a multifactorial and highly heterogeneous disorder and may involve both genomic and environmental contributions [50]. Population and family studies have implicated a number of mutational mechanisms (SSCs, CNVs and SVs) and many genes of small effect in the causation of this disease [51,52,53] with no single gene affected in all cases. Conceptually, an accumulation of mutations affecting a number of genes affecting disease specific pathways may establish a genetic liability gradient for schizophrenia where a critical threshold is required for the manifestation of the disease state [54]. Further, the underlying genetic liability is expected to be higher in families with schizophrenia patients. At the individual level, this liability will be much higher in the related family members. In the case of MZ twins where one is affected with the disease, the affected member is expected to have acquired additions to the liability scale via gene mutations and/or environmental factors that contribute to threshold crossing and thus development of disease. It remains an attractive hypothesis but has remained untested, primarily because of the lack of appropriate data. Also, threshold models are often assessed at the level of the population. This study is unique in that it applies this model to cases representing individual patients representing MZD twins along with a genetically matched control.

The current study, with complete genome sequences of pairs of MZD twins is novel and offers a number of advantages. First, it focuses on individual patients and matches the patient with her unaffected monozygotic twin. The consequence is that it drastically reduces genetic heterogeneity between a case and a control. Second, it focuses on the cause of the disease in highly selected and clinically defined individual patients rather than grouping together heterogeneous cases of schizophrenia. Third, the availability of complete genome sequences allows for identification of almost all forms of genetic mutations per individual. Fourth, the identification of all affected genes in a patient allows for further identification of canonical pathways that may be affected by individual specific gene mutations. The threshold model argues that a pathway may be affected by mutations involving different sets of genes in different individuals. More importantly, the two members of a MZD twin pair would share a common genetic liability for the disease with the affected twin acquiring yet additional defects causing it to cross the disease threshold on the liability continuum and develop schizophrenia.

We have assessed the genomic data obtained in our experiments to evaluate the threshold model of schizophrenia (Fig. 6) [36]. This analysis has yielded a set of canonical pathways for four individuals representing two twin pairs. The top 20 pathways identified in four members of the two twin pairs are given in Table 10. We focused on affected pathways that were shared or not shared in the two MZD twins. Within twin pair sharing of pathways was viewed to represent “genetic predisposition” and were considered to not be sufficient to cause the disease; we labeled these predisposing pathways as GP1 in family 1 and GP2 in family 2. Naturally, the composition of GP1 and GP2 (GPN across n pairs of twins) is expected to differ between unrelated pairs. Further, we selected pathways that were unique to the two patients in this study. We argue that these additional defects were needed to cross the threshold and manifest the disease; we labeled these disease-causing pathways GPD. Once again, the composition of GPDs may vary across families and includes all variants, some validated and others not validated. In this analysis, we have identified the same GPDs in both families which involved two pathways; dopamine feedback in cAMP signaling pathway (Fig. 7a) and glutamate receptor signaling (Fig. 7b). Interestingly, a number of pathways identified in the genetic predisposition lists (GP1 and GP2) have been previously implicated in this heterogeneous disease. These include the dopamine feedback in cAMP signaling pathway [55,56,57], glutamate receptor signaling [58, 59], CREB signalling in neurons [60], axonal guidance signalling, Netrin signaling [61] and nNOS signalling in neurons [62]. Independent identification of known schizophrenia related defects from the affected twin supports the argument that the observed differences between MZD twins are not random sequencing artifacts. More important, they offer a logical explanation for the discordance of the two MZ twins with long term discordance for schizophrenia.

Dopamine feedback in cAMP signaling in schizophrenia

The dopamine feedback in cAMP signalling pathway is one of the pathways that has emerged independently in both schizophrenia-affected twins in this study (Fig. 7a). This pathway has been shown to be associated with psychiatric disorders due to the fact that it has critical function in integrating dopaminergic and glutamatergic signalling, and in turn affecting striatal function and plasticity [56, 57]. The leading theory to account for the pathophysiology of schizophrenia involves an excess of the neurotransmitter dopamine, either through excess production or postsynaptic dopamine over-activity possibly mediated by increased receptor density [55]. Recent reports reinforce the importance of dopamine in schizophrenia, including strong associations with DRD2 (a target of many antipsychotic drugs) [16]. The results support the contention that the defects associated with this pathway have the potential to contribute to schizophrenia in the two patients studied.

Glutamate receptor signalling in schizophrenia

A glutamate receptor-signalling pathway is the second pathway that is found in the two patients studied (Fig. 7b). The significance of this pathway in this disease is backed by the underlying neurochemical basis of schizophrenia that includes a hypofunctional glutamate system [63]. Many genes associated with glutamateric neurotransmission have been previously implicated in schizophrenia, including GRM3, GRIN2A, SRR and GRIA1 [16]. Further, morphological alteration of dendrites of glutamatergic neurons in the cerebral cortex of schizophrenia-affected individuals have been reported [64] suggesting a role for glutamate signalling in the etiology of schizophrenia. Interestingly, schizophrenia patients show enriched de novo mutations in genes regulating the postsynaptic density at glutamatergic synapses [58]. Furthermore, small de novo mutations were found to be overrepresented among glutamatergic postsynaptic proteins [59] and genes harbouring detrimental de novo mutations were reported to be enriched in networks affecting protein interactions [65]. In addition, a recent study investigating rare mutations in exonic regions of genes implicated in schizophrenia and autism spectrum disorder revealed that post-synaptic glutamate receptor complexes are key molecular mechanisms associated with schizophrenia and ASD [66]. We propose that the two pathways described above have the potential to at least partially contribute to the disease in these two patients. Also, given that the two pathways were identified from two patients from two unrelated families, they may represent recurring defects in this disease for a subset of patients.

Disease model: somatic de novo mutations interject in the threshold model of schizophrenia

The results included in this report also show that somatic de novo mutations may play a significant role in the development of schizophrenia. These observations are backed by extensive literature that argue for the existence of somatic mutation in our genomes [12, 41, 67, 68]. In this model, MZD twins with the disease are argued to have acquired additional de novo errors onto an already existing background of genetic predisposition and thus contributing to disease liability. These de novo errors must involve somatic mutations during ontogeny [3, 69]. Rare post-zygotic mutations are likely to have been underestimated in the past. Mutations that occur later in development are likely to be seen in very low frequency and simply look like sequencing errors in any analysis [70]. The high-quality results, publicly accessible for further analysis, support the proposition that de novo alterations may play a role in the manifestation of schizophrenia. The mutations identified are predominantly seen in genes already known to be implicated in schizophrenia and related disorders. They help highlight major processes affected in specific patients. They argue for extensive heterogeneity in disease processes. For the two patients studied here, they identify aberrations in dopamine feedback and glutamate receptor signalling pathways.

It has not escaped our mind that this model will account for a variety of results and observations reported in the extensive literature. For example, it is possible to override the liability scale established by small effects of a number of genes with large effects on the disease phenotype [24, 71]. Some of these may result from de novo mutation in MZD or singleton cases [3, 72, 73]. Also, these changes must be present in the brain. In fact, de novo mutations are more likely to occur in schizophrenia patients than unaffected siblings [74] and damaging de novo mutations in schizophrenia patients were found to disrupt genes regulating neurogenesis in the postmortem human brain [21]. Interestingly, de novo mutations are now known to be rather pervasive in the brain [75, 76]. They may represent a unique source of neuronal diversity and individual differences. We also note that by its very nature, the effect of a de novo mutation may or may not be always apparent depending on the organ system affected and the degree of mosaicism. The impact of such mutations on disease causation is being recognized and deserves intensive investigation [77]. Finally, individual specific exceptional comprehensive genome-wide results included in this report provide a promising approach to the understanding of schizophrenia and may apply to other related disorders.


The complete genome sequences of monozygotic twins discordant for schizophrenia have allowed for direct identification of patient specific somatic de novo mutations that may augment the threshold of liability and cause this neurodevelopmental disease. The results also identify variants in the unaffected twin as fragment of genetic predisposition that are not sufficient to manifest the disease. Together they support two propositions. First, somatic de novo mutations, that are pervasive in the brain, may play a role in the development of schizophrenia and second, discordance of monozygotic twins for schizophrenia and related neurodevelopmental diseases in some families may be influenced more heavily by genetic, rather than environmental effects.



copy number variant


genetic predisposition


genetic predisposition leading to disease


ingenuity pathway analysis




monozygotic discordant


single nucleotide variant


small sequence changes (SNVs, indels, block substitutions)


structural variant


  1. 1.

    Gottesman II, Shields J (1967) A polygenic theory of schizophrenia. Proc Natl Acad Sci USA 58(1):199–205

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  2. 2.

    Cardno AG, Gottesman II (2000) Twin studies of schizophrenia: from bow-and-arrow concordances to star wars Mx and functional genomics. Am J Med Genet 97(1):12–17

    CAS  Article  PubMed  Google Scholar 

  3. 3.

    Maiti S, Kumar KHBG, Castellani CA, O’Reilly R, Singh SM (2011) Ontogenetic de novo copy number variations (CNVs) as a source of genetic individuality: studies on two families with MZD twins for schizophrenia. PLoS ONE 6(3):e17125

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  4. 4.

    Bruder C, Piotrowski A et al (2008) Phenotypically concordant and discordant monozygotic twins display different DNA copy number variation profiles. Am J Hum Genet 82:1–9

    Article  Google Scholar 

  5. 5.

    Fraga MF, Ballestar E, Paz MF, Ropero S, Setien F, Ballestar ML et al (2005) Epigenetic differences arise during the lifetime of monozygotic twins. Proc Natl Acad Sci USA 102(30):10604–10609

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  6. 6.

    Ehli EA, Abdellaoui A, Hu Y, Hottenga JJ, Kattenberg M, van Beijsterveldt T et al (2012) De novo and inherited CNVs in MZ twin pairs selected for discordance and concordance on attention problems. Eur J Hum Genet 20(10):1037–1043

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  7. 7.

    Dempster EL, Pidsley R, Schalkwyk LC, Owens S, Georgiades A, Kane F et al (2011) Disease-associated epigenetic changes in monozygotic twins discordant for schizophrenia and bipolar disorder. Hum Mol Genet 20(24):4786–4796

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  8. 8.

    Dempster EL, Wong CCY, Lester KJ, Burrage J, Gregory AM, Mill J et al (2014) Genome-wide methylomic analysis of monozygotic twins discordant for adolescent depression. Biol Psychiatry 76(12):977–983

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  9. 9.

    Muotri AR, Gage FH (2006) Generation of neuronal variability and complexity. Nature 441(7097):1087–1093

    CAS  Article  PubMed  Google Scholar 

  10. 10.

    Muotri AR, Marchetto MCN, Coufal NG, Oefner R, Yeo G, Nakashima K et al (2010) L1 retrotransposition in neurons is modulated by MeCP2. Nature 468(7322):7443–7446

    Article  Google Scholar 

  11. 11.

    Rall K, Eisenbeis S, Barresi G, Rückner D, Walter M, Poths S et al (2015) Mayer-Rokitansky-Küster-Hauser syndrome discordance in monozygotic twins: matrix metalloproteinase 14, low-density lipoprotein receptor-related protein 10, extracellular matrix, and neoangiogenesis genes identified as candidate genes in a tissue-specific. Fertil Steril 103(2):494–502.e3

    CAS  Article  PubMed  Google Scholar 

  12. 12.

    Li R, Montpetit A, Rousseau M, Wu SYM, Greenwood CMT, Spector TD et al (2014) Somatic point mutations occurring early in development: a monozygotic twin study. J Med Genet 51(1):28–34

    CAS  Article  PubMed  Google Scholar 

  13. 13.

    Baranzini SE, Mudge J, van Velkinburgh JC, Khankhanian P, Khrebtukova I, Miller NA et al (2010) Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis. Nature 464(7293):1351–1356

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  14. 14.

    Petersen B-S, Spehlmann ME, Raedler A, Stade B, Thomsen I, Rabionet R et al (2014) Whole genome and exome sequencing of monozygotic twins discordant for Crohn’s disease. BMC Genom 15:564

    Article  Google Scholar 

  15. 15.

    Gottesman II (1991) Schizophrenia genesis: the origin of madness. W.H. Freeman and Co., New York

    Google Scholar 

  16. 16.

    Ripke S, Neale BM, Corvin A, Walters JTR, Farh K-H, Holmans PA et al (2014) Biological insights from 108 schizophrenia-associated genetic loci. Nature 511(7510):421–427

    CAS  Article  PubMed Central  Google Scholar 

  17. 17.

    Kavanagh DH, Tansey KE, O’Donovan MC, Owen MJ (2014) Schizophrenia genetics: emerging themes for a complex disorder. Mol Psychiatry 20(1):72–76

    Article  PubMed  Google Scholar 

  18. 18.

    McGue M, Gottesman II, Rao DC (1983) The transmission of schizophrenia under a multifactorial threshold model. Am J Hum Genet 35(6):1161–1178

    CAS  PubMed  PubMed Central  Google Scholar 

  19. 19.

    Xu B, Roos JL, Dexheimer P, Boone B, Plummer B, Levy S et al (2011) Exome sequencing supports a de novo mutational paradigm for schizophrenia. Nat Genet 43(9):864–868

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  20. 20.

    Girard SL, Gauthier J, Noreau A, Xiong L, Zhou S, Jouan L et al (2011) Increased exonic de novo mutation rate in individuals with schizophrenia. Nat Genet 43(9):860–863

    CAS  Article  PubMed  Google Scholar 

  21. 21.

    Gulsuner S, Walsh T, Watts AC, Lee MK, Thornton AM, Casadei S et al (2013) Spatial and temporal mapping of de novo mutations in schizophrenia to a fetal prefrontal cortical network. Cell 154(3):518–529

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  22. 22.

    McCarthy SE, Gillis J, Kramer M, Lihm J, Yoon S, Berstein Y et al (2014) De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol Psychiatry 19(6):652–658

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  23. 23.

    Singh SM, O’Reilly R (2009) (Epi)genomics and neurodevelopment in schizophrenia: monozygotic twins discordant for schizophrenia augment the search for disease-related (epi)genomic alterations. Genome 52(1):8–19

    CAS  Article  PubMed  Google Scholar 

  24. 24.

    Kirov G, Pocklington AJ, Holmans P, Ivanov D, Ikeda M, Ruderfer D et al (2012) De novo CNV analysis implicates specific abnormalities of postsynaptic signalling complexes in the pathogenesis of schizophrenia. Mol Psychiatry 17(2):142–153

    CAS  Article  PubMed  Google Scholar 

  25. 25.

    Szatkiewicz JP, O’Dushlaine C, Chen G, Chambert K, Moran JL, Neale BM et al (2014) Copy number variation in schizophrenia in Sweden. Mol Psychiatry 19(7):762–773

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  26. 26.

    First MB, Spitzer RL, Gibbon M, Williams JBW (1996) Clinical interview for DSM-IV Axis I disorders, clinician version (SCID-CV). American Psychiatric Press, Inc, Washington, DC

    Google Scholar 

  27. 27.

    First MB, Gibbon M, Spitzer RL, Williams JBW, Benjamin LS (1997) Structured clinical interview for DSM-IV Axis II personality disorders, (SCID-II). American Psychiatric Press, Inc., Washington, DC

    Google Scholar 

  28. 28.

    O’Reilly R, Torrey EF, Rao J, Singh S (2013) Monozygotic twins with early-onset schizophrenia and late-onset bipolar disorder: a case report. J Med Case Rep 7:134

    Article  PubMed  PubMed Central  Google Scholar 

  29. 29.

    Drmanac R, Sparks AB, Callow MJ, Halpern AL, Burns NL, Kermani BG et al (2010) Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science 327(5961):78–81

    CAS  Article  PubMed  Google Scholar 

  30. 30.

    Wain LV, Pedroso I, Landers JE, Breen G, Shaw CE, Leigh PN et al (2009) The role of copy number variation in susceptibility to amyotrophic lateral sclerosis: genome-wide association study and comparison with published loci. PLoS ONE 4(12):e8175

    Article  PubMed  PubMed Central  Google Scholar 

  31. 31.

    Yavas G, Koyutürk M, Özsoyo M, Gould MP, Koyuturk M, Ozsoyoglu M et al (2009) An optimization framework for unsupervised identification of rare copy number variation from SNP array data. Genome Biol. 10(10):R119 (Epub 2009 Oct 23)

    Article  PubMed  PubMed Central  Google Scholar 

  32. 32.

    Pang AW, MacDonald JR, Pinto D, Wei J, Rafiq MA, Conrad DF et al (2010) Towards a comprehensive structural variation map of an individual human genome. Genome Biol 11:R52

    Article  PubMed  PubMed Central  Google Scholar 

  33. 33.

    Butler JL, Osborne Locke ME, Hill KA, Daley M (2013) HD-CNV: hotspot detector for copy number variants. Bioinformatics 29(2):262–263

    CAS  Article  PubMed  Google Scholar 

  34. 34.

    Krzywinski M, Schein J, Birol I, Connors J, Gascoyne R, Horsman D et al (2009) Circos: an information aesthetic for comparative genomics. Genome Res 19(9):1639–1645

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  35. 35.

    Kumar K, Maiti S, Castellani CA, O’Reilly R, Singh SM (2013) A novel deletion cluster at 13q14.2-q21.33 in an 80-year man with late onset leukemia: clinical and molecular findings. Indian J Hum Genet 19(1):96–100

    Article  PubMed  PubMed Central  Google Scholar 

  36. 36.

    Castellani CA, Melka MG, Gui JL, O’Reilly RL, Singh SM (2015) Integration of DNA sequence and DNA methylation changes in monozygotic twin pairs discordant for schizophrenia. Schizophr Res 169(1–3):433–440

    CAS  Article  PubMed  Google Scholar 

  37. 37.

    Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM, Handsaker RE et al (2012) An integrated map of genetic variation from 1,092 human genomes. Nature 491(7422):56–65

    Article  PubMed  Google Scholar 

  38. 38.

    Project the 1000 G, Conrad DF, Keebler JE, Depristo MA, Lindsay SJ, Zhang Y et al (2011) Variation in genome-wide mutation rates within and between human families. Nat Genet 43:712

    Article  Google Scholar 

  39. 39.

    Milholland B, Dong X, Zhang L, Hao X, Suh Y, Vijg J (2017) Differences between germline and somatic mutation rates in humans and mice. Nat Commun 9(8):15183

    Article  Google Scholar 

  40. 40.

    Dal GM, Ergüner B, Sağıroğlu MS, Yüksel B, Onat OE, Alkan C et al (2014) Early postzygotic mutations contribute to de novo variation in a healthy monozygotic twin pair. J Med Genet 51(7):455–459

    CAS  Article  PubMed  Google Scholar 

  41. 41.

    Weber-Lehmann J, Schilling E, Gradl G, Richter DC, Wiehler J, Rolf B (2014) Finding the needle in the haystack: differentiating “identical” twins in paternity testing and forensics by ultra-deep next generation sequencing. Forensic Sci Int Genet 9:42–46

    CAS  Article  PubMed  Google Scholar 

  42. 42.

    Kondo S, Schutte BC, Richardson RJ, Bjork BC, Knight AS, Watanabe Y et al (2002) Mutations in IRF6 cause Van der Woude and popliteal pterygium syndromes. Nat Genet 32(2):285–289

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  43. 43.

    Reumers J, De Rijk P, Zhao H, Liekens A, Smeets D, Cleary J et al (2012) Optimized filtering reduces the error rate in detecting genomic variants by short-read sequencing. Nat Biotechnol 30(1):61–68

    CAS  Article  Google Scholar 

  44. 44.

    Vadlamudi L, Dibbens LM, Lawrence KM, Iona X, McMahon JM, Murrell W et al (2010) Timing of de novo mutagenesis—a twin study of sodium-channel mutations. N Engl J Med 363(14):1335–1340

    CAS  Article  PubMed  Google Scholar 

  45. 45.

    Ye K, Beekman M, Lameijer E-W, Zhang Y, Moed MH, van den Akker EB et al (2013) Aging as accelerated accumulation of somatic variants: whole-genome sequencing of centenarian and middle-aged monozygotic twin pairs. Twin Res Hum Genet 16(6):1026–1032

    Article  PubMed  Google Scholar 

  46. 46.

    Krawczak M, Cooper DN, Fändrich F, Engel W, Schmidtke J (2012) How to distinguish genetically between an alleged father and his monozygotic twin: a thought experiment. Forensic Sci Int Genet 6(5):e129–e130

    CAS  Article  PubMed  Google Scholar 

  47. 47.

    Bruder CEG, Piotrowski A, Gijsbers AACJ, Andersson R, Erickson S, Diaz de Ståhl T et al (2008) Phenotypically concordant and discordant monozygotic twins display different DNA copy-number-variation profiles. Am J Hum Genet 82(3):763–771

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  48. 48.

    Forsberg LA, Rasi C, Razzaghian HR, Pakalapati G, Waite L, Thilbeault KS et al (2012) Age-related somatic structural changes in the nuclear genome of human blood cells. Am J Hum Genet 90(2):217–228

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  49. 49.

    Piotrowski A, Bruder CEG, Andersson R, Diaz de Ståhl T, Menzel U, Sandgren J et al (2008) Somatic mosaicism for copy number variation in differentiated human tissues. Hum Mutat 29(9):1118–1124

    Article  PubMed  Google Scholar 

  50. 50.

    Modinos G, Iyegbe C, Prata D, Rivera M, Kempton MJ, Valmaggia LR et al (2013) Molecular genetic gene-environment studies using candidate genes in schizophrenia: a systematic review. Schizophr Res 150(2–3):356–365

    Article  PubMed  Google Scholar 

  51. 51.

    Picchioni MM, Murray RM (2007) Schizophrenia. BMJ 335(7610):91–95

    Article  PubMed  PubMed Central  Google Scholar 

  52. 52.

    Kendler KS (2015) A joint history of the nature of genetic variation and the nature of schizophrenia. Mol Psychiatry 20(1):77–83

    CAS  Article  PubMed  Google Scholar 

  53. 53.

    Schwab SG, Wildenauer DB (2013) Genetics of psychiatric disorders in the GWAS era: an update on schizophrenia. Eur Arch Psychiatry Clin Neurosci 263(Suppl):S147–S154

    Article  PubMed  Google Scholar 

  54. 54.

    Bouchard TJ, McGue M (1981) Familial studies of intelligence: a review. Science 212(4498):1055–1059

    Article  PubMed  Google Scholar 

  55. 55.

    Abi-Dargham A, Laruelle M (2005) Mechanisms of action of second generation antipsychotic drugs in schizophrenia: insights from brain imaging studies. Eur Psychiatry 20(1):15–27

    Article  PubMed  Google Scholar 

  56. 56.

    Kunii Y, Hyde TM, Ye T, Li C, Kolachana B, Dickinson D et al (2014) Revisiting DARPP-32 in postmortem human brain: changes in schizophrenia and bipolar disorder and genetic associations with t-DARPP-32 expression. Mol Psychiatry 19(2):192–199

    CAS  Article  PubMed  Google Scholar 

  57. 57.

    Meyer-Lindenberg A, Straub RE, Lipska BK, Verchinski BA, Goldberg T, Callicott JH et al (2007) Genetic evidence implicating DARPP-32 in human frontostriatal structure, function, and cognition. J Clin Investig 117(3):672–682

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  58. 58.

    Hall J, Trent S, Thomas KL, O’Donovan MC, Owen MJ (2015) Genetic risk for schizophrenia: convergence on synaptic pathways involved in plasticity. Biol Psychiatry 77(1):52–58

    CAS  Article  PubMed  Google Scholar 

  59. 59.

    Fromer M, Pocklington AJ, Kavanagh DH, Williams HJ, Dwyer S, Gormley P et al (2014) De novo mutations in schizophrenia implicate synaptic networks. Nature 506(7487):179–184

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  60. 60.

    Katanuma Y, Numakawa T, Adachi N, Yamamoto N, Ooshima Y, Odaka H et al (2014) Phencyclidine rapidly decreases neuronal mRNA of brain-derived neurotrophic factor. Synapse 68(6):257–265

    CAS  Article  PubMed  Google Scholar 

  61. 61.

    Gregório SP, Sallet PC, Do K-A, Lin E, Gattaz WF, Dias-Neto E (2009) Polymorphisms in genes involved in neurodevelopment may be associated with altered brain morphology in schizophrenia: preliminary evidence. Psychiatry Res 165(1–2):1–9

    Article  PubMed  Google Scholar 

  62. 62.

    Oliveira RMW, Guimarães FS, Deakin JFW (2008) Expression of neuronal nitric oxide synthase in the hippocampal formation in affective disorders. Braz J Med Biol Res 41(4):333–341

    CAS  Article  PubMed  Google Scholar 

  63. 63.

    Labrie V, Roder JC (2010) The involvement of the NMDA receptor d-serine/glycine site in the pathophysiology and treatment of schizophrenia. Neurosci Biobehav Rev 34(3):351–372

    CAS  Article  PubMed  Google Scholar 

  64. 64.

    Hu W, MacDonald ML, Elswick DE, Sweet RA (2015) The glutamate hypothesis of schizophrenia: evidence from human brain tissue studies. Ann N Y Acad Sci 1338(1):38–57

    CAS  Article  PubMed  Google Scholar 

  65. 65.

    Mostafavi S, Ray D, Warde-Farley D, Grouios C, Morris Q (2008) GeneMANIA: a real-time multiple association network integration algorithm for predicting gene function. Genome Biol 9(Suppl 1):S4

    Article  PubMed  PubMed Central  Google Scholar 

  66. 66.

    Kenny EM, Cormican P, Furlong S, Heron E, Kenny G, Fahey C et al (2014) Excess of rare novel loss-of-function variants in synaptic genes in schizophrenia and autism spectrum disorders. Mol Psychiatry 19(8):872–879

    CAS  Article  PubMed  Google Scholar 

  67. 67.

    Breckpot J, Thienpont B, Gewillig M, Allegaert K, Vermeesch JR, Devriendt K (2012) Differences in copy number variation between discordant monozygotic twins as a model for exploring chromosomal mosaicism in congenital heart defects. Mol Syndromol 2(2):81–87

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  68. 68.

    O’Huallachain M, Karczewski KJ, Weissman SM, Urban AE, Snyder MP (2012) Extensive genetic variation in somatic human tissues. Proc Natl Acad Sci USA 109(44):18018–18023

    Article  PubMed  PubMed Central  Google Scholar 

  69. 69.

    Singh SM, Castellani CA, O’Reilly RL (2009) Copy number variation showers in schizophrenia: an emerging hypothesis. Mol Psychiatry 14(4):356–358

    CAS  Article  PubMed  Google Scholar 

  70. 70.

    Insel TR (2014) Brain somatic mutations: the dark matter of psychiatric genetics? Mol Psychiatry 19(2):156–158

    CAS  Article  PubMed  Google Scholar 

  71. 71.

    Georgieva L, Rees E, Moran JL, Chambert KD, Milanova V, Craddock N et al (2014) De novo CNVs in bipolar affective disorder and schizophrenia. Hum Mol Genet 23(24):6677–6683

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  72. 72.

    Bloom RJ, Kahler AK, Collins AL, Chen G, Cannon TD, Hultman C et al (2013) Comprehensive analysis of copy number variation in monozygotic twins discordant for bipolar disorder or schizophrenia. Schizophr Res 146(1–3):289–290

    Article  PubMed  PubMed Central  Google Scholar 

  73. 73.

    Castellani CA, Awamleh Z, Melka MG, O’Reilly RL, Singh SM (2014) Copy number variation distribution in six monozygotic twin pairs discordant for schizophrenia. Twin Res Hum Genet 17(2):108–120

    Article  PubMed  Google Scholar 

  74. 74.

    Gulsuner S, McClellan JM (2014) De novo mutations in schizophrenia disrupt genes co-expressed in fetal prefrontal cortex. Neuropsychopharmacology 39(1):238–239

    CAS  Article  PubMed  Google Scholar 

  75. 75.

    Coufal NG, Garcia-Perez JL, Peng GE, Yeo GW, Mu Y, Lovci MT et al (2009) L1 retrotransposition in human neural progenitor cells. Nature 460(7259):1127–1131

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  76. 76.

    Baillie JK, Barnett MW, Upton KR, Gerhardt DJ, Richmond TA, De Sapio F et al (2011) Somatic retrotransposition alters the genetic landscape of the human brain. Nature 479(7374):534–537

    CAS  Article  PubMed  PubMed Central  Google Scholar 

  77. 77.

    Forsberg LA, Gisselsson D, Dumanski JP (2016) Mosaicism in health and disease—clones picking up speed. Nat Rev Genet 18(2):128–142

    Article  PubMed  Google Scholar 

Download references

Authors’ contributions

SMS, RLO and CAC designed the experiment. RLO identified the patients and conducted the clinical assessments. CAC undertook all data analysis under the supervision of SMS. JLG and AJG assisted with data organization and performing of wet lab experiments. CAC and SMS wrote the manuscript. MGM assisted with the writing, organization and review of the manuscript. All authors read and approved the final manuscript.


First, we thank the participating twins and their families in this research. This research was conceived and planned by SMS, ROR and CAC and funded by The Schizophrenia Society of Ontario, The Ontario Mental Health Foundation and The Canadian Institutes of Health Research. A number of individuals provided valuable help during the course of investigation. We wish to thank Dr. Jay Rao (Psychiatrist) who provided follow up diagnostic reviews. We are also thankful to Ali Pensamiento for her assistance in the laboratory.

Competing interests

The authors declare that they have no competing interests.

Consent for publication

All subjects provided written informed consent to participate in this study. Further, they have agreed to the sharing of data (genetic and clinical) in any publication.

Data availability statement

Please contact author for data requests.

Ethics approval and consent to participate

All subjects gave informed consent to participate in this study. This research received ethics approval from the Committee on Research Involving Human Subjects at The University of Western Ontario, London, Ontario, Canada.


This work was funded by The Canadian Institutes of Health Research, The Schizophrenia Society of Ontario and The Ontario Mental Health Foundation. The funders had no role in the design, collection, analysis, interpretation or writing of this manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information



Corresponding author

Correspondence to C. A. Castellani.

Additional file

Additional file 1.

Additional Figures S1–S4 and Tables S1, S2.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Castellani, C.A., Melka, M.G., Gui, J.L. et al. Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia. Clin Trans Med 6, 43 (2017).

Download citation


  • Schizophrenia
  • Whole genome sequencing
  • Monozygotic twins
  • Genome variation
  • Somatic mutation
  • Threshold model