- Open Access
CRISPR/Cas9-mediated heterozygous knockout of the autism gene CHD8 and characterization of its transcriptional networks in neurodevelopment
Molecular Autismvolume 6, Article number: 55 (2015)
Disruptive mutation in the CHD8 gene is one of the top genetic risk factors in autism spectrum disorders (ASDs). Previous analyses of genome-wide CHD8 occupancy and reduced expression of CHD8 by shRNA knockdown in committed neural cells showed that CHD8 regulates multiple cell processes critical for neural functions, and its targets are enriched with ASD-associated genes.
To further understand the molecular links between CHD8 functions and ASD, we have applied the CRISPR/Cas9 technology to knockout one copy of CHD8 in induced pluripotent stem cells (iPSCs) to better mimic the loss-of-function status that would exist in the developing human embryo prior to neuronal differentiation. We then carried out transcriptomic and bioinformatic analyses of neural progenitors and neurons derived from the CHD8 mutant iPSCs.
Transcriptome profiling revealed that CHD8 hemizygosity (CHD8 +/−) affected the expression of several thousands of genes in neural progenitors and early differentiating neurons. The differentially expressed genes were enriched for functions of neural development, β-catenin/Wnt signaling, extracellular matrix, and skeletal system development. They also exhibited significant overlap with genes previously associated with autism and schizophrenia, as well as the downstream transcriptional targets of multiple genes implicated in autism. Providing important insight into how CHD8 mutations might give rise to macrocephaly, we found that seven of the twelve genes associated with human brain volume or head size by genome-wide association studies (e.g., HGMA2) were dysregulated in CHD8 +/− neural progenitors or neurons.
We have established a renewable source of CHD8 +/− iPSC lines that would be valuable for investigating the molecular and cellular functions of CHD8. Transcriptomic profiling showed that CHD8 regulates multiple genes implicated in ASD pathogenesis and genes associated with brain volume.
ASDs are a class of neurodevelopmental disorders characterized by persistent deficits in social communication/interaction and restricted, repetitive patterns of behaviors, interests, or activities (DSM-5) . The genetic risk factors for ASD are heterogeneous, and up to 1 thousand genes are estimated to be involved . Recent whole exome and genome-sequencing projects, focused on the identification of rare de novo mutation in the probands of ASD family “trios” or “quads”, have discovered hundreds of genes with functionally disruptive mutations [3–12], some of which also map to ASD-associated de novo copy number variations (CNVs) . Many of the implicated genes encode proteins involved in synapse formation, transcriptional regulation, and chromatin remodeling [10, 14], indicating a likely convergence of functional pathways, despite genetic heterogeneity.
Chromodomain helicase DNA binding protein 8 (CHD8) emerged as a top ASD-candidate gene from multiple exome-sequencing studies [6, 8], which altogether have analyzed thousands of ASD probands and in some cases their families too [15, 16]. More importantly, disrupting chd8 in zebrafish development has recapitulated multiple features of ASD, including macrocephaly observed in some ASD cases carrying CHD8 mutations [15, 16]. In addition, macrocephaly is often found in ASD caused by disruption of other candidate genes .
CHD8 is a ubiquitously expressed member of the CHD family of ATP-dependent chromatin-remodeling factors that play important roles in chromatin dynamics, transcription, and cell survival . Previous studies [19–21] have shown that the CHD8 protein interacts with β-catenin and negatively regulates Wnt signaling, while both β-catenin and Wnt signaling play critical roles in normal brain development and neuropsychiatric disorders , including ASD . CHD8 also functionally interacts with p53  and recruits MLL histone methyltransferase complexes to regulate cell cycle genes .
To elucidate the roles of CHD8 in neurodevelopment and address the contribution of its disruption to ASD pathogenesis, three groups have recently reported genome-wide CHD8 binding sites and transcriptomic changes upon shRNA-mediated knockdown of CHD8 expression in neural progenitor cells (NPCs) , neural stem cells (NSCs) , and SK-N-SH neuroblastoma cells . The results showed that CHD8 binds to thousands of genes, largely biased to promoters, in NPCs, NSCs, and developing mammalian brains. Reduced expression of CHD8 resulted in the potential disruption of gene networks involved in neurodevelopment, which contained many ASD-risk genes [15, 25–27]. The transcriptional targets of CHD8 have also been studied in non-neural systems, as it is also involved in cell cycle regulation, Wnt signaling, and several forms of cancers [19, 21, 24, 28, 29].
As ASD is a developmental disorder with symptoms emerging in early childhood and the CHD8 causal mutations in patients are germline, it is important to establish cell models that can mimic the persistent loss of CHD8 function in the developing embryos prior to and during neuronal differentiation and brain development. Therefore, we applied CRISPR/Cas9 technology  to generate CHD8 +/− iPSCs by knocking out one copy of the gene in an iPSC line derived from a healthy male subject. We then differentiated both the wild-type (WT) and the CHD8 +/− iPSCs to NPCs and subsequently neurons and performed comparative transcriptomic analysis (RNA-seq). Our approach has several advantages: precisely targeted changes at the DNA level, persistent reduction of CHD8 expression, no introduction of extra genetic materials (e.g., virus vector) to the cells, and greater flexibility in the types of differentiated cells that can be generated.
We found that heterozygous CHD8 knockout (KO) disrupted the expression of many genes involved in extracellular matrix formation, neuronal differentiation, and skeletal system development. Interestingly, CHD8-regulated genes were enriched with ASD-risk genes, schizophrenia-risk genes, and genes implicated in regulating head size or brain volume. Furthermore, we found that CHD8-regulated genes significantly overlapped with the downstream targets of several critical genes (TCF4, EHMT1, SATB2, and NRXN1) that have been associated with ASD or other neuropsychiatric disorders. Taken together, our results not only shed light on the molecular roles of CHD8 in neurodevelopment, but also provide evidence of potential convergence of cellular pathways that could be disrupted by mutations of distinct ASD-risk genes.
Development of iPSCs from skin fibroblasts
We have been developing iPSCs from controls and patients with 22q11.2 deletion and diagnosis of a psychotic disorder . The study and consent forms were approved by the Institutional Review Board of Albert Einstein College of Medicine . Consent was obtained by a skilled member of the research team who had received prior human subject training. One of the healthy male control (without 22q11.2 del too) was used in the current study. Exome sequencing was performed on DNA extracted from white blood cells of this subject to detect coding variants prior to generating the CHD8 KO. We used GATK  for variant calling and ANNOVAR  for variant annotation.
The iPSC line used in this study was generated from fibroblasts obtained from a skin biopsy performed by a board-certified dermatologist. The procedures for growing fibroblasts and iPSC reprogramming are detailed in Additional file 1. Pluripotency was confirmed by immunocytochemistry using antibodies (Ab) against Tra-1-60, Tra-1-81, SSEA3, and SSEA4, which are expressed in pluripotent stem cells. In addition, the capacity to differentiate into all three germ layers was established by in vitro assays, as previously described (see Additional file 1 for details) [34, 35].
Design of the CHD8 single guide RNA sequences
Single guide RNA (sgRNA) sequences targeting the region adjacent to the Ser62 codon of CHD8 were picked using the online CRISPR design tool  from the Zhang lab at the Broad Institute, and the two selected sgRNAs were predicted to have very low probability off-target sites. The sgRNA sequences were then cloned into the pSpCas9 (BB)-2A-Puro (PX459) vector (a gift from Dr. Feng Zhang, Addgene plasmid # 48139) .
CRISPR/Cas9-mediated CHD8 knockout
Human iPSCs were cultured and fed daily in mTeSR1 (Stem Cell technologies) on Matrigel (BD)-coated plates at 37 °C/5 % CO2/85 % in a humidified incubator. Cells were maintained in log phase growth, and differentiated cells were manually removed. iPSCs were exposed to 10-μM ROCK Inhibitor for ~4 h to improve cell survival during nucleofection. After 4 h, growth medium was aspirated, and the cells were rinsed with DMEM/F12. iPSCs were dissociated into single cells using accutase and harvested. Nucleofection was performed using the Amaxa-4D Nucleofector Basic Protocol for Human Stem Cells (Lonza) according to the manufacturer’s instructions. Briefly, 8 × 105 cells and 5 μg of the CRISPR/Cas9 plasmids with either sgRNA1 or sgRNA2 were nucleofected using the P3 Primary Cell 4D-Nucleofector X Kit L with program CA-137. Cells were resuspended in mTeSR1 + 10-μM ROCK Inhibitor and placed in one well of a 6-well Matrigel-coated plate. The following day, cells were fed with fresh mTeSR1, and were subsequently fed with fresh medium every day. On days 4–14, cells were exposed to 0.5 μg/ml puromycin for 6 h. Puromycin-resistant colonies were picked and expanded in mTeSR1 without further puromycin treatment.
Characterizing CHD8 knockout lines
“TA” cloning was used to identify the knockout alleles. A 479-bp PCR amplicon flanking the CRSPR/Cas9-targeted sites was generated using the primers 5’-CTGTAAGACAGGTTGGGCTG-3’ and 5’-CTTGTTTCTTGCCTCTATACTTGA-3’. The PCR product was purified and ligated into pCR™2.1 using a TA Cloning Kit developed by Life Technologies following the manufacturer’s protocol. Recombinant plasmids were introduced into competent E. coli and selected in ampicillin. Plasmid DNA was extracted and sequenced across the insert using one of the PCR primers.
Western blotting confirmed that the CHD8 +/− lines expressed lower levels of CHD8 protein. Specifically, cell lysates from NPCs differentiated from WT, and KO iPSCs were separated by SDS PAGE, transferred to PVDF membranes, and then blotted with anti-CHD8 antibody (Bethyl Cat #A301-224A). Anti-actin antibody (BD Biosciences, Cat # 612656) was used for loading control.
Neurons were generated from iPSC-derived NPCs as described by Marchetto et al. with slight modifications [34, 37]. A detailed description of the protocol can be found in Additional file 1. Essentially, the protocol leads to a mixed population of glutamatergic and GABAergic neurons, from which RNA was extracted and sent for sequencing.
We obtained 101-bp paired-end RNA-seq reads from Illumina HiSeq 2500. RNA-seq reads were aligned to the human genome (hg19) using Tophat (version 2.0.8) . The number of RNA-seq fragments mapped to each gene was determined for genes in the GENCODE database (v18) . Exonic/intronic/intergenic rates were calculated by CollectRnaSeqMetrics in Picard . Cufflink (version 2.2.1)  was used to generate the gene-expression values as FPKMs (fragments per kilobase of exon per million fragments mapped). We restricted our analysis to 12,843 protein-coding genes with average FPKM >1 across all four samples. DESeq2  was used to determine differentially expressed genes (DEGs) in NPCs and neurons. The list of significantly DEGs was defined at false discovery rate (FDR) < 0.05. DAVID [43, 44] was used for Gene Ontology (GO) analysis with 12,843 expressed protein-coding genes as background. Ingenuity pathway analysis (IPA)  was used for canonical pathway analysis and disease association, with the ingenuity knowledge base (genes only) as background. Toppgene  was used for human phenotype ontology analysis, and the whole genome was used as background. The RNA-seq data have been deposited in the Gene Expression Omnibus (GEO; accession # GSE71594).
To find neurodevelopment genes specifically affected by CHD8 +/−, we added an interaction design in DESeq2 (option: ~celltype + genotype + celltype:genotype) to specifically model the interactions between development status (NPCs or neurons) and CHD8 status (WT or KO).
Validating targeted deletions and assessing off-targets using RNA-seq reads
First, we examined if CHD8 was targeted and edited precisely according to our CRISPR sgRNA design. A 2-bp deletion in chr14:21899785 (hg19) and a 10-bp deletion in chr14:21899722 (hg19) were identified in a proportion of RNA-seq reads that mapped to targeted regions of the two CRISPR sgRNAs in CHD8 +/− samples (KO1 and KO2, respectively). This was confirmed by DNA sequencing. The two short indels were not found in any of the WT samples. We also called short indels (supported by at least five RNA-seq reads) from RNA-seq reads by samtools , but we did not detect any additional indels that were present in CHD8 +/− samples but not in the WT controls.
Quantitative real-time PCR (qPCR)
qPCR was carried out on reverse transcribed PCR using the 2−ΔΔCt method as previously described [48, 49]. A detailed description and the primers used for this analysis can be found in Additional file 1.
Definition of CHD8-binding genes
ChIP-seq peaks of CHD8-binding sites in NPCs were from Sugathan’s report . Only peaks replicated by all three antibodies were used. Genes with at least one peak from 2 kb upstream of the transcription start sites to the transcription terminus were defined as CHD8-binding genes.
Interaction network analysis
DEGs in NPCs with CHD8-binding were imported into the STRING database v10  to construct protein-protein interaction networks. We retained any interaction (i.e., edge) from experiments and databases that had a median confidence ≥0.4.
For detecting converged networks of multiple ASD-risk genes, we first collected DEG lists from our CHD8 +/− NPCs, TCF4 knockdown, EHMT1 knockdown, MBD5 knockdown, and SATB2 knockdown, respectively [51, 52], and then imported genes shared by at least two lists into the STRING database to construct gene networks. GO enrichment was calculated by “Enrichment” function in STRING. Networks were visualized using Cytoscape . The same approach was also applied to DEGs from CHD8 KO, ZNF804A KD, and NRXN1 KD neurons.
Upstream regulator prediction
IPA was used to predict upstream regulators for 841 DEGs without CHD8 binding in NPCs. In this analysis, the p value from IPA measures the significance in overlap between query genes and pre-defined sets of genes that are regulated by a specific regulator, using the Fisher test. At the end, we used p < 0.05 to select upstream regulators that (a) regulated at least five non CHD8-bound DEGs and, themselves, were (b) in our NPC DEG list.
ASD/schizophrenia-risk gene sets
The first ASD gene set was obtained from the SFARI gene-scoring module , using genes scored as high confidence, to minimal evidence and syndromic. The second ASD gene set was from the AutismKB  core dataset, which includes syndromic autism related genes and non-syndromic-related genes, designated as high confidence. High-confidence and probable ASD genes in Willsey’s paper  were used as the third set (“Willsey_ASD”). Genes predicted from whole exome-sequencing and co-expression network  were used as another set (“Liu_ASD”). The other two lists were derived from massive whole exome sequencing: one (“Iossifov_ASD”) focused on de novo mutations  and the other (“DeRubeis_ASD”) combined de novo and inherited mutations to develop a high-confidence list (FDR < 0.1) . Two schizophrenia gene lists were from the SZgene database  and a recent GWAS report .
Identification of common GO terms for DEG lists from different CHD8 studies
DEGs were determined by the following criteria from data in four previous studies. For the study by Cotney et al. , we selected genes with logFC > 0.1 and log counts per million (logCPM) between 2 and 10 to meet the Poisson assumption, as described by the original authors. However, we repeated differential expression analysis with a less stringent FDR cutoff, using the Benjamini-Hotchberg method instead of Bonferroni to adjust p values for significance. For the study by Sugathan et al. , we used ComBat in the sva package  to adjust batch effects, followed by differential expression analysis with DESeq2 . DEGs were selected by p value < 0.05; 96 % of DEGs in Sugathan’s list were in our reanalysis list. For Wilkinson et al’s study , we used the DEG list provided by the authors.
Enriched GO terms for each of the five DEG lists were first determined by ClueGO  (p value < 0.05). Subsequently, GO terms shared by ≥3 DEG lists were considered as common and used for subsequent analysis of function overlap between the five CHD8 +/− and knockdown studies. The relationships among the selected GO terms were based on their shared genes, which was measured using kappa statistics . Two GO terms were connected by an edge if they had a kappa score >0.4. ClueGO relies on term similarity to define functional groups of multiple terms. In our analysis, we set initial group size to 5 (default value, 2) and percentage for group merge (default values, 50 %) to 80 % to obtain a summary of less redundant functional clusters of common GO terms. Since a GO term can be included in several functional groups, we assigned each term only to the functional groups in which it had the most significant group p values, meaning that this term had the most similar genetic component in this group. Subsequently, terms of the same groups formed a closed circle. In addition to edges connecting terms to show their relationships, we also added edges between terms and studies to reveal enrichment of specific GO terms among individual DEG lists.
To test if DEGs were significantly overlapped/enriched with a specific gene set, 12,843 expressed genes in our samples were used as background (of all genes) for the Fisher exact test. Statistics tests were conducted in R  and multiple test correction was applied unless specified otherwise.
Characterization of CHD8 knockout iPSC lines
Several functional disruptive mutations, including premature stop codons and frameshift mutations, have been detected in CHD8 in ASD probands [15, 16]. We designed two separate CRISPR sgRNA sequences to target the N-terminal of CHD8 protein to generate truncated mutations (Fig. 1a). iPSCs derived from a healthy male subject were transduced with CRISPR/Cas9 vectors containing each of the two target sequences. After screening, two clones, one with a 2-bp (KO1) and the other with a 10-bp (KO2) heterozygous deletion were found; the other allele was intact in both (Fig. 1b). Both genomic DNA sequencing and Western blotting analysis (Fig. 1b, c) confirmed the heterozygous knockout status for both clones. The CHD8 +/− iPSC lines were used to generate NPCs and early differentiating neurons for RNA-seq analysis, together with samples prepared from the parental WT clones, for a total of eight samples (two biological replicates of WT and CHD8 +/− at two neurodevelopmental stages).
Genes with altered expression in CHD8 +/− are involved in neurodevelopment
During RNA-seq analysis, we obtained on average ~28 million read pairs per sample (Additional file 2: Table S1). Examination of the RNA-seq reads mapped to the CHD8 exons confirmed the 2-bp and 10-bp deletion (Fig. 1d), indicating that both the WT and KO CHD8 copies were expressed in NPCs and neurons, with fewer reads from the KO copy than the WT. Importantly, our indel analysis of the RNA-seq reads detected no off-target sites at coding regions (see Methods).
Differential expression analysis identified 1248 and 3289 genes that changed expression in NPCs and neurons, respectively (FDR < 0.05, Fig. 2a, Additional file 3: Table S2). Note that CHD8 showed no significant expression change at the transcript level (p = 0.85 in NPCs, p = 0.27 in neurons, Additional file 3: Table S2). The fact that many more DEGs were detected in neurons than NPCs indicates that persistent CHD8 hemizygosity could have continuous and amplified effects in neurodevelopment, i.e., genes with altered expression in NPCs would directly affect the expression of additional sets of genes in the differentiating neurons. Seven DEGs, including SMARCA2, WNT7A, HMGA2, TESC, TCF4, TGFβ3, and DDR2, which are involved in transcription regulation, cell division, and WNT-β-catenin signal pathways, or related to head size or brain volume (see below), were selected for qRT-PCR analysis, and their differential expression in CHD8 +/− neurons was confirmed (Fig. 2b). Of them, SMARCA2, HMGA2, and TCF4 were potentially direct targets as they were bound by CHD8 .
Previous studies have reported that CHD8 bound to thousands of genomic regions (i.e., peaks), especially to gene promoters in NPCs or NSCs [25, 26]. To explore the relationship between DNA binding and gene regulation, we integrated CHD8 binding in control NPCs  with our expression data. Based on the average gene expression of the two WT NPC samples, we separated genes into 10 bins of equal size (1284 genes per bin). Consistent with the results from Sugathan et al. , CHD8 binding was observed more frequently at more highly expressed genes (Fig. 2c). To our surprise, we found that the percentages of CHD8 binding genes among the DEGs were significantly lower than the expectation from genome-wide binding (Fig. 2c). Nevertheless, this finding is consistent with previous reports that only a small percentage of CHD8-bound genes displayed differential expression upon CHD8 knockdown . This result indicates that CHD8 directly regulates a limited number of genes and that the majority of gene-expression changes due to CHD8 knockout are indirectly regulated targets.
GO analysis using DAVID revealed that upregulated genes in CHD8 +/− NPCs and neurons were enriched with similar GO terms, including “extracellular region,” “skeletal system development,” and “cell adhesion.” On the contrary, for downregulated genes, except for the GO term “neuron differentiation” that was enriched in both NPCs and neurons, “neuron projection,” “synaptic transmission,” and “transcription factor activity” were only enriched in neurons (Fig 2d, see full list in Additional file 4: Table S3). In addition, using IPA, we found that DEGs were highly enriched in “axonal guidance signaling,” “WNT-β-catenin signal,” and “PTEN signaling” (Fig. 2d, full list in Additional file 5: Table S4). These results indicate that heterozygous CHD8 mutations may disrupt multiple processes of neurodevelopment and neural functions in ASD. It should be noted that, in addition to nervous system development, the DEGs were also enriched for cancers, gastrointestinal disease, and cardiovascular system development, based on the IPA analysis (Additional file 5: Table S4).
As genes downregulated in both CHD8 +/− NPCs and neurons were enriched for neuronal differentiation, we wondered how heterozygous CHD8 KO might affect gene regulation during the transition from NPCs to differentiating neurons. To address this, we utilized a two-factor linear model implemented in DESeq2 to search for genes whose transitional expression from NPCs to neurons were affected by CHD8 reduction (i.e., interaction between neuronal differentiation and CHD8 status; see Methods). This analysis identified 1098 genes. Among them, 360 also showed a significant differential expression between NPCs and neurons in WT but not in the comparison of CHD8 +/− samples. Of these 360 genes, 207 genes were expressed at a higher level in WT neurons (vs. WT NPCs). GO analysis revealed that these genes were enriched in “neurological system process,” especially in “transmission of nerve impulse” and “synaptic transmission” (Additional file 2: Figure S1A). The 153 genes with higher expression in WT NPCs (vs. WT neurons) were enriched for “cell junction” and “cell adhesion” (Additional file 2: Figure S1B). These results indicate that normal synapse formation and function could be disrupted during CHD8 +/− neuron differentiation.
CHD8 direct targets vs. indirect targets
To better understand the regulatory roles of CHD8, we further characterized the 407 DEGs in NPCs with CHD8 binding (i.e., direct targets) and used STRING to define their function interactions, which include direct protein-protein interactions and indirect functional associations. Of the 407 genes, 140 were included in the resulted STRING network, and interestingly, those genes could be grouped into several function clusters: cell cycle, cytoskeleton, cell adhesion, chromatin factor, ribonucleoprotein complex, and GTPase genes (Fig. 3a), revealing major pathways that could be directly regulated by CHD8 in NPCs. We did not perform the same analysis for neuron DEGs because no CHD8-binding data was available for human neurons.
As DEGs were statistically depleted of CHD8 binding, we wondered how then those 841 NPC DEGs without CHD8 binding (i.e., indirect targets) could be affected by reduced CHD8 expression. To address this, we applied IPA software to predict the upstream regulators of these CHD8 indirect targets. The results showed that 12 CHD8 direct targets could serve as upstream regulators of 95 of the 841 CHD8 indirect targets (Fig. 3b), including ASD-risk genes MEF2  and ARNT2 . This analysis also revealed that some DEGs could be regulated by 35 DEGs that were, themselves, indirect targets. Together, these 47 upstream regulators formed a complex regulatory network (Fig. 3b), mediating key cellular pathways (e.g., BMP and TGFβ) that eventually may lead to expression changes for thousands of genes when CHD8 expression is disrupted, an interesting finding to be further investigated.
CHD8-regulated genes are overall longer than non-DEGs
We noticed that some extremely long genes were dysregulated in either NPCs or neurons, like LSAMP (2187 kb), PCDH15 (1825 kb), RBFOX1 (1694 kb), and NRXN3 (1622 kb). In addition, many high-confidence ASD-candidate genes are exceptionally long . At the genome-wide level, we found that DEGs in both NPCs and neurons were significantly longer than the non-DEGs (Fig. 4). The length difference between DEGs and non-DEGs was not detected in a separate study in which we compared the iPSC-derived neurons from schizophrenia patients with controls (manuscript submitted; data available in the GEO: GSE46562); the mean lengths were 77 kb for DEGs and 73 kb for non-DEGs (p value = 0.51, Student’s t test). This indicates that our observation is not simply a result of analyzing expression data in the neural induction system we used. This difference was detected for genes with CHD8-binding (Fig. 4a) or without CHD8-binding (Fig. 4b), suggesting that other transcription regulators may cooperate with CHD8 to regulate the expression of long genes.
DEGs are enriched for genes associated with human head size/brain volume
Macrocephaly is overrepresented in ASD patients and a defined feature of some syndromic forms of ASD, including those carrying loss-of-function CHD8 mutations [15, 16]. In addition, chd8 disruption in zebrafish resulted in increased embryonic head size [15, 25]. We thus used Toppgene to predict the potential phenotypes that could result from the dysregulation of CHD8 targets. For downregulated genes in CHD8 +/− neurons, “abnormality of skull size” was one of the most significant human phenotypes (Fig. 5a), so were neurodevelopmental abnormality, intellectual disability, and abnormality of brain morphology, all consistent with clinical phenotypes seen in patients with CHD8-disruptive mutations . For the other gene groups, i.e., up- or downregulated genes in CHD8 +/− NPCs or upregulated genes in CHD8 +/− neurons, either no human phenotypes were significantly enriched, or enriched phenotypes showed no obvious relation to brain or head development (Additional file 6: Table S5). It would be interesting to further study how this difference detected for the DEGs of CHD8 +/− NPCs and neurons may be related to changing CHD8 roles at different stages of brain development.
Next, we compared the lists of DEGs in CHD8 +/− NPCs and neurons with genes previously associated with brain size or volumes of specific brain regions. Recent GWASs revealed a small number of common variants associated with the volumes of different human brain regions or head sizes [67–69]. Remarkably, of the only 12 genes linked to the statistically significant GWAS variants, 7 were differentially expressed in CHD8 +/− neurons, which was significantly higher than expected (odds ratio (OR) = 4.07, p = 0.016, Fisher’s exact test, one-tailed; Table 1). HMGA2, a high-confident candidate linked to adult intracranial volume  and infant head circumference , was a CHD8 direct target (with CHD8 binding to its promoter [25, 26]), and it was upregulated in both CHD8 +/− NPCs and neurons. Interestingly, HMGA2 encodes a chromatin-associated regulator important for stem cell renewal. How CHD8 and HMGA2 interact and co-regulate gene-expression and cell cycles warrants further study. Another DEG associated with caudate volume is FAT3, which codes for an atypical cadherin previously shown to affect dendritic pruning , a known defect in ASD. Interestingly, by analyzing the spatiotemporal transcriptional data across developing brains available in the Brainspan project , we found that the expression of CHD8 was significantly and positively correlated with FAT3 during brain development (Additional file 2: Figure S2), suggesting that CHD8 is a positive regulator of FAT3, consistent with the downregulation of FAT3 in CHD8 +/−. MAPT, another top candidate associated with infant head circumference , was downregulated in CHD8 +/− neuron (nominal p value = 0.015).
DEGs are enriched with ASD and schizophrenia-risk genes
Functionally disruptive mutations in CHD8 have been reported in multiple ASD patients as well as one sporadic schizophrenia patient . Comparing our list of DEGs with ASD-risk gene sets from multiple sources (see Methods), we found that upregulated genes in CHD8 +/− NPCs and downregulated genes in CHD8 +/− neurons were significantly enriched with ASD-risk genes (Fig. 5b, Additional file 7: Table S6). In Table 2, we provided a list of high-confident ASD-risk genes that were dysregulated in CHD8 +/−. For the 163 ASD-risk genes that were in our DEG lists, they were significantly enriched for “transmission of nerve impulse,” “synaptic transmission,” and “neuron differentiation,” further supporting the importance of CHD8 in regulating synaptic functions. In a comparison of our DEGs with the genes associated with schizophrenia, we found that our DEGs were enriched with schizophrenia-related genes from the SCZgene database  but not enriched in the high-confident gene list from a recent schizophrenia GWAS  (Fig. 5b, Additional file 7: Table S6).
CHD8-regulated genes significantly overlap with the targets of other autism-risk genes
Since ASD is caused by mutations in a diverse array of genes involved in different cellular functions, we next set out to address whether the dysregulated targets by different ASD-risk genes indeed converge on molecular and cellular pathways. We thus searched for published expression data reporting downstream targets of ASD genes, especially transcriptional regulators. Recent studies reported that in human progenitor cells, reduced expression of transcription factor 4 (TCF4) and histone-lysine N-methyltransferase 1 (EHMT1) converged at several levels with respect to their downstream targets . Also, the downstream targets of methyl-CpG-binding domain 5 (MBD5) and special AT-rich binding protein (SATB2) were converged in neural stem cells . We therefore decided to include these genes to our analysis because their association with ASD has been described previously [8, 73, 74], and their regulatory targets were identified in NPCs using an experimental scheme similar to ours, comparing expression changes by RNA-seq after reducing expression of ASD-candidate genes that function as transcription regulators.
First, we compared the DEGs from these studies with ours. Note that in CHD8 +/− NPCs, TCF4 was upregulated (3.4-fold increase, q value = 1.58e-19), but no expression changes were observed for the other three. We found that our list of DEGs in NPCs showed significant overlap with the DEGs found in the TCF4, EHMT1, and MBD5 knockdown studies (Fig. 6a, Additional file 8: Table S7). To search for functional commonality, we analyzed the 439 genes that were affected by at least two of the five ASD genes. Again, we used STRING to define function interactions among these 439 genes. The results indicated that the common genes were mainly distributed in two highly interconnected clusters (Fig. 6b). One was significantly enriched for genes involved in forming extracellular matrix (p = 5.83e-12, Additional file 8: Table S7), including multiple collagen genes. The other was highly enriched with cell cycle-related genes (p = 6.78e-9, Additional file 8: Table S7), though the enrichment was mainly derived from the common genes between EHMT1 and MBD5 knockdown. Genes critical for neurogenesis, like PLP1 and GFAP, were also significantly enriched (p = 1.14e-8, Additional file 8: Table S7), but they showed sparse connection, perhaps because of incomplete information in the interaction database.
Next, we carried out the same analysis for DEGs in neurons, including data from our current study and a previous study in which neurexin 1 (NRXN1) expression was reduced . NXRN1 was downregulated in CHD8 knockout neurons (3.2-fold reduction, q value = 2.84e-4). Again, we found a significant overlap of the DEGs from CHD8 knockout neurons with those in NRXN1 knockdown neurons (Fig. 6a). The overlapping genes were, again, enriched for “extracellular matrix.”
The transcriptional targets of zinc finger protein 804A (ZNF804A), a top schizophrenia candidate, were recently reported by our group from RNA-seq analysis in which the gene was knocked down in neurons derived from iPSCs . While the overlap of DEGs from differentiating neurons with CHD8 knockout and ZNF804A knockdown was not significant (Fig. 6a), the overlap of DEGs from ZNF804A knockdown and NRXN1 knockdown was highly significant (OR = 2.63, p = 0.023, Fisher’s exact test, one-tailed). These results suggest that both common and unique transcription targets exist for neuropsychiatric disorder candidate genes that code for gene-expression regulators. It should also be noted that previous studies have also reported a convergence of genes affected by SATB2 knockdown and MBD5 knockdown .
In summary, our comparison of genes that show expression changes upon knockout or knockdown of key genes implicated in major neuropsychiatric disorders found that many downstream target genes were commonly affected by several transcription factors and epigenetic modifiers, and the common genes were often enriched for those that code for “cell cycle,” “extracellular matrix,” and “cell proliferation.”
Comparison with previous results from CHD8 knockdown studies
We compared our lists of DEGs with other four lists of DEGs that were obtained by knocking down CHD8 expression in neural cells [25–27]. Except for the two lists from two independent shRNA knockdowns in the same study, DEG lists from different analyses showed modest overlap; at most, 30–40 % of DEGs in one list were detected in other studies (Additional file 2: Figure S3).
Whereas the overlap from different studies was modest, we considered the possibility that the DEGs from different studies might converge on common functional pathways. We thus performed GO enrichment analysis for each of the five lists of DEGs and then focused on the 327 GO terms that were enriched in at least three DEG lists (Additional file 9: Table S8). Among them, only four terms were present in all five DEG sets (referred to as “5 L” terms) and 31 terms (“4 L” terms) were shared by four studies. When the functions of these 327 GO terms were grouped by the software GlueGO , however, 13 major functional clusters emerged (Fig. 7). The largest cluster, harboring 72 individual GO terms, was involved in cell communication, to which all of the 5 L terms belonged. This suggests that one common effect of reduced CHD8 expression across all studies is cell communication and signal transduction. Some of the DEGs from our CHD8 +/− analysis are regulators of major signaling pathways, for example, NFATC1 and BMP2/4, and WNT7A (Fig. 3b). This finding was further supported by the observation that 62 % of the 4 L terms (n = 22) were located within similar function clusters (indicated by high density of grey lines between them), such as cellular protein metabolic process, RNA metabolic process, and cellular response to stimulus. The second largest cluster was “neuron development,” including 66 specific GO terms related to neuron differentiation, axon guidance, and synapse organization, etc. This indicates that CHD8 disruption has a profound effect on multiple aspects of neurodevelopment. Note that the two DEG lists from shRNA knockdown in the study by Cotney et al. were not enriched for this large functional cluster. However, cell cycle pathways were particularly enriched in those two lists, suggesting that the samples in that study might be in a more proliferative and less “neural-like” state. This integrated analysis also identified a number of GO terms clustered into programmed cell death and cytoskeleton organization, consistent with the previously reported interactions between CHD8 and p53-mediated apoptosis  and the Wnt-β-catenin signaling pathway . In addition to well-known functional clusters involving CHD8, including cell adhesion and extracellular matrix organization, our bioinformatics analysis suggests that cell migration and skeletal development could also be a common theme affected by reduced CHD8 expression.
Finally, we compared the five lists of DEGs to the results in a recent analysis of telencephalic organoids generated from idiopathic autism patient-specific iPSCs . Of the differentially expressed genes in two developmental stages (TD11 and TD31) of the organoids between ASD and control samples , 516 and 1060 were expressed in our samples. When the DEGs in CHD8 +/− or CHD8 knockdown studies were compared to the organoid DEGs, our CHD8 +/− showed the largest overlap with the organoid data (Fig. 8, Additional file 2: Figure S3). GO analysis indicated that the dysregulated genes common in CHD8 +/− samples and ASD organoids were significantly enriched for genes involved in skeleton system development, cell adhesion, and neuron differentiation (Fig. 8), indicating that these could be functional pathways commonly disrupted in ASD. Interestingly, the brain volume-associated gene FAT3 was upregulated in TD31, further suggesting the potential importance of FAT3 in ASD.
To better understand the effect of genetic disruption of CHD8 in ASD subjects, we have applied CRISPR/Cas9 technology to knockout one copy of CHD8 in a control iPSC line and studied its effect on gene expression during early neurodevelopment. In comparison to previous studies [25–27] that used a gene knockdown approach to reduce CHD8 expression, our approach generated heterozygous disruptions that better mimic the germline mutations in ASD patients and allows for the study of long-term effects of CHD8 disruption in neurogenesis in vitro. In addition, creating CHD8 +/− iPSCs provides a truly renewable resource for investigators, as opposed to NPCs, which have a finite replicative capacity. Furthermore, iPSCs can differentiate into any cell type, including cerebral organoids ; NPCs are restricted in their differentiation potential.
Perhaps one of the most important findings emerging from our transcriptomic analysis is that several genes known to be related to head size or brain volume, either from the analysis of human phenotype ontology  or identified through GWAS [67–69], displayed significant changes in their expression in CHD8 +/− neurons. Studies examining CHD8 function in zebrafish during embryonic development revealed that the macrocephaly phenotype observed in ASD probands with CHD8 mutations is likely caused by disturbed neuronal proliferation at early developmental stages [15, 25]. By uncovering genes like HMGA2 and FAT3, which have been associated with head size, our finding thus provides new molecular insights that may eventually link CHD8 mutation to abnormal neuronal proliferation and macrocephaly. This association between macrocephaly and ASD is not the first to be reported in genetically defined subgroups. Mutations in PTEN are also associated with severe macrocephaly and ASD . Although PTEN itself was not differentially expressed in our CHD8 +/− samples, FOXO1 and FOXO3, two critical transcription factors in PTEN signaling, were. Interestingly, IPA analysis of the differentially expressed genes reported an enrichment of genes in PTEN signaling, suggesting that there may be a link between the PTEN and CHD8 pathways, and the molecular link underlies the observed macrocephaly and ASD. In this regard, we should mention that PTEN was differentially expressed upon CHD8 knockdown in NPCs in a previous study . We should also point out there were additional genes in our DEG lists that have been previously suggested to be associated with hippocampal volumes, such as BDNF, DISC1, and NRG1.
Dysregulated genes in CHD8 +/− cells exhibited significant overlap with previously defined ASD-risk genes, including some high confident candidates like ANK2, SCN2A, and SUV420H1  that also showed significant differential expression (Table 2). We further demonstrated, interestingly, that CHD8-regulated genes significantly overlapped with the downstream targets of other ASD-risk genes like TCF4 and NRXN1, providing transcriptomic evidence that ASD-risk genes have overlapping function and converge on downstream regulatory pathways. This is extremely important in a genetically heterogeneous condition, such as ASD, in which any individual candidate gene carrying deleterious mutation may only contribute to 1~2 % ASD cases . TCF4 is associated with Pitt-Hopkins syndrome (MIM: 610954), which is defined by severe psychomotor delay, epilepsy, daily bouts of diurnal hyperventilation, mild postnatal growth retardation, postnatal microcephaly, and distinctive facial features . It is also a top schizophrenia candidate gene. In NPCs and NSCs, CHD8 binds to the TCF4 gene body in a region that is also enriched with H3K27ac [25, 26], a histone modification associated with active enhancers. TCF4 is significantly upregulated in both CHD8 +/− NPCs and neurons. Moreover, TCF4-regulated genes significantly overlapped with CHD8-regulated genes. Taken together, these results suggest a direct connection between TCF4 and CHD8 regulatory networks. The common genes in the two networks, especially those regulated oppositely by CHD8 and TCF4, such as HMGA2, which was upregulated in CHD8 +/− (Table 1) and downregulated in TCF4 knockdown , are strong candidates for regulating the development of brain size.
It was intriguing to find that DEG lists from different CHD8 +/− or knockdown studies had only limited overlaps. A recent study of knockout vs. knockdown in zebrafish egfl7 proposed that compensatory networks would be activated to buffer against deleterious mutations from knockout, which was absent in knockdown , providing a potential explanation for the lack of good overlap among CHD8 KO and KD findings. However, we found that at the function and pathway levels, genes involved in similar functions were affected by reduced CHD8 expression in different contexts. It is conceivable that a limited number of upstream regulators are directly regulated by CHD8, and the subsequent response of the downstream targets is mostly dependent on genetic background, cell culture conditions, and other experimental factors. In this regard, we should mention that five (GDPD4, VPS13B, KMT2C, SETBP1, and CLTCL1) of the ~1000 ASD-risk genes were predicted to be functionally disrupted by premature stop or frameshift variants located to the coding exons of the subject used to prepare our WT iPSC line (Additional file 10: Table S9 and S10). However, none of these five genes exhibited differential expression in the WT neurons when compared to samples from other control iPSC lines derived from six unrelated subjects (data not shown). As neuronal differentiation is a complex process, affected by both environmental cues and intrinsic cellular signaling, our analysis suggests that it is important to study the effects of the same genes under different experimental conditions. Nevertheless, comparison of our results with the transcriptomic data in ASD-derived organoids indicates that reduction of CHD8 expression in our CHD8 +/− samples is probably more consistent with the gene regulation in ASD-developing brains, although it should be noted that the ASD-derived organoids were from patients with unknown genetic mutations.
As most of the functionally disruptive mutations uncovered in the CHD8-coding regions in the ASD probands introduce premature stop or frameshift mutation [15, 16], we used CRISPR/Cas9 technology to make small deletions in the first coding exon of CHD8 in this study. While the 2- or 10-bp deletion is predicted to cause frameshift, and no functional protein is expected from the mutants, RNA transcripts from the knockout CHD8 copy were observed in our RNA-seq data, indicating nonsense mediated mRNA decay is incomplete if it occurs to the mutated CHD8 transcripts (Fig. 1). CHD8 encodes a multi-domain protein, and deleterious mutations in CHD8 have been found in almost every important functional domain [15, 16]. While our data indicate that CHD8 regulates multiple pathways related to neural development, the different mutations in ASD individuals may impair distinct and specific aspects of CHD8 functions. In the future, it will be valuable to carry out gene-expression profiling using ASD-specific iPSCs to see how our current findings can be recapitulated in additional iPSC-derived NPCs and neurons. As genetic background likely plays an important role in modulating CHD8 function during brain development, it is important to both perform our current knockout analysis in additional control iPSC lines and to derive patient-specific lines from multiple ASD individuals with CHD8 mutations. In addition, it will be extremely valuable to apply CRISPR technology to correct the CHD8 mutations and perform transcriptomic analysis and other molecular assays once the patient-specific lines are established.
CHD8 regulates multiple genes involved in cell communication, extracellular matrix and neurogenesis that are critical for brain development. In addition, CHD8 hemizygosity causes expression changes in several genes that are associated with brain volume or head size, suggesting a molecular link between CHD8 mutation and macrocephaly. By cross-analysis of several studies in which the expression of several ASD-candidate genes was reduced, we provide evidence that the transcription targets of ASD genes converge on a set of genes and pathways.
- CHD8 :
Chromodomain helicase DNA binding protein 8
Copy number variation
Differentially expressed gene
Fragments per kilobase of exon per million fragments mapped
Genome-wide association study
Ingenuity pathway analysis
Induced pluripotent stem cell
Neural progenitor cell
Neural stem cell
Single guide RNA
American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders, 5th Edition (DSM-5) (American Psychiatric Association, 2013).
De Rubeis S, Buxbaum JD. Genetics and genomics of autism spectrum disorder: embracing complexity. Hum Mol Genet. 2015;24:R24–31.
O'Roak BJ, Deriziotis P, Lee C, Vives L, Schwartz JJ, Girirajan S, et al. Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nat Genet. 2011;43:585–9.
Iossifov I, Ronemus M, Levy D, Wang Z, Hakker I, Rosenbaum J, et al. De novo gene disruptions in children on the autistic spectrum. Neuron. 2012;74:285–99.
Michaelson JJ, Shi Y, Gujral M, Zheng H, Malhotra D, Jin X, et al. Whole-genome sequencing in autism identifies hot spots for de novo germline mutation. Cell. 2012;151:1431–42.
Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A, et al. Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature. 2012;485:242–5.
Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, Willsey AJ, et al. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature. 2012;485:237–41.
O'Roak BJ, Vives L, Girirajan S, Karakoc E, Krumm N, Coe BP, et al. Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature. 2012;485:246–50.
Jiang YH, Yuen RK, Jin X, Wang M, Chen N, Wu X, et al. Detection of clinically relevant genetic variants in autism spectrum disorder by whole-genome sequencing. Am J Hum Genet. 2013;93:249–63.
De Rubeis S, He X, Goldberg AP, Poultney CS, Samocha K, Cicek AE, et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature. 2014;515:209–15.
Iossifov I, O'Roak BJ, Sanders SJ, Ronemus M, Krumm N, Levy D, et al. The contribution of de novo coding mutations to autism spectrum disorder. Nature. 2014;515:216–21.
Krumm N, Turner TN, Baker C, Vives L, Mohajeri K, Witherspoon K, et al. Excess of rare, inherited truncating mutations in autism. Nat Genet. 2015;47:582–8.
Sanders SJ, He X, Willsey AJ, Ercan-Sencicek AG, Samocha KE, Cicek AE, et al. Insights into Autism Spectrum Disorder Genomic Architecture and Biology from 71 Risk Loci. Neuron. 2015;87:1215–33.
Krumm N, O'Roak BJ, Shendure J, Eichler EE. A de novo convergence of autism genetics and molecular neuroscience. Trends Neurosci. 2014;37:95–105.
Bernier R, Golzio C, Xiong B, Stessman HA, Coe BP, Penn O, et al. Disruptive CHD8 mutations define a subtype of autism early in development. Cell. 2014;158:263–76.
O'Roak BJ, Vives L, Fu W, Egertson JD, Stanaway IB, Phelps IG, et al. Multiplex targeted sequencing identifies recurrently mutated genes in autism spectrum disorders. Science. 2012;338:1619–22.
Chaste P, Klei L, Sanders SJ, Murtha MT, Hus V, Lowe JK, et al. Adjusting head circumference for covariates in autism: clinical correlates of a highly heritable continuous trait. Biol Psychiatry. 2013;74:576–84.
Marfella CG, Imbalzano AN. The Chd family of chromatin remodelers. Mutat Res. 2007;618:30–40.
Nishiyama M, Skoultchi AI, Nakayama KI. Histone H1 recruitment by CHD8 is essential for suppression of the Wnt-beta-catenin signaling pathway. Mol Cell Biol. 2012;32:501–12.
Thompson BA, Tremblay V, Lin G, Bochar DA. CHD8 is an ATP-dependent chromatin remodeling factor that regulates beta-catenin target genes. Mol Cell Biol. 2008;28:3894–904.
Sawada G, Ueo H, Matsumura T, Uchi R, Ishibashi M, Mima K, et al. CHD8 is an independent prognostic indicator that regulates Wnt/beta-catenin signaling and the cell cycle in gastric cancer. Oncol Rep. 2013;30:1137–42.
Okerlund ND, Cheyette BN. Synaptic Wnt signaling-a contributor to major psychiatric disorders? J Neurodev Disord. 2011;3:162–74.
Nishiyama M, Oshikawa K, Tsukada Y, Nakagawa T, Iemura S, Natsume T, et al. CHD8 suppresses p53-mediated apoptosis through histone H1 recruitment during early embryogenesis. Nat Cell Biol. 2009;11:172–82.
Subtil-Rodriguez A, Vazquez-Chavez E, Ceballos-Chavez M, Rodriguez-Paredes M, Martin-Subero JI, Esteller M, et al. The chromatin remodeller CHD8 is required for E2F-dependent transcription activation of S-phase genes. Nucleic Acids Res. 2014;42:2185–96.
Sugathan A, Biagioli M, Golzio C, Erdin S, Blumenthal I, Manavalan P, et al. CHD8 regulates neurodevelopmental pathways associated with autism spectrum disorder in neural progenitors. Proc Natl Acad Sci U S A. 2014;111:E4468–4477.
Cotney J, Muhle RA, Sanders SJ, Liu L, Willsey AJ, Niu W, et al. The autism-associated chromatin modifier CHD8 regulates other autism risk genes during human neurodevelopment. Nat Commun. 2015;6:6404.
Wilkinson B, Grepo N, Thompson BL, Kim J, Wang K, Evgrafov OV, et al. The autism-associated gene chromodomain helicase DNA-binding protein 8 (CHD8) regulates noncoding RNAs and autism-related genes. Transl Psychiatry. 2015;5:e568.
Ceballos-Chavez M, Subtil-Rodriguez A, Giannopoulou EG, Soronellas D, Vazquez-Chavez E, Vicent GP, et al. The chromatin Remodeler CHD8 is required for activation of progesterone receptor-dependent enhancers. PLoS Genet. 2015;11:e1005174.
Rodriguez-Paredes M, Ceballos-Chavez M, Esteller M, Garcia-Dominguez M, Reyes JC. The chromatin remodeling factor CHD8 interacts with elongating RNA polymerase II and controls expression of the cyclin E2 gene. Nucleic Acids Res. 2009;37:2449–60.
Ran FA, Hsu PD, Wright J, Agarwala V, Scott DA, Zhang F. Genome engineering using the CRISPR-Cas9 system. Nat Protoc. 2013;8:2281–308.
Zhao D, Lin M, Chen J, Pedrosa E, Hrabovsky A, Fourcade HM, et al. MicroRNA Profiling of Neurons Generated Using Induced Pluripotent Stem Cells Derived from Patients with Schizophrenia and Schizoaffective Disorder, and 22q11.2 Del. PLoS One. 2015;10:e0132387.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38:e164.
Lin M, Hrabovsky A, Pedrosa E, Wang T, Zheng D, Lachman HM. Allele-biased expression in differentiating human neurons: implications for neuropsychiatric disorders. PLoS One. 2012;7:e44017.
Lin M, Pedrosa E, Shah A, Hrabovsky A, Maqbool S, Zheng D, et al. RNA-Seq of human neurons derived from iPS cells reveals candidate long non-coding RNAs involved in neurogenesis and neuropsychiatric disorders. PLoS One. 2011;6:e23356.
CRISPR design [http://crispr.mit.edu/]
Marchetto MC, Carromeu C, Acab A, Yu D, Yeo GW, Mu Y, et al. A model for neural development and treatment of Rett syndrome using human induced pluripotent stem cells. Cell. 2010;143:527–39.
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36.
Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, et al. GENCODE: the reference human genome annotation for The ENCODE Project. Genome Res. 2012;22:1760–74.
Trapnell C, Roberts A, Goff L, Pertea G, Kim D, Kelley DR, et al. Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks. Nat Protoc. 2012;7:562–78.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550.
da Huang W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc. 2009;4:44–57.
da Huang W, Sherman BT, Lempicki RA. Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009;37:1–13.
Chen J, Bardes EE, Aronow BJ, Jegga AG. ToppGene Suite for gene list enrichment analysis and candidate gene prioritization. Nucleic Acids Res. 2009;37:W305–311.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2079;25:2078.
Chen J, Lin M, Foxe JJ, Pedrosa E, Hrabovsky A, Carroll R, et al. Transcriptome comparison of human neurons generated using induced pluripotent stem cells derived from dental pulp and skin fibroblasts. PLoS One. 2013;8:e75682.
Chen J, Lin M, Hrabovsky A, Pedrosa E, Dean J, Jain S, et al. ZNF804A Transcriptional Networks in Differentiating Neurons Derived from Induced Pluripotent Stem Cells of Human Origin. PLoS One. 2015;10:e0124597.
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43:D447–452.
Chen ES, Gigek CO, Rosenfeld JA, Diallo AB, Maussion G, Chen GG, et al. Molecular convergence of neurodevelopmental disorders. Am J Hum Genet. 2014;95:490–508.
Gigek CO, Chen ES, Ota VK, Maussion G, Peng H, Vaillancourt K, et al. A molecular model for neurodevelopmental disorders. Transl Psychiatry. 2015;5:e565.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–504.
Xu LM, Li JR, Huang Y, Zhao M, Tang X, Wei L. AutismKB: an evidence-based knowledgebase of autism genetics. Nucleic Acids Res. 2012;40:D1016–1022.
Willsey AJ, Sanders SJ, Li M, Dong S, Tebbenkamp AT, Muhle RA, et al. Coexpression networks implicate human midfetal deep cortical projection neurons in the pathogenesis of autism. Cell. 2013;155:997–1007.
Liu L, Lei J, Sanders SJ, Willsey AJ, Kou Y, Cicek AE, et al. DAWN: a framework to identify autism genes and subnetworks using gene expression and genetics. Mol Autism. 2014;5:22.
Allen NC, Bagade S, McQueen MB, Ioannidis JP, Kavvoura FK, Khoury MJ, et al. Systematic meta-analyses and field synopsis of genetic association studies in schizophrenia: the SzGene database. Nat Genet. 2008;40:827–34.
Schizophrenia Working Group of the Psychiatric Genomics C. Biological insights from 108 schizophrenia-associated genetic loci. Nature. 2014;511:421–7.
Johnson WE, Li C, Rabinovic A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics. 2007;8:118–27.
Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009;25:1091–3.
Huang DW, Sherman BT, Tan Q, Collins JR, Alvord WG, Roayaei J, et al. The DAVID Gene Functional Classification Tool: a novel biological module-centric algorithm to functionally analyze large gene lists. Genome Biol. 2007;8:R183.
Novara F, Beri S, Giorda R, Ortibus E, Nageshappa S, Darra F, et al. Refining the phenotype associated with MEF2C haploinsufficiency. Clin Genet. 2010;78:471–7.
Chakrabarti B, Dudbridge F, Kent L, Wheelwright S, Hill-Cawthorne G, Allison C, et al. Genes related to sex steroids, neural growth, and social-emotional behavior are associated with autistic traits, empathy, and Asperger syndrome. Autism Res. 2009;2:157–77.
King IF, Yandava CN, Mabb AM, Hsiao JS, Huang HS, Pearson BL, et al. Topoisomerases facilitate transcription of long genes linked to autism. Nature. 2013;501:58–62.
Hibar DP, Stein JL, Renteria ME, Arias-Vasquez A, Desrivieres S, Jahanshad N, et al. Common genetic variants influence human subcortical brain structures. Nature. 2015;520:224–9.
Stein JL, Medland SE, Vasquez AA, Hibar DP, Senstad RE, Winkler AM, et al. Identification of common variants associated with human hippocampal and intracranial volumes. Nat Genet. 2012;44:552–61.
Taal HR, St Pourcain B, Thiering E, Das S, Mook-Kanamori DO, Warrington NM, et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat Genet. 2012;44:532–8.
Deans MR, Krol A, Abraira VE, Copley CO, Tucker AF, Goodrich LV. Control of neuronal morphology by the atypical cadherin Fat3. Neuron. 2011;71:820–32.
Kang HJ, Kawasawa YI, Cheng F, Zhu Y, Xu X, Li M, et al. Spatio-temporal transcriptome of the human brain. Nature. 2011;478:483–9.
McCarthy SE, Gillis J, Kramer M, Lihm J, Yoon S, Berstein Y, et al. De novo mutations in schizophrenia implicate chromatin remodeling and support a genetic overlap with autism and intellectual disability. Mol Psychiatry. 2014;19:652–8.
Van Balkom ID, Vuijk PJ, Franssens M, Hoek HW, Hennekam RC. Development, cognition, and behaviour in Pitt-Hopkins syndrome. Dev Med Child Neurol. 2012;54:925–31.
Talkowski ME, Rosenfeld JA, Blumenthal I, Pillalamarri V, Chiang C, Heilbut A, et al. Sequencing chromosomal abnormalities reveals neurodevelopmental loci that confer risk across diagnostic boundaries. Cell. 2012;149:525–37.
Zeng L, Zhang P, Shi L, Yamamoto V, Lu W, Wang K. Functional impacts of NRXN1 knockdown on neurodevelopment in stem cell models. PLoS One. 2013;8:e59685.
Mariani J, Coppola G, Zhang P, Abyzov A, Provini L, Tomasini L, et al. FOXG1-Dependent Dysregulation of GABA/Glutamate Neuron Differentiation in Autism Spectrum Disorders. Cell. 2015;162:375–90.
Lin M, Zhao D, Hrabovsky A, Pedrosa E, Zheng D, Lachman HM. Heat shock alters the expression of schizophrenia and autism candidate genes in an induced pluripotent stem cell model of the human telencephalon. PLoS One. 2014;9:e94968.
Butler MG, Dasouki MJ, Zhou XP, Talebizadeh Z, Brown M, Takahashi TN, et al. Subset of individuals with autism spectrum disorders and extreme macrocephaly associated with germline PTEN tumour suppressor gene mutations. J Med Genet. 2005;42:318–21.
Abrahams BS, Geschwind DH. Connecting genes to brain in the autism spectrum disorders. Arch Neurol. 2010;67:395–9.
Amiel J, Rio M, de Pontual L, Redon R, Malan V, Boddaert N, et al. Mutations in TCF4, encoding a class I basic helix-loop-helix transcription factor, are responsible for Pitt-Hopkins syndrome, a severe epileptic encephalopathy associated with autonomic dysfunction. Am J Hum Genet. 2007;80:988–93.
Forrest MP, Waite AJ, Martin-Rendon E, Blake DJ. Knockdown of human TCF4 affects multiple signaling pathways involved in cell survival, epithelial to mesenchymal transition and neuronal differentiation. PLoS One. 2013;8:e73169.
Rossi A, Kontarakis Z, Gerri C, Nolte H, Holper S, Kruger M, et al. Genetic compensation induced by deleterious mutations but not gene knockdowns. Nature. 2015;524:230–3.
We would like to thank Dr. Brett Abrahams for his critical comments on the manuscript. We also would like to thank Dr. Carl Ernst and Dr. Derek Blake for sharing DEG lists of TCF4, EHMT1, MBD5, and SATB2 knockdown and the Einstein High Performance Computing for computing support. This work was supported by grants from the National Institute of Mental Health (NIMH; MH099452 to DZ, and MH099427 and MH087840 to HML). WG would like to acknowledge the support from NYSTEM (C028109 and C029571). This work was also supported in part by a grant to The Rose F. Kennedy Intellectual and Developmental Disabilities Research Center (RFK-IDDRC) from the Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD) at the NIH (1P30HD071593-01).
The authors declare no competing financial interests and no non-financial conflicts of interest for any of the authors.
HML and DZ conceived of the project. HML performed experimental data analysis. PW, ML, and DZ performed the bioinformatics data analysis. EP and AH prepared the iPS lines, neuronal differentiation, RNA-seq samples, and qPCR validation. WG designed the CRISPR guide sequences. ZZ and WG performed the Western blot analysis. PW, ML, HML, and DZ wrote the manuscript. All authors edited and approved the final manuscript.
Ping Wang and Mingyan Lin shared first authorship.
Supplemental methods. (PDF 101 kb)
Figure S1–S3 and Table S1. (PDF 367 kb)
Table S2. Full list of gene-expression values, fold changes, and statistics. (XLSX 562 kb)
Table S3. GO analysis (DAVID) of DEGs in NPCs and neurons (only terms with Benjamini corrected p < 0.05 were included). (XLSX 155 kb)
Table S4. Pathway analysis (IPA) of DEGs in NPCs and neurons (only pathways with Benjamini corrected p < 0.05 were included). (XLSX 177 kb)
Table S5. Human phenotype analysis (Toppgene) of DEGs. (XLSX 153 KB)
Table S6. DEGs associated with ASD or schizophrenia. (XLSX 49 kb)
Table S7. DEGs from CHD8 +/− or knockdown of TCF4, EHMT1, MBD5, SATB2, NRXN1, or ZNF804A expression (“1” represents differential expression, “0” represents no change) and GO analysis from the STRING (only term with FDR < 0.05 were included). (XLSX 292 kb)
Table S8. GO analysis (ClueGO) of DEGs in each of five DEG lists from CHD8 +/− or knockdown analysis. (XLSX 670 kb)
Table S9–S10. Numbers and putative functionally disrupted coding variants in the WT iPSC line. (PDF 34 kb)