- Open Access
Systems biology approaches toward autosomal dominant polycystic kidney disease (ADPKD)
Clinical and Translational Medicine volume 9, Article number: 1 (2020)
Autosomal dominant polycystic kidney disease (ADPKD), a common of monogenetic disorder caused by the polycystic kidney disease-1 (PKD1) or PKD2 genes deficiency. In this study, we have re-analyzed a microarray dataset to generate a holistic view of this disease.
GSE7869, an expression profiling dataset was downloaded from the Gene Expression Omnibus (GEO) database. After quality control assessment, using GEO2R tool of GEO, genes with adjusted p-value ≤ 0.05 were determined as differentially expressed (DE). The expression profiles from ADPKD samples in different sizes were compared. Using CluePedia plugin of Cytoscape software, the protein–protein interaction (PPI) networks were constructed and analyzed by Cytoscape NetworkAnalyzer tool and MCODE application. Pathway enrichment analysis of clustered genes by MCODE with the high centrality parameters in PPI networks was performed using Cytoscape ClueGO plugin. Moreover, by Enrichr database, microRNAs (miRNAs) and transcription factors (TFs) targeted DE genes were identified.
In this study to explore the molecular pathogenesis of kidney in ADPKD, mRNA expression profiles of cysts from patients in different sizes were re-analyzed. The comparisons were performed between normal with minimally cystic tissue (MCT) samples, MCTs with small cysts, and small cysts with large cysts. 512, 7024, and 655 DE genes were determined, respectively. The top central genes, e.g. END1, EGFR, and FOXO1 were identified with topology and clustering analysis. DE genes that were significantly enriched in PPI networks are critical genes and their roles in ADPKD remain to be assessed in future experimental studies beside miRNAs and TFs predicted. Furthermore, the functional analysis resulted in which most of them are expected to be associated with ADPKD pathogenesis, such as signal pathways that involved in cell growth, inflammation, and cell polarity.
We have here explored systematic approaches for molecular mechanisms assay of ADPKD as a monogenic disease, which may also be used for other monogenetic diseases beside complex diseases to provide suitable therapeutic targets.
The hereditary autosomal dominant polycystic kidney disease (ADPKD) is the most common monogenic disorder. ADPKD is a multi-systematic disease diagnosed by growing multiple cysts on kidneys. liver cysts and cerebral aneurysms are also the main clinical findings of disease . ADPKD is genetically heterogeneous and results from mutations in at least two genes, Polycystic Kidney Disease-1 (PKD1) or PKD-2 . These genes encode transmembrane proteins, Polycystin-1 (PC-1) and Polycystin-2 (PC-2) which form a functional complex . This protein complex, similar to other proteins are affected in polycystic kidney diseases locate in primary cilia of epithelial and endothelial cells . PC-1 known as a cell surface receptor and PC-2 is a cation channel and both of them play a critical role in controlling of signaling pathways related to proliferation, apoptosis, and cell polarities through Ca2+ homeostasis regulation . In spite of numerous studies related to polycystins functions, their roles are poorly understood. Regarding this major limitation being sensible to recognize the underlying mechanisms, systems biology approaches with a holistic view of the molecular mechanisms of disorders, have the potential to overcome these limitations. These approaches with comprehensive interpretation, using high throughput data extracted from omics data, provide the opportunity to represent the behavior of networks and emerge new therapeutic strategies. Therefore, we re-analyzed the array dataset deposited by Song X et al. which was compared transcription profiling of all samples from PKD1 patients with normal tissue, and gene set enrichment analysis (GSEA) was performed . But here, we have shown large-scale protein interaction networks. For deeply understanding of central genes that related with phenotypes of disease in each step, network and clustering analysis were carried out. These revealed some of the key genes, such as EDN1, EGFR, ARF6, FOXO1, and ITGB5 involved during disease. Pathways were identified with enrichment analysis with the notice on cysts size, from early to late steps. Moreover, for the purpose of assay the regulatory mechanisms of DE genes, microRNAs (miRNAs) and transcription factors (TFs) enriched with DE genes were predicted.
Microarray data and DE genes screening
Microarray dataset with accession number “GSE7869” from the Gene Expression Omnibus (GEO) database was extracted. The quality of transcriptomics dataset was measured by principal component analysis (PCA) through the ggplot2 package and prcomp function of R . Using GEO2R a web tool of GEO, groups were compared to detect genes that are differentially expressed with cysts growth. Samples of normal tissues (n = 3), minimally cystic tissues (n = 5), small cysts (n = 5), and large cysts (n = 3) were compared based on during the time of disease progression, using Student’s t-test, respectively. Benjamini–Hochberg false discovery rate (FDR) was used for p-value correction. Genes were declared as differentially expressed, had an adjusted p-value less than 0.05.
Protein-protein interaction networks construction
The protein–protein interaction (PPI) networks were built with DE genes. For networks construction, CluePedia plugin version 1.5.2  of Cytoscape software version 3.7.1  was used. STRING database with confidence cutoff 0.80 was provided, for retrieving interactions . Networks topology was investigated using the NetworkAnalyzer tool of Cytoscape . “Molecular Complex Detection” (MCOD) plugin of Cytoscape detected modules, highly connected sub-networks, based on default settings .
Pathway enrichment analysis
Functional analysis of genes clustered with MCODE was done by Cytoscape ClueGO plugin version 2.5.2 . Reactome  and KEGG (Kyoto Encyclopedia of Genes and Genomes)  databases were chosen for retrieving pathways. Bonferroni step down was applied for p-value correction, and signaling pathways with adjusted p-value ≤ 0.05 were determined.
miRNA and TF enrichment analysis
The microRNAs (miRNAs) and transcription factors (TFs), key regulators of genes, were predicted by Enrichr web server . TargetScan microRNA 2017 and ChEA 2016 libraries were used for miRNA and TF enrichment analysis, respectively. Adjusted p-value less than 0.05 was considered as the significant threshold. The miRNAs with more targeted genes were selected.
By microarray data analysis, differentially expressed genes were identified
The microarray dataset “GSE7869” which includes renal cysts in different sizes; small cysts (SC) less than 1 mm, medium cysts between 10 and 25 mm, and large cysts (LC) greater than 50 mm have been analyzed. Minimally cystic tissues (MCT) obtained from healthy parts of the renal cortex of PKD1 patients were considered as heterozygote samples. In quality assay step except medium cysts, the samples were segregated based on their states (normal tissue, minimally cystic tissue, small cyst, and large cyst), indicate the acceptable quality of this dataset (Fig. 1). Using GEO2R tool, we obtained 512, 7024, and 655 genes which are significantly variably expressed between normal vs. MCT samples, MCT vs. SC samples, and SC vs. LC samples, respectively (Additional file 1). Interestingly, these sets of DE genes have few overlapping genes (Fig. 2a).
Protein–protein interaction networks were constructed
The PPI networks with DE genes were constructed. Links between genes were selected based on activation, binding, post-translational modification, and inhibition interactions. PPI networks are shown small cyst growth phase is an important and complex step during the progression of the disease. 81, 2737, and 155 nodes (genes) are in PPI networks (normal vs. MCT, MCT vs. SC, and SC vs. LC), respectively (Fig. 2b–d). The MCODE application identified protein clusters in networks. These protein complexes and modules are highly interconnected subnetworks with the most effective genes. Network topology were measured based on the graph theory concepts such as degree, betweenness, and closeness centrality. The seed gene with the highest centrality is EDN1 in the early stage, normal vs. MCT comparison. Seed genes such as EGFR, ARF6, WWTR, SMURF2, TGFB2, and HSD17B8 are critical genes in the comparison of MCT with SC. FOXO1, EDN1, and ITGB5 are introduced as central genes in the late stage, SC vs. LC comparison. Some of these genes including EGFR and EDN1 have been recognized related to ADPKD in previous experimental studies [17, 18] and other genes are candidates for future studies. The genes are represented in Table 1.
Pathway enrichment analysis was performed
Functional analysis was carried out based on genes detected by MCODE. Using pathway enrichment analysis from 18, 1318, and 66 genes, we reached to 7, 113, and 39 pathways, respectively (Fig. 3). Interestingly, the GoTerms are informative and related to the phenotype of each step, such as collecting duct acid secretion in early step. An interesting finding in this study was the detection of critical pathways and functions such as EGF, Wnt, MAPK, HIF, P53, CFTR, AMPK, PDGF, NFκB, IGF1, MET signaling, oxidative phosphorylation, energy metabolism, cell–cell and cell–matrix interaction, and signaling by interleukins which were previously shown to be associated with ADPKD in experimental studies [19,20,21], and other pathways could consider for more studies and validation.
miRNAs and TFs enriched with DE gene were determined
The miRNAs and TFs as important regulators of DE genes were predicted. HNF4A, ESR1, and RXR were defined as TFs in the initial step, in normal vs. MCT comparison. TFs were significant in the small and large cyst growth steps are shown in Table 2. The top miRNAs enriched with DE genes in each phase are shown in Fig. 4. Previous studies reported the association of ADPKD with some of TFs e.g. HNF4A, STAT3, VDR, TP53, and HIF1A [6, 20, 22, 23]. Also, the role of miR-17 family and miR-192 in cyst enlargement were identified [24, 25]. It is valuable to investigate other miRNAs and TFs in experimental studies.
ADPKD caused by mutations in PKD1 or PKD2 genes  and the protein products of these genes, polycystin-1 and polycystin-2 act as a mechanosensor on the surface of epithelial and endothelial cells . The loss and gain of function of these proteins, leading to dysregulation of pathways related to proliferation, apoptosis, and polarity of cells . Despite many studies indicated the functions of the polycystins, the numerous ambiguities remain about the molecular mechanisms of the disease progression. For the importance of time series analysis of diseases , the purpose of this study was the computational analysis of the expression profile of renal cysts that were compared based on different sizes of cysts. Bioinformatics methods were performed in this study showed that 512, 7024, and 655 DE genes, respectively dysregulated in each step. The PPI networks were shown nodes and their interactions became more complex with the progression of disease in small cyst growth. The topology and clustering analysis of networks were employed for revealing candidate genes with high centrality as therapeutic targets. Nodes (genes) with high degree, they have many connections and are important for the networks. Betweenness centrality is based on the number of shortest paths going through a node and are shortcuts of the networks. Also, closeness centrality calculated physically nearest genes to all nodes . Modules are high density regions in the network and identify functional genes . The role of some of these genes has been well documented in ADPKD such as EDN1 as a vasoconstrictor may promote tumorigenesis and recent studies have documented that an increase in serum endothelin levels is associated with renal pathogenesis of ADPKD. Also, polymorphisms of EDN1 can influence the age of onset of end-stage renal disease in ADPKD [18, 28]. EGFR promotes cell growth, proliferation, and cell survival and has important functions in the progression of ADPKD . Other genes introduced as applicant genes for future studies are ARF6, SMURF2, WWTR1, CACNB2, and FOXO1. ARF6 is a member of the RAS superfamily that regulates signaling pathways related to actin remodeling such as wnt path, the central pathway in ADPKD . SMURF2 controls cell migration with BMP and TGFβ signaling pathways . WWTR1 acts as a transcriptional coactivator downstream of the Hippo signaling pathway that plays a major function in the control of organ size . Ablation of CACNB2 leads to calcium homeostasis derivation and could have a critical role in the initiation and progression of the disease. Previous studies showed that mutation in the PKD1 leads to higher glycolysis in ADPKD kidneys. FOXO1 through insulin signaling plays a main role in glucose metabolism and consequently involved in ADPKD pathogenesis [31, 32]. Also, ITGB5 contributes to cell adhesion and known as a biomarker in kidney disease . The mechanisms of the newly introduced crucial genes such as PPIE remain to be identified with experimental studies. We pointed out TFs such as HNF4A, STAT3, VDR, TP53, and HIF1A associated with ADPKD [22, 23]. In addition, other TFs as CLOCK in ADPKD pathogenesis firstly are described in this study. Since CLOCK involved in kidney function, confirmation its role in ADPKD can get interesting results . Functional analysis was shown that the pathways are correlated with the phenotype of disease in each step including pathways involved in cell proliferation, apoptosis, and inflammation. The roles of some of the pathways have determined in ADPKD pathogenesis [19, 20].
Here by computational tools we generate a systematic view of the ADPKD to explore the comprehensive molecular mechanisms of a monogenic disease. Methods employed in this study may also be used for each monogenic disorder to reach novel therapeutic targets. Also, the necessity of holistic maps assay of monogenetic disease besides complex disease is desired.
Availability of data and materials
All data analyzed during this study are included in article and its additional files.
autosomal dominant polycystic kidney disease
polycystic kidney disease-1
polycystic kidney disease-2
gene set enrichment analysis
Gene Expression Omnibus
principal component analysis
false discovery rate
Molecular Complex Detection
minimally cystic tissue
kyoto encyclopedia of genes and genomes
Masoumi A et al (2008) Developments in the management of autosomal dominant polycystic kidney disease. Ther Clin Risk Manag 4(2):393–407
Peters DJ, Sandkuijl LA (1992) Genetic heterogeneity of polycystic kidney disease in Europe. Contrib Nephrol 97:128–139
Yoder BK, Hou X, Guay-Woodford LM (2002) The polycystic kidney disease proteins, polycystin-1, polycystin-2, polaris, and cystin, are co-localized in renal cilia. J Am Soc Nephrol 13(10):2508–2516
Spasic M, Jacobs CR (2017) Primary cilia: cell and molecular mechanosensors directing whole tissue function. Semin Cell Dev Biol 71:42–52
Nauli SM et al (2003) Polycystins 1 and 2 mediate mechanosensation in the primary cilium of kidney cells. Nat Genet 33(2):129–137
Song X et al (2009) Systems biology of autosomal dominant polycystic kidney disease (ADPKD): computational identification of gene expression pathways and integrated regulatory networks. Hum Mol Genet 18(13):2328–2343
R Core Team (2014) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/.
Bindea G, Galon J, Mlecnik B (2013) CluePedia Cytoscape plugin: pathway insights using integrated experimental and in silico data. Bioinformatics 29(5):661–663
Shannon P et al (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11):2498–2504
Szklarczyk D et al (2017) The STRING database in 2017: quality-controlled protein-protein association networks, made broadly accessible. Nucleic Acids Res 45(DI):D362–D368
He X, Zhang J (2006) Why do hubs tend to be essential in protein networks? PLoS Genet 2(6):e88–e88
Bader GD, Hogue CW (2003) An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinformatics 4:2
Bindea G et al (2009) ClueGO: a Cytoscape plug-into decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25(8):1091–1093
Fabregat A et al (2016) The reactome pathway knowledgebase. Nucleic Acids Res 44(DI):D481–D487
Kanehisa M, Goto S (2000) KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res 28(1):27–30
Kuleshov MV et al (2016) Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res 44(W1):W90–W97
Torres VE et al (2003) EGF receptor tyrosine kinase inhibition attenuates the development of PKD in Han:sPRD rats. Kidney Int 64(5):1573–1579
Chang MY, Ong AC (2011) Endothelin in polycystic kidney disease. Contrib Nephrol 172:200–209
Chapin HC, Caplan MJ (2010) The cell biology of polycystic kidney disease. J Cell Biol 191(4):701–710
Harris PC, Torres VE (2014) Genetic mechanisms and signaling pathways in autosomal dominant polycystic kidney disease. J Clin Invest 124(6):2315–2324
Torres VE, Harris PC (2006) Mechanisms of disease: autosomal dominant and recessive polycystic kidney diseases. Nat Clin Pract Nephrol 2(1):40–55
Vendramini LC et al (2019) Association of vitamin D levels with kidney volume in autosomal dominant polycystic kidney disease (ADPKD). Front Med (Lausanne) 6:112
de Stephanis L et al (2018) MicroRNA501-5p induces p53 proteasome degradation through the activation of the mTOR/MDM2 pathway in ADPKD cells. J Cell Physiol 233(9):6911–6924
Kim DY et al (2019) Impact of miR-192 and miR-194 on cyst enlargement through EMT in autosomal dominant polycystic kidney disease. Faseb j 33(2):2870–2884
Hajarnis S et al (2017) microRNA-17 family promotes polycystic kidney disease progression through modulation of mitochondrial metabolism. Nat Commun 8:14395
Abedi M et al (2019) Big data to knowledge: common pitfalls in transcriptomics data analysis and representation. RNA Biol 16(11):1531–1533
Koschutzki D, Schreiber F (2008) Centrality analysis methods for biological networks and their application to gene regulatory networks. Gene Regul Syst Bio 2:193–201
Kocyigit I et al (2019) The association of endothelin-1 levels with renal survival in polycystic kidney disease patients. J Nephrol 32(1):83–91
Lan HY (2011) Diverse roles of TGF-beta/Smads in renal fibrosis and inflammation. Int J Biol Sci 7(7):1056–1067
Kang HS et al (2009) Glis3 is associated with primary cilia and Wwtr1/TAZ and implicated in polycystic kidney disease. Mol Cell Biol 29(10):2556–2569
Nakae J et al (2003) The forkhead transcription factor Foxo1 regulates adipocyte differentiation. Dev Cell 4(1):119–129
Rowe I et al (2013) Defective glucose metabolism in polycystic kidney disease identifies a new therapeutic strategy. Nat Med 19(4):488–493
Ju W, Smith S, Kretzler M (2012) Genomic biomarkers for chronic kidney disease. Transl Res 159(4):290–302
Firsov D, Bonny O (2018) Circadian rhythms and the kidney. Nat Rev Nephrol 14(10):626–635
This study was supported by the Isfahan University of Medical Science.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Rahimmanesh, I., Fatehi, R. Systems biology approaches toward autosomal dominant polycystic kidney disease (ADPKD). Clin Trans Med 9, 1 (2020) doi:10.1186/s40169-019-0254-5
- Autosomal dominant polycystic kidney disease
- Protein interaction network
- Signal pathway