Integrated network analysis platform for protein-protein interactions

Wu, Jianmin; Vallenius, Tea; Ovaska, Kristian; Westermarck, Jukka; Mäkelä, Tomi P; Hautaniemi, Sampsa

doi:10.1038/nmeth.1282

Brief Communication
Published: 14 December 2008

Integrated network analysis platform for protein-protein interactions

Jianmin Wu¹,
Tea Vallenius¹^na1,
Kristian Ovaska¹^na1,
Jukka Westermarck^2,3,
Tomi P Mäkelä¹ &
…
Sampsa Hautaniemi¹

Nature Methods volume 6, pages 75–77 (2009)Cite this article

2680 Accesses
220 Citations
3 Altmetric
Metrics details

Abstract

There is an increasing demand for network analysis of protein-protein interactions (PPIs). We introduce a web-based protein interaction network analysis platform (PINA), which integrates PPI data from six databases and provides network construction, filtering, analysis and visualization tools. We demonstrated the advantages of PINA by analyzing two human PPI networks; our results suggested a link between LKB1 and TGFβ signaling, and revealed possible competitive interactors of p53 and c-Jun.

You have full access to this article via your institution.

Assessment of community efforts to advance network-based prediction of protein–protein interactions

Article Open access 22 March 2023

WeiBI (web-based platform): Enriching integrated interaction network with increased coverage and functional proteins from genome-wide experimental OMICS data

Article Open access 27 March 2020

Reliable identification of protein-protein interactions by crosslinking mass spectrometry

Article Open access 11 June 2021

Main

Protein-protein interactions (PPIs) are critical for most cellular processes. High-throughput PPI detection technologies have already resulted in identification of large sets of PPIs for several organisms¹ and are being applied to other organisms. Given the rapidly growing numbers of public PPI data, analysis of PPI networks has become a major thrust in systems biology research. Analysis efforts to date can be roughly divided into two types, based on the size of the analyzed networks. The first type are large-scale studies of an entire species that involve hundreds of thousands of PPIs¹, in which the main objective has been to identify network topological characteristics that have biological implications². The second type are small-scale targeted studies of a single protein to identify tens of interactions to extend the knowledge of this protein.

There is, however, a paucity of medium-size PPI network studies in which protein interactors for tens or hundreds of proteins are analyzed simultaneously. This scale is interesting because protein lists of this size are being generated or inferred from functional genomics, proteomics and metabolomics studies. Analysis of such networks could reveal interesting hypotheses to be investigated in biological settings, but this potential has not been fully exploited because of the lack of a platform to systematically query, filter, analyze, visualize and manage PPI networks of this scale. Current databases and approaches provide these features in a limited way. The majority of public PPI databases, such as IntAct³, MINT⁴ and BioGRID⁵, provide PPI records from literature curation or direct user submissions. Some databases including APID⁶, STRING⁷, MiMI⁸ and UniHI⁹ integrate information from these curated PPI databases to provide a more comprehensive set of public PPIs. In addition, the STRING database contains predicted PPIs based on functional associations including genomic context, co-expression data and text mining.

The protein interaction network analysis platform (PINA) is an integrated platform for protein interaction network construction, filtering, analysis, visualization and management (Fig. 1). It includes a quarterly updated, nonredundant database based on integration of data from six public PPI databases: IntAct³, MINT⁴, BioGRID⁵, DIP¹⁰, HPRD¹¹ and MIPS MPact¹². PINA has versatile querying capabilities to construct PPI networks, such as queries for single proteins, a list of proteins, a list of protein pairs or two lists of proteins (Supplementary Methods online). Additionally, the query functions for either a list of proteins or protein pairs have been provided as web services to facilitate automatic use of PINA by other programs. Together, these querying features enable more advanced PPI searches than any current PPI database. In addition, PINA provides protein annotations including protein domains and Gene Ontology (GO) terms, and interaction annotations including the experiment information and links to the primary PubMed articles. Generated networks can be downloaded in GraphML, MITAB¹³ or customized tab delimited (TSV) formats. The latter two types of files can be loaded into Cytoscape¹⁴ to integrate PPI interactions, for example, with gene expression profiles.

**Figure 1: Schematic of the PINA platform.**

PPI networks from query results can be rapidly analyzed using PINA's built-in GO and graph theoretical tools. For instance, PINA includes a GO tool that uses semantic similarities between the annotated GO terms of an interacting protein pair to generate a confidence score for the interaction (Supplementary Methods). Additionally, a GO enrichment analysis tool can be used to identify significantly enriched GO terms of a PPI network. Graph theoretical tools can be used to either discover basic topology properties of a PPI network or identify topologically important proteins, such as hubs or bottlenecks, based on several centrality measures (Supplementary Methods). All built-in tools can also be executed in an interactive applet (Fig. 2a), which supports visualization and manipulation of a PPI network.

**Figure 2: PINA-generated visualization of the PPI networks for LKB1 and its 14 substrate kinases.**

A central and unique feature in PINA is a 'user space' that allows users with a free registered account to save networks from query results, manually upload interaction data and comment on specific interactions. With these features, users can modify networks by manually adding or removing identified PPIs, or use filtering tools to accept or reject interactions to obtain more reliable networks. Saved networks can be used as input for analysis tools and published in PINA for open or restricted access.

We demonstrated the utility of PINA by analyzing two human PPI networks. LKB1 is a tumor suppressor kinase underlying hereditary Peutz-Jeghers polyposis/cancer syndrome and is frequently inactivated in sporadic non-small cell lung adenocarcinomas¹⁵. LKB1 also phosphorylates and activates 14 kinases, but it remains unresolved which if any of these are important in tumorigenesis. We investigated whether generating and analyzing a PPI network of LKB1 and its 14 substrate kinases could provide hypotheses on links and pathways critical for tumorigenesis following LKB1 deficiency. Using these 15 proteins, in one PINA query we generated an initial PPI network consisting of 131 proteins and 203 interactions (Fig. 2a). Based on the substantial number of unexpected interactions, we recurated all interactions from the original publications with notes added to the 'comment' field in PINA records. This resulted in the exclusion of 64 PPIs for several reasons, including a lack of any evidence for an interaction other than co-occurrence of gene/protein names or duplication owing to different UniProt accession numbers (Supplementary Table 1 and Supplementary Methods online). After removing these interactions using a filtering tool based on user comments in PINA, we generated a network consisting of 139 interactions. This network, however, lacked several established interactions, especially kinase-substrate interactions, which implies that these typically transient but very important interactions are underrepresented in PPI databases. After we manually added 31 interactions, the curated LKB1 PPI network contained 170 interactions (Fig. 2b), and is available online in 'shared networks' of PINA.

The LKB1 PPI network generated in PINA highlights interactors easily left unnoticed because of, for example, PPIs being reported in supplementary tables of large-scale experiments or incorporated only in one of the source databases of PINA. For instance, the interactions of CDC25C with both MARK3 and BRSK1 suggest that BRSK kinases may also function through stable kinase-substrate complexes involving 14-3-3 (refs. 16, 17). In addition, the generated network linked LKB1 and the members of TGFβ signaling pathway through NUAK2 (Fig. 2b; SMAD2, SMAD4 and TGFβR1). Recently, decreased SMAD2 and TGFβ pathway activity had been suggested to underlie LKB1 deficiency–mediated tumorigenesis¹⁸. Thus, the PPIs suggest that decreased LKB1 signaling could attenuate TGFβ signaling through NUAK2.

The antagonist biological functions of transcription factors p53 and c-Jun¹⁹ could be due to competitive binding of limiting amounts of interacting proteins. Using PINA we rapidly identified, out of 447 interactors of either p53 or c-Jun (Supplementary Fig. 1a online), 39 proteins that interact with both. After recuration (Supplementary Table 1) 36 proteins were retained as common interacting proteins, and we then used these proteins in a single query to identify interactions among them. This revealed 123 additional PPIs, from which we combined 105 recurated interactions with the p53 and c-Jun interactions using the 'network operation' tool in PINA for further analysis (Supplementary Fig. 1b).

The PINA GO enrichment tool revealed significantly enriched terms in the transcription category (Supplementary Fig. 2 online), which were directly related to the regulation of either DNA-binding or transactivation capacity of p53 and c-Jun. Moreover, using a graph theoretical analysis tool in PINA, the top four candidate proteins were SMAD2, SP1, SMAD3 and ESR1 based on eigenvector centrality measure (Supplementary Fig. 3 online). High centrality scores for SP1 and SMAD3 may be due to their requirement for c-Jun– and p53–mediated activation of transcription at least in certain conditions²⁰. Taken together, PINA analysis of common interactors of p53 and c-Jun revealed several candidates that could represent a critical limiting factor for the transcriptional regulation of gene expression by either p53 or c-Jun. This analysis provides directions for additional experiments to characterize mechanisms by which these transcription factors antagonize each other's functions in the regulation of cell proliferation.

PINA analysis of these two human PPI networks revealed several issues to be addressed in network analysis of PPIs from public databases. First, many such databases lack a substantial portion of PPIs, emphasizing the need to integrate multiple PPI databases. For example, in the LKB1 analysis, the 139 PPIs identified from the PPI databases originated from HPRD (118 PPIs), IntAct (92), BioGRID (63) and MINT (78) demonstrating that at least 15% of PPIs would have been missed using any single database. Also all databases contained PPIs not found in any other database: HRPD (48 PPIs), IntAct (14), BioGRID (11) and MINT (1). HPRD has better coverage in this case because it is dedicated to human PPIs, whereas MIPS MPact is not listed here as it only includes yeast data.

Our analysis also revealed that curated PPI databases contain some mistakes, for instance, through mistaking genes with similar alias names (Supplementary Table 1), demonstrating the importance of being able to manually remove interactions from a network. Similarly the ability to add interactions is useful for inclusion of new or unrecorded PPIs (Fig. 2b). Manual improvement of the networks may considerably enhance the relevance of results generated by analysis tools in PINA.

To facilitate integrative analysis of PPI data, we encourage curators of current PPI databases to fully adopt the Proteomics Standards Initiative–Molecular Interaction (PSI-MI) standard for exchanging protein interactions¹³. For example, after network construction, it would be interesting to rank PPIs based on the interaction-detection methods by giving additional weights to PPIs identified by more than one method. Unfortunately some of the PPI databases have not fully adopted the PSI-MI standard in curating interaction detection methods, which limits the usefulness of this ranking strategy. In PINA, users can add and share such information as comments.

Our analysis also demonstrated the utility of PINA in translating fragmented knowledge in PPI databases to testable predictions. The application of PINA will accelerate analysis of PPI networks from the rapidly growing amount of public PPI data, and formulation of hypotheses of protein functions and cellular processes. PINA is freely available at http://csbi.ltdk.helsinki.fi/pina/.

Note: Supplementary information is available on the Nature Methods website.

References

Cusick, M.E., Klitgord, N., Vidal, M. & Hill, D.E. Hum. Mol. Genet. 14 (special issue 2), R171–R181 (2005).
Article CAS Google Scholar
Jeong, H., Mason, S.P., Barabasi, A.L. & Oltvai, Z.N. Nature 411, 41–42 (2001).
Article CAS Google Scholar
Kerrien, S. et al. Nucleic Acids Res. 35, D561–D565 (2007).
Article CAS Google Scholar
Chatr-aryamontri, A. et al. Nucleic Acids Res. 35, D572–D574 (2007).
Article CAS Google Scholar
Breitkreutz, B.J. et al. Nucleic Acids Res. 36, D637–D640 (2008).
Article CAS Google Scholar
Prieto, C. & De Las Rivas, J. Nucleic Acids Res. 34, W298–W302 (2006).
Article CAS Google Scholar
von Mering, C. et al. Nucleic Acids Res. 35, D358–D362 (2007).
Article CAS Google Scholar
Jayapandian, M. et al. Nucleic Acids Res. 35, D566–D571 (2007).
Article CAS Google Scholar
Chaurasia, G. et al. Nucleic Acids Res. 35, D590–D594 (2007).
Article CAS Google Scholar
Salwinski, L. et al. Nucleic Acids Res. 32, D449–D451 (2004).
Article CAS Google Scholar
Peri, S. et al. Genome Res. 13, 2363–2371 (2003).
Article CAS Google Scholar
Guldener, U. et al. Nucleic Acids Res. 34, D436–D441 (2006).
Article Google Scholar
Hermjakob, H. et al. Nat. Biotechnol. 22, 177–183 (2004).
Article CAS Google Scholar
Cline, M.S. et al. Nat. Protocols 2, 2366–2382 (2007).
Article CAS Google Scholar
Katajisto, P. et al. Biochim. Biophys. Acta 1775, 63–75 (2007).
CAS PubMed Google Scholar
Peng, C.Y. et al. Cell Growth Differ. 9, 197–208 (1998).
CAS PubMed Google Scholar
Lu, R., Niida, H. & Nakanishi, M. J. Biol. Chem. 279, 31164–31170 (2004).
Article CAS Google Scholar
Katajisto, P. et al. Nat. Genet. 40, 455–459 (2008).
Article CAS Google Scholar
Eferl, R. et al. Cell 112, 181–192 (2003).
Article CAS Google Scholar
Moustakas, A. & Kardassis, D. Proc. Natl. Acad. Sci. USA 95, 6733–6738 (1998).
Article CAS Google Scholar

Download references

Acknowledgements

This work was supported by the European Union Sixth Framework Programme grant European Network for Functional Integration (LSHG-CT-2005-518254), Sigrid Jusélius Foundation, University of Helsinki's Research Funds, Academy of Finland (projects 125826, 213345 and 1121413), Finnish Cancer Association, Emil Aaltonen Foundation and competitive research funding of the Pirkanmaa Hospital District.

Author information

Tea Vallenius and Kristian Ovaska: These authors contributed equally to this work.

Authors and Affiliations

Genome-Scale Biology Program, Institute of Biomedicine, University of Helsinki, Haartmaninkatu 8, Helsinki, 00014, Finland
Jianmin Wu, Tea Vallenius, Kristian Ovaska, Tomi P Mäkelä & Sampsa Hautaniemi
Institute of Medical Technology, University of Tampere and Tampere University Hospital, Biokatu 8, Tampere, 33520, Finland
Jukka Westermarck
Centre for Biotechnology, University of Turku and Åbo Akademi University, Tykistökatu 6A, Turku, 23520, Finland
Jukka Westermarck

Authors

Jianmin Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tea Vallenius
View author publications
You can also search for this author in PubMed Google Scholar
Kristian Ovaska
View author publications
You can also search for this author in PubMed Google Scholar
Jukka Westermarck
View author publications
You can also search for this author in PubMed Google Scholar
Tomi P Mäkelä
View author publications
You can also search for this author in PubMed Google Scholar
Sampsa Hautaniemi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Ji.W. designed and implemented the PINA platform; T.V. and T.P.M. designed the LKB1 case study and recurated corresponding interactions; K.O. implemented graph and GO tools; Ju.W. designed the p53–c-Jun case study and recurated corresponding interactions; T.P.M initiated the project; S.H. did overall design and supervised the project. All authors participated in manuscript writing.

Corresponding author

Correspondence to Jianmin Wu.

Supplementary information

Supplementary Text and Figures

Supplementary Figures 1–3, Supplementary Methods (PDF 2861 kb)

Supplementary Table 1

Classification of discarded interactions in the two case studies. (XLS 53 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wu, J., Vallenius, T., Ovaska, K. et al. Integrated network analysis platform for protein-protein interactions. Nat Methods 6, 75–77 (2009). https://doi.org/10.1038/nmeth.1282

Download citation

Received: 20 June 2008
Accepted: 18 November 2008
Published: 14 December 2008
Issue Date: January 2009
DOI: https://doi.org/10.1038/nmeth.1282

This article is cited by

A comprehensive computational analysis to explore the importance of SIGLECs in HCC biology
- Senbang Yao
- Wenjun Chen
- Huaidong Cheng
BMC Gastroenterology (2023)
Gene’s expression underpinning the divergent predictive value of [18F]F-fluorodeoxyglucose and prostate-specific membrane antigen positron emission tomography in primary prostate cancer: a bioinformatic and experimental study
- Matteo Bauckneht
- Cecilia Marini
- Gianmario Sambuceti
Journal of Translational Medicine (2023)
Predicting response of immunotherapy and targeted therapy and prognosis characteristics for renal clear cell carcinoma based on m1A methylation regulators
- Lei Li
- Hongwei Tan
- Fengming Hu
Scientific Reports (2023)
Gene interaction network analysis in multiple myeloma detects complex immune dysregulation associated with shorter survival
- Anish K. Simhal
- Kylee H. Maclachlan
- Allen Tannenbaum
Blood Cancer Journal (2023)
Catechol-O-methyl transferase suppresses cell invasion and interplays with MET signaling in estrogen dependent breast cancer
- Lucia Janacova
- Michaela Stenckova
- Pavel Bouchal
Scientific Reports (2023)

Integrated network analysis platform for protein-protein interactions

Abstract

Similar content being viewed by others

Assessment of community efforts to advance network-based prediction of protein–protein interactions

WeiBI (web-based platform): Enriching integrated interaction network with increased coverage and functional proteins from genome-wide experimental OMICS data

Reliable identification of protein-protein interactions by crosslinking mass spectrometry

Main

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Supplementary information

Supplementary Text and Figures

Supplementary Table 1

Rights and permissions

About this article

Cite this article

This article is cited by

A comprehensive computational analysis to explore the importance of SIGLECs in HCC biology

Gene’s expression underpinning the divergent predictive value of [18F]F-fluorodeoxyglucose and prostate-specific membrane antigen positron emission tomography in primary prostate cancer: a bioinformatic and experimental study

Predicting response of immunotherapy and targeted therapy and prognosis characteristics for renal clear cell carcinoma based on m1A methylation regulators

Gene interaction network analysis in multiple myeloma detects complex immune dysregulation associated with shorter survival

Catechol-O-methyl transferase suppresses cell invasion and interplays with MET signaling in estrogen dependent breast cancer

Search

Quick links

Abstract

Similar content being viewed by others

Main

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links