Skip to content

XGMML files

Salvador Martinez-Bartolome edited this page Jul 26, 2017 · 14 revisions

XGMML files (the eXtensible Graph Markup and Modeling Language) are files that can be opened in Cytoscape. ProteinclusterQuant exports the data in that format in order to visualize the protein-peptide network and the quantitative analysis.

Each XGMML file contain a protein-peptide network, that can contain more than one protein cluster or connected component.

In order to open the network in Cytoscape:
1. Open Cytoscape.
2. Click on Import Network From File.
Import Network File button in Cytoscape
or go to menu: File -> Import -> Network -> File...
or press CTRL+L
3. Select the XGMML file you want to open. Then, the file will be loaded.
4. Click on Apply Preferred Layout in order to see the network properly in the screen.
Apply preferred layout button in Cytoscape

Once you open the file, you can zoom in and zoom out to look for particular clusters and nodes. Protein cluster example

You can also use the Search text box on the top right in order to search for particular protein, peptides or any associated feature.
Search input box in Cytoscape

The following table shows which are the features that are encoded in the network and therefore are searchable through the Cytoscape Search feature:

Feature name Peptide Node Protein Node Description
ACC UniProt protein Accessions UniProt accession number, i.e. A2BC19, P12345, A0A022YWF9. If the protein node contains more than one protein, they will be listed separated by spaces.
ID Uniprot entry name UniProt entry name (formerly called ID) associated with the protein, i.e. INS_HUMAN, INS1_MOUSE and INS2_MOUSE
Classification1Case 0-7 See [[Classification Schemas
Classification2Case 1-7 See [[Classification Schemas 
conclusiveProtein Boolean Whether the protein is linked to a unique peptide node
conclusiveProteinNode Boolean Whether the protein node is linked to a unique peptide node
FDR Decimal Quantification False Discovery Rate associated with the peptide node ratio.
finalRatio Decimal Final ratio of the peptide node, log2 transformed
GeneName Gene ID Gene ID associated with the protein. If the protein node contains more than one gene ID, they will be listed separated by scaces.
lightIons Integer Number of Light ions associated with the peptide node (just in case of isobaric-isotopologue quantitation).
heavyIons Integer Number of heavy ions associated with the peptide node (just in case of isobaric-isotopologue quantitation).
ionCount Integer Number of ions associated with the peptide node (just in case of isobaric-isotopologue quantitation). It is equals to the sum of heavyIons + lightIons.
isProtein 0 1 0 for peptide nodes, and 1 for protein nodes
name Peptide sequences Protein accessions Node name
numConnectedPeptideNodes Integer Number of connected peptide nodes associated with the protein node.
numConnectedProteinNodes Integer Number of connected protein nodes associated with the peptide node.
numMSRuns Integer Number of different MS runs in which the peptide node has been detected
numPeptideSequences Integer Number of different peptide sequences included in the peptide node.
numPeptideSequencesInProteins Integer Number of different peptide sequences connected to the protein nodes.
numProteins Integer Integer Number of proteins in the protein node, or number of different proteins connected to the peptide node
numPsms Integer Number of PSMs in the peptide node.
numPsmsInProtein Integer Number of PSMs associated with the protein node
PCQ_ID Peptide sequences Unique identifier for the peptide nodes containing the peptide sequences of the peptide node.
PeptideSequences Peptide sequences Peptide sequence of the peptide node. It is equals to PCQ_ID.
ProteinDescription Protein description Description of the proteins of the protein node. If the protein node contains more than one protein, the descriptions will be separated by '####'
shared name Peptide sequences UniProt protein Accessions Identifier of the node.
significant Boolean 0 for False, 1 for True
Species Species Taxonomy species of the protein in the protein node.
UniprotKB UniProt protein Accessions UniProt accession number. If the protein node contains more than one protein, they will be listed separated by spaces.
uniquePeptide Boolean Whether the peptide is unique (is connected only to one protein) or not
uniquePeptideNode Boolean Whether the peptide node is unique (is connected only to one protein node) or not
Variance Decimal Calculated variance associated with the peptide node ratio. A lower variance means a more reliable ratio.
Weight Decimal Calculated weight associated with the peptide node ratio. A higher weight means a more reliable ratio.

Additionally, you can include new columns for annotations from UniprotKB with the option 'uniprot_xpath' in the input parameters file. Those annotations will be then searchable in Cytoscape. See input parameters file for more information.