TopoGSA Tutorial
This tutorial will walk through an example analysis to explain the usage of TopoGSA and illustrate its features. As an alternative, you can watch the video tutorial (if your browser supports the Flash Video format). If you only require information on specific parts of the analysis procedure, you can jump to the corresponding section using the links below:

TopoGSA - default analysis: Special analysis types: Troubleshooting:


1. Preparing and submitting your gene/protein sets


TopoGSA accepts lists of genes, proteins or microarray probe-identifiers as input. These lists of identifiers are mapped onto a molecular interaction network by first converting the identifiers into the gene/protein name format of the network (if necessary) and then identifying which of the molecules are present in the network. To guarantee a successful conversion process, the identifiers should have one of the following formats (depending on the species under consideration): Please make sure to choose the correct format for your dataset from the drop-down menu "Species and identifier format" on the start page. If you would like to analyse an example dataset first, you can select one of the options from the "Insert example data" menu. In this tutorial we will use the first option, the comprehensive set of human cancer genes (Futreal et al., 2004), as example. Fig. 1 shows the correct settings on the web-interface for this dataset (for all examples the settings are configured automatically).

example settings
Fig. 1: web-interface settings for the human cancer genes example dataset

To upload your own dataset, please insert only one gene/protein identifier per line and use the same identifier format for each entry (automatic disambiguation of homonyms is not supported). The maximum allowed dataset size is 1000 genes or proteins (larger datasets would cover major parts of the interaction network and would not be suitable for comparison against much smaller reference datasets).
Optionally, you can assign class labels to the identifiers in order to group subsets of genes together (data points with the same class label will receive the same colour when plotting the results). For this purpose, arbitrary class labels (e.g. "tumour"/"normal", "class1"/"class2"/"class3") can be written behind the gene/protein names, separated by a comma. For example,

ENSG00000002016,tumour
ENSG00000005339,tumour
ENSG00000005339,normal
ENSG00000134058,normal
ENSG00000010704,tumour
...


would be a valid gene set with class labels to be mapped on a human interaction network (to view a real-world example with class labels, you can select the 'Microarray, two classes' dataset from the example data menu).

Before submitting a dataset and also during a gene/protein set analysis the user can always switch between two analysis types: If you wish to skip the individual dataset analysis (the default first step), you can directly submit your dataset to the comparative analysis, by choosing an option from the "Comparative analysis" menu on the main interface (see Fig. 1). For the human species, the available reference datasets include gene/protein sets from KEGG and BioCarta signalling pathways, Gene Ontology and InterPro (for non-human species, the reference datasets are extracted from the Gene Ontology and/or the MetaCyc database)). Human gene or protein lists can be mapped onto two alternative human protein interaction networks: To use the second network above, simply activate the check-box "Use only proteome-scale datasets" on the main interface (see Fig. 1).
To submit your dataset, click on the "Start analysis" button. Instructions on how to use alternative analysis types can be found in section 5 (Using microarray data as input) and 6 (Using your own interaction network as input).




2. Checking gene/protein name conversion and network mapping

After submitting a dataset, the user will first be presented with conversion and mapping summary, revealing which genes or proteins from the uploaded list could successfully converted into a standard identifier format and were present in the interaction network for the chosen species (human, yeast, fly, plant or worm). For the human cancer genes example data set, the first entries of the resulting conversion and mapping table are shown below:

original identifiers converted identifiers present in network
ABL1 ENSG00000097007 yes
ABL2 ENSG00000143322 yes
ACSL3 ENSG00000123983 yes
AF15Q14 not found no
... ... ...

By clicking on the column titles, you can sort each column alphabetically and quickly identify whether a specific gene/protein in the uploaded list could not be converted to the standard format in the network (in the example above: from HGNC SYMBOL format to ENSEMBL format, e.g. the human cancer gene AF15Q14 could not be converted), or whether the corresponding molecule is not present in the network (see last column). If none of the input identifiers could be converted successfully, the system will attempt to detect the identifier format automatically. If this automatic identification process fails, the user will be notified. The submitted gene list and the selected identifier type will be shown to help the user in solving problems related to the input format.




3. Analysing topological properties of genes/proteins in your dataset

In order to obtain a first general overview on the network topological properties of the uploaded gene/protein set, the first outcome is a statistics table, providing the user with the averaged values for five topological properties (shortest path length centrality, node betweenness, node degree, local clustering coefficient and eigenvector centrality) computed for the uploaded gene set, 10 matched-size random gene sets in the network (as a random model) and for the entire network. As example, the output for the human cancer genes example dataset is shown below (see Tab. 1).

Shortest path lengthNode betweennessDegreeClustering coefficientEigenvector centrality
Uploaded gene/protein set3.745446021.370.10.05
10 random simulations (mean)4.84 (±0.04)5052 (±764)3.97 (±0.26)0.02 (±0)0.03 (±0)
Static mean over entire network4.12 (±0.94)14669 (±68893)8.27 (±16.2)0.11 (±0.21)0.02 (±0.04)
Tab. 1: general topological statistics for the human cancer genes example dataset, a random model and the entire network

In this table, the user can for example see that the average node betweenness and degree of the uploaded gene set exceed the corresponding values for the matched-size random gene sets by several standard deviations. Please note that the topological properties are not necessarily normally distributed, and care must be taken when interpreting these averaged results: The user should additionally check the 2D and 3D plots of the topological properties (see below), to determine whether the dataset contains outliers.
The different network topological properties of genes/proteins from the uploaded dataset can be plotted and visualized. To investigate other pairs or triples of topological properties in the 2D- and 3D-plots, the user can select a property of interest for each coordinate (X- Y- and optionally Z-axis) and click on the "Regenerate gene set plot" button.
Moreover, before regenerating the plots, a filtering can be applied to each property, in order to remove data points below a given percentile threshold. As an alternative to the filtering option, the user might prefer to use the in-built zooming feature in the 2D and 3D plots. All plots are interactive and allow the user to zoom into the plot by using the mouse wheel (or left-click on empty areas in the 2D plot and Shift + left-click in the 3D plot) and obtain functional annotations for the plotted genes/proteins, by clicking on the corresponding data points. An example 2D plot for the human cancer genes dataset by Futreal et al. (2004) is shown below (see Fig. 2) and also provides the interactive zoom- and annotation-features mentioned above). In this 2D example plot, one can clearly see that the tumour suppressor gene P53 (symbol: TP53) has outstanding network properties (a high node betweeness and a small shortest path length centrality) in comparison to the other genes in the dataset. The 2D and 3D visualizations of the topological features for each single molecule in the gene/protein set also enable the user to compare coloured sub-groups of identifiers, if the user has provided class labels for the uploaded dataset (see Section 1).
zoom out
1.0
2.0
3.0
4.0
5.0
6.0
7.0
0
250000
500000
750000
1000000
1250000
1500000
1750000
2000000
2250000
Uploaded genes

Fig. 2: Interactive 2D topological properties plot for the human cancer genes example dataset. Here, the node betweenness is plotted against the shortest path length (use left mouse-click or scroll-wheel to zoom into the plot, hover mouse over data points to see annotation data) - below the corresponding 3D plot is shown (Java-enabled browser required)



The 3D-plots are available in two different formats to ensure compatibility with the user's system (for Java-enabled browsers, the LiveGraphics3D plot can be used; otherwise, free VRML-browser plug-ins can be used to explore the data in a VRML-environment).





4. Comparing your dataset to reference datasets

By clicking on the "Compare to existing datasets" button at the bottom of the individual dataset analysis page or by selecting a reference database in the "Comparative analysis" menu on the start page, the user can compare an uploaded gene/protein set against a collection of reference datasets. These reference gene/protein sets represent cellular pathways and processes, molecular functions or sub-cellular localisations and have been collected from public annotation databases including KEGG, Gene Ontology, BioCarta, InterPro and MetaCyc. For the human species reference gene sets are available from each of these databases, except the MetaCyc database, and for other species, the reference datasets were obtained from the Gene Ontology database and/or MetaCyc.
TopoGSA provides two possibilities to compare the topological properties of these reference datasets against the uploaded dataset: In analogy to the previous individual dataset analysis, interactive 2D and 3D plots are also available for the comparative analysis. However, since there are many reference gene sets with large sizes, only the median values of their topological properties are plotted to prevent overcrowded plots.
Fig 3. below shows a 2D comparative plot for the human cancer genes example dataset in comparison with datasets from the KEGG database. The KEGG datasets are coloured based on the top-level hierarchy of the KEGG-BRITE categorization of cellular pathways to facilitate the distinction between different types of pathways. Just as for the plots in the individual dataset analysis, the user can zoom into the plot with the mouse (using the mouse-wheel or left-click into empty regions) and see labels by hovering the mouse over data points. Moreover, the user can highlight data points by clicking on them and the corresponding entries in the similarity ranking table below will be highlighted in red text colour.
zoom out
0.0
1.0
2.0
3.0
4.0
5.0
0
250000
500000
750000
1000000
1250000
1500000
1750000
2000000
2250000
Metabolism
Genetic Information Processing
Cellular Processes
Environmental Information Processing
Human Diseases
uploaded gene set (median)
uploaded gene set (all)

Fig. 3: Comparative plot for the human cancer genes example dataset in combination with KEGG gene sets (this is an interactive plot, which has been "zoomed" to an interesting region) - below the corresponding 3D plot is shown (Java-enabled browser required)


In addition to this qualitative comparison of topological properties, the similarity ranking table (see Tab. 2 below) provides a quantitative comparison of the uploaded dataset with the reference datasets, based on a similarity score (given in the last column of the ranking table). The similarity score is obtained by computing 5 ranks for each pathway/process set according to the absolute diffences between each of its 5 median topological properties and the corresponding value for the uploaded gene/protein set. The sum of ranks across all topological properties is then computed and normalized to a range between 0 and 1. As with previous data tables, each column can be sorted alpha-numerically by clicking on the corresponding column-header. Moreover, the user can identify whether datasets are topologically similar due to a large overlap between their genes or proteins, in the column displaying the number of genes in the gene sets (2nd column from the right), the size of the overlap between the reference gene sets and the user-defined gene set is shown in brackets.


[Legend: CC = clustering coefficient; SPL = shortest path length; BW = betweenness; EVC = eigenvector centrality]

IdentifierMedian
degree
Median
CC
Median
SPL
Median
BW
Median
EVC
No. of
genes
uploaded gene set100.073.6798950.02311

Identifier  Median degree  Median CC  Median SPL  Median BW  Median EVC  No. of genes  Score  
hsa04120:Ubiquitin mediated proteolysis90.053.68119490.02105 (11)0.09
hsa04530:Tight junction10.50.043.6796860.02106 (11)0.1
hsa04010:MAPK signaling pathway120.053.6493900.03225 (22)0.11
hsa04810:Regulation of actin cytoskeleton13.50.053.65112770.02184 (19)0.11
hsa04650:Natural killer cell mediated cytotoxicity110.043.6671690.02117 (11)0.12
hsa04150:mTOR signaling pathway14.50.073.64148830.0248 (8)0.12
hsa04310:Wnt signaling pathway100.043.7696070.01123 (9)0.13
hsa04540:Gap junction120.033.67126960.0279 (10)0.14
hsa04512:ECM-receptor interaction100.063.8364070.0173 (1)0.15
hsa04912:GnRH signaling pathway140.043.65118840.0282 (8)0.15
hsa04350:TGF-beta signaling pathway140.053.63155940.0282 (4)0.16
hsa05030:Amyotrophic lateral sclerosis (ALS)70.053.722440.0219 (2)0.16
hsa04910:Insulin signaling pathway140.073.58144850.03123 (16)0.17
hsa05020:Parkinson's disease90.033.74135200.0120 (0)0.17
hsa04330:Notch signaling pathway150.083.59115340.0340 (5)0.18
hsa04916:Melanogenesis100.033.82115000.0180 (13)0.19
hsa04630:Jak-STAT signaling pathway70.123.778170.03132 (23)0.19
hsa04510:Focal adhesion150.063.55142370.03181 (23)0.2
hsa04730:Long-term depression140.043.74146650.0168 (7)0.21
hsa04670:Leukocyte transendothelial migration160.053.57133950.0399 (7)0.21
hsa04360:Axon guidance70.033.8268920.01111 (7)0.21
hsa04950:Maturity onset diabetes of the young7.50.023.8165350.0118 (0)0.22
hsa04720:Long-term potentiation150.043.58159450.0363 (8)0.24
hsa04620:Toll-like receptor signaling pathway150.093.56142370.0389 (6)0.24
hsa04640:Hematopoietic cell lineage80.13.951200.0182 (3)0.25
hsa01430:Cell Communication60.073.9234830.0184 (1)0.26
hsa04514:Cell adhesion molecules (CAMs)60.063.9853300.01109 (1)0.27
hsa04610:Complement and coagulation cascades80.063.994951067 (0)0.27
hsa04070:Phosphatidylinositol signaling system50.013.7621170.0155 (3)0.28
hsa04020:Calcium signaling pathway60.043.929520.01151 (6)0.31
hsa04940:Type I diabetes mellitus70.064.27169043 (2)0.31
hsa05010:Alzheimer's disease160.053.63248710.0228 (0)0.32
hsa03022:Basal transcription factors13.50.153.844670.0128 (0)0.33
hsa04060:Cytokine-cytokine receptor interaction60.094.0135070.01216 (14)0.33
hsa04115:p53 signaling pathway210.073.57188550.0259 (14)0.34
hsa04740:Olfactory transduction4.503.9563290.0120 (0)0.35
hsa00240:Pyrimidine metabolism50.043.881523053 (0)0.35
hsa04710:Circadian rhythm90.173.8757870.0112 (1)0.35
hsa05217:Basal cell carcinoma60.054.115875043 (5)0.35
hsa05211:Renal cell carcinoma16.50.073.51162920.0466 (17)0.37
hsa00860:Porphyrin and chlorophyll metabolism303.89111620.0113 (0)0.38
hsa04140:Regulation of autophagy40.023.9327420.0117 (0)0.38
hsa04920:Adipocytokine signaling pathway18.50.083.54178460.0362 (8)0.38
hsa00790:Folate biosynthesis403.9693900.0125 (2)0.39
hsa03020:RNA polymerase70.183.886255015 (0)0.39
hsa05120:Epithelial cell signaling in Helicobacter pylori infection17.50.053.53190590.0354 (5)0.4
hsa04742:Taste transduction50.013.9927110.0122 (1)0.4
hsa04930:Type II diabetes mellitus150.043.61212230.0343 (3)0.4
hsa04612:Antigen processing and presentation60.044.196661070 (2)0.41
hsa03320:PPAR signaling pathway3.503.9112380.0138 (2)0.41
hsa04130:SNARE interactions in vesicular transport100.14.174764028 (0)0.42
hsa04110:Cell cycle20.50.083.54198180.03106 (15)0.44
hsa04370:VEGF signaling pathway160.043.5157640.0463 (8)0.44
hsa00562:Inositol phosphate metabolism403.9111190.0138 (2)0.44
hsa03030:DNA polymerase50.143.8712950.0121 (0)0.45
hsa05060:Prion disease180.053.51268010.0313 (1)0.45
hsa04664:Fc epsilon RI signaling pathway180.063.45158990.0767 (10)0.47
hsa05218:Melanoma170.043.51185680.0464 (25)0.47
hsa00500:Starch and sucrose metabolism304.0118630.0138 (2)0.5
hsa04340:Hedgehog signaling pathway40.014.162586043 (2)0.5
hsa05222:Small cell lung cancer210.083.51262560.0385 (14)0.51
hsa03050:Proteasome90.44.082083021 (0)0.51
hsa05110:Cholera - Infection504.051120029 (1)0.51
hsa00230:Purine metabolism304.031155087 (1)0.52
hsa04662:B cell receptor signaling pathway170.073.48207790.0661 (13)0.53
hsa04660:T cell receptor signaling pathway180.13.49179600.0689 (18)0.53
hsa04080:Neuroactive ligand-receptor interaction404.1318910198 (1)0.54
hsa00632:Benzoate degradation via CoA ligation303.97136011 (1)0.55
hsa00350:Tyrosine metabolism304.061016025 (1)0.55
hsa00010:Glycolysis / Gluconeogenesis3.504.11984038 (0)0.56
hsa00710:Carbon fixation5.504.13963016 (0)0.56
hsa00360:Phenylalanine metabolism303.9820013 (1)0.57
hsa00051:Fructose and mannose metabolism2.504.041138022 (0)0.57
hsa00251:Glutamate metabolism2.504.09956020 (1)0.57
hsa00100:Biosynthesis of steroids304.04886011 (0)0.58
hsa00252:Alanine and aspartate metabolism304.142633019 (0)0.59
hsa00450:Selenoamino acid metabolism304.06191013 (0)0.6
hsa05216:Thyroid cancer200.073.45262560.0529 (17)0.6
hsa00030:Pentose phosphate pathway304.11599019 (0)0.61
hsa04614:Renin-angiotensin system404.262046016 (0)0.61
hsa00310:Lysine degradation204.03508025 (1)0.61
hsa04210:Apoptosis210.093.51215030.0481 (11)0.61
hsa05210:Colorectal cancer22.50.073.44319630.0584 (22)0.62
hsa00052:Galactose metabolism304.14472017 (0)0.62
hsa00565:Ether lipid metabolism204.1614016 (1)0.63
hsa03010:Ribosome304.19417043 (0)0.64
hsa00600:Sphingolipid metabolism2.504.08184020 (0)0.64
hsa05219:Bladder cancer24.50.063.5390070.0442 (16)0.64
hsa00150:Androgen and estrogen metabolism2.50.014.27193014 (0)0.65
hsa05213:Endometrial cancer240.073.38330660.0751 (19)0.65
hsa04012:ErbB signaling pathway240.073.39285890.0786 (18)0.65
hsa00561:Glycerolipid metabolism204.056019 (0)0.66
hsa00903:Limonene and pinene degradation204.12125013 (1)0.67
hsa01032:Glycan structures - degradation204.15116016 (0)0.68
hsa05215:Prostate cancer280.063.37356600.0784 (28)0.69
hsa00910:Nitrogen metabolism304.2147013 (0)0.69
hsa05223:Non-small cell lung cancer290.073.35382530.0855 (17)0.69
hsa02010:ABC transporters - General304.41606021 (0)0.69
hsa00340:Histidine metabolism204.1314016 (0)0.7
hsa05212:Pancreatic cancer300.073.35457880.0775 (19)0.7
hsa00970:Aminoacyl-tRNA biosynthesis204.2221018 (1)0.7
hsa05130:Pathogenic Escherichia coli infection - EHEC23.50.043.41336090.0540 (4)0.7
hsa05131:Pathogenic Escherichia coli infection - EPEC23.50.043.41336090.0540 (4)0.7
hsa00410:beta-Alanine metabolism104.140011 (0)0.7
hsa05221:Acute myeloid leukemia270.093.39325210.0757 (18)0.7
hsa00380:Tryptophan metabolism204.144030 (0)0.71
hsa00361:gamma-Hexachlorocyclohexane degradation204.23315011 (0)0.71
hsa00530:Aminosugars metabolism204.24444014 (0)0.71
hsa04520:Adherens junction260.053.38335140.0771 (10)0.71
hsa00564:Glycerophospholipid metabolism204.25163036 (1)0.71
hsa04320:Dorso-ventral axis formation300.063.32328140.122 (10)0.71
hsa01031:Glycan structures - biosynthesis 2304.54543019 (0)0.72
hsa00510:N-Glycan biosynthesis204.19111018 (1)0.72
hsa05214:Glioma280.053.35374250.0761 (21)0.73
hsa00020:Citrate cycle (TCA cycle)204.36263020 (4)0.74
hsa00260:Glycine, serine and threonine metabolism204.28155020 (0)0.74
hsa00620:Pyruvate metabolism204.39319021 (0)0.74
hsa00330:Arginine and proline metabolism204.41198023 (0)0.75
hsa00590:Arachidonic acid metabolism204.34113026 (0)0.75
hsa05220:Chronic myeloid leukemia360.083.3492180.176 (25)0.75
hsa00740:Riboflavin metabolism204.42315011 (0)0.76
hsa01510:Neurodegenerative Diseases330.043.33676120.0538 (4)0.76
hsa00480:Glutathione metabolism204.51140018 (2)0.77
hsa00512:O-Glycan biosynthesis104.320013 (0)0.77
hsa01030:Glycan structures - biosynthesis 11.504.30038 (3)0.77
hsa05040:Huntington's disease310.033.35596580.0528 (6)0.77
hsa00071:Fatty acid metabolism104.230018 (1)0.78
hsa00220:Urea cycle and metabolism of amino groups1.504.31014 (0)0.78
hsa00190:Oxidative phosphorylation1.504.3925056 (2)0.79
hsa05050:Dentatorubropallidoluysian atrophy (DRPLA)320.033.31636520.0513 (0)0.79
hsa00280:Valine, leucine and isoleucine degradation104.280014 (0)0.8
hsa00980:Metabolism of xenobiotics by cytochrome P450104.390017 (0)0.81
hsa00650:Butanoate metabolism104.340018 (0)0.81
hsa00640:Propanoate metabolism104.430015 (0)0.83
hsa00591:Linoleic acid metabolism104.570011 (0)0.84

Tab. 2: Ranking of KEGG gene sets based on topological similarity to uploaded gene set


In order to regenerate the results for another reference data set, change the filtering settings or plot other topological properties, the user only needs to adjust the corresponding settings at the bottom of the page and click on the "Regenerate comparative plot and statistics" button.




5. Using microarray data as input

Gene sets of interest are often derived from experimental data, e.g. from high-throughput measurement devices like DNA microarrays. Extracting differentially expressed genes from a microarray study, using reliable significance scores and annotating the genes with current data from public databases, can be a time-consuming and error-prone process, especially for researchers with limited previous experience in this field. For this reason, we have combined TopoGSA with a "Gene Selection Analysis" module from our web-application for automatic analysis of microarray data, ArrayMining.net.
By clicking on the "Use microarray data" button on the TopoGSA start page, the user is re-directed to a new web-interface, which supports the upload of microarray data instead of only gene or protein sets (the array data has to contain gene and sample class labels, multiple replicates per sample class and needs to be uploaded either in a flat-file format (see help and example datasets on the corresponding interface) or as zip-compressed Affymetrix CEL-files with an additional text-file containing the class labels). This web-application can automatically extract a ranked list of differentially expressed genes from a microarray data set, compute q-value significance scores (using the Benjamini-Hochberg method), box plots and a heat map (see Fig. 4, and forward the extracted list of genes to TopoGSA. The user can choose between 6 different feature selection methods (eBayes, SAM, PLS-CV, CFS, RF-MDA, and a majority-vote ensemble method combining all algorithms together), and specify the desired maximum size of extracted differentially expressed genes.
When forwarding the resulting gene list to TopoGSA, class labels specifying whether a gene's expression values had a positive or negative correlation with ordinal class labels, will automatically be added to the list. These genes will receive different colours in TopoGSA's topological properties plot, enabling the user for example to distinguish between up- and down-regulated genes in two different biological conditions and compare their topological characteristics.

microarray heat map example
Fig. 4: Heat map for differentially expressed genes extracted from microarray data and forwarded to TopoGSA





6. Using your own interaction network as input

Although TopoGSA provides protein-protein interaction network for 5 different species (human, yeast, fly, plant, worm), we realise that some users might want to use their own network, for instance for other species or other types of molecules and relations between them (e.g. gene co-expression networks. genetic interactions). We therefore provide a second web-interface, enabling the user to upload a network in combination with a molecule list (e.g. gene/protein list) of interest. The data submission process consists of a simple two-step procedure: The network file uploaded in the first step can contain molecules and relations of any type. The file format is a two-column flat-file format (also known as edge list-format), where column 1 specifies the first interactor and column 2 the second interactor (separated by a space or a tab). For example

ENSG00000000938 ENSG00000080824
ENSG00000051180 ENSG00000002016
ENSG00000179348 ENSG00000005339
ENSG00000134058 ENSG00000008128
ENSG00000072274 ENSG00000010704
...

could be the beginning of a valid network file, where all genes/proteins are in the same format. Optional comment lines in the network-file can be added by using the #-symbol as the first character in a line.
In the second step, the user has to upload the molecule list of interest (this corresponds to the gene or protein list that a user would upload on the main TopoGSA interface, when using pre-defined networks). The molecule identifiers have to match the format of the node labels in the network file to guarantee a successful mapping process. Finally, by pressing "Start Analysis", the processing of the uploaded network will begin, and the user will be notified, when the topological properties of the network and molecule list have been computed.
The user should note that the automatic extraction, conversion and consolidation of molecule identifiers from functional annotation databases is currently not supported for arbitrary types of species and molecules. Thus, on the web-interface for user-defined networks we can only provide a comparison of subsets within the set of identifiers provided by the user. For the same reason, the automatic conversion of molecule identifiers into the format used by the network is not supported on this interface.
In order to compare subsets of identifiers, class labels have to be specified for each molecule in the uploaded identifier list (see the first section on "Preparing and submitting your gene/protein set").
An analysis with a user-defined network will generally require more time than an analysis using the default networks, because not only the topological properties of a small subset of nodes have to be computed, but also the entire network has to be pre-processed to compute the statistics for the global network and the matched-size random subsets for the random model. For this reason, after having uploaded a network and a molecule list, the user can bookmark the webpage on which the results will be displayed after data processing, or alternatively, provide an e-mail address to obtain a notification message, when the results are available.




7. Recommended settings to solve browser compatibility issues

Depending on the user's browser and operating system, some of the dynamic or interactive features on TopoGSA (e.g. the video tutorial or the 3D plots) might not be available without installing a browser-plugin or switching to a more common or recent version of the web-browser or operating system.
We have tested TopoGSA using the web-browsers Firefox 3.6, Internet Explorer 8.0.6 and Opera 10.50 on the operating systems Microsoft Windows XP Professional, Ubuntu Linux 9.10, MacOS X Leopard (version 10.5). The recommend screen resolution is 1280x1024. To access the 3D-plots, the user's browser either needs to be Java-enabled (for the LiveGraphics3D-plot) or contain a VRML 2.0 plugin (for the VRML-plot).
Should you experience any layout or compatibility problems while visiting TopoGSA, we recommend that you use one of the above configurations. Also, please notify us on the contact page about any difficulties/problems you might experience with the preparation and analysis of your data.




Go to tutorial menuGo to start page