Skip to content

Pathway Enrichment

Gene Set Enrichment Analysis (GSEA) helps you to identify enriched reactions and pathways in your metabolic model using a ranked list file that you upload.

Analysis is done by the gseapy Python package (via the gp.prerank function).

Input

You need to provide a ranked list file containing reaction IDs and associated scores.

Format (comma-separated, .csv):

Reaction      Rank
PMANM         5.881484159
SELCYSLY      5.667135956
SELCYSLY2     5.667135956
FBP           5.548850782
FBP26         5.548850782
PI4PLC        5.376448076
PI45PLC       5.376448076
PIPLC         5.376448076
  • Reaction → identifier of a metabolic reaction in the model
  • Rank → score indicating importance (e.g., fold change, z-score, differential expression statistic)

Parameters

Parameters

In the modal, you can configure the following analysis options:

Parameter Description Default Notes
Permutations Number of random permutations to compute statistical significance 1000 Higher values = more stable p-values, but slower
Min Size Minimum pathway size (number of reactions) to be tested 15 Avoids very small pathways
Max Size Maximum pathway size (number of reactions) to be tested 500 Avoids overly broad pathways
Metrics Statistical measures reported FDR q-val = False Discovery Rate
FWER p-val = Family-wise Error Rate
p-value cutoff Significance threshold to filter enriched pathways 0.05 or 0.01 Applied to FDR/FWER values

Results

After running GSEA, a ranked table of enriched pathways is displayed:

GSEA result table

Each row corresponds to a significantly enriched pathway.

  • Clicking on a row will:
    1. Open a reaction–reaction network displaying the subset of reactions responsible for the enrichment.
    2. Highlight these reactions as green nodes in the main canvas, linking enrichment back to the global network view.
Column Description
pathway Name of the pathway (from your gene_sets database, e.g. KEGG, MetaCyc, or custom).
es Enrichment Score → raw measure of how strongly the pathway’s reactions are skewed toward the top or bottom of the ranked list.
nes Normalized Enrichment Score → ES adjusted for pathway size and permutation background, allowing fair comparison across pathways.
fdr False Discovery Rate (q-value) → estimated probability that a pathway identified as enriched is a false positive.
fwer Family-Wise Error Rate p-value → conservative correction accounting for multiple hypothesis testing.
nom Nominal p-value → unadjusted significance of enrichment for that pathway.
gene % Proportion of reactions in the pathway that appear in the ranked list.
tag % Proportion of ranked list contributing to the enrichment score (leading-edge reactions).
lead_genes The genes / reactions that caused the pathway to be enriched

Show the enriched reactions

Important

Only pathways meeting the p-value cutoff are shown in the table.