Proteogenomics
Where the genome meets the proteome — using protein evidence to refine annotation and discover novel and variant peptides.
What this area is.
Proteogenomics closes the loop between DNA and protein. By searching mass spectra against sample-specific databases built from genomic variants, we confirm that predicted changes are actually translated.
This validates annotation, uncovers novel ORFs and variant peptides, and identifies candidates such as tumour neoantigens that matter for immunotherapy.
Tools & technologies
What we do.
Core methods we apply in proteogenomics.
Variant protein databases
Custom search spaces built from a sample's genome/transcriptome.
Peptide–spectrum matching
Confident identification against expanded databases.
Novel peptide discovery
Finding ORFs and peptides missing from references.
Variant-peptide validation
Confirming that genomic variants are translated.
Neoantigen candidates
Tumour-specific peptides for immunotherapy.
Annotation refinement
Improving gene models with protein evidence.
From data to insight.
How a proteogenomics project flows end to end.
Genome/RNA
variant calls
Custom DB
variant proteins
MS search
spectra matching
Validate
variant peptides
Discover
novel ORFs
Apply
neoantigens · annotation
Publication-grade figures.
Interactive, live-rendered visualisations used in proteogenomics.
Where we go deep.
Cancer neoantigens
Tumour-specific variant peptides for immunotherapy design.
Annotation refinement
Protein-level evidence correcting gene models.
Variant validation
Showing that genomic variants reach the proteome.
Questions we answer.
A few of the things people ask about proteogenomics — and our short answers. Ask CGB-AI for more.
What does proteogenomics add?
It provides protein-level proof that a genomic variant is expressed — crucial for annotation and for neoantigen discovery.
Why custom databases?
Reference protein databases miss sample-specific variants; building the search space from the genome lets those peptides be found.
Publications in Proteogenomics.
Drawn from our full record of 173 papers, filtered to this area.
Start a proteogenomics project.
Tell us the biological question and the data you have — we will map out an approach.