Sohrab Shah

Chief of Computational Oncology at MSKCC

Sohrab Shah received a PhD in computer science from UBC in 2008 and did a Postdoctoral Fellowship under Dr. Aparicio and Dr. Huntsman at BC Cancer. He was soon appointed Principal Investigator in 2010 and was Senior Scientist and Associated Professor at BC Cancer and the University of British Columvia, respectively. As well, Dr. Shah was appointed to MSK in April 2018 as the inaugural Chief of the Computational Oncology Service and is the incumbent of the Nicholls-Biondi Chair. He held the Canada Research Chair in Computational Cancer Genomics, and was the recipient of both a Michael Smith Foundation for Health Research Career Investigator Award and a Terry Fox Research Institute New Investigator Award. His research focuses on understanding how tumours evolve over time through integrative approaches involving genomics and computational modeling. Dr. Shah has pioneered computational methods and software for inference of mutations in cancer genomes as well as deciphering patterns of cancer evolution which have been widely disseminated internationally. He has a track record of developing novel, innovative Bayesian statistical models, algorithms, and computational approaches to analyze large, high dimensional genomics and transcriptomic data sets, from both patient tumours and model systems. This includes advancing molecular profiling of cancer cells at single cell resolution. Dr. Shah has been at the forefront of studying tumor evolution in breast, ovary and lymphoid malignancies. His work has been published in Nature, Nature Genetics, Nature Methods, NEJM, Genome Research, Genome Biology, amongst others. Dr. Shah oversees an annual budget of >$1M in competitively awarded funding from philanthropic, government and international bodies.


Clonal fitness inferred from time-series modelling of single-cell cancer genomes.

Epiclomal: Probabilistic clustering of sparse single-cell DNA methylation data.

Chemogenomic profiling of breast cancer patient-derived xenografts reveals targetable vulnerabilities for difficult-to-treat tumors.

Eleven grand challenges in single-cell data science.

Clonal Decomposition and DNA Replication States Defined by Scaled Single-Cell Genome Sequencing.

Dissociation of solid tumor tissues with cold active protease for single-cell RNA-seq minimizes conserved collagenase-associated stress responses.

Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling.

Pharmacological systems analysis defines EIF4A3 functions in cell-cycle and RNA stress granule formation.

clonealign: statistical integration of independent single-cell RNA and DNA sequencing data from human cancers.

Interfaces of Malignant and Immunologic Clonal Dynamics in Ovarian Cancer.

Engineered in-vitro cell line mixtures and robust evaluation of computational methods for clonal decomposition and longitudinal dynamics in cancer.

Genomic consequences of aberrant DNA repair mechanisms stratify ovarian cancer histotypes.

CDK12 regulates alternative last exon mRNA splicing and promotes breast cancer cell invasion.

ddClone: joint statistical inference of clonal populations from single cell and bulk tumour sequencing data.

CLK-dependent exon recognition and conjoined gene formation revealed with a novel small molecule inhibitor.

CX-5461 is a DNA G-quadruplex stabilizer with selective lethality in BRCA1/2 deficient tumours.

Scalable whole-genome single-cell library preparation without preamplification.

Robust high-performance nanoliter-volume single-cell multiple displacement amplification on planar substrates.

Clonal genotype and population structure inference from single-cell tumor sequencing.

Divergent modes of clonal spread and intraperitoneal mixing in high-grade serous ovarian cancer.

The somatic mutation profiles of 2,433 breast cancers refines their genomic and transcriptomic landscapes.

Systematic analysis of somatic mutations impacting gene expression in 12 tumour types.

DNA barcoding reveals diverse growth kinetics of human breast tumour subclones in serially passaged xenografts.

Dynamics of genomic clones in breast cancer patient xenografts at single-cell resolution.

A tumor DNA complex aberration index is an independent predictor of survival in breast and ovarian cancer.

TITAN: inference of copy number architectures in clonal cell populations from tumor whole-genome sequence data.

PyClone: statistical inference of clonal population structure in cancer.

DriverNet: uncovering the impact of somatic driver mutations on transcriptional networks in cancer.

Integrative analysis of genome-wide loss of heterozygosity and monoallelic expression at nucleotide resolution reveals disrupted pathways in triple-negative breast cancer.

The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.

The clonal and mutational evolution spectrum of primary triple-negative breast cancers.

JointSNVMix: a probabilistic model for accurate detection of somatic mutations in normal/tumour paired next-generation sequencing data.

Recurrent somatic DICER1 mutations in nonepithelial ovarian cancers.

Feature-based classifiers for somatic mutation detection in tumour-normal paired sequencing data.

ARID1A mutations in endometriosis-associated ovarian carcinomas.

Somatic mutations altering EZH2 (Tyr641) in follicular and diffuse large B-cell lymphomas of germinal-center origin.

Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution.

Mutation of FOXL2 in granulosa-cell tumors of the ovary.