my research focuses on building scientifically-grounded models for multimodal biological data, with a primary focus on oncology. in particular, developing structured multi-omics foundation models that respect the causal and statistical relationships between genomic modalities. a central theme is the incorporation of domain knowledge as structured inductive biases, rather than treating biological data as generic sequences to be modelled with off-the-shelf architectures, aligning models with the underlying biology.
in particular, i focus on three central research questions: