Xinming Tu

Univerisity of Washington. Seattle.

prof_pic_PKU.jpg

Weiming Lake, Bo Ya Ta, Peking University

I am a fifth year PhD student at the Paul Allen School of Computer Science and Engineering at University of Washington supervised by Sara Mostafavi. Before my PhD, I earned BSs in Biology and Computer Science from Peking University. During my undergraduate, I did some bioinformatics research at Gao Lab. Please see my CV for more information.

My research interest spans two interconnected directions. First, I work on AI for Regulatory Genomics, including sequence-to-function prediction, multi-omics integration, and perturbation modeling—with a growing focus on developing agentic systems that can autonomously design and execute experiments. Second, I am expanding toward AI for Open-ended Science Discovery, creating LLM-based agents capable of autonomous reasoning and discovery across diverse scientific domains. My goal is to accelerate the arrival of the AI Scientist itself.

news

Feb 7, 2023 I posted my third blog - AI for General Science - Large language models for scientific hypothesis/research ideas generation
Jan 5, 2023 This summer, I will join in the Aviv Regev’s Group at Genentech as a Research Intern, work with Romain Lopez.
Sep 14, 2022 CLUE is accepted to NeurIPS 2022 as Oral presentation! See you in New Orleans!
Jun 30, 2022 I posted my second blog - Collection of some resources related to the PhD journey
Feb 16, 2022 I gave a talk about our CLUE method on the 9th Winter q-bio conference!

latest posts

selected publications

2025

  1. A scalable approach to investigating sequence-to-expression prediction from personal genomes
    Anna E Spiro*, Xinming Tu*, Yilun Sheng, Alexander Sasse, Rezwan Hosseini, Maria Chikina, and Sara Mostafavi
    bioRxiv, 2025
    Under review at Nature Methods
  2. Deep genomic models of allele-specific measurements
    Xinming Tu, Alexander Sasse, Kaitavjeet Chowdhary, Anna E Spiro, Liang Yang, Maria Chikina, Christophe Benoist, and Sara Mostafavi
    bioRxiv, 2025

2024

  1. MLCB
    A Supervised Contrastive Framework for Learning Disentangled Representations of Cell Perturbation Data
    Xinming Tu, Jan-Christian Hutter, Zitong Jerry Wang, Takamasa Kudo, Aviv Regev, and Romain Lopez
    Proceedings of the 18th Machine Learning in Computational Biology meeting, 2024

2023

  1. HDSR
    What Should Data Science Education Do with Large Language Models?
    Xinming Tu, James Zou, Weijie J. Su†, and Linjun Zhang†
    Harvard Data Science Review, 2023

2022

  1. Cross-Linked Unified Embedding for cross-modality representation learning
    Xinming Tu*, Zhi-Jie Cao*, Chen-Rui Xia, Sara Mostafavi†, and Ge Gao†
    NeurIPS(Oral), 2022

2019

  1. Expectation pooling: an effective and interpretable pooling method for predicting DNA–protein binding
    Xiao Luo*, Xinming Tu*, Yang Ding, Ge Gao†, and Minghua Deng†
    Bioinformatics, 2019