Computational biologist and bioinformatician at UCSF with 4+ years of hands-on work in single-cell,
single-nuclei, and spatial transcriptomics. I build production-grade analysis pipelines and research software
for liver disease and tumor microenvironment discovery. I'm also developing a foundation model for the liver.
Production tools and modeling systems for transcriptomics and computational pathology.
Foundation Model
LiverTransformer
Liver-specific transformer (23.6M parameters) pre-trained using masked gene prediction on 1.04M human liver cells (36K genes, 7 diseases, 125 cell types), with strong transfer to spatial transcriptomics tasks including hepatocyte zonation.
Framework: PyTorch
Scale: 1.04M cells from CellxGene Census
Status: Active research platform (public repo pending)
Repository not published yet. Release planned after internal validation.
Spatial Genomics
spatialzones
Python package for inside/interface/outside tumor region assignment in spatial transcriptomics using nearest-neighbor context, with built-in visual diagnostics and downstream expression profiling.
Toolkit for bridging spatial transcriptomics outputs with CellxGene-style exploratory workflows, including utilities for region-aware expression analysis and dataset handoff for interactive atlas inspection.
Open-source Julia package for transcriptomics analysis from preprocessing through clustering, marker discovery, and UMAP, built as a high-performance alternative to typical R/Python workflows.
End-to-end Python framework that combines single-cell tumor profiles with LLM reasoning to classify hepatoblastoma subtypes and generate biologically interpretable hypotheses with supporting literature context.
Hepatoblastoma Tumor Atlas: Building single-cell and single-nuclei atlases to map transcriptional heterogeneity and subtype-specific programs in pediatric liver cancer.
Autoimmune Hepatitis: Integrating bulk, single-nuclei, and spatial transcriptomics from biopsies to identify candidate autoantigens and actionable immune pathways.