Publications

A complete list of my published papers and preprints.

All Publications

GPC: Deep generative model of genetic variation data improves imputation accuracy in private populations Prateek Anand, Anji Liu, Meihua Dang, Boyang Fu, Xinzhu Wei, Guy Van den Broeck, Sriram Sankararaman ICLR Gen2 2026 (Oral 10%) 2026
Probabilistic Generative Models Genomics Imputation
STRAND: Sequence-Conditioned Transport for Single-Cell Perturbations Boyang Fu, George Dasoulas, Sameer Gabbita, Xiang Lin, Shanghua Gao, Xiaorui Su, Soumya Ghosh, Marinka Zitnik arXiv 2026
Foundation Model Genomics Single-cell Perturbation
A biobank-scale test of marginal epistasis reveals genome-wide signals of polygenic interaction effects Boyang Fu*, Ali Pazokitoroudi*, Zhuozheng Shi, Asha Kar, Albert Xue, Aakarsh Anand, Prateek Anand, Zhengtong Liu, Richard Border, Päivi Pajukanta, Noah Zaitlen & Sriram Sankararaman Nature Genetics 2025
Scalable ML Statistical Genetics Epistasis
Comprehensive gene heritability estimation reveals the genetic architecture of rare coding variants underlying complex traits Zhengtong Liu, Boyang Fu, Moonseong Jeong, Prateek Anand, Aakarsh Anand, Seon-Kyeong Jang, Aditya Gorla, Jiazheng Zhu, Päivi Pajukanta, Pier Francesco Palamara, Noah Zaitlen, Richard Border, Sriram Sankararaman bioRxiv 2025
Scalable ML Statistical Genetics Whole Genome Sequencing
Investigating the sources of variable impact of pathogenic variants in monogenic metabolic conditions Angela Wei, Richard Border, Boyang Fu, Sinéad Cullina, Nadav Brandes, Seon-Kyeong Jang, Sriram Sankararaman, Eimear E. Kenny, Miriam S. Udler, Vasilis Ntranos, Noah Zaitlen & Valerie A. Arboleda Nature Communications 2025
Statistical Genetics Protein Language Model Epistasis
A Scalable Adaptive Quadratic Kernel Method for Interpretable Epistasis Analysis in Complex Traits Boyang Fu*, Prateek Anand*, Aakarsh Anand*, Joel Mefford, Sriram Sankararaman Genome Research (preliminary version at RECOMB), 2024 2024
Scalable ML Statistical Genetics Epistasis
Fast Kernel-based Association Testing of non-linear genetic effects for Biobank-scale data Boyang Fu, Ali Pazokitoroudi, Mukund Sudarshan, Lakshminarayanan Subramanian, Sriram Sankararaman Nature Communications, 2023 2023
Scalable ML Statistical Genetics Epistasis
Leveraging family data to design Mendelian Randomization that is provably robust to population stratification Nathan LaPierre*, Boyang Fu*, Steven Turnbull, Eleazar Eskin, Sriram Sankararaman Genome Research (preliminary version at RECOMB), 2023 2023
Causal Inference Mendelian Randomization Statistical Genetics
Marginal Contribution Feature Importance-an Axiomatic Approach for Explaining Data Amnon Catav, Boyang Fu, Yazeed Zoabi, Ahuva Libi Weiss Meilik, Noam Shomron, Jason Ernst, Sriram Sankararaman, Ran Gilad-Bachrach International Conference on Machine Learning (ICML), 2021 2021
Explainable AI
PrivateBus: Privacy Identification and Protection in Large-Scale Bus WiFi Systems Zhihan Fang, Boyang Fu, Zhou Qin, Fan Zhang, Desheng Zhang Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2020 2020
Urban Computing Data Privacy
A Statistical Model for Quantifying the Needed Duration of Social Distancing for the COVID-19 Pandemic Nadav Rakocz, Boyang Fu, Eran Halperin, Sriram Sankararaman KDD 2020 - AI For COVID-19, 2020 2020
Hierarchical Bayesian Modeling Pandemic Modeling
MAC: Measuring the impacts of anomalies on travel time of multiple transportation systems Zhihan Fang, Yu Yang, Shuai Wang, Boyang Fu, Zixing Song, Fan Zhang, Desheng Zhang Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (UbiComp), 2019 2019
Urban Computing