Publications
* denotes equal contribution first author † denotes equal contribution corresponding author
Full list also available on Google Scholar.
Preprints
Scaling Medical Reasoning Verification via Tool-Integrated Reinforcement Learning
arXiv preprint arXiv:2601.20221, 2026.
MEDVISTAGYM: A Scalable Training Environment for Thinking with Medical Images via Tool-Integrated Reinforcement Learning
arXiv preprint arXiv:2601.07107, 2026.
RAG in the Wild: On the (In)effectiveness of LLMs with Mixture-of-Knowledge Retrieval Augmentation
arXiv preprint arXiv:2507.20059, 2025.
Multi-Agent LLMs Ensemble for Efficient Atrial Fibrillation Annotation of ECG Reports
arXiv preprint arXiv:2410.16543, 2025.
MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning
arXiv preprint arXiv:2503.07459, 2025.
MENDR: Manifold Explainable Neural Data Representations
arXiv preprint arXiv:2508.04956, 2025.
2026
Scaling Agentic Reinforcement Learning for Tool-Integrated Reasoning in VLMs
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2026.
MedAgentGym: A Scalable Agentic Training Environment for Code-Centric Reasoning in Biomedical Data Science
Proceedings of the Fourteenth International Conference on Learning Representations (ICLR), 2026. Oral, top 1.18%
Multi-level Asymmetric Contrastive Learning for Volumetric Medical Image Segmentation Pre-training
IEEE Journal of Biomedical and Health Informatics (JBHI), 2026.
Human Experiences and Reflections (HEARs) Data Connector Protocol: Development and Validation of an AI-Based Tool to Improve Discovery and Reuse of Archived Qualitative Data
JMIR Research Protocols (JRP), 2026.
Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI
IEEE Transactions on Education (ToE), 2026.
PKAttn: Pharmacokinetics-Inspired Fusion of DCE-MRI and Clinical Data for Accurate Breast Cancer Recurrence Prediction
IEEE International Symposium on Biomedical Imaging (ISBI), 2026.
WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
Findings in Proceedings of the European Chapter of the Association for Computational Linguistics (EACL), 2026.
Causal Machine Learning for Surgical Interventions
Big Data Mining and Analytics, 2026.
CellForge: Agentic Design of Virtual Cell Models
AAAI 2026 Workshop on AI for Scientific Research, 2026. Oral, Best Paper Award
2025
AceSearcher: Bootstrapping Reasoning and Search for LLMs via Self-Play Reinforcement Fine-Tuning
Proceedings of the Conference on Neural Information Processing Systems (NeurIPS), 2025. Spotlight, top 3.2%
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration
Proceedings of the Conference on Language Modeling (COLM), 2025.
MedAssist: LLM-Empowered Medical Assistant for Assisting the Scrutinization and Comprehension of Electronic Health Records
Proceedings of the ACM on Web Conference (WWW), 2025.
SBDH-Reader: a Large Language Model-Powered Method for Extracting Social and Behavioral Determinants of Health from Clinical Notes
Journal of the American Medical Informatics Association (JAMIA), 2025.
Novel extraction of discriminative fine-grained feature to improve retinal vessel segmentation
Image and Vision Computing, Volume 163, 2025.
Advancing Sleep Disorder Diagnostics: A Transformer-based EEG Model for Sleep Stage Classification and OSA Prediction
IEEE Journal of Biomedical and Health Informatics (JBHI), vol. 29, no. 2, 2025.
Fairness Artificial Intelligence in Clinical Decision Support: Mitigating Effect of Health Disparity
IEEE Journal of Biomedical and Health Informatics (JBHI), vol. 29, no. 2, 2025.
Predicting Pediatric Patient Rehabilitation Outcomes After Spinal Deformity Surgery with Artificial Intelligence
Nature Communications Medicine, vol. 5, no. 1, 2025.
Sex Dimorphism Influences Cortical Microglial Morphological and Phenotypic Marker Profile After Closed Head Mild Traumatic Brain Injury in Rats
Neurotrauma Reports, vol. 6, no. 1, 2025.
ResSwinUnet3D: Developing A New Residual-Based SwinUnet3D Model for Enhanced 3D Medical Image Segmentation
IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), 2025.
KindSleep: Knowledge-Informed Diagnosis of Obstructive Sleep Apnea from Oximetry
Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2025.
Multi-Agent LLM Reasoning for Clinical Procedure Sequencing from High-Granularity EHR Data
Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2025. Best Paper Honorable Mention, 3/170
From Association to Mechanism: Causal Graph-Based Fine-Mapping of Regulatory SNPs for Functional Validation
Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2025.
Developing Fairness-Aware Task Decomposition to Improve Equity in Post-Spinal Fusion Complication Prediction
Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2025.
MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale
GenAI4Health Workshop at NeurIPS, 2025.
2024
EHRAgent: Code Empowers Large Language Models for Few-Shot Complex Tabular Reasoning on Electronic Health Records
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
MedAdapter: Efficient Test-Time Adaptation of Large Language Models towards Medical Reasoning
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
MIMIR: A Customizable Agent Tuning Platform for Enhanced Scientific Applications
Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024.
RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
Knowledge-infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL), 2024.
Clinical Decision Making under Uncertainty: A Bootstrapped Counterfactual Inference Approach
BMC Medical Informatics and Decision Making, vol. 24, no. 1, 2024.
Developing a Novel Causal Inference Algorithm for Personalized Biomedical Causal Graph Learning Using Meta Machine Learning
BMC Medical Informatics and Decision Making, vol. 24, no. 1, 2024.
Optimized Clinical Feature Analysis for Improved Cardiovascular Disease Risk Screening
IEEE Open Journal of Engineering in Medicine and Biology (OJEMB), vol 5, 2024.
Heterogeneous Treatment Effects of Spinal Fusion Surgery for Adolescent Idiopathic Scoliosis Patients
Proceedings of the 15th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2024. Best Student Paper Award, 2/201
2023
Improving Explainable AI with Patch Perturbation-based Evaluation Pipeline: a COVID-19 X-ray Image Analysis Case Study
Scientific Reports, vol 13, issue 1, 2023.
Integrating Multi-omics Data with EHR for Precision Medicine Using Advanced Artificial Intelligence
IEEE Reviews in Biomedical Engineering (RBME), vol 17, 2023.
Retrieval-augmented Large Language Models for Adolescent Idiopathic Scoliosis Patients in Shared Decision-Making
Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2023. Best SIGBio Paper Award, 1/191
Choice Over Effort: Mapping and Diagnosing Augmented Whole Slide Image Datasets with Training Dynamics
Proceedings of the 14th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2023.
Explainable Synthetic Image Generation to Improve Risk Assessment of Rare Pediatric Heart Transplant Rejection
Journal of Biomedical Informatics (JBI), vol 139, 2023.
Early and fair COVID-19 outcome risk assessment using robust feature selection
Scientific Reports, vol 13, issue 1, 2023.
2022
Explainable Artificial Intelligence Methods in Combating Pandemics: A Systematic Review
IEEE Reviews in Biomedical Engineering (RBME), vol 16, 2022. Annual Featured Article
A FHIR-compliant Application for Multi-site and Multi-modality Pediatric Scoliosis Patient Rehabilitation
IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2022.
Learning from Heterogeneous Data via Contrastive Learning: An Application in Multi-source Covid-19 Radiography
IEEE EMBS International Conference on Biomedical and Health Informatics (BHI), 2022.
Development of a Generalizable Multi-site and Multi-modality Clinical Data Cloud Infrastructure for Pediatric Patient Care
Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2022.
2021 & Earlier
COVID-19 Automatic Diagnosis with Radiographic Imaging: Explainable Attention Transfer Deep Neural Networks
IEEE Journal of Biomedical and Health Informatics (JBHI), vol 25, issue 7, 2021.
EXAM: an Explainable Attention-based Model for Covid-19 Automatic Diagnosis
Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics (BCB), 2020.
Integrating Sparse Reconstruction Saliency and Target-Aware Active Contour Model for Airport Extraction
25th IEEE International Conference on Image Processing (ICIP), 2018.