I bridge physics-based simulations with machine learning architectures to accelerate molecular design and discovery.
I’m open to collaboration across AI for Science, molecular modeling, and generative chemistry. Let’s connect and build the next generation of molecular design together.
🔗 LinkedIn · Google Scholar · arXiv · ORCID
💊 Senior Machine Learning Scientist @ Merck & Co.
Applying predictive deep learning models and generative AI to accelerate drug discovery by designing, predicting, and optimizing molecular therapeutics for real-world impact.
🎓 Ph.D. in Chemical Engineering @ Caltech (Wang & Brady Groups)
Developed molecular simulations of polyelectrolytes and multivalent ions to uncover mechanisms of mineralization, adsorption, and polymer–surface interactions using state-of-the-art enhanced sampling techniques.
- Binding Modes and Water-Mediation of Polyelectrolyte Adsorption to a Neutral Calcium Carbonate Surface — Langmuir, 2025
DOI - Multivalent Ion-Mediated Polyelectrolyte Association and Structure — Macromolecules, 2024
DOI, arXiv - Adsorption Isotherm and Mechanism of Calcium-Ion Binding to Polyelectrolyte — Langmuir, 2024
DOI, arXiv - Swimming in Potential Flow — J. Fluid Mech., 2022
DOI - Geometry and Dynamics of Lipid Membranes: The Scriven–Love Number — Phys. Rev. E, 2020
DOI, arXiv
- Molecular formats:
SMILES, molecular graphs, 3D conformers, fingerprints - Augmentation: SMILES permutation, conformer sampling, noise injection
- Pre-trained embeddings: ChemBERTa, CheMeleon
- Graph Neural Networks: GCNs, MPNNs, GATs
- Generative Models: VAEs, Normalizing Flows, GFlowNets, Diffusion Models, Transformers
- Convolutional Networks: ResNet, DenseNet, U-Net, ViT
- Recurrent Networks: GRU, BiLSTM
- Classical ML: Random Forests, Gradient Boosting, SVMs, kNN, Gaussian Processes
- Reinforcement Learning: Policy gradient, reward shaping, exploration and exploitation
- Transfer Learning: Pre-training, domain adaptation
- Unsupervised Learning: Clustering (k-means, hierarchical), dimensionality reduction (PCA, t-SNE, UMAP)
- Uncertainty Quantification: Ensembles, mean–variance estimation, evidential deep learning, MC dropout
GROMACSfor all-atom and coarse-grained MD simulationsPLUMEDfor enhanced sampling (metadynamics, umbrella sampling, OPES)
ORCAfor energy calculations and geometry optimizations
Schrödinger Suite,GLIDE, andAutoDock Vinafor ligand–protein docking and screening
Python · C++ · CUDA · SQL · Shell · LaTeX · FORTRAN
- HPC Scheduling:
Slurm,PBS - Provisioning:
Ansible,Spack - Containerization:
Dockerwith persistent volumes - Version Control:
Git,GitHub,GitHub Actions(CI/CD) - Editors:
VS Code,Vim,Jupyter


