I'm an Assistant Professor of Computer Science at Harvard SEAS where I lead the Data-Centric Machine Learning (DCML) group. I'm also an Associate Faculty at the Kempner Institute, and have affiliations with the Center for Research on Computation and Society and the Harvard Data Science Initiative. I am also a researcher at Microsoft Research New England.
My research seeks to make machine learning more broadly applicable (especially to data-poor applications) and trustworthy (e.g., robust and interpretable). I am particularly interested in the implications of these two directions for applications in the natural and medical sciences. My approach to the first of these goals draws on ideas from statistics, optimization, and applied mathematics, especially optimal transport, which I have used to develop methods to mitigate data scarcity by various types of geometric dataset manipulations: alignment, comparison, generation, and transformation. This talk provides a high-level overview of this part of my work. As for trustworthy machine learning, I have worked on methods for explaining predictions of black box models, showed their lack of robustness, proposed methods to robustify them, and sought inspiration in the social sciences to make them human-centered. In the past, I worked on various aspects of learning with highly-structured data such as text or graphs, ranging from learning representations of structured objects, to generating them, to interpreting models that operate on them.
Prospective lab members: If you are interested in joining my group at Harvard, please read this.
I obtained a PhD in computer science from MIT, where I worked at CSAIL on various topics in machine learning and natural language processing. I also hold BSc (Licenciatura) and MS degrees in mathematics from ITAM and Courant Institute (NYU), respectively. During the latter, I worked on semidefinite programming for domain adaptation under the supervision of Mehryar Mohri. Between Master's and PhD, I spent a year at IBM's T.J. Watson Research Center, working with Ken Church and others in the Speech Recognition Group.
DDEQs: Distributional Deep Equilibrium Models through Wasserstein Gradient Flows
Jonathan Geuter, Clément Bonet, Anna Korba, David Alvarez-Melis.
AISTATS'25: International Conference on Artificial Intelligence and Statistics . 2025.
Mixture of Parrots: Experts improve memorization more than reasoning
Samy Jelassi, Clara Mohri, David Brandfonbrener, Alex Gu, Nikhil Vyas, Nikhil Anand, David Alvarez-Melis, Yuanzhi Li, Sham M. Kakade, Eran Malach
ICLR'25: International Conference on Learning Representations. 2025.
A Label is Worth A Thousand Images in Dataset Distillation
Tian Qin, Zhiwei Deng, David Alvarez-Melis
NeurIPS'24: Neural Information Processing Systems. 2024.
Tag-LLM: Repurposing General-Purpose LLMs for Specialized Domains
Junhong Shen, Neil Tenenholtz, James Brian Hall, David Alvarez-Melis, Nicolo Fusi
ICML'24: International Conference on Machine Learning. 2024.
Generating Synthetic Datasets by Interpolating along Generalized Geodesics
Jiaojiao Fan, David Alvarez-Melis
UAI'23: Uncertainty in Artificial Intelligence. 2023
InfoOT: Information Maximizing Optimal Transport
Ching-Yao Chuang, Stefanie Jegelka, David Alvarez-Melis
ICML'23: International Conference on Machine Learning. 2023.
Optimizing Functionals on the Space of Probabilities with Input Convex Neural Networks
David Alvarez-Melis, Yair Schiff, Youssef Mroueh
Transactions of Machine Learning Research (TMLR). 2022.
Earlier version at OTML: NeurIPS'21 Workshop on Optimal Transport in Machine Learning .
From Human Explanation to Model Interpretabilty: A Framework Based on Weight of Evidence
David Alvarez-Melis, Harmanpreet Kaur, Hal Daumé III, Hanna Wallach, Jennifer Wortman Vaughan
HCOMP '21: The 9th AAAI Conference on Human Computation and Crowdsourcing. 2021.
Dataset Dynamics via Gradient Flows in Probability Space
David Alvarez-Melis, Nicolò Fusi
ICML'21: International Conference on Machine Learning. 2021.
Geometric Dataset Distances via Optimal Transport
David Alvarez-Melis, Nicolò Fusi
NeurIPS'20: Neural Information Processing Systems. 2020.
Earlier version at AutoML @ ICML 2020.
Unsupervised Hierarchy Matching with Optimal Transport over Hyperbolic spaces
David Alvarez-Melis, Youssef Mroueh, Tommi S. Jaakkola
AISTATS'20: Artificial Intelligence and Statistics. 2020.
Earlier version at OTML: NeurIPS'18 Workshop on Optimal Transport for Machine Learning . Spotlight.
Optimal Transport in Structured Domains: Algorithms and Applications
David Alvarez-Melis (advisor: Tommi S. Jaakkola)
PhD Thesis, MIT. 2019.
Learning Generative Models across Incomparable Spaces
Charlotte Bunne, David Alvarez-Melis, Andreas Krause, Stefanie Jegelka
ICML'19: International Conference on Machine Learning.
Earlier version at R2L: NeurIPS'18 Workshop on Relational Representation Learning. Best Paper Award.
Towards Optimal Transport with Global Invariances
David Alvarez-Melis, Stefanie Jegelka, Tommi S. Jaakkola
AISTATS'19: Artificial Intelligence and Statistics. 2019.
@InProceedings{pmlr-v89-alvarez-melis19a, title = {Towards Optimal Transport with Global Invariances}, author = {Alvarez-Melis, David and Jegelka, Stefanie and Jaakkola, Tommi S.}, booktitle = {Proceedings of Machine Learning Research}, pages = {1870--1879}, year = {2019}, editor = {Chaudhuri, Kamalika and Sugiyama, Masashi}, volume = {89}, series = {Proceedings of Machine Learning Research}, address = {}, month = {16--18 Apr}, publisher = {PMLR}, pdf = {http://proceedings.mlr.press/v89/alvarez-melis19a/alvarez-melis19a.pdf}, url = {http://proceedings.mlr.press/v89/alvarez-melis19a.html}, abstract = {Many problems in machine learning involve calculating correspondences between sets of objects, such as point clouds or images. Discrete optimal transport provides a natural and successful approach to such tasks whenever the two sets of objects can be represented in the same space, or at least distances between them can be directly evaluated. Unfortunately neither requirement is likely to hold when object representations are learned from data. Indeed, automatically derived representations such as word embeddings are typically fixed only up to some global transformations, for example, reflection or rotation. As a result, pairwise distances across two such instances are ill-defined without specifying their relative transformation. In this work, we propose a general framework for optimal transport in the presence of latent global transformations. We cast the problem as a joint optimization over transport couplings and transformations chosen from a flexible class of invariances, propose algorithms to solve it, and show promising results in various tasks, including a popular unsupervised word translation benchmark.} }
Towards Robust Interpretability with Self-Explaining Neural Networks
David Alvarez-Melis, Tommi S. Jaakkola
NeurIPS'18: Neural Information Processing Systems. 2018.
Gromov-Wasserstein Alignment of Word Embedding Spaces
David Alvarez-Melis, Tommi S. Jaakkola
EMNLP'18: Empirical Methods in Natural Language Processing. 2018. Oral Presentation.
Structured Optimal Transport
David Alvarez-Melis, Tommi S. Jaakkola, Stefanie Jegelka
AISTATS'18: Artificial Intelligence and Statistics. 2018. Oral Presentation.
Earlier version at NIPS Workshop on Optimal Transport for Machine Learning, 2017, as Extended Oral.
A Causal Framework for Explaining the Predictions of Black-Box Sequence-to-Sequence Models
David Alvarez-Melis, Tommi S. Jaakkola
EMNLP'17: Empirical Methods in Natural Language Processing. 2017.
Tree-structured Decoding with Doubly-recurrent Neural Networks
David Alvarez-Melis, Tommi S. Jaakkola
ICLR'17: International Conference on Learning Representations. 2017.
Word Embeddings as Metric Recovery in Semantic Spaces
Tatsunori B. Hashimoto, David Alvarez-Melis, Tommi S. Jaakkola
TACL: Transactions of the Association for Computational Linguistics. 2016. (presented at ACL'16).
Current and past courses I have taught or TA'd:
"Feynman was a truly great teacher. He prided himself on being able to devise ways to explain even the most profound ideas to beginning students. Once, I said to him, "Dick, explain to me, so that I can understand it, why spin one-half particles obey Fermi-Dirac statistics." Sizing up his audience perfectly, Feynman said, "I'll prepare a freshman lecture on it." But he came back a few days later to say, "I couldn't do it. I couldn't reduce it to the freshman level. That means we don't really understand it."
Full CV in PDF (or a shorter Resumé).
I am always looking for motivated students and postdocs to join my group. Unfortunately, I am not able to respond to all emails. So, depending on your situation, please follow one of the follwing routes:
If your email is not formatted as above, my filters won't catch it so I will almost certainly not see it.
Outside of research, I enjoy running, brewing beer and playing guitar. I also like quotes. Here's a few more:
"We cannot solve our problems with the same thinking we used when we created them." - A. Einstein
"The real danger is not that computers will begin to think like men, but that men will begin to think like computers" - Syndey J. Harris