Regina Barzilay | Institute for Medical Engineering & Science

Regina Barzilay

Affiliate Faculty

Delta Electronics Professor, EECS

Faculty Co-Lead, J-Clinic

MacArthur Fellow

Affiliate Faculty at the MIT Institute of Medical Engineering & Science (IMES)

Regina Barzilay

Affiliate Faculty

Delta Electronics Professor, EECS

Faculty Co-Lead, J-Clinic

MacArthur Fellow

Affiliate Faculty at the MIT Institute of Medical Engineering & Science (IMES)

Degrees

PhD in Computer Science, Columbia University

Bio

Regina Barzilay is a Delta Electronics professor in the Department of Electrical Engineering and Computer Science and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. Her research interests are in natural language processing, applications of deep learning to chemistry and oncology. She is a recipient of various awards, including the NSF Career Award, the MIT Technology Review TR-35 Award, Microsoft Faculty Fellowship and several Best Paper Awards at NAACL and ACL. In 2017, she received a MacArthur fellowship, an ACL fellowship and an AAAI fellowship. She received her PhD in Computer Science from Columbia University, and spent a year as a postdoc at Cornell University. She was elected to the National Academy of Medicine in 2023.

More about Professor Barzilay can be found here.

Research Interests

Natural Language Processing

I develop machine learning models that aim to understand and generate natural languages. We are currently witnessing the first generation of NLP tools that have been deployed at scale and are used by millions of people. However, the major component of this success is access to large amounts of training data which machines use to learn mappings between input and output. In many applications and languages, such annotations are not readily available, and are expensive and slow to collect. I am interested in designing algorithms that do not suffer from this annotation dependence. Specifically, we are developing deep learning models that can transfer annotations across domains and languages, that can learn from a few annotated examples by utilizing supplementary data sources, and that can take advantage of human-provided rationales to constrain model structure.

Oncology

Learning to Cure Data collected about millions of cancer patients — their pathology slides, imaging, and other tests — contain answers to many open questions in oncology. Jointly with the MGH collaborators, we are developing algorithms that can learn from this data to improve models of disease progression, prevent over-treatment, and narrow down to the cure. On the NLP side, we are creating databases which record pertinent cancer features extracted from raw documents. On the computer vision side, we are working on deep learning models that compute personalized assessment from mammogram data focusing on early cancer detection.

Chemistry

ML Drug Discovery Today, drug discovery involves practitioners with years of advanced training and is carried out in a trial-and-error, labor-intensive fashion. Our goal is to change a traditional pipeline. In a joint work with chemical engineers and chemists at MIT, we are working on deep learning methods for modeling chemical processes.

Selected Publications

Bao, Yujia, Shiyu Chang, Mo Yu, and Regina Barzilay, “Deriving Machine Attention from

Human Rationales,” Proceedings of Empirical Methods in Natural Language Processing

(EMNLP), 2018.
Chen, Benson, Regina Barzilay, and Tommi Jaakkola, “Path-Augmented Graph

Transformer Network,” ICML Workshop on Learning and Reasoning with Graph-

Structured Representations, 2019.
Yan, Susu, Enrico Santus, Adam Yala, Leonard Wee, Jason Kim, Santiago A. Lozano-

Calderon, Joseph H. Schwab, Regina Barzilay, Kevin S. Hughes, and Karen De Amorim

Bernstein, “Using Deep Learning Algorithm to Extract Tumour Information from

Pathology and Surgical Notes of Chondrosarcoma Patients,” Proceedings of the

International Conference on the Use of Computers in Radiation Therapy and the

International Conference on Monte Carlo Techniques for Medical Applications (ICCRMCMA),

2019.
Lehman, Eric, Jay DeYoung, Regina Barzilay and Byron C. Wallace, “Inferring Which

Medical Treatments Work from Reports of Clinical Trials,” Proceedings of the North

American Chapter of the Association for Computational Linguistics (NAACL), 2019.
Qian, Yujie, Enrico Santus, Zhijing Jin, Jiang Guo, and Regina Barzilay, “GraphIE: A

Graph-Based Framework for Information Extraction,” Proceedings of the North American

Chapter of the Association for Computational Linguistics (NAACL), 2019.
Schuster, Tal, Ori Ram, Regina Barzilay, and Amir Globerson, “Cross-Lingual Alignment

of Contextual Word Embeddings, with Applications to Zero-Shot Dependency Parsing,”

Proceedings of the North American Chapter of the Association for Computational

Linguistics (NAACL), 2019.
Luo, Jiaming, Yuan Cao, and Regina Barzilay, “Neural Decipherment via Minimum-Cost

Flow: from Ugaritic to Linear B,” Proceedings of the 57th Annual Meeting of the

Association for Computational Linguistics (ACL), 2019.
Jin, Wengong, Kevin Yang, Regina Barzilay, and Tommi Jaakkola, “Learning Multimodal

Graph-to-Graph Translation for Molecule Optimization,” Proceedings of International

Conference on Learning Representations (ICLR), 2019.
Ingraham, John, Vikas K. Garg, Regina Barzilay, and Tommi Jaakkola, “Generative

Models for Graph-Based Protein Design,” ICLR Workshop on Deep Generative Models for

Highly Structured Data, 2019.
Schuster, Tal, Darsh J. Shah, Yun Jie Serene Yeo, Daniel Filizzola, Enrico Santus, and

Regina Barzilay, “Towards Debiasing Fact Verification Models,” Proceedings of Empirical

Methods in Natural Language Processing (EMNLP), 2019.
Fisch, Adam, Jiang Guo, and Regina Barzilay, “Working Hard or Hardly Working:

Challenges of Integrating Typology into Neural Dependency Parsers, “Proceedings of

Empirical Methods in Natural Language Processing (EMNLP), 2019.
Shah, Darsh J., Tal Schuster, and Regina Barzilay, “Automatic Fact Guided Sentence

Modification,” Thirty-Fourth Conference of the Association for the Advancement of

Artificial Intelligence (AAAI), 2020.
Tuan, Luu Anh, Darsh J. Shah, and Regina Barzilay, “Capturing Greater Context for

Question Generation,” Thirty-Fourth Conference of the Association for the Advancement of

Artificial Intelligence (AAAI), 2020.

People: Regina Barzilay