Regina Barzilay
Affiliate Faculty
Title
Delta Electronics Professor, EECS
Faculty Co-Lead, J-Clinic
MacArthur Fellow
Affiliate Faculty at the MIT Institute of Medical Engineering & Science (IMES)
Photo: Regina Barzilay
Email
reginab [at] mit.edu
Phone
(617) 258-5706
Address

77 Massachusetts Ave.

Cambridge, MA 02139

Administrative Assistant(s)
Ciarra Brodie
617-324-6742
cbrodie [at] csail.mit.edu
Regina Barzilay
Affiliate Faculty
Title
Delta Electronics Professor, EECS
Faculty Co-Lead, J-Clinic
MacArthur Fellow
Affiliate Faculty at the MIT Institute of Medical Engineering & Science (IMES)

Degrees

  • PhD in Computer Science, Columbia University

Bio

Regina Barzilay is a Delta Electronics professor in the Department of Electrical Engineering and Computer Science and a member of the Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT. Her research interests are in natural language processing, applications of deep learning to chemistry and oncology. She is a recipient of various awards, including the NSF Career Award, the MIT Technology Review TR-35 Award, Microsoft Faculty Fellowship and several Best Paper Awards at NAACL and ACL. In 2017, she received a MacArthur fellowship, an ACL fellowship and an AAAI fellowship. She received her PhD in Computer Science from Columbia University, and spent a year as a postdoc at Cornell University. She was elected to the National Academy of Medicine in 2023.

More about Professor Barzilay can be found here.

Research Interests

Natural Language Processing

I develop machine learning models that aim to understand and generate natural languages. We are currently witnessing the first generation of NLP tools that have been deployed at scale and are used by millions of people. However, the major component of this success is access to large amounts of training data which machines use to learn mappings between input and output. In many applications and languages, such annotations are not readily available, and are expensive and slow to collect. I am interested in designing algorithms that do not suffer from this annotation dependence. Specifically, we are developing deep learning models that can transfer annotations across domains and languages, that can learn from a few annotated examples by utilizing supplementary data sources, and that can take advantage of human-provided rationales to constrain model structure.

Oncology

Learning to Cure Data collected about millions of cancer patients — their pathology slides, imaging, and other tests — contain answers to many open questions in oncology. Jointly with the MGH collaborators, we are developing algorithms that can learn from this data to improve models of disease progression, prevent over-treatment, and narrow down to the cure. On the NLP side, we are creating databases which record pertinent cancer features extracted from raw documents. On the computer vision side, we are working on deep learning models that compute personalized assessment from mammogram data focusing on early cancer detection.

Chemistry

ML Drug Discovery Today, drug discovery involves practitioners with years of advanced training and is carried out in a trial-and-error, labor-intensive fashion. Our goal is to change a traditional pipeline. In a joint work with chemical engineers and chemists at MIT, we are working on deep learning methods for modeling chemical processes.

Selected Publications

  • Bao, Yujia, Shiyu Chang, Mo Yu, and Regina Barzilay, “Deriving Machine Attention from

    Human Rationales,” Proceedings of Empirical Methods in Natural Language Processing

    (EMNLP), 2018.
  • Chen, Benson, Regina Barzilay, and Tommi Jaakkola, “Path-Augmented Graph

    Transformer Network,” ICML Workshop on Learning and Reasoning with Graph-

    Structured Representations, 2019.
  • Yan, Susu, Enrico Santus, Adam Yala, Leonard Wee, Jason Kim, Santiago A. Lozano-

    Calderon, Joseph H. Schwab, Regina Barzilay, Kevin S. Hughes, and Karen De Amorim

    Bernstein, “Using Deep Learning Algorithm to Extract Tumour Information from

    Pathology and Surgical Notes of Chondrosarcoma Patients,” Proceedings of the

    International Conference on the Use of Computers in Radiation Therapy and the

    International Conference on Monte Carlo Techniques for Medical Applications (ICCRMCMA),

    2019.
  • Lehman, Eric, Jay DeYoung, Regina Barzilay and Byron C. Wallace, “Inferring Which

    Medical Treatments Work from Reports of Clinical Trials,” Proceedings of the North

    American Chapter of the Association for Computational Linguistics (NAACL), 2019.
  • Qian, Yujie, Enrico Santus, Zhijing Jin, Jiang Guo, and Regina Barzilay, “GraphIE: A

    Graph-Based Framework for Information Extraction,” Proceedings of the North American

    Chapter of the Association for Computational Linguistics (NAACL), 2019.
  • Schuster, Tal, Ori Ram, Regina Barzilay, and Amir Globerson, “Cross-Lingual Alignment

    of Contextual Word Embeddings, with Applications to Zero-Shot Dependency Parsing,”

    Proceedings of the North American Chapter of the Association for Computational

    Linguistics (NAACL), 2019.
  • Luo, Jiaming, Yuan Cao, and Regina Barzilay, “Neural Decipherment via Minimum-Cost

    Flow: from Ugaritic to Linear B,” Proceedings of the 57th Annual Meeting of the

    Association for Computational Linguistics (ACL), 2019.
  • Jin, Wengong, Kevin Yang, Regina Barzilay, and Tommi Jaakkola, “Learning Multimodal

    Graph-to-Graph Translation for Molecule Optimization,” Proceedings of International

    Conference on Learning Representations (ICLR), 2019.
  • Ingraham, John, Vikas K. Garg, Regina Barzilay, and Tommi Jaakkola, “Generative

    Models for Graph-Based Protein Design,” ICLR Workshop on Deep Generative Models for

    Highly Structured Data, 2019.
  • Schuster, Tal, Darsh J. Shah, Yun Jie Serene Yeo, Daniel Filizzola, Enrico Santus, and

    Regina Barzilay, “Towards Debiasing Fact Verification Models,” Proceedings of Empirical

    Methods in Natural Language Processing (EMNLP), 2019.
  • Fisch, Adam, Jiang Guo, and Regina Barzilay, “Working Hard or Hardly Working:

    Challenges of Integrating Typology into Neural Dependency Parsers, “Proceedings of

    Empirical Methods in Natural Language Processing (EMNLP), 2019.
  • Shah, Darsh J., Tal Schuster, and Regina Barzilay, “Automatic Fact Guided Sentence

    Modification,” Thirty-Fourth Conference of the Association for the Advancement of

    Artificial Intelligence (AAAI), 2020.
  • Tuan, Luu Anh, Darsh J. Shah, and Regina Barzilay, “Capturing Greater Context for

    Question Generation,” Thirty-Fourth Conference of the Association for the Advancement of

    Artificial Intelligence (AAAI), 2020.