Foundation Models for Conversation

Reducing hallucination in conversation systems and developing models that can perform a variety of specialized tasks

Overview

The project explores strategies to anchor dialogue response generation on dependable enterprise content, utilizing structures like decision trees, flow charts, and structured data for grounding responses. It also investigates using large language models (LLMs) to harness approved unstructured documents for generating responses. A core aspect of this exploration is to ensure that generated responses remain faithful to the content of documents fed into the model.

By building foundation models for digital interaction data, the project team aims to overcome research challenges around data modeling, identifying the right pre-training objectives, and ensuring cost-effectiveness and efficiency.

The ambitious goal is to support a myriad of downstream tasks, often new unseen tasks, with the same foundational model requiring minimal tuning. Examples of such tasks include predicting a user's following action, analyzing a user group's activity footprint to match a product with its target user group, or initiating a personalized dialog flow with a customer.

Publications

Variational Learning for Unsupervised Knowledge Grounded Dialogs
- - Mayank Mishra
  - Dhiraj Madan
  - et al.
- 2022
- IJCAI 2022
End-to-End Learning of Flowchart Grounded Task-Oriented Dialogs
- - Dinesh Raghu
  - Shantanu Agarwal
  - et al.
- 2021
- EMNLP 2021
MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents
- - Song Feng
  - Siva Sankalp Patel
  - et al.
- 2021
- EMNLP 2021
Simulated Chats for Building Dialog Systems: Learning to Generate Conversations from Instructions
- - Bishwesh Mohapatra
  - Gaurav Pandey
  - et al.
- 2021
- EMNLP 2021
Unsupervised Learning of KB Queries in Task-Oriented Dialogs
- - Dinesh Raghu
  - Nikhil Gupta
  - et al.
- 2021
- ACL-IJCNLP 2021
Constraint based Knowledge Base Distillation in End-to-End Task Oriented Dialogs
- - Dinesh Raghu
  - Atishya Jain
  - et al.
- 2021
- ACL-IJCNLP 2021
doc2dial: A goal-oriented document-grounded dialogue dataset
- - Song Feng
  - Hui Wan
  - et al.
- 2020
- EMNLP 2020
Unsupervised Learning of Interpretable Dialog Models
- - Dhiraj Madan
  - Dinesh Raghu
  - et al.
- 2020
- ECAI 2020