Controllable Text Generation And Ethical Implications

Prabhumoye, Shrimai

doi:10.1184/R1/24944502.v1

Controllable Text Generation And Ethical Implications

thesis

posted on 2024-01-12, 21:34 authored by Shrimai PrabhumoyeShrimai Prabhumoye

The 21st century is witnessing a major shift in the way people interact with technology and Natural Language Generation (NLG) is playing a central role. Users of smartphones and smart home devices now expect their gadgets to be aware of their situation, and to produce natural language outputs in interactions. This thesis identifies three aspects of human communication to make machines sound human-like - style, content and structure. This thesis provides deep learning solutions to controlling these variables in neural text generation. I first outline the various modules which could be manipulated to perform effective controllable text generation. I provide two novel solutions for style transfer – using back-translation technique, and tag and generate approach. I also introduce two new tasks for style transfer and provide datasets for further exploration – political slant transfer and politeness transfer. I establish the task of document grounded generation which leverages information from unstructured documents for the generation process. I introduce two new tasks for document grounded generation – Wikipedia Update generation and Document Grounded Dialogue Response generation. Furthermore, I build two new extensions to pre-trained encoder-decoder models to solve this task. I also design a new elegant solution for the sentence ordering task to learn effective document structures. For all three tasks of style transfer, document grounded generation and sentence order, I add importance to the human evaluation of the models. I introduce new human evaluation measures for understanding the notion of grounding and for understanding the quality of predictions in sentence ordering. At the end, I provide a discussion on the ethical considerations of the applications of controllable text generation. Specifically, I use deontological ethics to evaluate NLP systems and discuss how controllable text generation techniques can be used to make these systems ethical

History

Date

2021-05-09

Degree Type

Dissertation

Department

Language Technologies Institute

Degree Name

Doctor of Philosophy (PhD)

Advisor(s)

Alan W Black Ruslan Salakhutdinov

Usage metrics

Keywords

controllable text ethics controllable text generation deontological ethics Artificial Intelligence and Image Processing not elsewhere classified

Licence

CC BY 4.0

Controllable Text Generation And Ethical Implications

History

Date

Degree Type

Department

Degree Name

Advisor(s)

Usage metrics

Categories

Keywords

Licence

Exports