Student Training for Distractible Teachers
Discussed on the blog: The Literal Listener
Abstract
My project seeks to improve open-domain question generation. I introduce a student-teacher game played in the presence of distractors (i.e. a distractible teacher), in which a pragmatic speaker (the student) has to reason about a literal listener (the teacher). To score this game, the metrics of topicality and teacher patience are introduced, which are calculated from the teacher’s perspective. These metrics are used as a reward to improve pretrained QG models, and to evaluate different experimental approaches. I test the hypothesis that a topic embedding of the teacher’s knowledge better enables the student to generate questions that recover hidden knowledge in the game. I also test the hypothesis that reinforcement learning can be used to further increase the student’s ability to interrogate the teacher through additional training on a large, question-free corpus. Results are positive but limited; with hypotheses proving true by the numbers but falling short in qualitative assessment.
@misc{hollows2022studentt,
author = {Hollows, Peter},
title = {{Student Training for Distractible Teachers}},
year = {2022},
month = mar,
note = {Stanford XCS224U},
url = {https://dojo7.com/papers/xcs224u/}
}