Software Workshop Talk Biography
25 June 2019 at 14:30 - 15:15 | PS-Aquarium

Using Wikipedia Edits in Low Resource Grammatical Error Correction

ORGANIZERS
Thumb ticker sm jean claude foto kleiner 0721 final
Software Workshop
Team Leader

We develop a grammatical error correction system for German using a small gold corpus augmented with edits extracted from Wikipedia revision history. We extend the automatic error annotation tool ERRANT (Bryant et al., 2017) for German and use it to analyze both gold corrections and Wikipedia edits (Grundkiewicz and Junczys-Dowmunt, 2014) in order to select as additional training data Wikipedia edits containing grammatical corrections similar to those in the gold corpus. Using a neural machine translation approach (Chollampatt and Ng, 2018), we evaluate the contribution of Wikipedia edits and find that carefully selected Wikipedia edits increase performance by over 5%.

Speaker Biography

Adriane Boyd (University of Tübingen)