Manual Post-editing of Automatically Transcribed Speeches from the Icelandic Parliament - Althingi
The design objectives for an automatic transcription system are to produce text readable by humans and to minimize the impact on manual post-editing. This study reports on a recognition system used for transcribing speeches in the Icelandic parliament - Althingi. It evaluates the system performance and its effect on manual post-editing. The results are compared against the original manual transcription process. 239 total speeches, consisting of 11 hours and 33 minutes, were processed, both manually and automatically, and the editing process was analysed. The dependence of word edit distance on edit time and the editing real-time factor has been estimated and compared to user evaluations of the transcription system. The main findings show that the word edit distance is positively correlated with edit time and a system achieving a 12.6 distance would match the performance of manual transcribers. Producing perfect transcriptions would result in a real-time factor of 2.56. The study also shows that 99 subjective evaluations. On the contrary, 21 received a bad grade.
READ FULL TEXT