Conférence de Melissa Terras dans le
cadre du colloque "Crossing borders : Three talks on Text Analysis and
Digital Humanities" organisé par le laboratoire LATTICE.
Melissa Terras : Linking Crowdsourced Transcription to Automated Handwriting Recognition : Lessons from Transcribe Bentham
For nearly seven years, the Transcribe Bentham project has been generating high quality crowdsourced transcripts of the writings of the philosopher and jurist Jeremy Bentham (1748-1832), held at University College London, and latterly, the British Library. Now with nearly 6 million words transcribed by volunteers, little did we know at the outset that this project would provide an ideal, quality controlled dataset to provide "ground truth" for the development of Handwriting Technology Recognition. This paper will look at the past, present and future of automated handwriting analysis for documents, showing how our research on the EU framework 7 Transcriptorium, and now H2020 READ projects, is working towards a service to improve the searching and analysis of digitised manuscript collections across Europe, and reusing the data created by crowdsourced, volunteer labour, for machine learning purposes.
Consultez le programme du colloque "Crossing borders".
Voir aussi
|
Cursus :
Melissa Terras est directrice du Centre UCL pour les sciences humaines numériques, professeure d'humanités numériques au Département d'études de l'information de l'UCL et vice-doyenne de recherche de la Faculté des arts et des sciences humaines de l'UCL.
Cliquer ICI pour fermerDernière mise à jour : 11/07/2017