Programme



Download recording of the morning sessions

Unfortunately, due to a technical problem, there is no recording of the afternoon sessions.

9:45 Opening
10:00 Session 1
  • Marco Baroni (University of Trento), My love affair with the Web, and why it ended (download slides)
  • Adam Kilgarriff (Lexical Computing Ltd), WebBootCaT usage 2010-13 (download slides)
  • Serge Sharoff (University of Leeds), Automatic classification of BootCat'ed corpora (download slides)
11:30 Coffee break
11:45 Session 2
  • Marco Brunello (University of Leeds), BiTextCaT: a complete pipeline to collect parallel corpora from the web (download slides)
  • Erika Dalan (University of Bologna), Genre-driven vs. topic-driven BootCaT corpora: building and evaluating a corpus of academic course descriptions (download slides)
  • Maja Miličević (University of Belgrade), Genre-based BootCaT corpora for morphologically rich languages (download slides)
13:15 Lunch break
14:45 Session 3
  • Egon W. Stemle and Verena Lyding (EURAC, Bolzano), The future of BootCaT: A Creative Commons License filter (download slides)
  • Tomaž Erjavec (Jožef Stefan Institute, Ljubljana), Polishing BootCat corpora: XML validation and tagset unification (download slides)
  • Nikola Ljubešić (University of Zagreb), Helping BootCaT to catch the Babel fish: Getting encoding, content and language right (download slides)
16:30 Coffee break
16:45 Round table
17:45 Closing

 
© Copyright 2013 - Alma Mater Studiorum - Università di Bologna - Campus di Forlì
Updated 6/05/2013