Unfortunately, due to a technical problem, there is no recording of the afternoon sessions.
9:45 |
Opening |
10:00 |
Session 1
- Marco Baroni (University of Trento), My love affair with the Web, and why it ended (download slides)
- Adam Kilgarriff (Lexical Computing Ltd), WebBootCaT usage 2010-13 (download slides)
- Serge Sharoff (University of Leeds), Automatic classification of BootCat'ed corpora (download slides)
|
11:30 |
Coffee break |
11:45 |
Session 2
- Marco Brunello (University of Leeds), BiTextCaT: a complete pipeline to collect
parallel corpora from the web (download slides)
- Erika Dalan (University of Bologna), Genre-driven vs. topic-driven BootCaT corpora:
building and evaluating a corpus of academic course descriptions (download slides)
- Maja Miličević (University of Belgrade), Genre-based BootCaT corpora for morphologically
rich languages (download slides)
|
13:15 |
Lunch break |
14:45 |
Session 3
- Egon W. Stemle and Verena Lyding (EURAC, Bolzano), The future of BootCaT: A Creative Commons
License filter (download slides)
- Tomaž Erjavec (Jožef Stefan Institute, Ljubljana), Polishing BootCat corpora: XML validation
and tagset unification (download slides)
- Nikola Ljubešić (University of Zagreb), Helping BootCaT to catch the Babel fish: Getting
encoding, content and language right (download slides)
|
16:30 |
Coffee break |
16:45 |
Round table |
17:45 |
Closing |