IDS-Logo
Startseite : : Organisationsstruktur : : Pragmatik : : Personal : : 
Thomas SchmidtThomas SchmidtThomas SchmidtThomas SchmidtThomas SchmidtThomas Schmidt

Pragmatik

Dr. Thomas Schmidt

Institut für Deutsche Sprache
R 5, 6-13, Büro 3.13
D-68161 Mannheim

E-Mail:
thomas.schmidt (at) ids-mannheim.de

Tel.: +49 621 / 1581 – 313
Fax: +49 621 / 1581 – 200

Weitere Websites:

Dr. Thomas Schmidt

Functions:

CV:

  • 1992 - 1998 Studies in General Linguistics / English Linguistics / Romance philology and Mathematics / Computer Science at the Universities of Kaiserslautern and Mainz
  • 1996 - 1997 Studies in Linguistics and Artificial INTELLIGENCE at the University of Edinburgh, Scotland
  • 1998 - 1999 Language Resource Engineer at Philips Speech Processing, Aachen
  • 1999 - 2000 Studies "European Master of Linguistics" at the Free University Berlin and the Université Paris VIII
  • 2000 - 2011 Project Assistant / Principal investigator in the project "Computer-assisted creation and analysis of multilingual data", SFB 538 Multilingualism, University of Hamburg
  • 2004 PhD in German linguistics (text technology) from the University of Dortmund on Thesis title: Computergestützte Transkription – Modellierung und Visualisierung gesprochener Sprache mit texttechnologischen Mitteln
  • 2005 - 2006 DAAD Post-Doc-Researcher at the International Computer Science Institute, Berkeley
  • 2007 - 2008 Project assistant in the Digital Dictionary of the German Language, Berlin-Brandenburg Academy of Science
  • since 2012 Researcher at the Institute for the German Language

Developer of the EXMARaLDA system and FOLKER, author of Kicktionary, founding Managing Director of the HSZK (Hamburg Centre for Speech Corpora), Member of the Academic Network of Internet Lexicography

Areas of Research:

Oral corpora, corpus linguistics, text technology, computational lexicography

Recent Publications:

  • Batinić, Dolores / Schmidt, Thomas (2018): Reconstruction of separable particle verbs in a corpus of spoken German. In: Rehm, Georg / Declerck, Thierry (eds.): Language technologies for the challenges of the digital age. 27th International Conference, GSCL 2017 Berlin, Germany, September 13–14, 2017. Proceedings. Cham, Switzerland: Springer, 2018. pp. 3-10. PDF
  • Cassidy, Steve / Schmidt, Thomas (2017): Tools for Multimodal Annotation. In: Ide, Nancy / Pustejovsky, James (eds.): Handbook of Linguistic Annotation. Springer, Dordrecht, pp. 209-227.
  • Schmidt, Thomas / Hedeland, Hanna / Jettka, Daniel (2017): Conversion and Annotation Web Services for Spoken Language Data in CLARIN. In: Selected papers from the CLARIN Annual Conference 2016, Aix-en-Provence, 26–28 October 2016, CLARIN Common Language Resources and Technology Infrastructure, by Borin, Lars (ed.), Linköping Electronic Conference Proceedings, pp. 113-130. PDF

  • Schmidt, Thomas (2016): Construction and Dissemination of a Corpus of Spoken Interaction - Tools and Workflows in the FOLK project. In: Corpus Linguistic Software Tools, Journal for Language Technology and Computational Linguistics (JLCL 31/1), by Kupietz, Marc & Geyken, Alexander (eds.), pp. 127-154. PDF

  • Schmidt, Thomas (2016): Good practices in the compilation of FOLK, the Research and Teaching Corpus of Spoken German. In: Compilation, transcription, markup and annotation of spoken corpora, by Kirk, John M. and Gisle Andersen (eds.), Special Issue of the International Journal of Corpus Linguistics [IJCL 21:3], pp. 396-418.
  • Westpfahl, Swantje / Schmidt, Thomas (2016): FOLK-Gold – A GOLD standard for Part-of-Speech-Tagging of Spoken German. In: Proceedings of the Tenth Conference on International Language Resources and Evaluation (LREC’16), Portorož, Slovenia. Paris: European Language Resources Association (ELRA), pp. 1493-1499. PDF
  • Fandrych, Christian / Frick, Elena / Hedeland, Hanna / Iliash, Anna / Jettka, Daniel / Meißner, Cordula / Schmidt, Thomas / Wallner, Franziska / Weigert, Kathrin / Westpfahl, Swantje (2016): User, who art thou? User Profiling for Oral Corpus Platforms. In: Proceedings of the Tenth Conference on International Language Resources and Evaluation (LREC’16), Portorož, Slovenia. Paris: European Language Resources Association (ELRA), pp. 280-287. PDF
  • Reimer, Eva / Trevisan, Bianka / Eraßme, Denise / Schmidt, Thomas / Jakobs, Eva-Maria (2015): Annotating Modality Interdependencies. In: Proceedings of the Int. Conference of the German Society for Computational Linguistics and Language Technology, University of Duisburg-Essen, Germany, Sep 30–Oct 2 2015, pp. 110-11. PDF
  • Herzog, Gottfried / Heid, Ulrich / Trippel, Thorsten / Bański, Piotr / Romary, Laurent / Schmidt, Thomas / Witt, Andreas / Eckart, Kerstin (2015): Recent Initiatives towards New Standards for Language Resources. In: Proceedings of the Int. Conference of the German Society for Computational Linguistics and Language Technology, University of Duisburg-Essen, Germany, Sep 30–Oct 2 2015, pp. 154–156. PDF
  • Ruhi, Şükriye / Haugh, Michael / Schmidt, Thomas / Wörner, Kai (eds.) (2014): Best Practices for Spoken Corpora in Linguistic Research. Newcastle: Cambridge Scholars Publishing.
  • Thomas Schmidt (2014): Gesprächskorpora und Gesprächsdatenbanken am Beispiel von FOLK und DGD. In: Gesprächsforschung - Online-Zeitschrift zur verbalen Interaktion 15, pp. 196-233. PDF
  • Schmidt, Thomas (2014): The Database for Spoken German - DGD2. In: Proceedings of the Ninth International conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA). PDF
  • Schmidt, Thomas (2014): The Research and Teaching Corpus of Spoken German - FOLK. In: Proceedings of the Ninth International conference on Language Resources and Evaluation (LREC'14), Reykjavik, Iceland: European Language Resources Association (ELRA). PDF
  • Deppermann, Arnulf / Schmidt, Thomas (2014): Gesprächsdatenbanken als methodisches Instrument der Interaktionalen Linguistik - Eine exemplarische Untersuchung auf Basis des Korpus FOLK in der Datenbank für Gesprochenes Deutsch (DGD2). In: Domke, Christine & Gansel, Christa (eds.): Korpora in der Linguistik - Perspektiven und Positionen zu Daten und Datenerhebung [= Mitteilungen des Deutschen Germanistenverbandes 1/2014], pp. 4-17. PDF
  • Stift, Ulf-Michael / Schmidt, Thomas (2014): Mündliche Korpora am IDS: Vom Deutschen Spracharchiv zur Datenbank für Gesprochenes Deutsch. In: Institut für Deutsche Sprache (ed.): Ansichten und Einsichten. 50 Jahre Institut für Deutsche Sprache. Redaktion: Melanie Steine, Franz Josef Berens. pp. 360-375 - Mannheim: Institut für Deutsche Sprache, 2014.
  • Westpfahl, Swantje / Schmidt, Thomas (2013): POS für(s) FOLK – Part of Speech Tagging des Forschungs- und Lehrkorpus Gesprochenes Deutsch. In: Journal for Language Technology and Computational Linguistics, iss. 1, pp. 139-156. PDF
  • Schmidt, Thomas / Dickgießer, Sylvia / Gasch, Joachim (2013): Die Datenbank für Gesprochenes Deutsch - DGD2. Mannheim: Institut für Deutsche Sprache. PDF

Publications: