Startseite : : Organisationsstruktur : : Grammatik : : Personal : : 
Roman SchneiderDr. Roman SchneiderRoman SchneiderRoman SchneiderRoman SchneiderRoman Schneider

Dr. Roman Schneider

Institut für Deutsche Sprache
R 5, 6-13
D-68161 Mannheim

schneider (at)

Tel.: +49 621 / 1581 – 217
Fax: +49 621 / 1581 – 200

Dr. Roman Schneider


Senior Researcher for Computational Linguistics within the Grammar Department of the Institute for the German Language (IDS), lead for the programme sector Language Technology and Information Systems.

Education and work experience:

  • 1999-2000 and since 2002 Research Associate at the IDS Mannheim.
  • 2002 Dr.phil. in Media Sciences and Computational Linguistics. PhD thesis: User-adaptive Web Information Systems (Supervisors: Prof. Dr. Hans-Jürgen Bucher / Dr. Heinz J. Weber).
  • 2001 DAAD Visiting Researcher within the Text Retrieval Group at Oracle (Worldwide Headquarters), Redwood Shores, CA., USA. Design and implementation of the Oracle Text Thesaurus Administration Tool.
  • 2001 CTO of WebMemex (E-Book distribution startup).
  • 1997-1998 IT Consultant for database technology and text retrieval at Oracle, Bonn.
  • 1998 Multimedia Transfer Award.
  • Project work at the University of Trier, the University of Augsburg, the Institute for the German Language (IDS) in Mannheim, and with Triumph Adler Electronic Publishing in Nuremberg.
  • 1997 Master's degree (M.A.) in Computational Linguistics and Phonetics from the University of Trier. Master's thesis: Navigation Concepts and Database Integration for a Hypermedia Information System on German Verb Valency (Supervisors: Prof. Dr. Reinhard Köhler / Dr. Heinz J. Weber).

Research interests:

  • Digital Humanities, management and linkage of digital data repositories
  • Text technology, multidimensional markup technologies and standards
  • Corpus Linguistics and Text Mining
  • Database Management Systems and Information Retrieval for semi-structured content (Text/Language Databases: Architecture, data modelling and retrieval)
  • Web Information Systems, Hypertext, User-Adaptive Systems, Web Usability, and Human-Computer-Interaction
  • Semantic concepts for knowledge representation: Thesauri, Word Nets, Topic Maps, and Ontologies
  • Digital Lexicography

Recent publications:

Selected talks, conference and workshop activities

  • 19.06.2017: Shifting Complexity from Text to Data Model. Adding Machine-Oriented Features to a Human-Oriented Terminology Resource (with Christian Lang, Horst Schwinn und Karolina Suchowolec). Language Data and Knowledge (LDK 2017), Galway, Ireland.
  • 10.11.2016: Extracting Linguistic Terminology from Scientific Corpora (with Christian Lang and Karolina Suchowolec). Grammar and Corpora 2016, Mannheim (Poster Session).
  • 09.09.2016: Re-designing Online Terminology Resources for German Grammar (with Christian Lang und Karolina Suchowolec). 5th European Networked Knowledge Organization Systems Workshop, Hannover.
  • 25.07.2016: Improved Scientific Search through Controlled Vocabularies (with Christian Lang und Karolina Suchowolec). 5th Workshop on Controlled Natural Language (CNL 2016), University of Aberdeen (Poster Session).
  • 29.08.2015: Empirische Analysen zur Genitivvariation mit GenitivDB 2.0. XIII. Weltkongress der Internationalen Vereinigung für Germanistik (IVG), Shanghai.
  • 29.04.2015: Korpusbasierte Forschung zur deutschen Grammatik: Die Genitivmarkierung zwischen Standard und Variation (with Sandra Hansen-Morath). Ringvorlesung Korpuslinguistik, University of Bremen.
  • 03.03.2015: KoGra-R: Standardisierte statistische Auswertungen von Korpusrecherchen (with Hans-Christian Schmitz, Sandra Hansen-Morath and Sascha Wolfer). DH Summit Berlin, Postersession.
  • 02.10.2014: Mehrfachadressierung in Online-Grammatiken: Designfragen und Implementierungsstrategien. Kontrastive Grammatikschreibung im europäischen Vergleich: Theorie, Methoden und Anwendungen, Santiago de Compostela.
  • 28.05.2014: GenitivDB - a Corpus-Generated Database for German Genitive Classification. Language Resources and Evaluation Conference (LREC-2014), Reykjavik.
  • 26.09.2013: Decision Tree-Based Evaluation of Genitive Classification - An Empirical Study on CMC and Text Corpora. GSCL Conference Language Processing and Knowledge in the Web, Darmstadt.
  • 17.09.2013: KoGra-DB: Using MapReduce for Language Corpora. 43. Annual Conference of the German Informatics Society (Gesellschaft für Informatik, GI) (poster session), Koblenz.
  • 25.04.2013: Machine learning algorithms for the analysis of large language corpora. 5th workshop of the DFG network Empirikom, Hamburg.
  • 13.03.2013: The Corpus Grammar Database (KoGra-DB). Annual IDS conference, poster session on corpus technology.
  • 28.09.2012: Webkorpus-Analyse zum semi-automatischen Aufbau einer Domänen-Ontologie. Workshop "Webkorpora in Computerlinguistik und Sprachforschung". 27.-28.09.2012, IDS Mannheim, poster session.
  • 22.05.2012: Evaluating DBMS-based Access Strategies to Very Large Multi-layer Annotated Corpora. LREC-2012 Workshop on Challenges in the management of large corpora. Istanbul.
  • 10.05.2012: Design und Implementierung grammatischer Datenbanken und Informationssysteme am Beispiel von GRAMMIS. IDS Mannheim.
  • 03.05.2012: Exploration of Quantitative Phenomena for Internet Dictionaries, Using the Example of E-VALBU and KoGra-DB. 3. Workshop of the DFG network on internet lexicography. European Academy of Bozen.
  • 13.02.2012: Die Grenzen des Standards - Modellierung der Grenzen standardsprachlicher Grammatik mithilfe quantitativer Korpusanalysen. Coaching-Workshop Section A of the Leibniz-Gemeinschaft, Berlin.
  • 02.02.2012: GRAMMIS: A Comprehensive User-adaptive Web Information System on German Grammar. Instituut voor Nederlandse Lexicologie (INL) Leiden / NL.


Supervised student work:

Computational Linguistics and Digital Humanities, University of Trier:

  • SS 2015: Developing of an ML-Based Algorithm for Typing of Complement Phrases (Monica Fürbacher).

Institute of Computer Science, NLP Group, University of Leipzig:

  • WS 2012/2013: XML Transformations Within a Grammatical Web Information System (Carsten Englert, Manuel Konrad, Sebastian Schüller).
  • WS 2011/2012: XML2Office: Conversion of a Database-driven Dictionary on Verb Valency (Mike Bretschneider).
  • WS 2011/2012: Linearisation of XML Tree Structures (Michael Gassner, Martin Georgi).

Administrative services: