IDS-Logo
Startseite : : Organisationsstruktur : : Direktion : : Korpuslinguistik Corpus Linguistics : : Projects : : Corpus Development
KorpusausbauCorpus DevelopmentKorpusausbauKorpusausbauKorpusausbauKorpusausbau
Direktion und zentrale Forschung

Kontakt:
    <korpuslinguistik@ids-...>
 
Leitung:
    Dr. Marc Kupietz <kupietz@ids-...>
 
Wissenschaftliche Mitarbeiter:
    Cyril Belica <belica@ids-...>
    Dr. Harald Lüngen <luengen@ids-...>
    Rainer Perkuhn <perkuhn@ids-...>
 
Kooperationen:
    siehe hier
 
Ehemalige am Korpusaufbau beteiligte Mitarbeiter des IDS:
    siehe hier
 
Studentische Hilfskräfte:

  • Anna Konovalova
  • Theresa Sick

 

 

Development and Maintenance of Contemporary Written Corpora

The Mannheim German Reference Corpus (DeReKo)

The world's largest collection of German-language corpora as an empirical basis for linguistic research

The Corpora of Contemporary Written German at the IDS

  • constitute the world's largest linguistically motivated collection (over 32.85 billion words as of October, 2017) of electronic corpora with written German texts from today and the recent past
  • can be accessed via COSMAS II free of charge
  • contain belletristic, scientific and popular scientific texts, a large number of newspaper texts as well as a wide range of additional text types. They are being developed continuously
  • are being acquired with a view to size, variability, quality and topicality, and allow the creation of virtual corpora while using COSMAS II. These can be either representative corpora or corpora designed for particular research questions
  • contain only copyrighted material

Recent publications on DeReKo

  • Kupietz, Marc/Lüngen, Harald (2014): Recent Developments in DeReKo. In: Calzolari, Nicoletta et al. (eds.): Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14). Reykjavik: ELRA, 2378-2385.  www.lrec-conf.org/proceedings/lrec2014/pdf/842_Paper.pdf
  • Kupietz, Marc / Belica, Cyril / Keibel, Holger / Witt, Andreas (2010): The German Reference Corpus DeReKo: A primordial sample for linguistic research. In: Calzolari, Nicoletta et al. (eds.): Proceedings of the 7th conference on International Language Resources and Evaluation (LREC 2010). Valletta, Malta: European Language Resources Association (ELRA), 1848-1854.   www.lrec-conf.org/proceedings/lrec2010/pdf/414_Paper.pdf
  • Kupietz, Marc / Keibel, Holger (2009): The Mannheim German Reference Corpus (DeReKo) as a basis for empirical linguistic research. In Minegishi, Makoto / Kawaguchi, Yuji (Eds.): Working Papers in Corpus-based Linguistics and Language Education, No. 3. Tokyo: Tokyo University of Foreign Studies (TUFS), 53-59.   cblle.tufs.ac.jp/assets/files/publications/working_papers_03/section/053-059.pdf

Topic Overview

Contact us:

<korpuslinguistik@ids-...>