perldoc Stefan::EvertCorpus LinguisticsFAU Erlangen-Nürnberg

Stefan Evert - Research - Teaching - CV - Publications - Software - Private Life

Computational Corpus Linguistics

Research Interests

My computational corpus linguistics group at FAU Erlangen-Nürnberg carries out foundational methodological research on the quantitative analysis of large text corpora. The algorithms and software tools developed by the group support innovative studies in the digital humanities and social sciences as well as practical applications in language technology. A particular focus lies on understanding cooccurrence phenomena and their application in corpus-based discourse analysis.

Methodological foundationsCorpus toolsCooccurrence phenomena

Methodological foundations of corpus research and digital humanities

Corpus research in linguistics as well as in the digital humanities and social sciences relies on a wide range of statistical techniques and visualizations. A central goal of my research is to develop sound methodological foundations for corpus linguistics, which address key problems in order to ensure that quantitative analyses are both reliable and meaningful.

Projects

Software

Key publications

Corpus tools and language technology

My group develops algorithms and software tools for the automatic linguistic annotation, efficient indexing, flexible query and quantitative analysis of large text corpora. These tools form the basis of innovative research in the digital humanities as well as practical and commercial applications in language technology.

Software

Key publications

Collocations, multiword expressions and corpus-based discourse analysis

Cooccurrence patterns – such as collocations, multiword expression, valency and distributional semantics – play a central role not only in corpus linguistics but also for studying public discourses and political propaganda. My research in this area focuses on improving and refining the underlying analytical techniques as well as the development of new interactive methods for multi-modal corpus-based discourse analysis.

Projects

Software

Key publications

© by Stefan Evert (09 Jul 2017) / PDF version