Bibliographie zur Corpus- und Computerlinguistik

Literatur zur Computerlinguistik findet sich in der Bibliothek unter den Signaturen [61 ES 900 ...]  und [61 ER 765 ...].
Eine leicht verständliche allgemeine Einführung in den Themenbereich mit einigen theoretischen und psychologisch-philosophischen Überlegungen zu den prinzipiellen Möglichkeiten und Grenzen des Computereinsatzes in der Philologie sowie eine ausführlichen Bibliographie gibt:
Schmitz, Ulrich. Computerlinguistik. Eine Einführung. Westdeutscher Verlag, 1992.


Die führenden Fachzeitschriften sind:

  • CL. Computational Linguistics.  [61 EQ 1120- ] (seit 1984 Publikationsorgan der Association of Computational Linguistics, ACL, des weltweit größten Fachverbandes in diesem Bereich)
  • Computers and the Humanities. [61 ES 900 M 681] (seit 1966/67 Publikationsorgan der Association for Computers and the Humanities, ACH]
  • Literary and Linguistic Computing. [61 EQ 3387- ] (seit 1986 Publikationsorgan der Association of Literary and Linguistic Computing, ALLC, in Cambridge am Literary and Linguistic Computing Centre; deutlich philologische und textwissenschaftliche Ausrichtung)
  • IJCL. International Journal of Computational Linguistics. [61 EQ 2553-] (seit 1996)


Die folgende Titelliste enthält v.a. einführende Literatur zu den Grundbegriffen und -verfahren der Computerlinguistik sowie praktische Beispiele für die Anwendung der Korpusanalyse in den verschiedenen Bereichen der Linguistik,  Literaturwissenschaft und Didaktik. [In eckigen Klammern ist die Bibliothekssignatur der Universitätsbibliothek Augsburg angegeben.]


Adolphs, Svenja. Introducing Electronic Text Analysis. A practical guide to language and literary studies. London/NY: Routledge, 2006.  - Especially of use to readers who are new to corpus-based analysis. The author covers a wide range of corpus techniques and concepts (kewywords, concordance, frequency data, collocation semantic prosody) without overwhelming the reader with too much information. Including useful glossary. - [.....]


Aijmer, K. and Altenberg, B. (eds.) English Corpus Linguistics. London: Longman, 1991.


Aarts, J. and Meijs, W. (eds.). Theory and Practise in Corpus Linguistics. Amsterdam: Rodopi, 1990. [65 HF 180 A114]


Aarts, Jan/Pieter de Haan/Nelleke Oostdijk (eds.). English language Corpora. Design, analysis and exploitation. Papers from the Thirteenth Int. Conference on English Language Research on Computerized Corpory (ICAME 13), 1993. [65 HF 370 A114]


Armstrong, Susan (ed.). Using large Corpora. MIT Press, 1994. [61 ER 765 A738]


Barnbrook, Geoff. Language and Computers. A Practical Introduction to the computer Analysis of Language. Edinburgh Textbooks in Empirical Linguistics, Edinburgh UP, 1996. [61 ES 900 B259+2]


Biber, Douglas/Conrad, Susan/Reppen, Randi. Corpus Linguistics. Investigating language structure and use. CUP 1998. [61 ER 765 B581]


Black, E./Garside, R./Leech, G. (eds.). Statistically Driven Computer Grammars of English: The IBM/Lancaster Approach, Amsterdam: Rodopi, 1993.
- Record of a five-year research project on the development of a probabilistic grammar/parser, showing how to build and implement a statistically-driven broad-coverage grammar of English, including the necessary statistical algorithms and guidelines for grammar-training using a treebank.


Botley, Simon Philip. Multilingual Corpora in Teaching and Research. Amsterdam: Rodopi, 2000. [61 ER 765 B749]


Church, Kenneth/William Gale/Patrick Hanks/Donald Hindle, "Using Statistics in Lexical Analysis", in: Zernik (1991), 115-164.


Deignan, Alice. Metaphor and Corpus Linguistics. Amsterdam: Benjamins, 2005. - Corpus linguistic studies of different aspects of metaphor, which investigate its linguistic and semantic properties and relate them to current theoretical views. This book demonstrates the need for naturally-occurring language data to be used in the development of metaphor theory, and shows the value of corpus data and techniques in this work. -  [65 HF 435 D 324M5]


Fries, Udo/Viviane Müller/Peter Schneider (eds.). From Aelfric to the New York Times. Studies in English Corpus Linguistics. Amsterdam: Rodopi (Language and Computers: Studies in Practical Linguistics, No. 19), 1997. - 20 studies based on the analysis of computerized corpora, in honour of Gunnel Tottie.


Fries, Udo, Gunnel Tottie, Peter Schneider (eds.). Creating and using English language Corpora. Papers from the Fourteenth Int. Conference on English Language Research on Computerized Corpory (ICAME 14), Zürich 1993. Amsterdam: Rodopi, 1994. - Contains papers about the creation of the ARCHER corpus, the Hong Kong part of the ICE, and the bilingual English-Norwegian corpus, as well as corpus-based case studies on a range of syntactical, morphological, stylistic, and historical aspects of English. -  [65 HF 370 F912]


Garside, R., G. Leech and G. Sampson (eds.) The Computational Analysis of English. London: Longman, 1987.


Garside, Roger (ed.). Corpus annotation. Linguistic information from computer text corpora. London: Longman, 1997. [61 ER 765 G243]


Hasselgard, Hilde. Out of Corpora. Studies in honour of Stig Johannson. 1999. [65 HF 450 J65 H3]

Hoey, Michael. Lexical Priming. A new theory of words and language. London: Routledge, 2005. -  Using concrete statistical evidence from a corpus of newspaper English, but also referring to travel writing and literary texts, the author argues that words are 'primed' for use through our experience with them. ...a must for anyone involved in corpus linguistics. - [65 HF 370 H693]


Hunston, Susan. Corpora in Applied Linguistics. CUP, 2002.


Jannsen, S. "Automatic sense-disambiguation with LDOCE: enriching syntactically analyzed corpora with semantic data", in: Aarts and Meijs, 1990: 105-136.


Johannson, Stig. Corpora and cross-linguistic research. Theory, method, and case studies. 1998. [61 ER 765 J65]


Johannson, Stig/Anna-Brita Stenström (eds.). English Computer Corpora. Selected papers and research guide. 1991. [65 HF 450 J65]


Kennedy, Graeme D. An introduction to corpus linguistics. London: Longman, 1998. [61 ER 765 K35]


Kenny, Anthony. The Computation of Style. An Introduction to Statistics for Students of Literature and Humanities. Oxford: Pergamon Press, 1982.
 - A brief history of staistical stylistics: authorship attribution and determination of chronology by word and sentence length in the 19th c.; explanation of basic statistical terms and methods like frequency distribution and its graphic representation in polygons and histograms, measures of central/general tendency = mode, average/arithmetic mean, measures of variability/despersion = range, standard deviation, interquartile range. - [65 HF 331 K36]


Ljung, Magnus. Corpus-based studies in English. Papers from the Seventeenth Int. Conference on English Language Research on Computerized Corpory (ICAME 17). Amsterdam: Rodopi, 1997. [65 HF 450 L789]


McEnery, Tony & Andrew Wilson, Corpus Linguistics, Edinburgh Textbooks in Empirical Linguistics. Edinburgh UP, 1996. [61 ER 765 M141(2)]


Miller, G. et al. Five Papers on WordNet, CSL Report 43, Cognitive Science Laboratory, Princeton University, 1990/93.

Oakes, Michael P. Statistics for corpus lingustics. Edinburgh Textbooks in Empirical Linguistics. Edinburgh UP, 1998. - Designed for the use of non-mathematicians, with all technical terms (e.g. z-score, t-test, chi-square, degrees of freedom, mutual information, log likelihood coefficient, hierarchic clustering etc.) explained; with a wide variety of  examples showing the use of the various statistical techniques in corpus linguistics; the most relevant mathematical tables are included. - [61 ER 765 O11]


Partington, Alan. Patterns and meaning. 1998. [65 HF 450 P273]


Percy, Carol E./Charles F. Meyer/Ian Lancashire (eds.). Synchronic Corpus Linguistics. Papers from the Sixteenth Int. Conference on English Language Research on Computerized Corpory (ICAME 16), Amsterdam: Rodopi, 1996. [65 HF 450 L869]


Renouf, Antoinette. Exploration in Corpus linguistics. Amsterdam: Rodopi, 1998. [65 HF 450 R419]


Sinclair, John McH. Corpus, Concordance, Collocation. Oxford: OUP, 1991. [65 HF 370 S 616]


Stubbs, Michael. Text and Corpus Analysis. Computer-assisted Studies of Language and Culture. Blackwell, 1996.


Svartvik, Jan (ed.). Directions in Corpus Linguistics. Berlin: Mouton de Gruyter, 1992. [61 ER 760 S968]


Thomas, Jenny/Mick Short (eds.). Using corpora for language research. Studies in the honour of  Geoffrey  Leech. London: Longman, 1996. [61 ER 765 L483]


Wichmann, Anne (ed.). Teaching and language corpora. London: Longman, 1997. [61 ES 885 W635]


Zernik, Uri (ed.). Lexical Acquisition: Exploiting on-Line Resources to Build a Lexicon. Hillsdale, New Jersey: Lawrence Erlbaum Associates, 1991.