Lexikografiska aspekter på Internet som källa till informellt språkbruk

Authors

  • Håkan Jansson

Keywords:

Internet-baserad korpus, korpusbaserad lexikografi, informellt språk, balanserad korpus, SketchEngine

Abstract

This paper treats Internet-based corpora in a lexicographical perspective, which first of all requires a quick look at the terms normative and descriptive lexicography. However the main part of the paper presents research on the compilation of Internet-based corpora, and compares that type of corpora with the traditional kind. This includes a discussion of the notion of the representativeness of the corpus, with reference to comparison between Internet-based corpora and traditional corpora such as the BNC. It is noted that Internet-based corpora offers the possibility to capture language from registers that hitherto has been unrepresented or at least underrepresented in traditional corpora. Reference is being made to some experiences of compilation of Swedish web corpora, notably how to evaluate the differences in word frequency, when different corpora are compared.

Downloads

Published

2010-01-01

How to Cite

Jansson, H. (2010). Lexikografiska aspekter på Internet som källa till informellt språkbruk. Nordiske Studier I Leksikografi, (10). Retrieved from https://tidsskrift.dk/nsil/article/view/19233

Issue

Section

Artikler