Parallellkorpusen som resurs i lexikografiskt arbete

Forfattere

  • Håkan Jansson

Nøgleord:

korpusbaserad lexikografi, parallellkorpus, tvåspråkig lexikografi, n-gram, flerordsuttryck

Resumé

This paper discusses the use of parallel corpora as a tool in bilingual lexicography. After a brief background sketch with reference to literature and (lack of) reported ongoing projects, the text briefly considers the notion of a parallel corpus, before it reaches the main subject: a hands-on study of the opportunities connected to the use of parallel corpora in lexicography. The corpus used for the study is a relatively small, about 115,000-token, ad-hoc corpus compiled of texts from the Nordic Council. The languages investigated are Icelandic on the one hand, and Danish, Norwegian and Swedish on the other. One advantage of parallel corpora seems to be the possibility to compare concordances of n-grams between the studied languages. The two Icelandic verbs probed as examples were shown to be part of hitherto unnoticed equivalent pairs, where one or both sides turned out to be MWEs. Co-occurring patterns between active or passive voice and certain translation equivalents have also been noticed.

Downloads

Publiceret

2012-01-01

Citation/Eksport

Jansson, H. (2012). Parallellkorpusen som resurs i lexikografiskt arbete. Nordiske Studier I Leksikografi, (11). Hentet fra https://tidsskrift.dk/nsil/article/view/19349

Nummer

Sektion

Artikler