Data for lexicography The central role of the corpus
This paper looks at the nature of data for lexicography and in particular on the central role that electronic corpora can play in providing it. Data has traditionally come from existing dictionaries, citations, and from the lexicographer’s own knowledge of words, through introspection. Each of these...
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Indonesia
2010-10-01
|
Series: | Wacana: Journal of the Humanities of Indonesia |
Subjects: | |
Online Access: | http://wacana.ui.ac.id/index.php/wjhi/article/view/116 |
id |
doaj-478bc19d5db044c482272588e5572946 |
---|---|
record_format |
Article |
spelling |
doaj-478bc19d5db044c482272588e55729462021-07-08T04:08:06ZengUniversity of IndonesiaWacana: Journal of the Humanities of Indonesia1411-22722407-68992010-10-0112221924210.17510/wjhi.v12i2.116116Data for lexicography The central role of the corpusAllan F. Lauder0a guest lecturer in Postgraduate Linguistics at the Faculty of Humanities at Universitas IndonesiaThis paper looks at the nature of data for lexicography and in particular on the central role that electronic corpora can play in providing it. Data has traditionally come from existing dictionaries, citations, and from the lexicographer’s own knowledge of words, through introspection. Each of these is examined and evaluated. Then the electronic corpus is considered. Different kinds of corpora are described and key design criteria are explained, in particular the size of corpus needed for lexicography as well as the issue of representativeness and sampling. The advantages and disadvantages of corpora are weighed and compared against the other types of data. While each of these has benefits, it is argued that corpora are a requirement, not an option, as data for dictionary making.http://wacana.ui.ac.id/index.php/wjhi/article/view/116corpus linguistics, lexicography, data, linguistic intuition, citations, |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Allan F. Lauder |
spellingShingle |
Allan F. Lauder Data for lexicography The central role of the corpus Wacana: Journal of the Humanities of Indonesia corpus linguistics, lexicography, data, linguistic intuition, citations, |
author_facet |
Allan F. Lauder |
author_sort |
Allan F. Lauder |
title |
Data for lexicography The central role of the corpus |
title_short |
Data for lexicography The central role of the corpus |
title_full |
Data for lexicography The central role of the corpus |
title_fullStr |
Data for lexicography The central role of the corpus |
title_full_unstemmed |
Data for lexicography The central role of the corpus |
title_sort |
data for lexicography the central role of the corpus |
publisher |
University of Indonesia |
series |
Wacana: Journal of the Humanities of Indonesia |
issn |
1411-2272 2407-6899 |
publishDate |
2010-10-01 |
description |
This paper looks at the nature of data for lexicography and in particular on the central role that electronic corpora can play in providing it. Data has traditionally come from existing dictionaries, citations, and from the lexicographer’s own knowledge of words, through introspection. Each of these is examined and evaluated. Then the electronic corpus is considered. Different kinds of corpora are described and key design criteria are explained, in particular the size of corpus needed for lexicography as well as the issue of representativeness and sampling. The advantages and disadvantages of corpora are weighed and compared against the other types of data. While each of these has benefits, it is argued that corpora are a requirement, not an option, as data for dictionary making. |
topic |
corpus linguistics, lexicography, data, linguistic intuition, citations, |
url |
http://wacana.ui.ac.id/index.php/wjhi/article/view/116 |
work_keys_str_mv |
AT allanflauder dataforlexicographythecentralroleofthecorpus |
_version_ |
1721314282936729600 |