A corpus platform of Indonesian academic language

The Indonesian language is the language of education and the language to unify the 701 ethnic languages in Indonesia. To document this language and to determine how it is used in academic texts, a corpus database needs to be collected together with a corpus platform to explore the corpus. This paper...

Full description

Bibliographic Details
Main Author: Deny A. Kwary
Format: Article
Language:English
Published: Elsevier 2019-01-01
Series:SoftwareX
Online Access:http://www.sciencedirect.com/science/article/pii/S2352711018302413
Description
Summary:The Indonesian language is the language of education and the language to unify the 701 ethnic languages in Indonesia. To document this language and to determine how it is used in academic texts, a corpus database needs to be collected together with a corpus platform to explore the corpus. This paper presents the features and usage of the first and freely available corpus platform of Indonesian Academic language. The corpus was compiled from over five million word tokens comprising articles from nationally accredited journals and theses from reputable universities. The main features of the software are context, collocate, and frequency. The corpus platform will be an essential resource for linguists, lexicographers, and teachers. Keywords: Academic text, Digital corpus platform, Indonesian corpus, Indonesian language
ISSN:2352-7110