A method of constructing syllable level Tibetan text classification corpus

Corpus serves as an indispensable ingredient for statistical NLP research and real-world applications, therefore corpus construction method has a direct impact on various downstream tasks. This paper proposes a method to construct Tibetan text classification corpus based on a syllable-level processi...

Full description

Bibliographic Details
Main Authors: Dao Jizhaxi, Cai Zhijie, Cai Rangzhuoma, San Maocuo, Ban Mabao
Format: Article
Language:English
Published: EDP Sciences 2021-01-01
Series:MATEC Web of Conferences
Online Access:https://www.matec-conferences.org/articles/matecconf/pdf/2021/05/matecconf_cscns20_06013.pdf