A Statistical Language Model for Pre-Trained Sequence Labeling: A Case Study on Vietnamese
By defining the computable word segmentation unit and studying its probability characteristics, we establish an unsupervised statistical language model (SLM) for a new pre-Trained sequence labeling framework in this article. The proposed SLM is an optimization model, and its objective is to maximize...
Main Authors: | Chen, L. (Author), Huang, Y. (Author), Liao, X. (Author), Yang, P. (Author) |
---|---|
Format: | Article |
Language: | English |
Published: |
Association for Computing Machinery
2022
|
Subjects: | |
Online Access: | View Fulltext in Publisher |
Similar Items
-
Infants generalize representations of statistically segmented words
by: Katharine eGraf Estes
Published: (2012-10-01) -
Understanding Patterns in Infant-Directed Speech in Context: An Investigation of Statistical Cues to Word Boundaries
by: Hartman, Rose
Published: (2017) -
Hierarchical sequence labeling for extracting BEL statements from biomedical literature
by: Suwen Liu, et al.
Published: (2019-04-01) -
Cross-Linguistic Analysis of Vietnamese and English with Implications for Vietnamese Language Acquisition and Maintenance in the United States
by: Giang Tang
Published: (2007-01-01) -
The Meta-Science of Adult Statistical Word Segmentation: Part 1
by: Joshua K. Hartshorne, et al.
Published: (2019-01-01)