Semi-Supervised Chinese Word Segmentation in Geological Domain Using Pseudo-Lexicon and Self-Training Strategy
Chinese word segmentation (CWS), which involves splitting the sequence of Chinese characters into words, is a key task in natural language processing (NLP) for Chinese. However, the complexity and flexibility of geologic terms require that domain-specific knowledge be utilized in CWS for geoscience...
| Published in: | Applied Sciences |
|---|---|
| Main Authors: | , , , , , |
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-01-01
|
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/15/3/1404 |
