Artificial intelligence in linguistics: a GBRT model approach to forecast Cantonese levels among Chinese Malaysians

Abstract This study leverages a Gradient Boosted Regression Trees (GBRT) machine learning model to explore how Cantonese media exposure and cultural identity affect Cantonese language proficiency among Chinese Malaysians. By integrating sociolinguistic insights with predictive modeling, we address t...

Full description

Bibliographic Details
Published in:Humanities & Social Sciences Communications
Main Authors: Yuqing Peng, Junxian Xie, Lin Zhang, Yuwen Lyu
Format: Article
Language:English
Published: Springer Nature 2025-09-01
Online Access:https://doi.org/10.1057/s41599-025-05520-5