Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model

The advancement of the Internet has changed people’s ways of expressing and sharing their views with the world. Moreover, user-generated content has become a primary guide for customer purchasing decisions. Therefore, motivated by commercial interest, some sellers have started manipulating Internet...

Full description

Bibliographic Details
Main Authors: Lin, K.-C (Author), Weng, C.-H (Author), Ying, J.-C (Author)
Format: Article
Language:English
Published: MDPI 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02550nam a2200217Ia 4500
001 0.3390-app12073338
008 220421s2022 CNT 000 0 und d
020 |a 20763417 (ISSN) 
245 1 0 |a Detection of Chinese Deceptive Reviews Based on Pre-Trained Language Model 
260 0 |b MDPI  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3390/app12073338 
520 3 |a The advancement of the Internet has changed people’s ways of expressing and sharing their views with the world. Moreover, user-generated content has become a primary guide for customer purchasing decisions. Therefore, motivated by commercial interest, some sellers have started manipulating Internet ratings by writing false positive reviews to encourage the sale of their goods and writing false negative reviews to discredit competitors. These reviews are generally referred to as deceptive reviews. Deceptive reviews mislead customers in purchasing goods that are inconsistent with online information and thus obstruct fair competition among businesses. To protect the right of consumers and sellers, an effective method is required to automate the detection of misleading reviews. Previously developed methods of translating text into feature vectors usually fail to interpret polysemous words, which leads to such functions being obstructed. By using dynamic feature vectors, the present study developed several misleading review-detection models for the Chinese language. The developed models were then compared with the standard detection-efficiency models. The deceptive reviews collected from various online forums in Taiwan by previous studies were used to test the models. The results showed that the models proposed in this study can achieve 0.92 in terms of precision, 0.91 in terms of recall, and 0.91 in terms of F1-score. The improvement rate of our proposal is higher than 20%. Accordingly, we prove that our proposal demonstrated improved performance in detecting misleading reviews, and the models based on dynamic feature vectors were capable of more accurately capturing semantic terms than the conventional models based on the static feature vectors, thereby enhancing effectiveness. © 2022 by the authors. Licensee MDPI, Basel, Switzerland. 
650 0 4 |a BERT 
650 0 4 |a deep learning 
650 0 4 |a detection of deceptive reviews 
650 0 4 |a language model 
650 0 4 |a natural language processing 
700 1 0 |a Lin, K.-C.  |e author 
700 1 0 |a Weng, C.-H.  |e author 
700 1 0 |a Ying, J.-C.  |e author 
773 |t Applied Sciences (Switzerland)