From a Smoking Gun to Spent Fuel: Principled Subsampling Methods for Building Big Language Data Corpora from Monitor Corpora

With the influence of Big Data culture on qualitative data collection, acquisition, and processing, it is becoming increasingly important that social scientists understand the complexity underlying data collection and the resulting models and analyses. Systematic approaches for creating computationa...

Full description

Bibliographic Details
Main Author: Jacqueline Hettel Tidwell
Format: Article
Language:English
Published: MDPI AG 2019-04-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/4/2/48