Text this: Combining instance selection and self-training to improve data stream quantification