Text this: Data reduction techniques for highly imbalanced medicare Big Data