Text this: Understanding deep architectures and the effect of unsupervised pre-training