Text this: Leveraging High Performance Computing for Managing Large and Evolving Data Collections