Text this: Data integration by fuzzy similarity-based hierarchical clustering