Text this: A simulation study of rater agreement measures with 2x2 contingency tables