Text this: ConvWin-UNet: UNet-like hierarchical vision Transformer combined with convolution for medical image segmentation