May-28-2021, 06:58 PM
Something still seems off, as that regex won't match the string.
>>> import re >>> test = 'ChrX 74226540 T t 50 .' >>> test 'ChrX\t74226540\tT\tt\t50\t.' >>> print(test) ChrX 74226540 T t 50 . >>> raw_regex = r"^[Cc]hr(?:0?[1-9]|[1-9][0-9]|[MXY])\t0*[1-9][0-9]*\t[^\t]*\t[ATGC]{2}" >>> regex = re.compile(raw_regex) >>> regex.match(test) >>> regex re.compile('^[Cc]hr(?:0?[1-9]|[1-9][0-9]|[MXY])\\t0*[1-9][0-9]*\\t[^\\t]*\\t[ATGC]{2}')