I'll take a look at speeding this up.
First I need some info:
First I need some info:
- You have a huge bottleneck at the beginning (lines 13 through 19)
Curious, why change from DNA to 0 & 1? are you trying to count base 'G'?
- Could you please explain what the goal is here, I'm having difficulty understanding.
- Do you know the ftp site for downloading fasta file (CM000665.fasta)? something like ftp://ftp.ncbi.nih.gov/snp/organisms/hum.../rs_fasta/