I took a look at: https://gist.github.com/mfoll/04d751165416a4466001
which is where I assume you got the first script.
He states that the code is very slow, and it's not well formatted to include as an import file.
You need it to be a class , or at least a function.
so, a couple of questions.
The sequence doesn't have to be something that your working on, so long as the header format is the same.
which is where I assume you got the first script.
He states that the code is very slow, and it's not well formatted to include as an import file.
You need it to be a class , or at least a function.
so, a couple of questions.
- the fasta format is pretty simple, but the header is free form. I guessing that the Bio.SeqIo.Parse is reading in sequences one by one, and then printing the output.
- what does the header look like?
- what do you want the output to look like?
- could you show a sample of the fasta data?
The sequence doesn't have to be something that your working on, so long as the header format is the same.