Fasta Files - Printable Version +- Python Forum (https://python-forum.io) +-- Forum: Python Coding (https://python-forum.io/forum-7.html) +--- Forum: General Coding Help (https://python-forum.io/forum-8.html) +--- Thread: Fasta Files (/thread-28602.html) |
Fasta Files - johnny_sav1992 - Jul-25-2020 Hello everybody, i'm new in programming and its the first time i use python. I'm working on a code that should read a fasta file and delete the header of each sequence. My code to read the file: def read_fasta(inputfile): with open(inputfile,'r') as f: file=f.readlines() f.close return file fasta_file=read_fasta('SELEX_100_reads.txt') print(fasta_file)The output of fasta file looks like that: it's a list with header for each sequence. therefore i just want the DNA sequences (CTAAAA or AAGTAAAGCA) of each line as a list. Can anyone help me with that ? Thanks a lot Cheers, John RE: Fasta Files - ndc85430 - Jul-25-2020 So how do you think you can approach the problem? RE: Fasta Files - johnny_sav1992 - Jul-25-2020 i would do it with a loop, and ignore every line if it's not a sequence. Like that: for i in file: list_=[] if ... ='A' or 'T' : new_list=append.list_ .... ( if the line start with an A,T,C, or G then append to my list)i dont know how to write that as a code. RE: Fasta Files - Yoriz - Jul-25-2020 You could use the following: https://docs.python.org/3/library/stdtypes.html#str.startswith Wrote:str.startswith(prefix[, start[, end]]) RE: Fasta Files - Larz60+ - Jul-25-2020 Links to some Fasta code I wrote, can't remember exactly what's here, but expect that I have dealt with headers somewhere in the following:
|