Python Forum
Need help analyzing data inside a list
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Need help analyzing data inside a list
#1
Question 
I have a list that contains a large amount of characters(only letters), and i am trying to analyze the amount of each letters that list contains.
list=['MTIKEMPQPKTFGELKNLPLLNTDKPVQALMKIADELGEIFKFEAPGRVTRYLSSQRLIKEACDESRFDKNLSQALKFVRDFAGDGLFTSWTHEKNWKKAHNILLPSFSQQAMKGYHAMMVDIAVQLVQKWERLNADEHIEVPEDMTRLTLDTIGLCGFNYRFNSFYRDQPHPFITSMVRALDEAMNKLQRANPDDPAYDENKRQFQEDIKVMNDLVDKIIADRKASGEQSDDLLTHMLNGKDPETGEPLDDENIRYQIITFLIAGHETTSGLLSFALYFLVKNPHVLQKAAEEAARVLVDPVPSYKQVKQLKYVGMVLNEALRLWPTAPAFSLYAKEDTVLGGEYPLEKGDELMVLIPQLHRDKTIWGDDVEEFRPERFENPSAIPQHAFKPFGNGQRACIGQQFALHEATLVLGMMLKHFDFEDHTNYELDIKETLTLKPEGFVVKAKSKKIPLGGIPSPSTEQSAKKVRKKAENAHNTPLLVLYGSNMGTAEGTARDLADIAMSKGFAPQVATLDSHAGNLPREGAVLIVTASYNGHPPDNAKQFVDWLDQASADEVKGVRYSVFGCGDKNWATTYQKVPAFIDETLAAKGAENIADRGEADASDDFEGTYEEWREHMWSDVAAYFNLDIENSEDNKSTLSLQFVDSAADMPLAKMHGAFSTNVVASKELQQPGSARSTRHLEIELPKEASYQEGDHLGVIPRNYEGIVNRVTARFGLDASQQIRLEAEEEKLAHLPLAKTVSVEELLQYVELQDPVTRTQLRAMAAKTVCPPHKVELEALLEKQAYKEQVLAKRLTMLELLEKYPACEMKFSEFIALLPSIRPRYYSISSSPRVDEKQASITVSVVSGEAWSGYGEYKGIASNYLAELQEGDTITCFISTPQSEFTLPKDPETPLIMVGPGTGVAPFRGFVQARKQLKEQGQSLGEAHLYFGCRSPHEDYLYQEELENAQSEGIITLHTAFSRMPNQPKTYVQHVMEQDGKKLIELLDQGAHFYICGDGSQMAPAVEATLMKSYADVHQVSEADARLWLQQLEEKGRYAKDVWAG', 'MEPFVVLVLCLSFMLLFSLWRQSCRRRKLPPGPTPLPIIGNMLQIDVKDICKSFTNFSKVYGPVFTVYFGMNPIVVFHGYEAVKEALIDNGEEFSGRGNSPISQRITKGLGIISSNGKRWKEIRRFSLTTLRNFGMGKRSIEDRVQEEAHCLVEELRKTKASPCDPTFILGCAPCNVICSVVFQKRFDYKDQNFLTLMKRFNENFRILNSPWIQVCNNFPLLIDCFPGTHNKVLKNVALTRSYIREKVKEHQASLDVNNPRDFIDCFLIKMEQEKDNQKSEFNIENLVGTVADLFVAGTETTSTTLRYGLLLLLKHPEVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTDAVVHEIQRYSDLVPTGVPHAVTTDTKFRNYLIPKGTTIMALLTSVLHDDKEFPNPNIFDPGHFLDKNGNFKKSDYFMPFSAGKRICAGEGLARMELFLFLTTILQNFNLKSVDDLKNLNTTAVTKGIVSLPPSYQICFIPV', 'MRMPTGSELWPIAIFTIIFLLLVDLMHRRQRWTSRYPPGPVPWPVLGNLLQIDFQNMPAGFQKLRCRFGDLFSLQLAFESVVVLNGLPALREALVKYSEDTADRPPLHFNDQSGFGPRSQGVVLARYGPAWRQQRRFSVSTFRHFGLGKKSLEQWVTEEARCLCAAFADHSGFPFSPNTLLDKAVCNVIASLLFACRFEYNDPRFIRLLDLLKDTLEEESGFLPMLLNVFPMLLHIPGLLGKVFSGKKAFVAMLDELLTEHKVTWDPAQPPRDLTDAFLAEVEKAKGNPESSFNDENLRVVVADLFMAGMVTTSTTLTWALLFMILRPDVQCRVQQEIDEVIGQVRRPEMADQARMPFTNAVIHEVQRFADILPLGVPHKTSRDIEVQGFLIPKGTTLIINLSSVLKDETVWEKPLRFHPEHFLDAQGNFVKHEAFMPFSAGRRACLGEPLARMELFLFFTCLLQRFSFSVPAGQPRPSNYGVFGALTTPRPYQLCASPR', 'MALIPDLAMETWLLLAVSLVLLYLYGTHSHGLFKKLGIPGPTPLPFLGNILSYHKGFCMFDMECHKKYGKVWGFYDGQQPVLAITDPDMIKTVLVKECYSVFTNRRPFGPVGFMKSAISIAEDEEWKRLRSLLSPTFTSGKLKEMVPIIAQYGDVLVRNLRREAETGKPVTLKDVFGAYSMDVITSTSFGVNIDSLNNPQDPFVENTKKLLRFDFLDPFFLSITVFPFLIPILEVLNICVFPREVTNFLRKSVKRMKESRLEDTQKHRVDFLQLMIDSQNSKETESHKALSDLELVAQSIIFIFAGYETTSSVLSFIMYELATHPDVQQKLQEEIDAVLPNKAPPTYDTVLQMEYLDMVVNETLRLFPIAMRLERVCKKDVEINGMFIPKGVVVMIPSYALHRDPKYWTEPEKFLPERFSKKNKDNIDPYIYTPFGSGPRNCIGMRFALMNMKLALIRVLQNFSFKPCKETQIPLKLSLGGLLQPEKPVVLKVESRDGTVSGA', 'MDSLVVLVLCLSCLLLLSLWRQSSGRGKLPPGPTPLPVIGNILQIGIKDISKSLTNLSKVYGPVFTLYFGLKPIVVLHGYEAVKEALIDLGEEFSGRGIFPLAERANRGFGIVFSNGKKWKEIRRFSLMTLRNFGMGKRSIEDRVQEEARCLVEELRKTKASPCDPTFILGCAPCNVICSIIFHKRFDYKDQQFLNLMEKLNENIKILSSPWIQICNNFSPIIDYFPGTHNKLLKNVAFMKSYILEKVKEHQESMDMNNPQDFIDCFLMKMEKEKHNQPSEFTIESLENTAVDLFGAGTETTSTTLRYALLLLLKHPEVTAKVQEEIERVIGRNRSPCMQDRSHMPYTDAVVHEVQRYIDLLPTSLPHAVTCDIKFRNYLIPKGTTILISLTSVLHDNKEFPNPEMFDPHHFLDEGGNFKKSKYFMPFSAGKRICVGEALAGMELFLFLTSILQNFNLKSLVDPKNLDTTPVVNGFASVPPFYQLCFIPV', 'MAESVPIPEPPGYPLIGNLGEFTSNPLSDLNRLADTYGPIFRLRLGAKAPIFVSSNSLINEVCDEKRFKKTLKSVLSQVREGVHDGLFTAFEDEPNWGKAHRILVPAFGPLSIRGMFPEMHDIATQLCMKFARHGPRTPIDTSDNFTRLALDTLALCAMDFRFYSYYKEELHPFIEAMGDFLTESGNRNRRPPFAPNFLYRAANEKFYGDIALMKSVADEVVAARKASPSDRKDLLAAMLNGVDPQTGEKLSDENITNQLITFLIAGHETTSGTLSFAMYQLLKNPEAYSKVQKEVDEVVGRGPVLVEHLTKLPYISAVLRETLRLNSPITAFGLEAIDDTFLGGKYLVKKGEIVTALLSRGHVDPVVYGNDADKFIPERMLDDEFARLNKEYPNCWKPFGNGKRACIGRPFAWQESLLAMVVLFQNFNFTMTDPNYALEIKQTLTIKPDHFYINATLRHGMTPTELEHVLAGNGATSSSTHNIKAAANLDAKAGSGKPMAIFYGSNSGTCEALANRLASDAPSHGFSATTVGPLDQAKQNLPEDRPVVIVTASYEGQPPSNAAHFIKWMEDLDGNDMEKVSYAVFACGHHDWVETFHRIPKLVDSTLEKRGGTRLVPMGSADAATSDMFSDFEAWEDIVLWPGLKEKYKISDEESGGQKGLLVEVSTPRKTSLRQDVEEALVVAEKTLTKSGPAKKHIEIQLPSAMTYKAGDYLAILPLNPKSTVARVFRRFSLAWDSFLKIQSEGPTTLPTNVAISAFDVFSAYVELSQPATKRNILALAEATEDKDTIQELERLAGDAYQAEISPKRVSVLDLLEKFPAVALPISSYLAMLPPMRVRQYSISSSPFADPSKLTLTYSLLDAPSLSGQGRHVGVATNFLSHLTAGDKLHVSVRASSEAFHLPSDAEKTPIICVAAGTGLAPLRGFIQERAAMLAAGRTLAPALLFFGCRNPEIDDLYAEEFERWEKMGAVDVRRAYSRATDKSEGCKYVQDRVYHDRADVFKVWDQGAKVFICGSREIGKAVEDVCVRLAIEKAQQNGRDVTEEMARAWFERSRNERFATDVFD', 'MKQASAIPQPKTYGPLKNLPHLEKEQLSQSLWRIADELGPIFRFDFPGVSSVFVSGHNLVAEVCDEKRFDKNLGKGLQKVREFGGDGLFTSWTHEPNWQKAHRILLPSFSQKAMKGYHSMMLDIATQLIQKWSRLNPNEEIDVADDMTRLTLDTIGLCGFNYRFNSFYRDSQHPFITSMLRALKEAMNQSKRLGLQDKMMVKTKLQFQKDIEVMNSLVDRMIAERKANPDENIKDLLSLMLYAKDPVTGETLDDENIRYQIITFLIAGHETTSGLLSFAIYCLLTHPEKLKKAQEEADRVLTDDTPEYKQIQQLKYIRMVLNETLRLYPTAPAFSLYAKEDTVLGGEYPISKGQPVTVLIPKLHRDQNAWGPDAEDFRPERFEDPSSIPHHAYKPFGNGQRACIGMQFALQEATMVLGLVLKHFELINHTGYELKIKEALTIKPDDFKITVKPRKTAAINVQRKEQADIKAETKPKETKPKHGTPLLVLFGSNLGTAEGIAGELAAQGRQMGFTAETAPLDDYIGKLPEEGAVVIVTASYNGAPPDNAAGFVEWLKELEEGQLKGVSYAVFGCGNRSWASTYQRIPRLIDDMMKAKGASRLTAIGEGDAADDFESHRESWENRFWKETMDAFDINEIAQKEDRPSLSITFLSEATETPVAKAYGAFEGIVLENRELQTAASTRSTRHIELEIPAGKTYKEGDHIGILPKNSRELVQRVLSRFGLQSNHVIKVSGSAHMAHLPMDRPIKVVDLLSSYVELQEPASRLQLRELASYTVCPPHQKELEQLVSDDGIYKEQVLAKRLTMLDFLEDYPACEMPFERFLALLPSLKPRYYSISSSPKVHANIVSMTVGVVKASAWSGRGEYRGVASNYLAELNTGDAAACFIRTPQSGFQMPNDPETPMIMVGPGTGIAPFRGFIQARSVLKKEGSTLGEALLYFGCRRPDHDDLYREELDQAEQDGLVTIRRCYSRVENEPKGYVQHLLKQDTQKLMTLIEKGAHIYVCGDGSQMAPDVERTLRLAYEAEKAASQEESAVWLQKLQDQRRYVKDVWTGM', 'MKETSPIPQPKTFGPLGNLPLIDKDKPTLSLIKLAEEQGPIFQIHTPAGTTIVVSGHELVKEVCDEERFDKSIEGALEKVRAFSGDGLFTSWTHEPNWRKAHNILMPTFSQRAMKDYHEKMVDIAVQLIQKWARLNPNEAVDVPGDMTRLTLDTIGLCGFNYRFNSYYRETPHPFINSMVRALDEAMHQMQRLDVQDKLMVRTKRQFRYDIQTMFSLVDSIIAERRANGDQDEKDLLARMLNVEDPETGEKLDDENIRFQIITFLIAGHETTSGLLSFATYFLLKHPDKLKKAYEEVDRVLTDAAPTYKQVLELTYIRMILNESLRLWPTAPAFSLYPKEDTVIGGKFPITTNDRISVLIPQLHRDRDAWGKDAEEFRPERFEHQDQVPHHAYKPFGNGQRACIGMQFALHEATLVLGMILKYFTLIDHENYELDIKQTLTLKPGDFHISVQSRHQEAIHADVQAAEKAAPDEQKEKTEAKGASVIGLNNRPLLVLYGSDTGTAEGVARELADTASLHGVRTKTAPLNDRIGKLPKEGAVVIVTSSYNGKPPSNAGQFVQWLQEIKPGELEGVHYAVFGCGDHNWASTYQYVPRFIDEQLAEKGATRFSARGEGDVSGDFEGQLDEWKKSMWADAIKAFGLELNENADKERSTLSLQFVRGLGESPLARSYEASHASIAENRELQSADSDRSTRHIEIALPPDVEYQEGDHLGVLPKNSQTNVSRILHRFGLKGTDQVTLSASGRSAGHLPLGRPVSLHDLLSYSVEVQEAATRAQIRELASFTVCPPHRRELEELSAEGVYQEQILKKRISMLDLLEKYEACDMPFERFLELLRPLKPRYYSISSSPRVNPRQASITVGVVRGPAWSGRGEYRGVASNDLAERQAGDDVVMFIRTPESRFQLPKDPETPIIMVGPGTGVAPFRGFLQARDVLKREGKTLGEAHLYFGCRNDRDFIYRDELERFEKDGIVTVHTAFSRKEGMPKTYVQHLMADQADTLISILDRGGRLYVCGDGSKMAPDVEAALQKAYQAVHGTGEQEAQNWLRHLQDTGMYAKDVWAGI', 'MLFEGLDLVSALATLAACLVSVTLLLAVSQQLWQLRWAATRDKSCKLPIPKGSMGFPLIGETGHWLLQGSGFQSSRREKYGNVFKTHLLGRPLIRVTGAENVRKILMGEHHLVSTEWPRSTRMLLGPNTVSNSIGDIHRNKRKVFSKIFSHEALESYLPKIQLVIQDTLRAWSSHPEAINVYQEAQKLTFRMAIRVLLGFSIPEEDLGHLFEVYQQFVDNVFSLPVDLPFSGYRRGIQARQILQKGLEKAIREKLQCTQGKDYLDALDLLIESSKEHGKEMTMQELKDGTLELIFAAYATTASASTSLIMQLLKHPTVLEKLRDELRAHGILHSGGCPCEGTLRLDTLSGLRYLDCVIKEVMRLFTPISGGYRTVLQTFELDGFQIPKGWSVMYSIRDTHDTAPVFKDVNVFDPDRFSQARSEDKDGRFHYLPFGGGVRTCLGKHLAKLFLKVLAVELASTSRFELATRTFPRITLVPVLHPVDGLSVKFFGLDSNQNEILPETEAMLSATV', 'MEPTILLLLALLVGFLLLLVRGHPKSRGNFPPGPRPLPLLGNLLQLDRGGLLNSFMQLREKYGDVFTVHLGPRPVVMLCGTDTIKEALVGQAEDFSGRGTIAVIEPIFKEYGVIFANGERWKALRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQGAPLDPTFLFQCITANIICSIVFGERFDYTDRQFLRLLELFYRTFSLLSSFSSQVFEFFSGFLKYFPGAHRQISKNLQEILDYIGHIVEKHRATLDPSAPRDFIDTYLLRMEKEKSNHHTEFHHENLMISLLSLFFAGTETSSTTLRYGFLLMLKYPHVAEKVQKEIDQVIGSHRLPTLDDRSKMPYTDAVIHEIQRFSDLVPIGVPHRVTKDTMFRGYLLPKNTEVYPILSSALHDPQYFDHPDSFNPEHFLDANGALKKSEAFMPFSTGKRICLGEGIARNELFLFFTTILQNFSVSSHLAPKDIDLTPKESGIGKIPPTYQICFSAR', 'MAMSPAAPLSVTELLLVSAVFCLVFWAVRASRPKVPKGLKRLPGPWGWPLLGHLLTLGKNPHVALARLSRRYGDVFQIRLGSTPVVVLSGLDTIKQALVRQGDDFKGRPDLYSSSFITEGQSMTFSPDSGPVWAARRRLAQDSLKSFSIASNPASSSSCYLEEHVSQEAENLIGRFQELMAAVGRFDPYSQLVVSAARVIGAMCFGRRFPQGSEEMLDVVRNSSKFVETASSGSPVDFFPILRYLPNRPLQRFKDFNQRFLRFLQKTVREHYEDFDRNSIQDITGALFKHSEKNSKANSGLIPQEKIVNLVNDIFGAGFDTITTALSWSLMYLVTNPRRQRKIQEELDAVVGRARQPRLSDRPQLPYLEAFILELFRHTSFVPFTIPHSTTRDTTLNGFHIPKECCIFINQWQINHDPQLWGDPEEFRPERFLTADGAAINKPLSEKVTLFGLGKRRCIGETLARWEVFLFLAILLQRLEFSVPPGVPVDLTPIYGLTMKHPRCEHVQARPRFSDQ', 'MLLLGLLLLPLLAGARLLWNWWKLRSLHLPPLAPGFLHLLQPDLPIYLLGLTQKFGPIYRLHLGLQDVVVLNSKRTIEEAMVKKWADFAGRPEPLTYKLVSKNYPDLSLGDYSLLWKAHKKLTRSALLLGIRDSMEPVVEQLTQEFCERMRAQPGTPVAIEEEFSLLTCSIICYLTFGDKIKDDNLMPAYYKCIQEVLKTWSHWSIQIVDVIPFLRFFPNPGLRRLKQAIEKRDHIVEMQLRQHKESLVAGQWRDMMDYMLQGVAQPSMEEGSGQLLEGHVHMAAVDLLIGGTETTANTLSWAVVFLLHHPEIQQRLQEELDHELGPGASSSRVPYKDRARLPLLNATIAEVLRLRPVVPLALPHRTTRPSSISGYDIPEGTVIIPNLQGAHLDETVWERPHEFWPDRFLEPGKNSRALAFGCGARVCLGEPLARLELFVVLTRLLQAFTLLPSGDALPSLQPLPHCSVILKMQPFQVRLQPRGMGAHSPGQNQ', 'MEPSVLLLLALLVGFLLLLARGHPKSRGNFPPGPRPLPLLGNLLQMDRGGLLKSLIQLREKYGDVFTVHLGPRPVVMLCGTDTIREALVGQAEAFSGRGTVAVVEPTFKEYGVIFANGERWKTLRRFSLATMRDFGMGKRSVEERIQEEAQCLVEELRKSQGAPLDPTFLFQCITANVICSIVFGERFEYTDRQFLRLLELFYQTFSLISSFSSQMFELFSGFLKYFPGAHRQISKNLQELLDYIGHSVERHKATLDPSVPRDFIDIYLLRMEKEKSNQNAEFHHQNLMMSVLSLFFVGTETSSTTLHYGFLLMLKYPHVTEKVQKEIDQVIGSHRLPTLDDRTKMPYSDAVIHEIQRFSDLIPIGVPHRVTKDTLFRGYLLPKNTEVYPILSSALHDPQYFEQPDSFNPDQFLDANGALKKSEAFLPFSTGQIFDQKSVGKRICLGESIARSELFLFFTSILQNFSVASHVAPKDIDLTPKESGIGKIPPTYQICFLAR', 'MPSVYGFPAFTSATELLLAVTTFCLGFWVVRVTRTWVPKGLKSPPGPWGLPFIGHVLTLGKNPHLSLTKLSQQYGDVLQIRIGSTPVVVLSGLNTIKQALVKQGDDFKGRPDLYSFTLIANGQSMTFNPDSGPLWAARRRLAQNALKSFSIASDPTLASSCYLEEHVSKEAEYLISKFQKLMAEVGHFDPFKYLVVSVANVICAICFGRRYDHDDQELLSIVNLSNEFGEVTGSGYPADFIPILRYLPNSSLDAFKDLNKKFYSFMKKLIKEHYRTFEKGHIRDITDSLIEHCQDRRLDENANVQLSDDKVITIVFDLFGAGFDTITTAISWSLMYLVTNPRIQRKIQEELDTVIGRDRQPRLSDRPQLPYLEAFILETFRHSSFVPFTIPHSTIRDTSLNGFYIPKGHCVFVNQWQVNHDQELWGDPNEFRPERFLTSSGTLDKHLSEKVILFGLGKRKCIGETIGRLEVFLFLAILLQQMEFNVSPGEKVDMTPAYGLTLKHARCEHFQVQMRSSGPQHLQA', 'MLFPISMSATEFLLASVIFCLVFWVIRASRPQVPKGLKNPPGPWGWPLIGHMLTLGKNPHLALSRMSQQYGDVLQIRIGSTPVVVLSGLDTIRQALVRQGDDFKGRPDLYTFTLISNGQSMSFSPDSGPVWAARRRLAQNGLKSFSIASDPASSTSCYLEEHVSKEAEVLISTLQELMAGPGHFNPYRYVVVSVTNVICAICFGRRYDHNHQELLSLVNLNNNFGEVVGSGNPADFIPILRYLPNPSLNAFKDLNEKFYSFMQKMVKEHYKTFEKGHIRDITDSLIEHCQEKQLDENANVQLSDEKIINIVLDLFGAGFDTVTTAISWSLMYLVMNPRVQRKIQEELDTVIGRSRRPRLSDRSHLPYMEAFILETFRHSSFVPFTIPHSTTRDTSLKGFYIPKGRCVFVNQWQINHDQKLWVNPSEFLPERFLTPDGAIDKVLSEKVIIFGMGKRKCIGETIARWEVFLFLAILLQRVEFSVPLGVKVDMTPIYGLTMKHACCEHFQMQLRS', 'MDPVLVLVLTLSSLLLLSLWRQSFGRGKLPPGPTPLPIIGNTLQIYMKDIGQSIKKFSKVYGPIFTLYLGMKPFVVLHGYEAVKEALVDLGEEFSGRGSFPVSERVNKGLGVIFSNGMQWKEIRRFSIMTLRTFGMGKRTIEDRIQEEAQCLVEELRKSKGAPFDPTFILGCAPCNVICSIIFQNRFDYKDPTFLNLMHRFNENFRLFSSPWLQVCNTFPAIIDYFPGSHNQVLKNFFYIKNYVLEKVKEHQESLDKDNPRDFIDCFLNKMEQEKHNPQSEFTLESLVATVTDMFGAGTETTSTTLRYGLLLLLKHVDVTAKVQEEIERVIGRNRSPCMKDRSQMPYTDAVVHEIQRYIDLVPTNLPHLVTRDIKFRNYFIPKGTNVIVSLSSILHDDKEFPNPEKFDPGHFLDERGNFKKSDYFMPFSAGKRICAGEALARTELFLFFTTILQNFNLKSLVDVKDIDTTPAISGFGHLPPFYEACFIPVQRADSLSSHL', 'MLASGMLLVALLVCLTVMVLMSVWQQRKSKGKLPPGPTPLPFIGNYLQLNTEQMYNSLMKISERYGPVFTIHLGPRRVVVLCGHDAVREALVDQAEEFSGRGEQATFDWVFKGYGVVFSNGERAKQLRRFSIATLRDFGVGKRGIEERIQEEAGFLIDALRGTGGANIDPTFFLSRTVSNVISSIVFGDRFDYKDKEFLSLLRMMLGIFQFTSTSTGQLYEMFSSVMKHLPGPQQQAFQLLQGLEDFIAKKVEHNQRTLDPNSPRDFIDSFLIRMQEEEKNPNTEFYLKNLVMTTLNLFIGGTETVSTTLRYGFLLLMKHPEVEAKVHEEIDRVIGKNRQPKFEDRAKMPYMEAVIHEIQRFGDVIPMSLARRVKKDTKFRDFFLPKGTEVFPMLGSVLRDPSFFSNPQDFNPQHFLNEKGQFKKSDAFVPFSIGKRNCFGEGLARMELFLFFTTVMQNFRLKSSQSPKDIDVSPKHVGFATIPRNYTMSFLPR', 'MGFSVFSPTRSLDGVSGFFQGAFLLSLFLVLFKAVQFYLRRQWLLKALEKFPSTPSHWLWGHNLKDREFQQVLTWVEKFPGACLQWLSGSTARVLLYDPDYVKVVLGRSDPKPYQSLAPWIGYGLLLLNGKKWFQHRRMLTPAFHYDILKPYVKIMADSVSIMLDKWEKLDDQDHPLEIFHYVSLMTLDTVMKCAFSHQGSVQLDVNSRSYTKAVEDLNNLIFFRVRSAFYGNSIIYNMSSDGRLSRRACQIAHEHTDGVIKTRKAQLQNEEELQKARKKRHLDFLDILLFAKMEDGKSLSDEDLRAEVDTFMFEGHDTTASGISWVFYALATHPEHQERCREEVQSILGDGTSVTWDHLDQMPYTTMCIKEALRLYSPVPSVSRELSSPVTFPDGRSIPKGIRVTILIYGLHHNPSYWPNPKVFDPSRFSPDSPRHSHAYLPFSGGARNCIGKQFAMNELKVAVALTLLRFELLPDPTRIPVPMPRLVLKSKNGIHLRLKKLR', 'MVLNFLSPSLSRLGLWASVVILMVIVLKLFSLLLRRQKLARAMDSFPGPPTHWLFGHALEIQKLGSLDKVVSWAQQFPHAHPLWFGQFVGFLNIYEPDYAKAVYSRGDPKAADVYDFFLQWIGKGLLVLDGPKWFQHRKLLTPGFHYDVLKPYVAIFAESTRMMLDKWEKKASENKSFDIFCDVGHMALDTLMKCTFGKGDSGLGHRDNSYYLAVSDLTLLMQQRIDSFQYHNDFIYWLTPHGRRFLRACKIAHDHTDEVIRQRKAALQDEKERKKIQQRRHLDFLDILLGVRDESGIKLSDAELRAEVDTFMFEGHDTTTSGISWFLYCMALYPEHQQLCREEVRGILGDQDSFQWDDLAKMTYLTMCMKECFRLYPPVPQVYRQLNKPVTFVDGRSLPAGSLISLHIYALHRNSTVWPDPEVFDPLRFSPENAAGRHPFAFMPFSAGPRNCIGQQFAMNEMKVVTALCLLRFEFSLDPSKMPIKVPQLILRSKNGIHLYLKPLASRSGK', 'MGLPALLASALCTFVLPLLLFLAAIKLWDLYCVSGRDRSCALPLPPGTMGFPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGADNVRRILLGEHRLVSVHWPASVRTILGSGCLSNLHDSSHKQRKKVIMRAFSREALECYVPVITEEVGSSLEQWLSCGERGLLVYPEVKRLMFRIAMRILLGCEPQLAGDGDSEQQLVEAFEEMTRNLFSLPIDVPFSGLYRGMKARNLIHARIEQNIRAKICGLRASEAGQGCKDALQLLIEHSWERGERLDMQALKQSSTELLFGGHETTASAATSLITYLGLYPHVLQKVREELKSKGLLCKSNQDNKLDMEILEQLKYIGCVIKETLRLNPPVPGGFRVALKTFELNGYQIPKGWNVIYSICDTHDVAEIFTNKEEFNPDRFMLPHPEDASRFSFIPFGGGLRSCVGKEFAKILLKIFTVELARHCDWQLLNGPPTMKTSPTVYPVDNLPARFTHFHGEI', 'MEFSLLLLLAFLAGLLLLLFRGHPKAHGRLPPGPSPLPVLGNLLQMDRKGLLRSFLRLREKYGDVFTVYLGSRPVVVLCGTDAIREALVDQAEAFSGRGKIAVVDPIFQGYGVIFANGERWRALRRFSLATMRDFGMGKRSVEERIQEEARCLVEELRKSKGALLDNTLLFHSITSNIICSIVFGKRFDYKDPVFLRLLDLFFQSFSLISSFSSQVFELFPGFLKHFPGTHRQIYRNLQEINTFIGQSVEKHRATLDPSNPRDFIDVYLLRMEKDKSDPSSEFHHQNLILTVLSLFFAGTETTSTTLRYGFLLMLKYPHVTERVQKEIEQVIGSHRPPALDDRAKMPYTDAVIHEIQRLGDLIPFGVPHTVTKDTQFRGYVIPKNTEVFPVLSSALHDPRYFETPNTFNPGHFLDANGALKRNEGFMPFSLGKRICLGEGIARTELFLFFTTILQNFSIASPVPPEDIDLTPRESGVGNVPPSYQIRFLAR', 'MTQTLKYASRVFHRVRWAPELGASLGYREYHSARRSLADIPGPSTPSFLAELFCKGGLSRLHELQVQGAAHFGPVWLASFGTVRTVYVAAPALVEELLRQEGPRPERCSFSPWTEHRRCRQRACGLLTAEGEEWQRLRSLLAPLLLRPQAAARYAGTLNNVVCDLVRRLRRQRGRGTGPPALVRDVAGEFYKFGLEGIAAVLLGSRLGCLEAQVPPDTETFIRAVGSVFVSTLLTMAMPHWLRHLVPGPWGRLCRDWDQMFAFAQRHVERREAEAAMRNGGQPEKDLESGAHLTHFLFREELPAQSILGNVTELLLAGVDTVSNTLSWALYELSRHPEVQTALHSEITAALSPGSSAYPSATVLSQLPLLKAVVKEVLRLYPVVPGNSRVPDKDIHVGDYIIPKNTLVTLCHYATSRDPAQFPEPNSFRPARWLGEGPTPHPFASLPFGFGKRSCMGRRLAELELQMALAQILTHFEVQPEPGAAPVRPKTRTVLVPERSINLQFLDR', 'MALSQSVPFSATELLLASAIFCLVFWVLKGLRPRVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALSRMSQRYGDVLQIRIGSTPVLVLSRLDTIRQALVRQGDDFKGRPDLYTSTLITDGQSLTFSTDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAKALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNTHEFVETASSGNPLDFFPILRYLPNPALQRFKAFNQRFLWFLQKTVQEHYQDFDKNSVRDITGALFKHSKKGPRASGNLIPQEKIVNLVNDIFGAGFDTVTTAISWSLMYLVTKPEIQRKIQKELDTVIGRERRPRLSDRPQLPYLEAFILETFRHSSFLPFTIPHSTTRDTTLNGFYIPKKCCVFVNQWQVNHDPELWEDPSEFRPERFLTADGTAINKPLSEKMMLFGMGKRRCIGEVLAKWEIFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHVQARLRFSIN', 'MTQAVKLASRVFHRIHLPLQLDASLGSRGSESVLRSLSDIPGPSTLSFLAELFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPTLVEQLLRQESHCPERCSFSSWAEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAAAGYAGTLDNVVRDLVRRLRRQRGRGSGLPGLVLDVAGEFYKFGLESIGAVLLGSRLGCLEAEVPPDTETFIHAVGSVFVSTLLTMAMPNWLHHLIPGPWARLCRDWDQMFAFAQRHVELREGEAAMRNQGKPEEDMPSGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNTLSWTLYELSRHPDVQTALHSEITAGTRGSCAHPHGTALSQLPLLKAVIKEVLRLYPVVPGNSRVPDRDIRVGNYVIPQDTLVSLCHYATSRDPTQFPDPNSFNPARWLGEGPTPHPFASLPFGFGKRSCIGRRLAELELQMALSQILTHFEVLPEPGALPIKPMTRTVLVPERSINLQFVDR', 'MTQAVKLASRVFHRVQLPSQLGSDSVLRSLSDIPGPSTPSFLAELFCKGGLSRLHELQVHGAARYGPIWSGSFGTLRTVYVADPALVEQLLRQESHCPERCSFSSWSEHRRRHQRACGLLTADGEEWQRLRSLLAPLLLRPQAAAGYAGTLDSVVSDLVRRLRRQRGRGSGLPDLVLDVAGEFYKFGLEGIGAVLLGSRLGCLEAEVPPDTETFIEAVGSVFVSTLLTMAMPSWLHRLIPGPWARLCRDWDQMFAFAQKHVEQREGEAAVRNQGKPEEDLPTGHHLTHFLFREKVSVQSIVGNVTELLLAGVDTVSNTLSWALYELSRHPEVQSALHSEITGAVNPGSYAHLQATALSQLPLLKAVIKEVLRLYPVVPGNSRVPDRDICVGNYVIPQDTLVSLCHYATSRDPAQFREPNSFNPARWLGEGPAPHPFASLPFGFGKRSCIGRRLAELELQMALAQILTHFEVLPEPGALPVKPMTRTVLVPERSIHLQFVDR', 'MDPVVVLLLSLFFLLFLSLWRPSSGRGKLPPGPTPLPIIGNFFQVDMKDIRQSLTNFSKTYGPVYTLYVGSQPTVVLHGYEALKEALVDHGEEFSGRGRLPICEKVAKGQGIAFSHGNVWKATRHFTVKTLRNLGMGKGTIEDKVQEEAKWLVKELKKTNGSPCDPQFIMGCAPGNVICSIILQNRFDYEDKDFLNLIEKVNEAVKIISSPGIQVFNIFPILLDYCPGNHNIYFKNHTWLKSYLLEKIKEHEESLDVSNPRDFIDYFLIERNQENANQWMNYTLEHLAIMVTDLFFAGIETVSSTMRFALLLLMKYPHVTAKVQEEIDHVIGRHRSPCMQDRSHMPYTNAMVHEVQRYIDIGPNGLLHEVTCDTKFRNYFIPKGTAVLTSLTSVLHDSKEFPNPEMFDPGHFLDENGNFKKSDYFIPFSAGKRMCLGESLARMELFLFLTTILQNFKLKSLVDPKDINTTPICSSLSSVPPTFQMRFIPL', 'MGSFEIPEKHTALLQGEGGSLVIARDVPLPTLGPGHLLVKTAAVALNPCDFKTPAAFPNPGYYNGCDFAGTVVALGSDNIRDGGPWKIGDRIFGAIHGANPSDWDSGSHAEYVKAVSVFSYRIPDWMTFEEAAGLSPCCIATMGVSLFKALELPGTFEEPATKPLDVLIYGGSSSVGSLGIQMVKLTGHRLGHRCITTCSPKNFDLVKSYGADEVFDYKSPTCAQDIRKATRNCLKYAVDPFGEVKTMAICTEAIGRAGGRYSALEKFQEDVCDRKTVKRELTMGAIIIGHGLDLGGRYTRPHSPEMRAWGIEWYKSIQRLVDARKFKPHPIRVLKGGFEDMLEGLAMLKRREISAEKLVVSLDPAVSGLTADSTAR', 'MPVDPFVHLHVHTEYSMLDGAAKVGALFAEAQRLEMPAVGMTDHGNMFGADEFYQQAKKTGIKPIIGIEAYLAPGSRFHKKPVFWGEAKQRGTDEYGEGGDVSGAGAYTHMTMLARNATGLRNLFTLSSRASMEGQYRKPRMDRELVAEYAEGIIATTGCPSGEVQTRLRLRQFDEAMQAASDYKDIFGAENFFLELMDHGLPIERSVREGLLKIAKELGLKPVATNDSHYVTADQAESHGALLCVQSGKTLNDESRFKFDGDGYYLKSAAEMREYWDKEVPGAADNTLLIAERVESYEDVWSFQDRMPRVTDGSGKTERQLLEAEVEQYLPTRYPEGATQECLDRIKVELDVLDTKGYCAYFLVVGDLTRWAKSQGIHVGPGRGSAAGSLLAYILHITNLDPLEHGLIFERFLNPERDSPPDIDLDFDDRRRDEVLQYAIDKYGRDKVAQVITFGKIKTKAAIKDSARVHHGQPGFAIADKISKALPPPIAAKDIPLSGIVDPQHERYAEAAEVRNLIETDPSVSQIFDTARGLEGLIRNAGVHACAVILSSQPLMGTVPLWARDDGSIITGWDYPSCEAIGLLKMDFLGLSNLTILGDALKMVKANHDREIDLSNLGLDDAKTYELLARGESLGVFQLEGGGMRELLKRMQPTEFADIVACNALYRPGPMEVNAHNDYADRKNGKKPVEPIHPDLDEPLKDILSETYGLIVYQEQIMAIAQKVAGYSLGRADILRRAMGKKKKEVLDQEFEGFQAGMREQGFRDEAIDKLWATVLPFAGYAFNKSHAAGYALVAYWTAYLKANYPAEYMAALLTSNGDNKDKMAVYLAECRRMGVKVLSPDVNDSLNDFTAVGSDIRFGLSAVRNVGSNVVASIAKVREDKGRYSSFTDFLDKSETVACNKRVIESLIKAGAFDSLGHTRMSLAQHHEAAVDAVIGLKRQQALGQFDLFGGGDDAGGEESSSPLAHLQFTPDEWPRKQMLSYEREMLGLYVSAHPLDGAERLLAPYQDTGIAELVGGEREAGKDQVKIAGMISGIQRRINKNGHPWAIVTLEDLDASVEVLFFPKSYEMFADCLVEDTAIAVKGRINEREGTISIFASDAVPVDISAAETDPGTSPAFVIKVPASRVDRSLVAELKRTLQAHSGTVPVHVKLQGPRGVTRLALSSDYFVSTENGLQGELKGLLGAGCFETVL', 'SFRLSLSASTYAQRGSFTTPEHDFTLFPHRNHSVTSESRIPSEQTLKSLTDIPGNWRKNWLNVYYFWRSNGLNNAHQWMLDNFNKYGPIYREKIAYYESINIINPADAVIMNKSEGPFPKRIEMAPWVAYRDLRKENYGVQLLNGENWKRTRLILNNSIFAQSSIQRLVPLFNEVVLDFVSMVHKEVEKSRSDYWKTDLTNDLFKLALEVICYILYGERLDLLQRKYNKAPQKFIDSIATMFHSTPIMLYVPPSLLKSINSKIWQQHVGSWDNIFEHADTYLKKAYRQFQQGSKNEHAFPGVLTELLLQGALPFEDIRASIIDVMSGAIDTTSTTVHWMMYELAKHPHIQKNVRSEIMEAHQKTEGDPVKMLKSVPLLKCVVKETLRLYPVAISIQRYLNEDTVLQNYHIPAGTLVQLGLYAMGRNPKIFKNPEQYNPERWLKGEDTHFRHLGFGFGPRQCIGRRIAETQMVLLMIHMLQNFKIETDPMTEVKSKFSLILIPDKPINLKFTPIK', 'MLVRGLPLRSVLVKGCQPLLSAPREGPGHPRVPTGEGAGMSSHSPRPFKEIPSPGDNGWINLYHFWREKGPKKLHYHHFQNFQKYGPIYREKLGNVESVYIVDPEDVALLFKFEGPHPERFLIPPWTAYHQYFQKPVGVLFKSSDAWKKDRLALNPEVMALESIKNFIPLLDPVSQDFVSLLHRRMEQQGSGKFSGPIIEDLFRFAFESITNVIFGERQGMLDEIVDPEAQRFIDAVYKMFHTSVPMLSLPPDLFRLFRTKTWRDHVAAWDTVFSKAEQYTEKFYQDLKQKRHFDSYPGIFYRLLASNKLPFKDIQANVTEMLAGGVDTTSMSLQWHLYEIARNLRVQEMLREEVLAARRQAQGDTSTMVQMVPLLKASIKETLRLHPIAVTLQRYPQNDLVIRDYMIPAKTLVQVSIYTMGQDPTFFSNPRRFDPTRWLDKNKDLTHFRNLGFGWGVRQCLGRRIAELEMTLFLIHILENFRVEIQHLNDVDSTFGLILIPEKPISFTFWPITRAPPQA', 'RGLPSRSVFLRGCQASLSTAQERLGHPGVPTREGVRVATRSPRPYHEIPSPGDNGWLNLYHLAEEKGTHRVHYRHVQNFQKYGPIYRENLGNVESVYIMDPEDVALLFNSEGPQPERFLIPPWVAYHEYYRRPVGVLLKKAQGWKRDRVALNQEVMAPDAIKNFVPLLEAVSQAFVRMLHGRVQQGVFSGDISDDLFRFAFESMTNIMFGERLGMLEETVDPEAHEFIDAVYQMFHTSVPMLSLPPSLFRLFRTRTWRDHVAAWDVIFTNADKYTQSFYWDLRQKQDLGGSYRGILYSLLGTSKLSFEDIKANVTEMLAGSVDTTSMTLQWHLYEMGAALGMQEMLRAEVLAARRQAQGDMTAMLQSVPLLKASIKETLRLHPISVTLQRYLVNDLVLQDYMIPAKTLVQVANYGMGREPSFFANPEKFDPPRWLDKDKNATHFR', 'MLAKGLCLRSVLVKSCQPFLSPVWQGPGLATGNGAGISSTNSPRSFNEIPSPGDNGWINLYHFLRENGTHRIHYHHMQNFQKYGPIYREKLGNMESVYILDPKDAATLFSCEGPNPERYLVPPWVAYHQYYQRPIGVLFKSSDAWRKDRIVLNQEVMAPDSIKNFVPLLEGVAQDFIKVLHRRIKQQNSGKFSGDISDDLFRFAFESITSVVFGERLGMLEEIVDPESQRFIDAVYQMFHTSVPMLNMPPDLFRLFRTKTWKDHAAAWDVIFSKADEYTQNFYWDLRQKRDFSKYPGVLYSLLGGNKLPFKNIQANITEMLAGGVDTTSMTLQWNLYEMAHNLKVQEMLRAEVLAARRQAQGDMAKMVQLVPLLKASIKETLRLHPISVTLQRYIVNDLVLRNYKIPAKTLVQVASYAMGRESSFFPNPNKFDPTRWLEKSQNTTHFRYLGFGWGVRQCLGRRIAELEMTIFLINVLENFRIEVQSIRDVGTKFNLILMPEKPIFFNFQPLKQDLGSTMPRKGDTV', 'MVSDFGLPTFISATELLLASAVFCLVFWVAGASKPRVPKGLKRLPGPWGWPLLGHVLTLGKNPHVALARLSRRYGDVFQIRLGSTPVVVLSGLDTIKQALVRQGDDFKGRPDLYSFSFVTKGQSMIFGSDSGPVWAARRRLAQNALNSFSVASDPASSSSCYLEEHVSQEAENLISKFQELMAAVGHFDPYRYVVMSVANVICAMCFGRRYDHDDQELLSLVNLNDEFGKVAASGSPADFFLILRYLPNPALDTFKDLNERFYSFTQERVKEHCRSFEKGHIRDITDSLIKHYRVDRLDENANVQVSDEKTVGIVLDLFGAGFDTVTTAISWSLMYLVTKPRIQRKIQEELDAVVGRARRPRFSDRPQLPYLEAVIMETFRHTSFLPFTIPHSTTRDTSLGGFYIPKGRCVFVNQWQNNHDPELWGDPEAFRPERFLTPSGAVDKALTEKVLLFGLGKRKCIGETIGRLEVFLFLATLLQQVEFSVSPGTTVDMTPIYGLTMKHARCEHFQAKLRFEA', 'MSSKVITSLMAESILLSKVGQVISGYSPITVFLLGSILIFLVVYNKRRSRLVKYIEKIPGPAAMPFLGNAIEMNVDHDELFNRVIGMQKLWGTRIGINRVWQGTAPRVLLFEPETVEPILNSQKFVNKSHDYDYLHPWLGEGLLTSTDRKWHSRRKILTPAFHFKILDDFIDVFNEQSAVLARKLAVEVGSEAFNLFPYVTLCTLDIVCETAMGRRIYAQSNSESEYVKAVYGIGSIVQSRQAKIWLQSDFIFSLTAEYKLHQSYINTLHGFSNMVIRERKAELAILQENNNNNNNNAPDAYDDVGKKKRLAFLDLLIDASKEGTVLSNEDIREEVDTFMFEGHDTTSAAISWTLFLLGCHPEYQERVVEELDSIFGDDKETPATMKNLMDMRYLECCIKDSLRLFPSVPMMARMVGEDVNIGGKIVPAGTQAIIMTYALHRNPRVFPKPEQFNPDNFLPENCAGRHPFAYIPFSAGPRNCIGQKFAILEEKAVISTVLRKYKIEAVDRREDLTLLGELILRPKDGLRVKITPRD', 'MTAVKEVPRVSGGEEEHGHLEEFRTDPIGLMKRVREECGDVGWFQLADKQVILLSGAEANEFFFRSSDSELNQAEAYPFMTPIFGEGVVFDADPERRAEMLHNTALRGEHMKGHATTIEAEVRKMIEGWGESGEIDLLEFFAELTIYTSTACLIGLKFRNQLDSRFANYYHLLERGTDPLCYVDPYLPIESFRIRDEARAGLVELVQDVMHGRIANPPKDKSDRDMLDVLVSIKDEDGNPRFTANEITGMFISLMFAGHHTSSGTSSWTLIELLRHPEFYAKVQQELDDLYADGQEVSFHALRQIPSLDNALKETLRLHPPLIILMRVAQDEFEVAGYPIHKGQMVAASPAISNRIPEDFPNPDDFDPDRYEKPRQEDLINRWTWIPFGAGKHRCVGAAFAQMQIKAIFSVLLREYEFEMAQPPESYQNDHSKMVVQLARPAKVRYRRRVRD', 'MIAVFSLIAAALAVGSLVLLPVVLRGGCLLVVTIVWLWQILHFWHWRRLGVPFVPAAPFVGNVWNLLRGACCFGDQFRELYESKEAAGRAFVGIDVLHNHALLLRDPALIKRIMVEDFAQFSSRFETTDPTCDTMGSQNLFFSKYETWRETHKIFAPFFAAGKVRNMYGLLENIGQKLEEHMEQKLSGRDSMELEVKQLCALFTTDIIASLAFGIEAHSLQNPEAEFRRMCIEVNDPRPKRLLHLFTMFFFPRLSHRVGTHLYSEEYERFMRKSMDYVLSQRAESGENRHDLIDIFLQLKRTEPAESIIHRPDFFAAQAAFLLLAGFDTSSSTITFALYELAKNTTIQDRLRTELRAALQSSQDRQLSCDTVTGLVYLRQVVDEVLRLYPPTAFLDRCCNSRTGYDLSPWNGGSPFKLRAGTPVYISVLGIHRDAQYWPNPEVFDPERFSAEQRQQHHPMTYLPFGAGPRGCIGTLLGQLEIKVGLLHILNHFRVEVCERTLPEMRFDPKAFVLTAHNGTYLRFVKNSL', 'MLLIWLLLLTIVTLNFWLRHKYDYFRSRGIPHLPPSSWSPMGNLGQLLFLRISFGDLFRQLYADPRNGQAKIVGFFIFQTPALMVRDPELIRQVLIKNFNNFLNRFESADAGDPMGALTLPLAKYHHWKESRQCMSQLFTSGRMRDVMYSQMLDVASDLEQYLNRKLGDRLERVLPLGRMCQLYTTDVTGNLFYSLNVGGLRRGRSELITKTKELFNTNPRKVLDFMSVFFLPKWTGVLKPKVFTEDYARYMRHLVDDHHEPTKGDLINQLQHFQLSRSSNHYSQHPDFVASQAGIILLAGFETSSALMGFTLYELAKAPDIQERLRSELREAFISTATLSYDTLMTLPYLKMVCLEALRLYPAAAFVNRECTSSASEGFSLQPHVDFIVPPGMPAYISILGLHRDERFWPEPCVFDPERFGPERSRHIHPMTYIPFGAGPHGCIGSRLGVLQLKLGIVHILKQYWVETCERTVSEIRFNPKSFMLESENEIYLRFCRSSL', 'MFLVIGAILAGALFVGLLLYQLKFKRLIDLISYMPGPPVLPLVGHGHHFIGKPPHEMVKKIFEFMETYSKDQVLKVWLGPELNVLMGNPKDVEVVLGTLRFNDKAGEYKALEPWLKEGLLVSRGRKWHKRRKIITPAFHFKILDQFVDVFEKGSRDLLRNMEQDRLKHGDSGFSLYDWINLCTMDTICETAMGVSINAQSNADSEYVQAVKTISMVLHKRMFNILYRFDLTYMLTPLARAEKKALNVLHQFTEKIIVQRREELIREGSSQESSKDDADVGAKRKMAFLDILLQSTVDERPLSNLDIREEVDTFMFEGHDTTSSALMFFFYNIATHPEAQKKCFEEIRSVVGNDKSTPVSYELLNQLHYVDLCVKETLRMYPSVPLLGRKVLEDCEINGKLIPAGTNIGISPLYLGRREELFSEPNSFKPERFDVVTTAEKLNPYAYIPFSAGPRNCIGQKFAMLEIKAIVANVLRHYEVDFVGDSSEPPVLIAELILRTKDPLMFKVRERVY', 'MGFSALVASALCTFLLPLLLFLAAVRLWDLYCASGRDPSCPLPLPPGTMGLPFFGETLQMVLQRRKFLQMKRRKYGFIYKTHLFGRPTVRVMGAENVRHILLGEHRLVSVQWPASVRTILGSGCLSNLHNGQHKHRKKVIMQAFSRDALQHYVPVIQEEVSACLAQWLGAGPCLLVYPEVKRLMFRIAMRILLGFQPRQASPDGEQQLVEAFEEMIRNLFSLPIDVPFSGLYRGLRARNIIHAKIEENIRAKMARKEPEGGYKDALQLLMEHTQGNGEQLNMQELKESATELLFGGHETTASAATSLIAFLGLHHDVLQKVRKELQLKGLLSGPNQEKQLNMEFLEQLKYTGCVIKETLRLSPPVPGGFRIALKTLELNGYQIPKGWNVIYSICDTHDVADLFTDKDEFNPDRFMSPSPEDSSRFSFIPFGGGLRSCVGKEFAKVLLKIFTVELARSCDWQLLNGPPTMKTGPIVYPVDNLPAKFIGFSGQI', 'MQLMLRLNPKTFIKVGREYVLKFGHLQRVWIFNRLLIMSGDAELNEQLLSSQEHLVKHPVYKVLGQWLGNGLLLSDGKVWHQRRKIITPTFHFSILEQFVEVFDQQSNICVQRLAQKANGNTFDVYRSICAAALDIIAETAMGTKIYAQANESTPYAEAVNECTALLSWRFMSVYLQVELLFTLTHPHLKWRQTQLIRTMQEFTIKVIEKRRQALEDQQSKLMDTADEDVGSKRRMALLDVLLMSTVDGRPLTNDEIREEVDTFMFEGHDTTTSALSFCLHELSRHPEVQAKMLEEIVQVLGTDRSRPVSIRDLGELKYMECVIKESLRMYPPVPIVGRKLQTDFKYTHSVHGDGVIPAGSEIIIGIFGVHRQPETFPNPDEFIPERHENGSRVAPFKMIPFSAGPRNCIGQKFAQLEMKMMLAKIVREYELLPMGQRVECIVNIVLRSETGFQLGMRKRKHN', 'MDLYTLLTSALCTLALPLLLLLTAAKLWEVYCLRRKDAACANPLPPGTMGLPFFGETLQMVLQRRRFLQVKRSQYGRIYKTHLFGSPTVRVTGAENVRQILMGEHKLVSVHWPASVRTILGAGCLSNLHDNEHKYTKKVIAQAFSREALANYVPQMEEEVRCSVNLWLQSGPCVLVYPAIKRMMFRIAMRLLLGCDPQRMDREQEETLLEAFEEMSRNLFSLPIDVPFSGLYRGLRARNLIHAQIEENIKEKLQREPDEHCKDALQLLIDYSRRNGEPINLQALKESATELLFGGHGTTASAATSLTSFLALHKDVLEKVRKELETQGLLSTKPEEKKELSIEVLQQLKYTSCVIKETLRLSPPVAGGFRVALKTFVLNGYQIPKGWNVIYSIADTHGEADLFPDTDKFNPDRFLTPLPRDSSRFGFIPFGGGVRCCIGKEFAKILLKVFVVELCRNCDWELLNGSPAMTTSPIICPVDNLPAKFKPFSSSI', 'MMTTSLIWGIAIAACCCLWLILGIRRRQTGEPPLENGLIPYLGCALQFGANPLEFLRANQRKHGHVFTCKLMGKYVHFITNPLSYHKVLCHGKYFDWKKFHFATSAKAFGHRSIDPMDGNTTENINDTFIKTLQGHALNSLTESMMENLQRIMRPPVSSNSKTAAWVTEGMYSFCYRVMFEAGYLTIFGRDLTRRDTQKAHILNNLDNFKQFDKVFPALVAGLPIHMFRTAHNAREKLAESLRHENLQKRESISELISLRMFLNDTLSTFDDLEKAKTHLVVLWASQANTIPATFWSLFQMIRNPEAMKAATEEVKRTLENAGQKVSLEGNPICLSQAELNDLPVLDSIIKESLRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQLMHLDPEIYPDPLTFKYDRYLDENGKTKTTFYCNGLKLKYYYMPFGSGATICPGRLFAIHEIKQFLILMLSYFELELIEGQAKCPPLDQSRAGLGILPPLNDIEFKYKFKHL', 'MALSQFVPFSATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALSRMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGDDFKGRPDLYSFTFITDGQSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSGNPVDFFPILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDKNSVQDITGALFKHSKKGPRASGNLIPQEKIVNLVNDIFGAEFDTIATAISWSLMYLVTKPEIQRKIQKELDAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPHSTTRDTTLNGFYIPRECCVFINQWQVNHDPQLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHFQARLRFSFQ', 'MMSISLIWGIAVVVSCCIWFIIGIRRRKVGEPPLDNGLIPYLGCALKFGSNPLEFLRAKQRKHGHVFTCKLMGKYVHFITNSLSYHKVLCHGKYFDWKKFHYTTSAKAFGHRSIDPSDGNTTENINKTFNKTLQGDALCSLSEAMMQNLQSVMRPPGLPKSKSAVWVTEGMYAFCYRVMFEAGYLTLFGKDISKTDSQRAFIQNNLDSFKQFDQVFPALVAGVPIHLFKTAHKARERLAESLKHKNLYMRDQVSELIRLRMFLNDTLSTFDDMEKAKTHLVILWASQANTIPATFWSLFQMIRSPEAMKAASEEVNGALQSAGQELSSGGNAIYLDQEQLNNLPVLDSIIKEALRLSSASLNIRTAKEDFTLHLEDGSYNIRKDDIIALYPQLMHLDPEIYPDPLTFKYDRYLDESGKAKTTFYRNGNKLKYFYMPFGSGATICPGRLFAVQEIKQFLILMLSYFELELVESHTKCPPLDQSRAGLGILPPLNDIEFKYKLKH', 'MYQLFCFLAGIIVVYKAAQYYKRRTLVTKFHCKPARISPNKSWLEYLGIASVVHADEMIRKGGLYSEIDGRFKSLDVSTFKSITLGKTTYVTKDIENIRHILSATEMNSWNLGARPIALRPFIGDGIFASEGQSWKHSRIMLRPVFAKEHVKQITSMEPYVQLLIKIIKNHEGEPLEFQTLAHLFTIDYSTDFLLGESCDSLKDFLGEESNSTLDTSLRLAFASQFNKTQQQMTIRFMLGKLAFLMYPKSFQYSIQMQKDFVDVYIDRVVGMSEEELNNHPKSYVLLYQLARQTKNRDILQDELMSILLAGRDTTASLLTFLFFELSHHPEVFNKLKEEIERHFPDVESVTFGTIQRCDYLQWCINETMRLHPSVPFNFRTAANDTVIPRGGGKSCTDPILVHKGEQVLFSFYSVNREEKYFGTNTDKFAPERWSESLRRTEFIPFSAGPRACLGQQLPRVEASYVTIRLLQTFHGLHNASKQYPPNRVVAATMRLTDGCNVCFI', 'MDLMHRTLLTALGALSVVYALVKFSLGYWKRRGILHEKPKFLWGNIKGVVSGKRHAQDALQDIYTAYKGRAPFVGFYACLKPFILALDLKLVHQIIFTDAGHFTSRGLYSNPSGEPLSHNLLQLDGHKWRSLHAKSAEVFTPANMQKLLVRLSQISSRIQRDLGEKSLQTINISELVGAYNTDVMASMAFGLVGQDNVEFAKWTRNYWADFRMWQAYLALEFPLIARLLQYKSYAEPATAYFQKVALSQLQLHRRRDRQPLQTFLQLYSNAEKPLTDIEIAGQAFGFVLAGLGPLNATLAFCLYELARQPEVQDRTRLEINKALEEHGGQVTPECLRELRYTKQVLNETLRLHTPHPFLLRRATKEFEVPGSVFVIAKGNNVLIPTAAIHMDPGIYENPQRFYPERFEEQARRSRPAAAFLPFGDGLRGCIAARFAEQQLLVGLVALLRQHRYAPSAETSIPVEYDNRRLLLMPKSDIKLSVERVDKL', 'MDLVVVLGLCLSCLLLPSLWKQSHGGGKLPPGPTPFPILGNVLQLDFKDLSKSLTNLSKVYGPVFTVYLGMKPTVVVHGYEAVKEALVDLGHELSGRSRFLVTAKLNKGFGVIFSNGKRWTETRRFSLMTLRNFGMGKRSIEERVQEEAHCLVEELRKTNASPCDPTFILGAAPCNVICSVIFQNRFDYTDQDFLSLMGKFNENFKILNSPWVQFCNCFPILFDYFPGSHRKAVKNIFYVKNYITEQIKEHQKSLDINNPRDFIDCFLIKMEQEKCNQQSEFTIENLLTTVSDVFMAGTETTSTTLRYGLLLLMKHPEVIAKVQEEIERVIGRHRSPCMQDRSRMPYTDATVHEIQRYINLIPNNVPHTTICNLKFRNYLIPKGTDVLTSLSSVLHDDKEFPNPDRFDPGHFLDASGNFRKSDYFMPFSTGKRVCVGEALARMELFLFLTAILQNFTPKPLVNPNNVDENPFSSGIVRVPPLYRVSFIPV', 'MAVLGITIALLVWVATLLVISIWKQIYNSWNLPPGPFPLPILGNIFQLDLKDIPKSFTKLAKRFGPVFTLHLGSRRIVVLHGYKAVKEVLLNHKNEFSGRGDIPVFQEYKNKGIIFNNGPTWKDVRRFSLSILRDWGMGKQGNEARIQREAQFLVEELKKTKGQPFDPTFLIGCAPCNVIADILFNKRFDYNDKKCLRLMSLFNENFYLLSTPWIQLYNNFADYLRYLPGSHRKIMKNVSEIKQYTLEKAKEHLQSLDINCARDVTDCLLIEMEKEKHSQEPMYTMENVSVTLADLFFAGTETTSTTLRYGLLILMKYPEIEEKLHEEIDRVIGPSRVPAVRDRLDMPYMDAVVHEIQRFINLVPSNLPHEATRDTVFQGYVIPKGTVVIPTLDSLLYDSHEFPDPEKFKPEHFLNENGKFKYSDYFKAFSAGKRVCVGEGLARMELFLLLSAILQHFNLKSLVDPKDIDLSPVTVGFGSIPPQFKLCVIPRS', 'MDPVVVLVLGLCCLLLLSIWKQNSGRGKLPPGPTPFPIIGNILQIDAKDISKSLTKFSECYGPVFTVYLGMKPTVVLHGYEAVKEALVDLGEEFAGRGSVPILEKVSKGLGIAFSNAKTWKEMRRFSLMTLRNFGMGKRSIEDRIQEEARCLVEELRKTNASPCDPTFILGCAPCNVICSVIFHNRFDYKDEEFLKLMESLNENVRILSSPWLQVYNNFPALLDYFPGIHKTLLKNADYIKNFIMEKVKEHQKLLDVNNPRDFIDCFLIKMEQENNLEFTLESLVIAVSDLFGAGTETTSTTLRYSLLLLLKHPEVAARVQEEIERVIGRHRSPCMQDRSRMPYTDAVIHEIQRFIDLLPTNLPHAVTRDVRFRNYFIPKGTDIITSLTSVLHDEKAFPNPKVFDPGHFLDESGNFKKSDYFMPFSAGKRMCVGEGLARMELFLFLTSILQNFKLQSLVEPKDLDITAVVNGFVSVPPSYQLCFIPI', 'MVASSSSATASLLDQLFALTPLADSSAWIKTITVLVLLPLLAVVLNVASQLLLATPKNHPPVVFHFVPVIGSAIYYGIDPYKFFFECREKYGDVFTFVLLGRKITVALGPKGSNLVFNAKHQQVTAEDAYTHLTTPVFGKEVVYDVPNAVFMEQKKFVKVGLSIENFRVYVPQIVDEVREYIKSDARFSALKTRKTITVDIFQAMSELIILTASRTLQGKEVRQGLDKSFAQLYHDLDSGFTPINFVIPNLPLPSNFKRDRAQKKMSQFYQDIVAKRRAAGASTSADDASGENDMIAALIEQKYKNGRALSGVEIAHMMIALLMAGQHTSSATSSWAFLRLASRPEIIEELYEEQLNVYSDGHGGLRELDYETQKTSVPLLDAVVKETLRLHPPLHSIMRYVKSDLAVPPTLSSPTSTKSEPDAHYVIPKGHYIMAAPGVSQVDPQIWKSSDQFDPHRWLDATTAAAMQDSGEDKQDFGFGMISTGANSPYLPFGAGRHRCIGEQFAYLQIGVILATFVRIFKWHLDSKFPDPDYQSMVVLPSKNGCAIVLTPRAESLHLD', 'MDSISTAILLLILALICLLLTTSSKGKGRLPPGPRALPFLGNLLQLRSQDMLTSLTKLSKEFGAVYTVYLGPRRVVVLSGYQAVKEALVDQAEEFSGRGDYPAFFNFTKGNGIAFSNGDRWKALRKYSLQILRNFGMGKRTIEERILEEGHFLLEELRKTQGKPFDPTFVVSRSVSNIICSVIFGSRFDYDDDRLLTIIHLINENFQIMSSPWGEMYNIFPNLLDWVPGPHRRLFKNYGRMKNLIARSVREHQASLDPNSPRDFIDCFLTKMAQEKQDPLSHFFMDTLLMTTHNLLFGGTETVGTTLRHAFRLLMKYPEVQVRVQEEIDRVVGRERLPTVEDRAEMPYTDAVIHEVQRFADIIPMSLPHRVTRDTNFRGFTIPRGTDVITLLNTVHYDPSQFLKPKEFNPEHFLDANMSFKKSPAFMPFSAGRRLCLGEALARMELFLYLTAILQSFSLQPLGAPEDIDLTPLSSGLGNVPRPYQLCVRAR', 'MALSQSVPFLATELLLASAIFCLVFWVLRGSRPRVPKGLKSPPEPWGWPLLGHVLTLGKNPHLALSRMSQLYGDVLQIRIGSTPVLVLSGLDTIRQALVRQGDDFKGRPDLYSFTFITDGQSMSFSPDSGPVWAARRRLAQNALNTFSIASDPASSSSCYLEEHVSKEAEALISRLQELMAGPGHFDPYNQVVVSVANVIGAMCFGQHFPESSDEMLSLVKNSHEFVESASSGNPVDFFPILRYLPNPALQRFKAFNQRFRRFLQKTVQEHYQDFDKNSVQDITGALFKHSKKGPRASGNLIPQEKIVNLVNDIFGAGFDTIATAISWSLMYLVTKPEIQRKIQKELDAVIGRGRRPRLSDRPQLPYLEAFILETFRHSSFVPFTIPHSTTRDTTLNGFYIPRECCVFINQWQVNHDPQLWGDPSEFRPERFLTAEGTTINKPLSEKIMLFGLGKRRCIGEVLGKWEVFLFLAILLQQLEFSVPPGVKVDLTPIYGLTMKHARCEHFQARLRFSIK', 'MATQEIIDSALPYLTKWYTVITLAALVFLISSNIKNYVKAKKLKCRDPPYFKGAGWTGISPLIEIIKVKGNGRLARFWPIKTFDDYPNHTFYMSIIGALKIVLTVIQENIKAVLATQFTDFSLGTRHAHFYPLLGDGIFTLDGEGWKHSRAMLRPQFARDQIGHVKALEPHIQILAKQIKLNKGKTFDIQELFFRFTVDTATEFLFGESVHSLYDEKLGIPTPNEIPGRDNFATAFNTSQHYLATRTYSQTFYFLTNPKEFRDCNAKVHYLAKYFVNKALNFTPEEIEEKSKSGYVFLYELVKQTRDPKVLQDQLLNIMVAGRDTTAGLLSFAMFELARHPEIWSKLREEIEVNFGVGEESRVEEITFESLKRCEYLKAILNETLRMYPSVPVNSRTATRDTTLPRGGGPNGTDPIFIPKGSTVAYIVYKTHRLEEYYGKDADDFRPERWFEPSTKKLGWAYVPFNGGPRICLGQQFALTEASYVITRLVQMFETVSSPPDVEYPPPKCIHLTMSHDDGVFVKM', 'MLQLSLSRLGMGSLTASPWHLLLLGGASWILARILAWIYTFYDNCCRLRCFPQPPKPSWFWGHLTLMKNNEEGMQFIAHLGRNFRDIHLSWVGPVYPILRLVHPNVIAPLLQASAAVAPKEMTLYGFLKPWLGDGLLMSAGEKWNHHRRLLTPAFHFDILKSYVKIFNKSVNTMHAKWQRLTAKGSARLDMFEHISLMTLDSLQKCIFSFDSNCQESNSEYIAAILELSSLIVKRQRQPFLYLDFLYYLTADGRRFRKACDVVHNFTDAVIRERRSTLNTQGVDEFLKARAKTKTLDFIDVLLLAKDEHGKGLSDVDIRAEADTFMFGGHDTTASALSWILYNLARHPEYQERCRQEVRELLRDREPEEIEWDDLAQLPFLTMCIKESLRLHPPVLLISRCCSQDIVLPDGRVIPKGNICVISIFGVHHNPSVWPDPEVYNPFRFDPENPQKRSPLAFIPFSAGPRNCIGQTFAMSEIKVALALTLLRFCVLPDDKEPRRKPELILRAEGGLWLRVEPLSTVTSQLPWDLLAHPPTS', 'MAAGPQAAMEQASSPGLISATEVLVAAATFCLLLLLTQTRRQHAPKGLRSPPGPRGLPMLGNVLELRKDPHLVLTRLSRKYGDVMEVTIGSRPVVVLSGLETIKQALVRQAEDFMGRPDLPSWQYVSNGHSLAFSYECGDAWKARRKLAQNALKTFSIAASPTASSSCLLEEHVSTEASYLVTKFLQLMEEKQSFNPNSYLMVSVANVICAICFGKRYDHDDQELLSVVNMNTEFGDVAAAGNPADFIPLLRYLPNRAMAAFKDVNARFSAFVQKIVQNHYSTFDKEHIRDVTDSLIGHCQEKRTGEDVRVQPSDESIISIVNDLFGAGFDTVTTALSWCMMYAALYPHIQKKIQAELDQTIGRERRPRLSDRGMLPYTEAFILEAFRHSSLLPFTIPHCTTKDTVLNGYYIPKDTCVFINQWQANHDEKIWKDPPSFKPERFLNAAGTELSRTEADKVLIFGLGKRRCIGESIGRWEVFLFLTTILQQLEISLAPGQRVDITPQYGLTMKYKQCECFQMKKRFPSKGSA', 'MAALGCARLRWALRGAGRGLCPHGARAKAAIPAALPSDKATGAPGAGPGVRRRQRSLEEIPRLGQLRFFFQLFVQGYALQLHQLQVLYKAKYGPMWMSYLGPQMHVNLASAPLLEQVMRQEGKYPVRNDMELWKEHRDQHDLTYGPFTTEGHHWYQLRQALNQRLLKPAEAALYTDAFNEVIDDFMTRLDQLRAESASGNQVSDMAQLFYYFALEAICYILFEKRIGCLQRSIPEDTVTFVRSIGLMFQNSLYATFLPKWTRPVLPFWKRYLDGWNAIFSFGKKLIDEKLEDMEAQLQAAGPDGIQVSGYLHFLLASGQLSPREAMGSLPELLMAGVDTTSNTLTWALYHLSKDPEIQEALHEEVVGVVPAGQVPQHKDFAHMPLLKAVLKETLRLYPVVPTNSRIIEKEIEVDGFLFPKNTQFVFCHYVVSRDPTAFSEPESFQPHRWLRNSQPATPRIQHPFGSVPFGYGVRACLGRRIAELEMQLLLARLIQKYKVVLAPETGELKSVARIVLVPNKKVGLQFLQRQC', 'MGTSLSPNDPWPLNPLSIQQTTLLLLLSVLATVHVGQRLLRQRRRQLRSAPPGPFAWPLIGNAAAVGQAAHLSFARLARRYGDVFQIRLGSCPIVVLNGERAIHQALVQQGSAFADRPAFASFRVVSGGRSMAFGHYSEHWKVQRRAAHSMMRNFFTRQPRSRQVLEGHVLSEARELVALLVRGSADGAFLDPRPLTVVAVANVMSAVCFGCRYSHDDPEFRELLSHNEEFGRTVGAGSLVDVMPWLQYFPNPVRTVFREFEQLNRNFSNFILDKFLRHCESLRPGAAPRDMMDAFILSAEKKAAGDSHGGGARLDLENVPATITDIFGASQDTLSTALQWLLLLFTRYPDVQTRVQAELDQVVGRDRLPCMGDQPNLPYVLAFLYEAMRFSSFVPVTIPHATTANTSVLGYHIPKDTVVFVNQWSVNHDPLKWPNPENFDPARFLDKDGLINKDLTSRVMIFSVGKRRCIGEELSKMQLFLFISILAHQCDFRANPNEPAKMNFSYGLTIKPKSFKVNVTLRESMELLDSAVQNLQAKETCQ', 'MALIEICLALVVIGYLIYKWSTATFKTFEERKLYFEKPYPFVGNMAAAALQKSSFQRQLTEFYERTRQHKLVGFFNMRTPMITLNDPELIKKVCVKDFDHFPNHQPFITSNDRLFNDMLSVMRDQRWKHMRNTLTPVFTAAKMRNMFTLMNESFAECLQHLDSSSKTLPGRKGFEVDMKVMCNKLSNDIIATTAFGLKVNSYDNPKNEFYEIGQSLVFSRGLQFFKFMLSTLVPKLFSLLKLTIFDSAKVDYFARLVVEAMQYREKHNITRPDMIQLLMEAKNESEDKWTDDEIVAQCFIFFFAAFENNSNLICTTTYELLYNPDVQERLYEEIVETKKALNGAPLTYDAVQKMTYMDMVISESLRKWTLAAATDRLCSKDYTLTDDDGTKLFDFKVGDRINIPISGLHLDDRYFPEPRKFDPDRFSEERKGDMVPYTYLPFGVGPRNCIGNRYALMQVKGMLFNLLLHYKIEASPRTIKDLWGSASGFNFTPRSGFWMHLVPRK', 'MLDFAIFAVTFLLALVGAVLYLYPASRQAAGIPGITPTEEKDGNLPDIVNSGSLHEFLVNLHERYGPVVSFWFGRRLVVSLGTVDVLKQHINPNKTSDPFETMLKSLLRYQSGGGSVSENHMRKKLYENGVTDSLKSNFALLLKLSEELLDKWLSYPETQHVPLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQKNHGTVWSEIGKGFLDGSLDKNMTRKKQYEDALMQLESVLRNIIKERKGRNFSQHIFIDSLVQGNLNDQQILEDSMIFSLASCIITAKLCTWAICFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLRYCQHVLCETVRTAKLTPVSAQLQDIEGKIDRFIIPRETLVLYALGVVLQDPNTWPSPHKFDPDRFDDELVMKTFSSLGFSGTQECPELRFAYMVTTVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY', 'MDLIPGFSTETWVLLATSLVLLYLYGTYSHGLFKKLGIPGPRPLPYFGNILGYRKGVDHFDKKCFQQYGKMWGVYDGRQPLLAVTDPNMIKSVLVKECYSVFTNRRSFGPLGAMRNALSLAEDEEWKRIRTLLSPTFTSGKLKEMFPIISHYGDLLVSNLRKEAEKGKPVTMKDIFGAYSMDVITSTAFGVNIDSLNNPQDPFVENSKKLLKFSFFDPFLLSLIFFPFLTPIFEVLNITLFPKSSVNFFTKSVKRMKESRLTDQQKRRVDLLQLMINSQNSKEMDPHKSLSNEELVAQGIIFIFAGYETTSSALSLLAYELATHPDVQQKLQEEIEATFPNKAPPTYDALAQMEYLDMVVNETLRLYPIAARLERACKKDVEIHGVFVPKGTVVVVPVFVLHRDPDLWPEPEEFRPERFSKKHKDTINPYTYLPFGTGPRNCIGMRFALMNMKLALVRVLQNFSFKPCKETQIPLKLTTQGLTQPEKPVVLKILPRDGTVSGA', 'MLASGLLLVALLACLTVMVL', 'MDWDYYTLLKTSVAIIIVFVVAKLITSSKSKKKTSVVPLPPVLQAWPPFIGSLIRFMKGPIVLLREEYPKLGSVFTVKLLHKNITFLIGPEVSSHFFNAYESELSQKEIYKFNVPTFGPGVVFDVDYPVRMEQFRFFSSALKVNN', 'MTNQTARSSKKERYANLIPMEELHSEKDRLFPFPIYDKLRRESPVRYDPLRDCWDVFKYDDVQFVLKNPKLFSSKRGIQTESILTMDPPKHTKLRALVSRAFTPKAVKQLETRIKDVTAFLLQEARQKSTIDIIEDFAGPLPVIIIAEMLGAPIEDRHLIKTYSDVLVAGAKDSSDKAVADMVHNRRDGHAFLSDYFRDILSKRRAEPKEDLMTMLLQAEIDGEYLTEEQLIGFCILLLVAGNETTTNLIANAVRYLTEDSVVQQQVRQNTDNVANVIEETLRYYSPVQAIGRVATEDTELGGVFIKKGSSVISWIASANRDEDKFCKPDCFKIDRPSYPHLSFGFGIHFCLGAPLARLEANIALSSLLSMSACIEKAAHDEKLEAIPSPFVFGVKRLPVRITFK', 'MGLEALVPLAVIVAIFLLLVDLMHRRQRWAARYPPGPLPLPGLGNLLHVDFQNTPYCFDQLRRRFGDVFSLQLAWTPVVVLNGLAAVREALVTHGEDTADRPPVPITQILGFGPRSQGVFLARYGPAWREQRRFSVSTLRNLGLGKKSLEQWVTEEAACLCAAFANHSGRPFRPNGLLDKAVSNVIASLTCGRRFEYDDPRFLRLLDLAQEGLKEESGFLREVLNAVPVLLHIPALAGKVLRFQKAFLTQLDELLTEHRMTWDPAQPPRDLTEAFLAEMEKAKGNPESSFNDENLRIVVADLFSAGMVTTSTTLAWGLLLMILHPDVQRRVQQEIDDVIGQVRRPEMGDQAHMPYTTAVIHEVQRFGDIVPLGVTHMTSRDIEVQGFRIPKGTTLITNLSSVLKDEAVWEKPFRFHPEHFLDAQGHFVKPEAFLPFSAGRRACLGEPLARMELFLFFTSLLQHFSFSVPTGQPRPSHHGVFAFLVSPSPYELCAVPR', 'MDLVTFLVLTLSSLILLSLWRQSCGRGSLPPGPTPFPIIGNFLQIDIKNVSQSLTNFSKAYGPVFTLYLGSRPTVVLHGYEAVKEALIDHGEEFSDRGSIPMVEKINNGLGIVFSNGNRWKEIRRFTLTTLRNLGMGKRNIEDRVQEEAQCLVEELRKTKGSPCDPTFILSCAPCNVICSIIFQDRFDYKDKDFLMLMEKLNENVKILSSPWLQVCNNFPLLIDYCPGSHHKVLKNVKYIRSYLLEKIKEHQESLDVTNPRDFIDYYLIKQKQANHIQQAEFSLENLACTINNLFAAGTETTSTTLRYALLLLMKYPDVTAKVQEEIDHVIGRHRSPCMQDRNHMPYTDAMIHEVQRFINLVPNNIPRAVTCDIKFRNYLIPKGTTVVTSLTSVLHDSKEFPNPELFDPGHFLDANGNFKKSDHFMPFSAGKRVCAGEGLARMELFLFLTTILQNFKLKSLVHPKDIDMIPFVNGLIALPPHYQVCIIPR', 'MSVSALSSTRFTGSISGFLQVASVLGLLLLLVKAVQFYLQRQWLLKAFQQFPSPPFHWFFGHKQFQGDKELQQIMTCVENFPSAFPRWFWGSKAYLIVYDPDYMKVILGRSDPKANGVYRLLAPWIGYGLLLLNGQPWFQHRRMLTPAFHYDILKPYVKNMADSIRLMLDKWEQLAGQDSSIEIFQHISLMTLDTVMKCAFSHNGSVQVDGNYKSYIQAIGNLNDLFHSRVRNIFHQNDTIYNFSSNGHLFNRACQLAHDHTDGVIKLRKDQLQNAGELEKVKKKRRLDFLDILLLARMENGDSLSDKDLRAEVDTFMFEGHDTTASGVSWIFYALATHPEHQQRCREEVQSVLGDGSSITWDHLDQIPYTTMCIKEALRLYPPVPGIVRELSTSVTFPDGRSLPKGIQVTLSIYGLHHNPKVWPNPEVFDPSRFAPDSPRHSHSFLPFSGGARNCIGKQFAMSEMKVIVALTLLRFELLPDPTKVPIPLPRLVLKSKNGIYLYLKKLH', 'MNLFSALSLDTWVLLAIILVLLYRYGTRTHGLFKKQGIPGPKPLPFLGTVLNYYKGLWKFDMECYEKYGKTWGLFDGQMPLFVITDPEMIKNVLVKECFSVFTNRREFGPVGIMSKAISISKDEEWKRYRALLSPTFTSGKLKEMFPVIEQYGDILVKYLMQEAEKGKPVTMKDVLGAYSIDVITSTSFGVNVDSLNNPEDPFVEKAKGILRVDFFDPLVFSVVLFPFLTPVYEMLNICMFPKDSIEFFKKFVNRMKESRLDSKQKHRVDFLQLMMNAHNNSKDKDSHKALSDMEITAQSIVFIFAGYETTSSTLSFTLYCLATHPDIQKKLQEEIDETLPNKAPPTYDTVMEMEYLDMVLNETLRLYPIGNRLERFCKKDVELNGVYIPKGSTVMIPSYALHHDPQHWPEPEEFQPERFSKENKGSIDPYLYMPFGIGPRNCIGMRFAFMTMKLALTKVMQNFSFQPCQETQIPLKLSRQGLLQPEKPIVLKVVPRDVVITGA', 'MIGTIAVLVIAILVIFAFKKSPSNIPPIVETIPFIGCFYQFAKNPLQLVRNSYDRLGEIFTLHLMGFKMTFVLGPEAQALFFRGTDEELSPKEAYRFVTPVFGKGVVYDSETEIMYEQLRFVKNGLVLSQLKKAVGIIQEETEKYFETKWGDSGEIDLLYEMNKLTILTASRCLMGKSINKSLGQSGQLADLYHELEEGLNPISFFFPNLPLPSFKKRDAARAKVAAIFHSIIQERRRSTDDSVDDVLYTLMNSKYKDGSVLEDEQIVGLMIGLLFAGQHTSSITLTYTIFYLLNNLEYFDETQKDINDIVQKENQGEINFDGLKRMNRLETVIREVLRLHPPLIFLMRKVMTPMEYKGKTIPAGHILAVSPQVGMRLPTVYKNPDSFEPKRFDVEDKTPFSFIAFGGGKHGCPGENFGILQIKTIWTVLSTKYNLEVGPVPPTDFTSLVAGPKGPCMVKYSKKQK', 'MDPSVLLLLAVLLSLFLLLVRGHAKIHGHLPPGPHPLPLLGNLLQMDRGGLLKCFIQLQEKHGDVFTVHLGPRPVVVLCGTQTIREALVDHAEAFSGRGTIAAAQLVMQDYGIFFASGQRWKTLRRFSLATMKEFGMGKRSVEERIKEEAQCLVEELKKYQGVPLDPTFLFQCITANIICSIVFGERFDYTDDQFLHLLNLMYKIFSLLSSFSGQMFELFSGFLKYFPGVHRQIVKKQQELLDYIAHSVEKHKATLDPSAPRDYIDTYLLRMEKEKSNHNTEFHHQNLMMSVLSLFFAGTETTSATLHYGVLLMLKYPHVTEKVQKEIDQVIGSHRLPTLDDRTKMPYTDAVIHEIQRFSDLVPIGLPHKVIKDTLFRGYLLPKNTEVYPVLSSALHDPQYFEQPDKFNPEHFLDANGALKKCEAFLPFSTGKRICLGESIARNELFIFFTTILQNFSVASPVAPKDIDLTPKESGIGKIPPAHQIYFLAR', 'MSAVALPRVSGGHDEHGHLEEFRTDPIGLMQRVRDECGDVGTFQLAGKQVVLLSGSHANEFFFRAGDDDLDQAKAYPFMTPIFGEGVVFDASPERRKEMLHNAALRGEQMKGHAATIEDQVRRMIADWGEAGEIDLLDFFAELTIYTSSACLIGKKFRDQLDGRFAKLYHELERGTDPLAYVDPYLPIESFRRRDEARNGLVALVADIMNGRIANPPTDKSDRDMLDVLIAVKAETGTPRFSADEITGMFISMMFAGHHTSSGTASWTLIELMRHRDAYAAVIDELDELYGDGRSVSFHALRQIPQLENVLKETLRLHPPLIILMRVAKGEFEVQGHRIHEGDLVAASPAISNRIPEDFPDPHDFVPARYEQPRQEDLLNRWTWIPFGAGRHRCVGAAFAIMQIKAIFSVLLREYEFEMAQPPESYRNDHSKMVVQLAQPACVRYRRRTGV', 'MTVPALASASGLLQVASLLGLLLLLLKAAQLYLRRQWLLKALQQFPSPPSHWLYGHSREFQEESELQPLLKRVEKYPSACARWLWGTRAMVLVYDPDYMKVVLARSEPKAPVLYRLLIPWIGCGLLLLNGQTWFQRRRMLTPAFHYDILKPYVGLMAKSVQVMLDKWEQLVAQDPRLEIVGPVSLMTLDTIMKCAFSHQGSAQTDGDSHSYIQAIWDLKNLFSIRTKSAFLQNDIIYRLSPEGRKNHRAARIAHQHTDRVIQLRKAQLQKQGEMENVRKKRHLDFLDILLLARMEKGNSLSDTDLRAEVDTFMFEGHDTTASGISWILYALASHPEHQQRCREEIQGLLGDGTSITWDHLDQMPYTTMCIKEALRLYPPVPGVSRELSKPITFPDGRSLPAGIILSLSVYSLHHNPQVWPNPEEFDPSRFAPGSARHSHAFMPFSGGSRNCIGKQFAMNEMKVAVALTLLRFELAPDPSRKPTVIPEVVLHSKNGIHLKLRKLP', 'MEDSRLLITLILVFGVIFLKKFFQSNQHPSAQRLSATGVNAHGRPQGSTQNALRRTGRVNGGHPVTTQMVETVQNLAPNLHPEQIRYSLENTGSVEETVERYLRGDEFSFPPGFEPSRAPMGANAAVDNNAAGGGEFNDPRKKNMICAENLLDKFHVDLNEDMSNLSFKDLDIEERKRLLVWQARKNLETKLQSDKDLQSLLT', 'MTAKIFSLDEVSKHKTKSDLWVVIHNKVYDITRFVVEHPGGEEVLVDEGGKDATEAFEDIGHSDEAREMLEEYLIGSLDEASRTKEYNVNVIRAGELPEEKKGSSLRIILPALAIIGALVYKYVIVPKAHQ', 'MMPERFSFDFRPIDQYWTRAKGACSNTKRGNMYILASLALILLHLLVLPIYLYLTWHHKYWRKRGLVTARPLTLLGTYPGLLTRKSNLVFDVQKIYDKYKGKHRAVGVFVTRQPQILVLDPELAHEVLVSNFRCYKDSLQSSYLRHAKWDKYARLNPFWASGQSWRRLRTDAQAGISGSRLRQAYNIWEQGGQMLTEYMTQQVAEKNNILETRDLCFRYTAHVMADFIWGIDAGTLTRPMEQPNKVQEMASKWTSYAFYMLTLFMATIVAPCSRLLLRFRFYPKETDEFFSNLTKESIELRLKAGDSTRTDYLSHLLQLRDQKQATHDDLVGHALTVMLDGYDTSGTALLHALYYLAENPAVQQKLRVEILSCMASEKSLDFEKLSSLQYLEQVIYESLRLSSLIPQYTKVCTLPTVIRLSESKSLDVEVGMTIMIPNYQFHHDKQYFPEPEAFKPERFDNGAYQELMRKGIFLPFSDGPRICMGVPLAMLTLKSALVHILSNFQVVRGRDRLIPKGDSGFGVVLQGDVNLEYRRFFR', 'MVLMILPVIGSVSVSEGLVAMITMCLAYLILRLFRTEIPEGLLQLPGPKPLPIIGNVLEVGRNPYLSLTAMSKRYGDVFQIQIGMRPVVVLSGSETVRQALIKQGDXFAGRPDLYSFRFINDGKSLAFSTDQAGVWRARRKLAYSALRSFATLEGTTPEYSCALEEHVSKEAEYLVKQLHTVMEADGSFDPFRHIVVSVANVICGMCFGRRYDHNHQELLNLVNLSDEFGQVVASGNPADFIPILQYLPSTTMKKFLNINDRFNTFVQKIVSEHYTTFDKDNIRDITDSLIDHCEDRKLDENSNVQMSDEKIVGIVNDLFGAGFDTISTALSWSVMYLVAYPEIQERLYQEMNETVGPDRTPCLSDKPKLPFLEAFILETFRHSSFLPFTIPHCTSKDTSLNGYFIPKDTCVFINQWQINHDAELWKDPSSFNPDRFLNADGTEVNKLEGEKMMVFGMGKRRCIGEVIARSEVFLFLAILVQNLRFHSMPGEPLDMTPEYGLTMKHKRCQLRAAMRARNEE']
I was also provided another list that are the letters that i want to analyze in that enormous list.
listAA=["A","B","R","N","D","C","Q","E","G","H","I","L","K","M","F","P","S","T","W","X","Y","V"]
Here is the little piece of code that i ran, the issue is that every value returns zero
def FreqAbs (list):  #criámos uma função que nos desse o valor das frequências absolutas de cada um dos aminoácidos 
                     #em ListAA presentes em list
    
    dic_AA = {}
    ListAA = ["A","B","R","N","D","C","Q","E","G","H","I","L","K","M","F","P","S","T","W","X","Y","V"]
 
    for aa in ListAA:
        ct = list.count(aa)
        dic_AA[aa] = ct
        
    return dic_AA
print(FreqAbs(list))
Here is my ouput
{'A': 0, 'B': 0, 'R': 0, 'N': 0, 'D': 0, 'C': 0, 'Q': 0, 'E': 0, 'G': 0, 'H': 0, 'I': 0, 'L': 0, 'K': 0, 'M': 0, 'F': 0, 'P': 0, 'S': 0, 'T': 0, 'W': 0, 'X': 0, 'Y': 0, 'V': 0}
Reply
#2
Your problem is that 'list' is not a list of characters but a list of strings, like in the following example
>>> L = ["hello", "world"]
>>> L.count("o")
0
>>> L.count("hello")
1
You could use a collections.Counter() instance
from collections import Counter
C = Counter()
for x in list:
    C.update(x)
ListAA = ["A","B","R","N","D","C","Q","E","G","H","I","L","K","M","F","P","S","T","W","X","Y","V"]
dic_AA = {k: C.get(k, 0) for k in ListAA}
print(dic_AA)
Output:
{'N': 1359, 'B': 0, 'S': 2475, 'H': 1069, 'C': 556, 'V': 2469, 'X': 1, 'G': 2417, 'Q': 1581, 'D': 1975, 'E': 2397, 'A': 2502, 'I': 2037, 'K': 2112, 'W': 466, 'T': 1984, 'M': 1026, 'L': 4447, 'P': 2290, 'F': 2245, 'Y': 1133, 'R': 2374}
Reply
#3
from collections import Counter
print(Counter(''.join(my_list)))
Output:
Counter({'L': 4447, 'A': 2502, 'S': 2475, 'V': 2469, 'G': 2417, 'E': 2397, 'R': 2374, 'P': 2290, 'F': 2245, 'K': 2112, 'I': 2037, 'T': 1984, 'D': 1975, 'Q': 1581, 'N': 1359, 'Y': 1133, 'H': 1069, 'M': 1026, 'C': 556, 'W': 466, 'X': 1})
If you can't explain it to a six year old, you don't understand it yourself, Albert Einstein
How to Ask Questions The Smart Way: link and another link
Create MCV example
Debug small programs

Reply
#4
(Dec-08-2018, 02:42 PM)Gribouillis Wrote: Your problem is that 'list' is not a list of characters but a list of strings, like in the following example
>>> L = ["hello", "world"]
>>> L.count("o")
0
>>> L.count("hello")
1
You could use a collections.Counter() instance
from collections import Counter
C = Counter()
for x in list:
    C.update(x)
ListAA = ["A","B","R","N","D","C","Q","E","G","H","I","L","K","M","F","P","S","T","W","X","Y","V"]
dic_AA = {k: C.get(k, 0) for k in ListAA}
print(dic_AA)
Output:
{'N': 1359, 'B': 0, 'S': 2475, 'H': 1069, 'C': 556, 'V': 2469, 'X': 1, 'G': 2417, 'Q': 1581, 'D': 1975, 'E': 2397, 'A': 2502, 'I': 2037, 'K': 2112, 'W': 466, 'T': 1984, 'M': 1026, 'L': 4447, 'P': 2290, 'F': 2245, 'Y': 1133, 'R': 2374}
Sorry for the late answer! This actually worked!! i have been arround this for hours!
I forgot to mention one requirement: This needs to be set up as a function, and when i try to define one i get this message in the output
Output:
<function freqAbs at 0x00B49D68>

(Dec-08-2018, 02:42 PM)Gribouillis Wrote: Your problem is that 'list' is not a list of characters but a list of strings, like in the following example
>>> L = ["hello", "world"]
>>> L.count("o")
0
>>> L.count("hello")
1
You could use a collections.Counter() instance
from collections import Counter
C = Counter()
for x in list:
    C.update(x)
ListAA = ["A","B","R","N","D","C","Q","E","G","H","I","L","K","M","F","P","S","T","W","X","Y","V"]
dic_AA = {k: C.get(k, 0) for k in ListAA}
print(dic_AA)
Output:
{'N': 1359, 'B': 0, 'S': 2475, 'H': 1069, 'C': 556, 'V': 2469, 'X': 1, 'G': 2417, 'Q': 1581, 'D': 1975, 'E': 2397, 'A': 2502, 'I': 2037, 'K': 2112, 'W': 466, 'T': 1984, 'M': 1026, 'L': 4447, 'P': 2290, 'F': 2245, 'Y': 1133, 'R': 2374}

(Dec-08-2018, 05:00 PM)buran Wrote:
from collections import Counter
print(Counter(''.join(my_list)))
Output:
Counter({'L': 4447, 'A': 2502, 'S': 2475, 'V': 2469, 'G': 2417, 'E': 2397, 'R': 2374, 'P': 2290, 'F': 2245, 'K': 2112, 'I': 2037, 'T': 1984, 'D': 1975, 'Q': 1581, 'N': 1359, 'Y': 1133, 'H': 1069, 'M': 1026, 'C': 556, 'W': 466, 'X': 1})
Thank you for the reply!In what situation would i use this instead of the reply i got from Gribouillis? And i also wanted to make a function with those instructions inside but i am getting a message and nothing prints out but that message.(In the reply above)
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Help with to check an Input list data with a data read from an external source sacharyya 3 462 Mar-09-2024, 12:33 PM
Last Post: Pedroski55
  How do I call sys.argv list inside a function, from the CLI? billykid999 3 821 May-02-2023, 08:40 AM
Last Post: Gribouillis
  Reading All The RAW Data Inside a PDF NBAComputerMan 4 1,412 Nov-30-2022, 10:54 PM
Last Post: Larz60+
  Need to parse a list of boolean columns inside a list and return true values Python84 4 2,149 Jan-09-2022, 02:39 AM
Last Post: Python84
  Why changing data in a copied list changes the original list? plumberpy 3 2,269 Aug-14-2021, 02:26 AM
Last Post: plumberpy
  How to make global list inside function CHANKC 6 3,156 Nov-26-2020, 08:05 AM
Last Post: CHANKC
  converting string object inside a list into an intiger bwdu 4 2,670 Mar-31-2020, 10:36 AM
Last Post: buran
  Analyzing large text file with nltk.corpus (stopwords ) Drone4four 9 6,556 Jun-06-2019, 09:30 PM
Last Post: Drone4four
  Getting list inside of a list; how to clean it and retrieve value mrapple2020 5 2,994 Apr-10-2019, 09:47 PM
Last Post: Larz60+
  Help analyzing this data MAZambelli4353 1 2,122 Dec-20-2018, 09:45 AM
Last Post: buran

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020