>gnl|To_rRNA_ML|18S rRNA AACCTGGTTGATTCTGCCAGTAGTCATACGCTCGTCTCAAAGATTAAGCCATGCATGTCT AAGTATAACTCTTTTACTTTGAAAACTGCGAACGGCTCATTATATCAGTTATAGTTTCTT TGATAGTCCCTTACTACTTGGATATCCGTAGTAATTCTAGAGCTAATACATGCAACTACA CCCGACTTTTGGAAGGGTGGTATTTATTAGGTATAAACCTTCACGCTTCGGCGTTGATCT GGTGATTCATAATAACTTTTCGAATCGCATTCCCCTGTGGAGGCGATGGATCATTCAAGT TTCTGCCCTATCAGCTTTGGATGGTAGTGTATTGGACTACCATGGCTTTAACGGGTAACG AATTGTTAGGGCAAGATTTCGGAGAGGGAGCCTGAGAGACGGCTACCACATCCAAGGAAG GCAGCAGGCGCGTAAATTACCCAATCCTGACACAGGGAGGTAGTGACAATAAATAACAAT GCCGGGCCTTTACAGGTCTGGCAATTGGAATGAGAACAATTTAAATCCCTTATCGAGTAT CAATTGGAGGGCAAGTCTGGTGCCAGCAGCCGCGGTAATTCCAGCTCCAATAGCGTATAT TAAAGTTGTTGCAGTTAAAAAGCTCGTAGTTGGATTTCTGGCAGGAGCGACCGGTCACAC ACTCTGTGTGTGAACTTGTGTTGTCTCTGGCCATCCTTGGGGAGATCCTGTTTGGCATTA AGTTGTCGGGCAGGGGACATCCATCGTTTACTGTGAAAAAATTAGAGTGTTTAAAGCAGG CTTATGCCGTTGAATATATTAGCATGGAATAATAAGATAGGACTTCGGAACTATTTTGTT GGTTTGCGTTACGAAGTAATGATTAATAGGGACAGTTGGGGGTATTCGTATTTCGTTGTC AGAGGTGAAATTCTTGGATTTCCGAAAGACGAACTACTGCGAAAGCATTTACCAAGGATG TTTTCATTAATCAAGAACGAAAGTTAGGGGATCGAAGATGATTAGATACCATCGTAGTCT TAACCATAAACTATGCCGACTCAGGATTGGCGGTTGTTTTTTGACTCCGTCAGCACTGTA TGAGAAATCAAAGTCTTTGGGTTCCGGGGGGAGTATGGTCGCAAGGCTGAAACTTAAAGA AATTGACGGAAGGGCACCACCAGGAGTGGAACCTGCGGCTTAATTTGACTCAACACGGGA AAACTTACCAGGTCCAGACATAGTGAGGATTGACAGATTGAGAGTTCTTTCTTGATTCTA TGGGTGGTGGTGCATGGCCGTTCTTAGTTGGTGGAGTGATTTGTCTGGTTAATTCCGTTA ACGAACGAGACCGCCGCCTGCTAAATAGTTCCGCGAATGAATTTCATTGGCGAGAGCTTC TTAGAGGGACGTTCGTTCTACAAGACGAAGGAAGATGGCGGCAATAACAGGTCTGTGATG CCCTTAGATGTTCTGGGCCGCACGCGCGTTACAATGATGCACTCAACAGGCATATAACCT TGGCCGAGAGGCCTGGGTAATCCCGTTAACTTGCATCGTGTTAGGGATAGATTATTGCAA TTATTAATCTTGAACGAGGAATTCCTAGTAATCGCAGATCATCAATCTGCAATGATTACG TCCCTGCCCTTTGTACACACCGCCCGTCGCACCTACCGATTGAATGGTCCGGTGAAGACT CGGGATTGTGGCTTAGTTGCTTAATTGTGATTAGACCGTAAGAACCTGTCTAAACCTTAT CATTTAGAGGAAGGTGAAGTCGTAACAAGGTTTCCGTAGGTGAACCTGCGGAAGGATCA >gnl|To_NUC_proteinmodels_ML|x1 XKVGVLLLNLGGPETGEDVEGFLYNLFADPDIIRLPPILAPLQSLVATIISKRRAPKSRE AYDSIGGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX XXVILPLYPQFSISTSGSSLRVLQEEFAKKADIYGPQKMFHTVVPSWYERPGYVKSIANL IQKELDSFSKEELQEGTSDTQTIPKHVLFSAHGVPASYIEAGDPYKAQIEDCVERIKALL PSEKDGVKIHLSFQSRVGPVEWLRPYTDDVLPELGEQGVKNLVVVPISFVSEHIETLEEI DIEYRELALESGITNWRRSPALNTDPTFIDDMADMVADALNEPSQSITEACVANNVGNIE LETQASDLQLEISSAGVAGVGSX >gnl|To_NUC_proteinmodels_ML|x2 MLLLALLVPLLSLVASFGPNKAAWTIHSQRNTSTRRFADCGVEAAADTASTPGSTSTLTV ALTREVGKNTKLQKTIESSERLQQLFATGNDSEPSIQTLELPCIAHADGPDTDKLPSTLS SKQFDYIAITSPEAAKVFASAWKEAGQPQLGVVAAVGKATKEALSSLGIDVGFVPSKATA ATLVQELPFSQNSQDEGRSTTLLYPASAKAANTLQDGLEGRGFKVDRLNTYDTVTATWTX XXXXXXXXXXVACFASPSAVKAWLANSANSYPLAACIGETSAKACRENGWKEECIFFPEK PGVAGWADSVADALDSMX >gnl|To_NUC_proteinmodels_ML|x3 MRTSTLGSALIVSTVSAWSGEPKFGTFQASVKSAVVGAVGAAVSAGVVVSPASAGDVARG QQIFNMNCAACHAGGQNVIMPEKTLEKDALEKYLSGGRSEASVVTQVTYGKAAMPAFGGR LTDSEISDVASYVIQTSENGWK >gnl|To_NUC_proteinmodels_ML|x4 MQLLVYFGIVAGLGRCLAWTVSPPDAVDCAPRAIKNAVAGALTAGAVAISPAFAAGDVLA GEQVFRNNCAACHAGGQNVIMPEKTLEKDSLEKYLTGGRKPESVVYQVTNGKNAMPGRLQ EDDINNVASYVIAKSDAGWETDSTVSTVVQTTGLVSSTDRGNRVGGDVLSGEQIFVNNCA ACHSGGQNVIMPEKTLEKDALDKYLAGGRNKDSVITQVTNGKNAMPAFGGRLSEFDIEDV ASYVISTSETGWR >gnl|To_NUC_proteinmodels_ML|x5 MALMTILSTLVVTASAAHAFAFAPSPPGFVVRSSTALDVSAGVFYTTQTGNTETVAGYIA EAADLEMEDIADVEDDEIEELDTLIVGAPTWHTGAETERSGTEWDTWLYETLPDIDVKGK NVAVFGCGDQMSYSDNFADAVGELYDLFEAAGANMVGATSTDGYEHSDSKAMRDGKFVGL LCDEDNQYDLSEERAKSWVEQLKSEGAL >gnl|To_NUC_proteinmodels_ML|x6 LHSPSELVHHHHFNFESHMQQPEAPNGDDKRHHSHGRGGDFDRIHIIHDKLFFYGLALPI ALLTVFGLFVRDTNHEKSGGGAAYTTSEDGDGVNKSRVLVNTAGEGDVNVKKWTLIWFIL PLYLIFLEGARGHGVFHASERQATRHESLYVRVCMSMMSVSGYAATWALALFLIPVTKHS PILDCLRVTPIQALAFHRISGWVGFWNSVLHGFLHLRHLMDVLNRNHGRSNWEQFKWLIF PDDWGKCLATQSPWIFVRGQAPYYQGSDAEANQCWLSLVNGTGMVSTISFALLGLTSMAY FRRRFYAAFYFIHIPAAWLMLITAVWHYPTVRVDVDRLWCYANFSFYPELLQCALILIPN LVYYLSFNIPVYLDRLGSRWSKKSALTEAILIPGGCIELTFAAKEEPKRHESSYVHVFCP EVSMISHPFSAFSPACLMDADATPGNHSTKSILLRSESSFTDKLKHALLSPAEGQKNMPT IQFDSFYAGSFDWIGQAMNTHDKILIFAGGVGITPFLDFLPALQNSIRLRSQQEPYGQTA GPEAVHLHWCTRDVKLASYVWYKYLCHHICTWENDPDCRGKLRVHFHLTRLHSSVEGGEE LLEDTGFISAKTVHLQDSQVLVQDMGRRLLPCFLVASGILLHWWWYTQFTIKDQYRRNNL VIRSHPVIFSTLLAVVVWAVVEVLYCYKCDQGQYSILSAANEANKEASHQEDEETTLVES SVSDSSFSTYNSVGSLKIKINTGDVVAVSGGRPSISGIAHGIVETKKP >gnl|To_NUC_proteinmodels_ML|x7 MSGRPLSDKSGKYLGQLEERGYGDDPAMVALKAATVKYANVTGYCAAFCLALFLVPVSRS SPLLRALGLSPALAVGFHVWAGRLCWALSSVHGLLYAVDVPLYHYSGEEGGFWARAAGTL VPTGTDCLRWVAPWEFLAPDDEGGGHCYKPWRNVTGIASLLFLLLLGVTSLDRIRRRRYR VFLVCHVVFGSLMMLFAVFHFYWIGLYLVPGVLHYLCCTLPAVSGQLFSGALGDKSGGVT VAGAVDIPGSNGCFELRLLASLPPGEGARVRYPAYIRIGVPEVNGRGVAGRLLWHPFTVA SPVSALPPYGRHQQRERLGEELRLLIRGTGRFTNGLRELVRSDRRRAAPSGAGWRPSYTV HVDAL >gnl|To_NUC_proteinmodels_ML|x8 MISTTLLVTATVFAGLPPFSMGISQTNTDQYALNKPASDQFVGNSVWLFGSDGVHVYSPD GSNHQHHVTNAQICEDPDTFSESLISVLRVSLLHTSPTLLCTDHTHMKRNHAAGKSWQYC RFNDVQSDGKRFVWAAKRDGQISVLDIDTGSLVGNFKACVGPQDLSVHNLRDEIWIRCSG HDDVNSTESHTHLDVMSASNPSGEIQTDIMLKDSAKDAGLKSDGHSVIHHTLGDVGYLTD NGKPLLFKVDLTSKEIIKTVDLYPAAHGLNDMAYSPRNGHIYVRALMCCTCGTAESDVTS CGRSGASKVSPTTGKGAGLTDVDGVCGRSCSGVEGVDSVGVYEFDTKSESVVATHVLAEG IGGEPYASHDGILFSPRRICQEYIVLLGKNGGSTVRVLEAGEPGLPSTLVADIELDFSRD SDRYDNLSVARDFAFVDIVGKTYLAFPSGTSHTIAIVDFDNFNVRKVTLTEAVFENTAPH GRYRGVEWAVGTPYVWTNDSTEDEHYVVDVINAKVVNTIRDVDRSTLVSVQNWARVREAE EREKMMTEIKTMQEQAIQKQAQATLVTVTDQEKAREAATKEDILKQVRR >gnl|To_NUC_proteinmodels_ML|x9 MKFSAYAAATLAVATLAPSTSAFAPAPAARFATPLSSSVDDKTEVREYFNNEGFNRWNKI YSESDEVNNVQLDIRNGHDQTIQKILNWVEADGDIEGKSVCDCGCGVGSLAIPLAQMGAK ISASDISDAMASEAAARAKSMGINNAKFYTSDLESVTGKYNTVTCVDVAIHYPTDKMAEM VGHLCSLAEDRVLISFAPKTWYYELLKKIGELFPGPSKTTRAYLHEESVVREALSKAGFE VAREEMTGTNFYFSRLLEAKRV >gnl|To_NUC_proteinmodels_ML|x10 MNAATRRLVPSLARGSSRSSVACNALRCLSVEVTKTSASSGKRKKPKPPPPPFSYSPLFD LSPDTSTVYRQILPAEAVDTVDLPDGTVLLRVSGEAMRTLSSTAFADIAHLLRPAHLAQL RRILDDKDASDNDKFVAMELLKNANIASGRVLPGCQDTGTAIVMGKRGHLVLTDGKDDEH LSGGAYDAYTELNLRYSQVAPLDMFAEKNTGNNLPAQIDVLGTKGDEYNFLFIAKGGGSA NKTFLYQQTKALLNTASLEAFLEEKIKTIGTSACPPYHLAVVIGGLSAEQNLKTVKMAST KYYDNLPTEGDASGRAFRDVAWEDHILEMTRNLGIGAQFGGKYFCHDVRVIRLPRHGASC PVGIGVSCSADRQVKGKITKDGVFIEQLETDVSKYIPEVLDEHLDSDQGEVAIDLNAMTM DELRQTLSNYPTRTRLSLTGTIVVARDIAHAKMLDKIESGEGLPQYAKDHIIYYAGPAKT PEGYASGSFGPTTAGRMDAYVDKFMQSGGSFVTLAKGNRSRQVTNACKKYGGFYLGSIGG PAAILAKNCIRNVEVLDNEEDGMEAVWKIDVENFPAFIVVDDKGDDFFKEWLG >gnl|To_NUC_proteinmodels_ML|x11 MKFTAIVASLIFAGSEAFAPAASNVRSTGMCEHGSVDTTLSDSDFDLLPFAQRTQKSAEK KLTQSFLFRLVLNAEASRMEFLQQSAAAVAAFTMLPGQANAAKYGGFGAGSPEVIDPKDA LVDEDILKSEPVQKALEAVKGYKQSTVDLKTVLSSDNQADIGAKIRKDFDFSVIRTDLNA INAALDEDTQRGTDRIVRAILQDITELEVSQKQKPGVPRSEKRLGNVIGKLDKLEKSFDD YLAFAN >gnl|To_NUC_proteinmodels_ML|x12 MKSVIIAASVASAAAFAPTPVAKTTSALNAFEDEIGALPPVGFWDPAGLSDGISQEKFDE YRLAELKHGRAAQLAVLGYIAPETYRFGYDLVPGQLSTNDVPNGIAAINAIPFLGWVQIV AFVGCVETYGWFTSPTGVLDLPDDILAKRQTAELQHGRVAMLAFLELIRHDSQNLAVPGF DGYDNLITGASCS >gnl|To_NUC_proteinmodels_ML|p1 XACRQNFHEESEAAINKQINMELYASYVYLSMAYHFDRDDVALAGFFKFFKKQSDEEREH AQKLMSYQNKRGGRIVLHDVKX >gnl|To_NUC_proteinmodels_ML|p2 MAAPSPNLLAKGLPILTPTALNPHAPVGKWILGTSSLVCAMVHIGGVTRLTKSGLSMTDW KPLGSRPPITNDEWLEEFERYKQFPEWQQRQSMSLDEFKYIYFWEWGHRMAGRFVGLVFG GGWLYFTWLHRPKLQLDARSSLALPKNVTSAIPPGYQGRLALLFGMGGFQGLVGWWMVKS GLGQDRMGDRREIRVSPYRLTAHLGMAVSTYSLLLWTGLNVLSFPVDKYVLKSVAGDAAK VGTKTVSYLQEYARSLTPSALAHARRTRLGVLSTVGLTGLTILSGGFVAGNDAGCAYNTY PLMDGELIPWGDLVDPQVQPAWRNLFETTAAVQFNHRVLGTTTALSALGVAAYGLAKGQA RSVTPQVRRGLLALGTAATGQMSLGIATLLNYVPLHLAASHQLGSLVVLTCGIYSAHSLR YAGHGVIAKVGSRAVSSLVSGGGGVGSAQKSTVAVSRVINNVKI >gnl|To_NUC_proteinmodels_ML|p3 MAVMRAAASMLLVLSPSGAFAFAPVARTRSRPLASGPLFSTATDTDASIETEKNPRLSGL ALMLDDGTRKSHSIAENTQFVTGFFAGLADRDSYRSLMTSLFFVYEAMEICMDTTNEERV KYLDSPQLRRLPSIRKDMDFFYAEELGSDWDKKIEPSQASKQYVARIQEISETKPHLLIA HQYTRYLGDLFGGQMMGGMAARSLDLNDGEGTEFYRFEGIESTSAFITEWYKDLNKLDLT EKQKEEIVDEANLVFALNIAIFQEIEGSPIKAMFTLAISTLKQKLGIS >gnl|To_NUC_proteinmodels_ML|p4 MYAAVLLLFALVPNHSGAFVSTRPTLLRQESLIIRQSASTETTFKPILEELRDVAMKLHT REQAPKEGKAEEPKKPAEPFAPTQSDYLQFLVDSKEVYMALEDIVNGNDKLAPFRNSGLE RTGALEHDINWMATEFGLERPQCGRAGTTYAEDLRQMIKSDDDIPAFVCHFYNFYFAHMA GGRMIGKQMSKLLLDGEDLEFYKWDENVNELKSRNKEAIENFASSWTREERDRCVNETPN TFKGGGSLNGYLFGGSPH >gnl|To_NUC_proteinmodels_ML|p5 MKFSAYAAATLAVATLAPSTSAFAPAPAARFATPLSSSVDDKTEVREYFNNEGFNRWNKI YSESDEVNNVQLDIRNGHDQTIQKILNWVEADGDIEGKSVCDCGCGVGSLAIPLAQMGAK ISASDISDAMASEAAARAKSMGINNAKFYTSDLESVTGKYNTVTCADVAIHYPTRIKWQR WSDTCVRSSEDRVLISFAPKTWYYELLKKIGELFPGPSKTTRAYLHEESVVREALSKAGF EVAREEMTGTNFYFSRLLEAKRV >gnl|To_NUC_proteinmodels_ML|p6 MVTQDLKKFSSPVFPFTAIVGQEEMKLALQLNVIDPKIGGVMIMGDRGTGKSTTIRAIAD LLPEIEVIKDDPFNSHKSDLDLMGNEVKTAIQNGETLETEFIKIPMVDLPLGATEDRVCG TIDIEKALTEGVKAFEPGLLAKANRGLLYVDEVNLLDDHLVDILLDSAASGWNTVEREGI SIRHPARFVLVGSGNPEEGELRPQLLDRFGMHAEIRTVKDPILRVKVVEERTSFDQTPMV WMENYEVQQQELRNRIVDAQKLLPTVQIDYDLRVKISEVCSQLDVDGLRGDIVTNRAAKA HAAYNKRDKVTLEDIESIITLCLRHRLRKDPLESIDSGDKVSKVFKEIFEIE >gnl|To_NUC_proteinmodels_ML|p7 YLCEEAIRAGCQGGRQITSEDLRLAVKLAIAPRGTFINTPMDEDDMMVPQEFMFDVDSTP MDPDLIEFSSRERSGKGGGRGLIFSQDRGRYIKPMLPKGKVIRLAVDATLRASAPYQRGR RERAKGTKDEGRGVFIEQSDVRTKKMARKAGSLIIFVVDASGSMALNRMNAAKGAAMSLL TEAYQSRDQICLIPFQGDRADVLLPPTRSIAQAKKRLEVMPCGGGSPLADALQAAMLTGL NAQKTGDVGKVVVVCISDGRANVPLCVSKGEEFDPDADEDSKDGKPSRQYLKDEVLACAK QLGVLPGFNLLCIDTENKFISTGVAKDIADAAMGKYHQIAKADGSAIASVTNQALNAIKN E >gnl|To_NUC_proteinmodels_ML|p8 MASFKLAAAALLTLGSTDISAWTTNRAFGPKHQSIGKTAVETNRFYSQSALMERTSTALN VVTDPTDVESGRGGMFTASNAENRRIVPEDVRGRPTMKIVYVVLESQYQSSMTAAAKRIN AGSDSMAVECVGYLLEELRNEDAFEQFKKDVADANVFIGSLIFVQELAEKVSEVVTPLRD QLDAVLIFPSMPEVMRLNKVGSFTMKNLGQSKSVVADFMKKKKQEDGSSFEEGMLKLLRT LPKVLKFLPSDKAADARTFMMSFQYWLGGSPENLQSLLTMVGQDYVGPIKSAMEGKEKAV MEEPILLPDKAIWHPVAPDIVFETNEDYFRWYNTQHCPEAGIDPKTAPTVGIILQKSHIN TKDDTHYVSLIAELESRGSRVVPIYSGGLDFSGPVEEYYYDGFGKPIVDTVINLTGFALV GGPASQDHKKAASVLKKLNVPYMCAVPLVFQSFEEWQASELGLHPIQVALQVSLPEIDGA IEPIIYAGREGATGRSVPLADRVNLLADRAMKWSALRTKKNADKNIAITIFSFPPDKGNV GTAAYLDVFDSIKAVLKQLKSEGYDVGDAPDSKELIMESVLNDPEARINSPELNVAYRMN TDEYYELTPYAKDLEENWGPAPGNLNSDGQNLVVYGKQFGNVFIGVQPSFGYEGDPMRLL FAKSASPHHGFAAYYTYLEKIFKADAVLHFGTHGSLEFMPGKQVGMSGTCYPDRLINSLP SAYLYAANNPSEATIAKRRSYSATVSYLTPPAENAGLYKGLKELKELISSYQGLRENEGR GPAIINSIVSTAWTCNLDKDIEDLPDLETYDAKNDTVERRDEIAGAVYAQIMQIESRLLP CGLHTVGVPPSADEAVATLVNIAQLDRPEDGIEGIPRVIAATVGRDINDVYRGNNNGVLA DVELNEKITMASRAAVSALVNQSTDGNGRVKEVKNMFDEVGGFFGGLVGAKKPWTQAIID AGFPDVSEERLAPVFAYLEFCLKQVVANNELPGIMELLNGQFLMPAPGGDPIRNPDVLPT GRNMHALDPSAIPTQAAVEVAEDVVRKLLEKLADENDGAYPESIAFTLWGTDNIKTYGES LAQVLALAGVRPVADSLGRVNKVELIPLEELGRPRIDVVVSCSGVFRDLFINQMNLMDRG IKMAAEADEPLEQNFIRKHAIEQAEELNVSIREASCRVFSNSAGSYSANVGLAIENGGWE DESQLQEQFLTRKGFAFNADKPGMMEQQADLFKSALKTVDVTFQNLDSSEISITDVSHYY DSDPTKVVEGLRDDKKKPMSLMADTTTANAQVRTLSETVRLDARTKLLNPKFYEGMLSTG YEGVREIQKRLRNTMGWSATAGEVDNFVFEDANSVFIEDAEMQQRLLDTNPNAFRDMVTT FLEANGRGYWETTDENIERLQELYAEVEDRIEGV >gnl|To_NUC_proteinmodels_ML|p9 XIGGSRDPAIGSMDDDDIIAAVDIDLRKVLLRSDAPDPKVLGIKVWPTAIPQYELGHLDL MAELGEMEGKNEGGGLWVCGNYRSGVAFPDCVTFGYDHAKVVKEYLDGR >gnl|To_NUC_proteinmodels_ML|p10 MRNLLLSVSCTATCVPSALAVAVEPSSSVLDQGLGSTSISQKEQASRGVDLDGEVFRPQS YRIFDLLISEERLYRTGRMIFGASTLLVFTAALSAAKGFAPITGRSHVRSTGLEGTSSPT LEKETFQRSLLEAQLANKNGKKAGADAPPVNIGWDSHKPVDIVPDSLVRPGDGPDGNYPM RSKFESMIREAQISITDAIEKIDGKGKFQEDCWTRANGGGGMSRVLAGGDVFEKAGVNLS VVYGSMPQEALQAATERGVDRAKGMAPGERVPFFACGLSCVMHPRNPFCPTMHFNYRYFE TDGGVWWFGGGTDITPAYLNEDDMRHFHGTYKDVCDRHDPEFYPKFKAWADRYFVISHRN ETRGLGGIFFDDMNDRDPEELFEFAKEAVNSVVPAYGPIIEAHKNDPFTEKNKQWQLMRR GRYVEFNLVYDRGTVFGLKTGGRIESILMSLPETARWEYDHHAEPGSPEAEIMDAFKHPR DWV >gnl|To_NUC_proteinmodels_ML|p11 MFVCALVALALASRSSDAFSASSSSRAIAATAADIEEVFEDFAQFLISQQADIISEIEEA DGKGTFTNDRWGCFDDGASDDGNTSGGKTRVIEKGDVVEKGACSLTIIRNGKLTEERAKT IQGRQEHTDAGFTIDAGDEYCAAALSIVLHTRNPWVPTFRSDVRIFLVKSKDGCQSSAWF GGGSDLTPYYLNDEDVAGFHAHLKETVEKSFPPGNEYNLSHAQMKESCDDYFYLPARSEH RGTGGIFFDDLPATRSTLEFVRDVAQGWMPSWLPIVRKHASREYSDEQKHWQCLRRGRYL EFNLLYDRGVKFGLANANPRVEGVMVSASPQIAFDYNHVPKPGSEEERLVRVLKQPKDWV >gnl|To_NUC_proteinmodels_ML|p12 MKLLAALLALSAPAAHAFSAPAKAEPMPQTNTDASGAQIDPLLIRAARGEKTERVPVWMM RQAGRHIKEYRDLCKKHPTFRERSEIPEVAVEVSLQPWRNYQTDGCILFSDILTPLPGMG VEFDIDEKVGPVVKPMRTWEDVEKMHLIDPSAAAPFVAEALRTLREEVTPETAVLGFVGC PYTLATYLVEGKTSKEYLEIKKMAFTEPKLLHAILKNLAESIAEYALFQIENGAQLIQIF DSWAGHLSPRDYDEFAAPYQKMILDRVKERYPDVPTVVYIKHSGALIERMAATGVDVVSL DWTVDMAEGRERIENARKKAGLEGRGGVQGNLDPGVLFGDFATIKDRAEEIMRKAGPTGH VMNLGHGIEAATPEENAAYFIETVRNFRHEDA >gnl|To_NUC_proteinmodels_ML|p13 MKCHAASALLLLTRGESFSPASPTSRAPMKSSANTQQSTAPRRHFSSTLDAPPSSLNLSS EEDLQLTRQIIMAHVEKLGLDESILDDDDDENFRMVADPSSGDDDDLMDDDDYPENDLMI RAALGQKVERTPTWLFRQAGRHLPEYHEYKSKVGRNFLEMLSYPDDVAECTMQPCRRYEV DAAILFSDILVIPEALGVKVTMPGGVGIQVPFPLVGPADMRDRLPAKSEMTAEFVEDKLG HVLESVRLIRCRMREEDISIPLIGFSAAPWTLLFYMVGGSSKKNQGIGMEWLHEHSDASA ELLDLLTTIVIEYMSAQVDNGAHMLQLFEAMGMMIEPAEFDKFALPCLEKIATELKSRHP DVPLMVFSRGASFANEKLSELGYDVVTIDGGVDRSTARDVVGGRTGLQGNYDPAELIEAN GKTVETVRSTVRELLESLGPQRLIANLGEGLGGRESPELVNAFVEAVHEESEAMISAEN >gnl|To_NUC_proteinmodels_ML|p14 MLIRSTVALCCAVAASGFTAPSSPRRNARTTALPSTPVAEAESAASTTPCEGDRDILVRS ARGEVTERTPVWLMRQAGRYMSAFRQYSDKYPFRERSETPGMAIELSLQCHRAYGMDGII MFSDILTPLPTLGIDFDVIKGIGPVISTEVKTEEDVAKLGNAEDVDFDETLPFIREILGT LSKEAEEANTSLIGFVGSPFTLASYTIEGKSSKHCLDTKKMMMADETGESKAMSQFLDKI AVMIGNYACHQIECGAQVIQFFESWAHQASPKQFSEFAKPAAQKAMAIVKERYPDVPVIY FANGGSSYLELQRDMGADMIAVDWSVDMAEARKILGPDVPISGNIDPTVLFGTKEQIEQA VRDCIDKAGGPGNRHLLNLGHGVMQGTPEEAVGWLVDECRRYKGKDA >gnl|To_NUC_proteinmodels_ML|p15 MKFISALLFVGSASAFAPAQVSQGRSSSALFMAEDGEVKPLRIGTRGSPLALAQAYETRR RLIENFPELEEDGAIEICVMKTQGDMILDKSLMELGGKGLFTKELDTALLGDEVDICVHS MKDVPTWLPDGTVLPCNLPREDTNDAFITANGEIKTIADLPDNSVIGTASLRRQAQLLAQ NPTFKCVNFRGNVQTRLRKLDDGVVDATLLAIAGLKRMEMQDCATAVLDWEEMLPAVAQG AIGIQCRSDDERSLKYIDALNCMDTHVCVNCERGFLEALDGNCKTPIAGQARIIDGKIVF RGLIAMPDGSEKYETEATGAIEDAVEIGRKAGEELKERAGDKFFQMMVEMSPQQVLGQIT K >gnl|To_NUC_proteinmodels_ML|p16 MKIPAAIVLAISATANGFTSSSPTVSRSALFGAGNTDPVDPTMNGIDDNLGYEAFDPTAG DSPAVARNNNGGVWVKQRARPRRNRKSAGVRAMVRENIVTPANFIYPLFIHNEDHNTDIV SMPGCQRHCAESMLKEIGEALELGVKTFVLFPKVEDELKTNLACEAYNPEGIVHRSIRMI KEKYPEAIVCTDVALDPYSDQGHDGVVENGVILNDITVNQLCKQAVSQAKAGADIVAPSD MQDGRVKAIRDALDSEGFTNVSILSYTAKYASAYYGPFRDALDSHPGFGDKKTYQQDPAN GREALIEAALDAEEGADMLMVKPGMPYLDIIRRLKDASDLPIAAYHVSGEYSMLKAAVEK GWLDEKDVVLETLTCFKRAGADIILTYYAKQAAQWIKEDGLY >gnl|To_NUC_proteinmodels_ML|p17 MKLTVASALLAIVGSASAFSAAPSASGSGSALRSTPTATETYTFTKSEEIFAEAXXLMPG GVSSPVRAFKSVGGNPVVFDKVKGAYAWDVDGNKYIDYVGTWGPAILGHADDDVLDAVKA TMDKGTSFGAPCPLENVLAKAVIDAVPSVEMVRFTNSGTEACMGMIRLCRAYTGREKVIK FEGCYHGHADAFLVQAGSGVATLGLPDSPGVPGSATKATLCAEYNNLDSVKALFDENKDE IAAVILEPVVGNSGFIKPDKEFLQGIRDLCTDNGAVCVFDEVMTGFRVSYGGAQGYFGVT PDVTTMGKVIGGGLPVGAYGGKKEIMEMVAPAGPMYQAGTLSGNPLAMTAGIETMKKLSQ DGVYEELERKSKKLVDGIVATAEKHGLPISGDYAGGMFGWYFVEGPVKNFAKAATSDAEL FGKWHRAMLERGVYLAPSLYEAGFMSLAHTDEDIERTIEIADEVMSQL >gnl|To_NUC_proteinmodels_ML|p18 MKFIEALIAFVSIAGSSTAFQSPVAPSKAVSSRGRNTVVSGSALLAATLDGTAVNVPTGV KRKKTKQVEVREKLAIPEAEWNTQSNLICSTGQVEEAAILSTCNRFEIYFAASNAHEANA RVMEYLAERSGLPVSVLRRNIFMLEGDDAVWHLMRVSGGLDSLVVGEGQILSQVRQCHLH SIEDDGCGGKVLSRLLNNAVAAGKRVRSETNISKGSVSISSAAVELSEAMCMQDLNLPFS EARLAVIGAGTMTRLLITHLASRGLERISIVNRSMARPKELQEQFPDVDIEIVLEDGLWD VVGRSDIVFTATSSPDYVIDKQLLEANNLNSGRAIMLVDIAVPRNIGPDCSELPAVTPYN VDDLKAVVAKNTAMRQREMIEAENLLLDEKNSFVGWRESLSAIPTINQLQERANMFRQEE LKKCTRKLSQNDNFSDRELEAVERLSRGIVNKMLHGPMAHLRRAESVEKKQAALSELSSM FRLDDEDSGRGRRRRR >gnl|To_NUC_proteinmodels_ML|p19 MSGDAPTVDLEELRSKIAALGDSIKQLKSSSEPDTTAIGTAVAALLEAKRSCEYPSIHRE SSTSSLSADISIVSCDCDAPPLLSVAKNNGGIGVDGKEWQEPMTKSQKKKAEKEAKRKAL EAAGGGNQVSEANAAKKAAKKAEAKAKKAAMKAAAKDGGGGPPAAATKAPEAQKQAAALP PRPMLTKSRLRPNQISFNPNVSLEDRPVVALTVAILAGSIVDYELISDHTRPGCALGLPS GNGEVSGDLAMARFIAKRDGGGSSSALLGGPSEEDAAHMDQWVDYALSVSKFGLARRALS VQRTLDSVLVSGTYVVGHAISLADVALFAALGFPASDDARAGIASILPGGCPTLRWVDTM ARHPAVAEAAQLASNVARNDEAALEAGSCVEPILPGMAYLEGANPGCVTTRFPPEPSGYL HVGHAKAVLLNNYYARRYNGRLIVRFDDTNPSKEKDEYQTSIIEDLAKIGVKPDVVTFTS DYFETIRLYALSMIENGLAYMDDTPQEQMQDERMKRQNSKYRDQTPADALKYFELMCSGS DEGKAWCLRAKIDMTSDNGTLRDPVLYRQNTTPHHRSGTKYKAYPTYDLACPIVDSIEGV SHALRTTEYDDRNAQFMWVQKALGLRRVRIQTFARMNFMYTVMSKRKLTWFVDTGRVTGW DDPRFPTVRGVSRRGIDVDALKRFMCSQGASRRIVNMEWSKFWAENKKEIDKYAKRFMAI DKKNHAELTITNAGDGDEYLSTEYLPKDPSFGKRLVRTAKKVLVEKVDTEGMEVGESIVL VRWGVVKLTKVEDGVIEGVYDPDGDVKASKRKISWIADVPENVNVVLSEFDNLISKEKLE EDDKFEDFINPDTQAETEVIGDPGLKTLNENDIIQLERRGYYRVDRPYMGSGKPLMLIMV PDGKTKSMSKLDGKLAHR >gnl|To_NUC_proteinmodels_ML|p20 MKSVVSAVAVATALASGVSAFVTPSSRTTSTDASQLAVPPADEKYIPVDSTRASYYPFGA STSSDRLISLSARPPKSNEEVTDGSANVRQLLGLKGASETTDIWKIRLQLTKPVTWIPLV WGVMCGAAASGNYHWIWNPFDPNDRDVMLGLEDTAKGFVAMILAGPFLTGYTQTINDWYD REIDAINEPYRPIPSGAISEGQVIAQIWFlllgglGIAYGLDAWAGHDVPTVlllsifgs fisyiySAPPLKLKQNGWAGNYALGCSYISLPWWCGQAVFGELDRPVYFILPILYSIAGL GIAIVNDFKSVEGDRQLGLQSLPVAFGIDTAKWICAXSVTVTQLGLAAYLQSINETTYAA ILMGLLLPQIYYQATLLIPDPVANDVKYQASSQPFFVFGILATALCLGHHDF >gnl|To_NUC_proteinmodels_ML|p21 MKLAIASLCVGSTTAFSSFMGQNVAHAPATSSSALSMKYKVAVVGGGPSGACAAEIFAQE KNIDTVLFERKMDNAKPCGGAIPLCMIGEFDIPETTVDRKVRRMKLISPTNVEVDIGDTL QPNEYIGMCRREIMDKFLRDRAISYGAEPINALVTAIDVPADHVENPDAKYNIKYSEFVE GSRTGTAKEMMVDLIVGADGANSRVAKAMDAGEYNFAIAFQERIKISDEKLKFYEEMAEM YVGDDVSPDFYGWVFPKYDHVGVGTGTVVNRPAIKQYQKAIRDRAGDKIAGGKIIKVEAH PIPEHYRPRRVQGRMALVGDAAGYVTKCSGEGIYFAAKSGRMAAEAIVKLMDGGRRLPTQ ADIERTYIADYDKLYGPTYTVLDILQKVFYSNNGAREAFVELCNSKYVQQVTFDSYLYKK VQGNNPLDDIKLLGETIGCLIKGYSIAKPDAEFSNPVESMKRL >gnl|To_NUC_proteinmodels_ML|p22 MRISIALMSLPAASAWSTLTMKQSGVNSSRRDMLQKGAALATSIAIPASANAYAVPDLLY PFEALEPYIDAPTMKIHHDKHHATYVANINKATEGKPEVDILDLQLNALEAGPVRNSGGG HYNHAFFWDEMAPPDQAAKTKPSPELEAMINKSFGSFDEMKAAFEAQAAPGAVFGSGWVW VCLSSDGEELKIVGTPNQDNPLMKGVADEVMFPILGIDVWEHAYYLKYQNRRPEYVSNWW NVVNWDKINENFTYVTEKKAGVPVRG >gnl|To_NUC_proteinmodels_ML|p23 MRLNLPLLFAPVATTAFAPSCPKARPLASRRSTSLNNYSTVKWSPRGGGTAFSAPSSYSS ADLDEEPVKKMFPEISYSDLWILASYVGLEHTGGPVIDFTPGRVDHLDDXDAESGTVKGW EGLCTHVRNEVFYRMGFNDQEIVALLCGGHVYGRCHPNFSGYAGPWVEDMTKFSNEYATD MIEDDWTLVSNGDTWLDDMGAGELRPAPGKRQFVNKVPGRIDDEPNQMMLLSDMILAWDP NFRYHLEQYAADEEKLKHDFGVAFKKLTELGCGF >gnl|To_NUC_proteinmodels_ML|p24 MVSRSATILASLEAIDPVVTDGSGHEVEQDIIAYRAAFPPGPSPSPGGEAVRIRDIQFSA TSEAGKSDRSLRKNSSRYPRKNHEHIGADKSTTDGTRWIFACFCTGLCSATGANTTTPPP QTTETLSTTTMSLEDIKSDIKAIVAEKDCGPIFIRLSWHDAGVFSTGKLTGGCPNAAMRF TDGGEGTFGANAGLPTVALDLLKPVTDKYCPASISHADLWTLVANVAIETMGGPAIPTKF GRKDAATSAESVESQVGRLPDGDKGCPHLREIFHPKGFSDKDIVALSGAHTVGKCHGDRS GFDGKWTENHLKFDNSYFTEMLSKEYADETTAAGCPQKKHAASGTIMLISDLALLEAPFR EHVELYAKDQEAFFKDFVTVWVKLQENGCTGLRDTL >gnl|To_NUC_proteinmodels_ML|p25 XDFMDYDHADSRNPMGMDGCLDWASPNNAGLNSIWNDHTDLHRLWESTYSDISIPDFWIA SANAVVRSTSDGALDMRHTYYWGRPKADSCEGSADRLPSAKSCQEVEETFLERMGLTWKD AVALLGGHTLGRGHTEFSGHHGTWMPNDRLATRFNKQYYEELIRRAWSPRNVGTNIEDWS STPTGSTNPKMMLNTDICLYYDVEDGANGDCCTRTDAFRPDGSSRCPSFPNRECATIDPN SNHPRAEAAAAVRRYLGGMTVNDNQQPFYDAFSLAWFKATTNGFSDLKSVRDTC >gnl|To_NUC_proteinmodels_ML|p26 MKLSAAITVAAVSGAQAFTAPVNKPLRPASSLSVTQGDLDGAQSMIDGILTEKNCGPVFV RLAWHDSGTHDVSLADKEWPASGGAIGSIRFDPEINHGANAGLAGAIKLLEPVKEAFPGV SYADIFQMASARGIALAGGPEIDMKYGRVDATSPEECSPEGNLPDAEAGPEGKFGGPGGT ASTEDESAAWHLRKVFYRMGLDDEGIVALSGAHTFGRAYADRSGVGAEKTKFTDGSATKL ADGSETTAYTAGGSPWVEDWLVFNNSYFTTINDASTDEELVKCTSDKCLWEDAGFAPFAN KFADQEAFFESYAKAHKALSELGSKFEPIE >gnl|To_NUC_proteinmodels_ML|p27 XRSNRPQVRGLTIKFEITSVSGERIALWLLVSDTQTRDTHQTKDRRESRICCFLWASYEI QPFHRRSSATWRSTPGKDMRKVKKEAVLSVSRPPPAVYDMGTNPPRPCAGRRRKNQRFGE IKQRQDVEIGVRAGVCGVRPCADFEEELALLSQRYKSRGFNVLAFPSNDFNQEKDTNAEI LQYVNDHFPEVKFPIFSRGSLADSEVFKLCKEMTGESVRWNFHKYLVNGKGEAVKSYGHR IQPMAIEEDIVALLEENDGVRLQKPMVM >gnl|To_NUC_proteinmodels_ML|p28 MRSSLVVAGAASALLVLSNHAVAVAGNSAAECDTWAESGECSLNPKYMLQHCADACARQA ELDSEMAEAIESKIGHVTSFFDLEAQDIDENVITFDEYKGKVTVITNVASHCGYTESHYR GMVKLYKRFSGSAVGFNILAFPCNQFGEQEPEACPNIKRFAEKKGVEFTMMNKILVNGLD AHNVYHFLKKVAGPPSIAWNFATYYVVSPDGVVQSLSGVX >gnl|To_NUC_proteinmodels_ML|p29 MGVFAKLQDASTRMIFGSEQETHKESFYDIVDKDMDGNEVSKLYDTYGSRGFKILAFPCN QFGGQEPGTNEEILKFVDDKFGAGTKDKFVWFEKSHVNGKDTRELYSYLKKALPSTDGTR DIRWNFAKFLVDSEGIPFKRYGPKTNPEEMCADIEELLKKAGK >gnl|To_NUC_proteinmodels_ML|p30 XSLYPEGSTDGGLRLGNIVPDFKMETTMGDFDSFHEWKSGKIGRLALKYDKLKEMDCLVA TLSVDPVKSHTDWLNDVVAHCENEIEIKFPIIGDSDRSISTKYGMIDPGTSDEQSLPLTI RAVFIINPENKLMLSLNYPACVGRNMDEIVRCVEALQLSYQKSIATPANWPNNHADIPMA DGTRSTEFKGSVFLLPTVSEEDAKKSYPNYHSCEVPSKINYLRLVKKEDVEAAA >gnl|To_NUC_proteinmodels_ML|p31 MAATLLLLAATVVSVEGFSSSNTHVGSQQMASSTAISAKENENDRRSFLANAAAAAGASA IAGIPGQASAKVFLDPAMYGDQELRVSAVDSLRESVRRSLLQKPSLTPYFFELALLDSLS YDTQTNEGGPDGSIIKAVISSKGTDAHTKALQECANVLIESKKNLKKLSSITIADAVALA GAEAVNAVGGPTLPTQLGRTEAAAKAPIPSSMPPLDLFTGAVSGMDVYDVFQSSGLTDRE MTALLGCLLSIDTIEKETPEGSWKEASKPKFREAGKIGRMSEFKKLTDEDIANELAKDDE DDDGESYTVSGDDGWYIADTFGTADDKFGKRASGEKKIELSSALKNLSKVSPSTTQYSWI KDLLLSKDLPTAQTYVSKYGSKPLVYEKDLKVCYNSVTQLGAEFTGGKYESLLKNKKRKT LNDDELDFLK >gnl|To_NUC_proteinmodels_ML|p32 MAPKAESACPYHQTATVDASPAAKCPFNHGKKQLVEATTNGDWWPNQIDLRHLSQDPSSA RPSSVPHPTSSTVSSKLSSPREATTYKSKFESIDLRQLRRDVHAAITTSNPAWPADYGTY APLMVRLAWHSAGTYRVFDGRGGGNSGNIRLAPLNSWPDNANLDKARNVVLWPVKKKYGS KLSWADLIILAGNVAIESMMGEDFESVEPLWFGGGRVDAFAPETDVYWGIEKEWLQDERH GDAGGELEEPLSADQMGLIYVNPEGPGGNPDIQASGRHILSTFGRMGMNARETVALIAGG HTFGKAHGAAPDSHVGPEPEGAPIENLGMGWKSDYGSGKGKDTITSGLEGAWTAHPDRWD HGFFTNLYRYDWEQTKSPAGAIQWTPTDESIERNGGLDAATAVVDAHVDGLKHLPIMFTS DLALRYDPVMGPISEEYHAYHDIFTEEWKRAWYKLCHRDMGPKSRHLGPYVPKEDMRWQD PIPEPSSAIDVDDVANLKATVLKAIDESKASVSDLVKAAWASASTYRCTDHRGGANGGRV RLEPQTNWVANDPSSLAKVVTLLEGIQADFNSKSSRQVSFADLVVLAGGVAVEHAAKKAG SEVTVPFVPGRTDASQDATDVQSFNKALKPSVDGFRNYNESSDRPEHDLFDRAHLLSLSA PETVALVGGLRVLNANSDSMQVGVLTERPGVLDASFFENILDLSTDWAATGDGKLYLGTR ADGGRPWVASRVDLAMASNSQLRAISEVYASDDGKDAFVQDFVDAFAKVMNLDRFDLLA >gnl|To_NUC_proteinmodels_ML|p33 MLYMKGSPNQPMCGFSATVVGILKNSGADFASVNVLDYPEIREGVKKYAQWPTIPQLYVD SEFVGGCDIIKDLHESGELKEMLKVPEKEGE >gnl|To_NUC_proteinmodels_ML|p34 MAPIDIKDDDEARGASLLADLADVKKILFFYAEWHEPSAEGGPFDLVVKTLANQGPGEVR FYRVLAEETPSLSNKYNVTTVPTFVFLNADGSISDRIDGGEDVSRVTQCYARLSGASSIA TKTLASGQTTSREQEHDTAPHVQQSLNDRLKSLINSSPIVIFQKGTPTEPKCGFSRQAIE MLNDANVSFGYFNILEDDDVRQGLKAFSDWPTYPQLYVRGELVGGLDIMKEMADEEGGLV EQLELKEFVIAKSISAPASDEKDLNERLKELINRHRIMLFMKGVPSGPRCGFSRQMVEIL DSFEVSYDAFDILSDEDIRQGLKAYSDWPTYPQLYVDGELLGGLDIVKELQEGGELEETL RC >gnl|To_NUC_proteinmodels_ML|p35 MGRITIFCVNECNFCRQTKAALTAQNVPFVEINVEMYPEKRKDMQSLTGQLTVPQVFFNE KHVGGAKETLEILEEWDLETKSKYCPDRNVREHYIRMVGEQGEPTDKRLSIPKPKSSPSS ETESANVSSSRTRDLFKVDDKHWTTLEFTETLMQNMPRETLSYWGSHYFNTFKGCDGVTA LQKTFELKSRDEAAQLGQTLQRKQYIHHVTKDHPFGDNSYYYRLQPFHTPNVLNTFRVWT DEVDEPLNVIHRLAKLWSKLEARHLNSDGMVDHSHIRDDPYYWKFEEEVCELQGVRMAQL DDNARKAFVINVYNLVIRYASVKVGVPASAATRSVFFDQVCVNIEGADFSLNDLEHGILR ANTRHPFQFTRSFGMTSSKQSLALTKLDPRVHFALNCGARSCPPIKKYTSANIDEELEVS AQAFCEQDDNVEVDMVDGTLTLSKIFCWYSSDFRSEIPGVVAGFLSGKKKENLESLIDGG NLKVKYFDYDWSTNDVSNLTFERSEIRSRCIVGGKAPTDKYRMPK >gnl|To_NUC_proteinmodels_ML|p36 MRSAVIVGLAAAASAFQSSSTPLSRRTSALSSSADDVALPVNPAIKVAANGMSLLKPIFA AEASLQSAVLGAIGGVDKESIAAEIQALKEENKVLIYTYGLSPFSTEALALLDSAGYEYK NVKLGEEWFLLGGEGSETRVALSKEVESGATSLPKIFIGGKCIGGYSELAALAESGELDS TLKSAKAKKIGDSSEKPNFFTNLFA >gnl|To_NUC_proteinmodels_ML|p37 MKGLLLLAIAALAHGFTPAPSPRQANTKRMASPLDNILGFIKGGKIGLVKSIAVLDEKNA KYTAIELDKDEDGKAIRAELGDILGRTSVPAIWIDGKFIGGCNDGPMGGLITLDESNKLD EMLKAAKAL >gnl|To_NUC_proteinmodels_ML|p38 XAKAEIAANDVVVFSKAYCPFCTSTKQLLEQLKIDAKVYELDQMEDGAAIQGALLDISGQ RTVPNVFVKGKHLGGNDDTQAAARTGKLQELLK >gnl|To_NUC_proteinmodels_ML|p39 MRRIQILHSLTAIFIVERGTKRQRSARGDITGFEIRQFSNGYLHAADSQVVVWSKSYCPY CARTKNLLSERNIDAKVFELDQMDNGAELQAALLEMSGQRTVPNVFVKGEHLGGSDDTQA AARSGKLDEMLK >gnl|To_NUC_proteinmodels_ML|p40 MVLRLVWSSAFTRLPSVSSSGKRLTGRRKNNENFEQAAIMAGCLCFGGFCIPYSAAMPLL VYLLQTVAAKLAKAGLLPRYVARRIDRVNGVASKAPRSRGRGQRQRTKEGDAPRDRASTQ ATEEDCDDWCCRLTSFLPPFLLSAASSVSSETSLDESDVEEDESRDGIVELASDSTSFNP FVLDPTCRDSMHRITSLEQWEELRSTCRRRRLIAKFTAEWSKPCLVMQPPYEYIASVNSK DCAFATIDVDGKGCDAISSDNKVGLLPTFICFNGDGEEIDRINGANSSHKLRLWIEKMGR I >gnl|To_NUC_proteinmodels_ML|p41 MILTSSHVARLASLLMLLCMASASRPFGVTSGHRSRRFLATRTTLLNIRGGAVHESQTIS DLESRLQTAALQNKLTVIDFTVPLMIGVSHMQPRCNCSNTSSALTRTIETISDMIAPIYK DLSDEYGSRAQFLKVDVDTNQAAAQKFKVSSMPTFLFIKGGEVVDQMVGANPQRLRELID ELAF >gnl|To_NUC_proteinmodels_ML|p42 MLAVSRLSRRGHNIIRHTSLSANHRWMSAVVNLSDLDAVTKFRNINSKSVVYLTATWCPP CKMISPIYDELSKDDAFHQVAFGKVGPYKNWNFILARIHSLHFKTSTKTRKFSCHVRLIY FRSFDSAAMKFEVSAETHNTYSKNVQVPTFVFSNNGTDVVNKFSGADKEQLKTLLTDLKN S >gnl|To_NUC_proteinmodels_ML|p43 XPQVTGEELELMLQEWDTPLVVDAYATWCGPCLLMAPEFEEAAKELEGKVRFVKIDTDLE PEISGRLGIMGLPTLLFLGKNEDTAAVEEGKAPMAVLKQRIEGALQKKKIVDVCNFVFFD GPQPTLG >gnl|To_NUC_proteinmodels_ML|p44 MARSVVRIAVAAASVTASMAFAPSMHVNTSGRTAPSRPLQAVLDIDSEAAFDDKISSAGD SLVIIDYSTTWCGPCKVIAPKFDELSDSYPDSIFIKVIGDATPDASKLMKREGVRSVPSF HYFKNGEKIDVVNGANAEAIEAAIKKHQ >gnl|To_NUC_proteinmodels_ML|p45 MVSSRIRTAVFLLAIVEVAALSVPTIRRIESRGLRICNRQQPTNRRMRRRPSLSSSSHLF STTALHPMEEGEREMNKTRTGAINGVGTRNKWRRLMPKVLRNDGTEDETAITTVDTLEDY KREVVDVKDKIVVVRFYAQWCKSCKAAFPLFQKMKADLPSVKYVMVPLTKDTAYIHSGLG VPSVPFGHIYHPDVGLVEERKINRKVFGEFRESLESYVRESCDLPSDDEPSAGALPDMED ESEVFQ >gnl|To_NUC_proteinmodels_ML|p46 MRLGFAVLALAAAARPAAAFTIPQSTSSVQRATAAFNIFAPSTDSVTSTALRMSTAVDAD PQVTIDEALKAAGSGVTLFGKSGCPFCKKTKKALYFIGVHPTIVELDEVEGGAAIQKKLE ELTGKSTVPNVWLDGKFIGGSEEVIAGVDDGMFDAVEKKEIILMEDEEKIPIVQGPDALK VGDKVPDAKVWAGFASDDFVSLADYGKDKNILVVGLPGDEDKLKEAGVDSVIIYCVNDPA VMMAWAKDQGIENWEITESDGFVSFVSDPKSDLTSACGMTMTHPGPLSVGLFERCKRFAF YAEKGTVKVVNVSEYEGDPAGDDYPELTLPASMIEAIKKA >gnl|To_NUC_proteinmodels_ML|p47 MAAIHTRIESANRSAKQWEADRDLLAEIRQSIPLKELVPELVCDNEKRWNVFLDGVVVDS NRSGPVQCTTDCYKEDDGDWEGDDLLLKRLTLYFKQSVMKWCNQPPCSNPNCKGNEDGKQ MEAKGVRGPISDDEKAGKASRVEVYSCRLCGAETTFPRYNSPRMLNKSRRGRCGEFANLF GVSIFDYAYACLQALCTDMHNRKPSDLLSRAGYILDLTDHVWVEVWSVRQQRWIHADSCE GIVDRPSMYEQGWGKKLSYVIGATHDSVADVTRRYTRKLNSDDFLARRREFTPDETTGDR AFVQMDLTIRQVDNLPKGRLEELDKRVANEKKYFGIVQSSGVWDKDYYEGRLSGSLAWRA ARKELGNDSGEAMDDDENAEISSFLVESFYTSSGKGDNLSIEVKVPAAVSEFGGPCDVLP SSCIVVDGVPCAAALNKGGTCVAVIDERSGCILQSRGFGQWTSFCSFIDTLTDGRVLAIC HVKPAGGVDDQRAPPTKETSESLLRLGGFKIEDAVNAQYMIVGQLNHRPAWTTMSTAKGI KVTIKLNQSARPATKLRSEMNVVPSTLSSRLPESIMPIKEQLLASEYQKRVAFNAYMSRD GSNPSVVGYCTGGNKPVYLIDNGAVPFKKAEKDSCWSVYHHLPTYLVDDDDDVTQDDGKE KSDAAKFDIPIADDHFTQLLGNQLLSMSSSGGTSETDTSAAIANTRLVALYFSAHWCGPC RGFTPMLIEFYNVLKEAHPAHGIEIIFVSSDRDEPSFLQYFSTMPFLALPFSNRALAQQV KSMFGVRGIPSLVVLDSMSCQIVVPPDRSRQMVHQSCQRGEDAIEHLFKTWIDLVPAESK AMLEILAMSCQEAEQGSDGVGIKLRTNAHKYFARKAEESKSLSKEEFSARVKTIFGELVA TGLGPNEAAAEAINRATNEQSKTSTPSDEGALSGTGEVCDATVPEFTSIASALDEMCKLN NDDRSSVASVLKTARKYILNVKKDPHNPRYRSFRLSNKVFDQITSKPGSIALITWLGFSI YSSDTDFYACIPLTIDLDAFASVLDKAIQDYS >gnl|To_NUC_proteinmodels_ML|p48 XSAVELTPDNYDSMTSGKSVFLKFFAPWCGHCKKLKPDWDKLIDEINDEKRLVADVDCTA EGKALCDANGVRGYPTLKYGDPSDLQDYEGGRSLDDLRTFANENLKPMCSVKNIDLCDDE KKAQIQKYQGMSKADLESAVSVEEKKMQDAEEHFKNEVQKLQDKYTELSAAKDAAVADVK SSGLGLMKSVLSMSGGTRDEL >gnl|To_NUC_proteinmodels_ML|p49 XNGTDRLADMFAYDFETNHWSEVDCSLGERPSGRSSLVAQVHGNSLYIFGGYNGSTVLND FYKFRLQPVPIPPSGLLNDLRRLMIREDLSDVIFVVEGQEVHANRAMLAVRSQYFDALLF GGMSESIGVDEEGDRKPIVLNDVSYECFKQVIEFLYTDRVQDLTWDNGVPLLIASEQFML DRLKALCEDQIRRDIAVENVIGIFIASHRHNALGLKEIALEFILRNLTDPAIIAGLSDLK TEPDLLVEIITKNASNPFLPPADAAAVEFTNNEWAAR >gnl|To_NUC_proteinmodels_ML|p50 XMEARVRSSRSKKTRRSKPSLVPYVDHRHKVCHNYHDHSYDAFDESITAVVEQKRGSRGG VSVPFPTKLHVMLSSVEAEGQSDIVSWQPHGRCFVLHKPQEFVDTIMAKYFKQTKLTSFQ RQLNLYGFARITKGKDRGGYYHELFLRHKLFLCQKMTRIRIKGTGVKGRSNPESEPNFYS MTFLTEKDEDKINRDLEEEIKHEIEETSDRSSPPSGARHRASSQLLRPRAISDAENTDLK RVLSESDEDEEDDDEEWLPTSKVAPEKKIMHMGEAVIISPELSPKPVFTTPSSMTIDKLA NSYEGMLQQRQLNRHNISTTFREVDVPLEQAQPVTEAQMMSFGSSIVEGPESPRSGDEIS FEGQRFYYLDSFTVAASGPVTTVQNPIQSRRAVPSRLSIARSIQEPILTPSASTSSIYDG KEEEAVVINPHTIFSDKEESDMEEEWDVGHGRNIEVDEELTRLAAF >gnl|To_NUC_proteinmodels_ML|p51 MMVPPADEEDENGGTKKQPPASPAKEQARTGSPKAANYNDLIKKNSKKNGGKKGKKKKNT DIPPASIKQVLSFLPGTADRALLAAGVVAAVGNGLVYPALAYVFSNSFSDLGQASESLTQ VTEIAMTFLAVGAFAFVVAALQNFFFLIVSTRAADAFKKEWFRSLLRQDASFHDVHSVSG MATALSSASSKMKRGLGRKLGEGVQFGTCFLGGIAYAFWAEWRVALIILALLPLVSFAAF ALMQLNQNQTTNAQRAYTHAGSTAYALTFPLHWHAFGNRNYSGAVSSIRTVLSLNAVEEM IRQYSAATQEAYENGVRPLLKVGFVNGSMLGSFILLYAVLTLYGAYLLYGEVTDKNCDPS GSMSSLFPGMIETCSVSGPDVFGAMLGVAFAAQGMSQLANSVEAISTARAACATAMQAID RKLGSEATNVTKKVGTQKNEVDGTEEDIEETYTLPRYEIDSSSHHGLKPQKTEGEVVFKN VKFFYPTRPASVIFNDFNLRIESGKTVALVGPSGGGKSTTIGLIERFYDPVSGTVTLDGV DMKDLNVNHLRSQIGYVGQEPALFATSIAANIRYGKPDATMKEIEEAAKRANAHDFIMSF PEGYETQVGDKGGQLSGGQKQRVAIARVLVGDPKLLLLDEATSALDSESELVVQEALDQL LEREKRTTVIIAHRLTTIRNADVIVVISGGKVVEQGSHDELMLADTGHYRSLVQKQDVEG GGGDGSDSNGPSRNSSAKNLQSLVAAPSSTNIVGKSFATMTQIKFDSVSFAYPTRPNKPI LDNFSLSVRRGETLALVGTSGGGKSTTIALAERFYDPDGGAVLFEGVDLRDLNIGWYRDQ IGIVSQEPTLFSGSIAKNIAYGFQGATREQIEAAAKSANAHDFIMGMAKGYDTEVGEGGG GLSGGQKQRIAIARAIVKSPKVLLLDEATSALDSDSEKVVQQALDVLMESHERTTIVIAH RLSTIRRADRIAFIANGKLKEIGSHDELMERPNGRYRRLVESQRRQSTVSVSAIKKDNAS ALAEDGNEELDFEKEEEELASKAFNKADARKFAAPELNYYLIGSVGASIAGGVFPAWGIV FAEMIGLLFYPVFPCPLPCAMDPTQCESLDGTYVPMGHDTCENYYQESADKMQDMSFEIA LYWAGIIAACFVGNILVFQGFGSATERINKRIRDKTFGALMRQEVAFFDKRSVGSITSQL QDDVAFIFAFSGEPIRTLVINVSSVVTGLTISMIFMWPFALLSIGVIPAMGFATALEMKR FLGEDEGGETVEDGRDSPGGIIVETLLNIRTVSALTIEEQRFDDYCQALEKAEGNMAKEA AVSGVLSGLSIGIQQWVNALQFWWGGWLLYNFPDQFTFEDFLISMFALLFSLFALGAAAQ GAADKKKAEAAAGRLFYLINRTSANDPLSTDGKKLD >gnl|To_NUC_proteinmodels_ML|p52 MAFGASGMGIAANWVAAAAKGKAAAVRVFELFDRRPPIDSQPWNEDGSPSDIVVPEDSGK RGEIEFKNVKFAYPTRKTARVFDGMSLKIPAGQTAALIGSSGSGKTTVMSLLERFYDPVA AVVDRGENNGSDQIEIVIGDKPPNLDDSNGVVLVDGIDIRTMDVKYLRSQIALVGQEPVL FDASVSENIAFGKPGATEEEIISAAKVANAHEFISKFEGGYDYNVGTRGKKVSGGQKQRI AIARAVIKDHRILLLDEATSALDNESEKIVQRSLDNLIAGEKSAERTTVIIAHRLSTVRN ADCIYVLENSGDGAVVVESGCHDELIALNGKYKALLNATMKSDDD >gnl|To_NUC_proteinmodels_ML|p53 MMDYLMKELTSLCTAFAVVGLICLVTGFGYVSIFSYTGEMQSLRIQKAFVRASLNQDAAW FDTHNRETLPTKMGTALVHINNGIGRQIVDVYSNAISAAGCLAVALVLNTKLALVMLCVV PFAFIILMLFNFCIRRVKRRASAENSSAGGIATESLAGIKTVASLCAQPHFTSLYSHHIS ESSRLNIRASFLSSLVAGVTGGLFYVTYIFAFYIGTEQVAEDMERITIIRCLFSQEQNCR VTGASVMCCIYGVILCVVTFFGLMGPGLSVVNLARSAAVEVFDTLLRQPPIDPSSDKGTR IDGGVSGKIEFKKLCFTYPSAPDRPIFYDFNLTIEPGQSVALVGPSGSGKSTIARFLLRF YDPNQGSIIIDEKYPLNTLNISWFRSQVGYVAQDPLLFPGTIRDNIAMGLNAHGETATNE QVYQAAKDACAHDFILGLPDGYDTYYGGTSVQFSGGQLQRICIARALIRNPSILILDEAT SALDQMSETHVQNALAHIREHKKVTTVTIAHRLTTIIDSDTIAVINGGKIAELGDHKTLI EKENGIYRTLCESQGIKPEDDATSGLATSTDDADIETKSDVEDGKAEGSAMPLDDEIPQD DPTEVPQASMSSIWRQIGGARAFVELLIGIVGSGLVGCLSPCESILTAQIVTNFYTVDVD DMLQANKEFIVKFFYFALAALVGNMMVGYGLGRSGNKLAQKLRIKSFGSMMNRSMGWFDL PEHTTGDLTSILGADVEAVSGLVGLPLGYRVRALVSLLTGISIALKYSIEVGIVAVMCVP LIMIAGLVQSCCVRRRFASNTEGLSPGTIMEQGLRGIASVQAYGLESKVGADYEVALEPE SNGKVKQGMVAGAVFGFTQFSVFVSFALVFFVGSQLLIQVKIDFISFFTSVLAVMFGALG ASTVAADFNSRQKGLVAAARIYTTFYGPSDGSHEDQGTVMPIEGDITFQSVKFSYPSRPD SSIFYESDTMDGVSLHVAPGESIGLVGRSGSGKSTVLQIVMRFYDITGGSASLDKKYEFA DVNVKTLRSQIGYVGQLPTLFNGTVKENILLGKEDATDDEIIAACKLAQCHEFVLNLADG YDTHVGPGGNILSGGQKQRVAIARAIIKQPRILILALDNESQKLVQAALDDLSADYTTLT VAHRLLTVKNCTKIAFLGDGGVLECGSHEELVALKGNYRELWNMQGGGSEEAE >gnl|To_NUC_proteinmodels_ML|p54 XVVSDDLPALCLARCGAAVRFFSCLSCTCSNSILSNDTHNRNDSRFAVSLFKHMQVLGAA YHLERHAGEQMSILSRGTSATSTIIDALLFTLLPTFFEAFVISSVFAKVLRLPLIGLTTL ASVALFLIYTVKVTNLRLDQRRRVIEKNDAVGRIETETLVNYETVAMFGREEKEVGSYGE VRSEYKEERVKMlslfsllqlgqqsIRLAGTCIGLWLAGRATVYGTSGSGGDELLSAGSF VVVQLYIQQLFQPLTYLGFTYRQLTEALTDLEKATTLLKKKPLIVDSPDAVSWNEALELQ GKSMGASSGDITFDDVSFKYRVRKRSMQDGAKSgrrgagfghgrrggggrgmwsghgkgN FWIKSAAAKDGKPNGEDDEAKIEMGGITNVSFHIPAGKTAALVGPSGSGKTTIVRLILRL YDADTGSVSVDGINVKAMQQTSLRSNIGVVAQETILFHSSLRENTGQVIYGKedasdeev deavrvsaLEALVKSMPDGLETLVGERGMKLSGGERQRVGLARCVIKQPKLVLLDEATSA LDSGTEREIQRNILSQQAEVCKGRTTLMIAHRLSTARRADIILVLDKGSLVEQGTHDELL ERGGLYAKLWSDQMSGETLDEL >gnl|To_NUC_proteinmodels_ML|p55 MCNNGRIPSSSYRKLVGQLLLFLRVYCETGKLVQEGSPKRTLPLSFAEALMVVFSLSWNV KGMFFLTVVPACILATLPPLIAHLIGAFIRSVTRWNPDGTPNFDEKGALTSLLALMAMGY LLMPVLQFTKNFFQARYVSQLGSWCRKRMLNVMMKGGTEYSEVYRGGKLCDAFSNQLTQV EMFTQIAFMNMLQYSVTVIASVITAAIDLRIRASVKQASVDGRFAGMISSTVECKPVIRA CNAGSWTQTQFSDMMDETQMAHKTAFFRSTLIGDVFQLYFALYALAITVPLGLRVIAGYV PIADFTTLIGLLAPLTGALLFLGQLNSQVSLYSGAIQTVRDLTKSDFDEEPSSSSANKTT LAPFAKSLVMSDIKFRYREDLPNVLTDVNIEFPKGSYTCLCGGSGSGKSTVLNILMRFRT PNEGSVEWDGQDIFKTSLASFRDNVTVMFQKTMILQATVRDNILFGFDETPGAVEAAARN AEIHDAILLLPHGYDTLIGGDSLTNMSGGQLQRLCLARALYRDPSVLLLDEATSALDKLS EDAIIDTLVKLRDEKGLTLISVTHRPSTCIKANQIIVLERGSISECGTYDELVSTGGLFS RLVAAGEEE >gnl|To_NUC_proteinmodels_ML|p56 MMTPSAEEPADDVSPDIDIEKGSDSTAASETKKKSTASKKEQEFETETPSLLRFVLLARP ELPALLTSLFLVLAADGVNLVVPIVIARAYDALVSPVFDEDERGNVINQTMILVLVLTAV GSILGWIRGFLQGLIGERVVARMRLRLYRSVLFQDISFFDEHKSGEIVSRLGSDATLLQA SLSTSVPEVLSGMSKAIVCVVLMFYLSVRLTSVTLGGVVLISGLSVPLGKALGDLSRRYQ DALGVAQTRSTETIGSMRTVQSFVAEEKEFGRFADAIGDPDKGNILLYPKGTEGRKDGGR NTLQVGYSKATVTSTFFTLMFGGGMIVLYLSLWYGFHLVNGGWMSIGDLTAFQSYVFQVA FAIGQAAGNIAKLIEGIGASGRMLYLINRTPAIPKQDAEKEPFVPNERMRGDIEFNKVSF AYASRPDNTVLKDYSLTIPANTTTALVGSSGAGKSTVVSLLQRFYDISGGSITIDGHEIT DLDLSWLRSNIGFVQQEPHLFGLTVRDNLLYGVNREVSQKELESVCRDANCLDFIESWPE HFETMVGEKGVKLSGGQKQRISIARALLTDCRILLLDEATSALDAESEHLVQSAIENLML RSSRTVIIVAHRLSTVQKADQIVVMHDHKIAGVGSHQKLLEENSRYKDLIKRQSVMTIPM WSTSSLSEEKKSD >gnl|To_NUC_proteinmodels_ML|p57 MTALPTIAAAATFLVYVYGSAGEISASILFSSIVAFDLIRMPLMMYPMALAQFSQCKVSL TRIAVFLGYDEVNEIGYTRDESADGEIIIEGATLYWSDPTKPLPRSALEKTSKLDDSQQS SRKLSLPGRLSRSSSKTSLSDLEEAAEEQSEILYPKAVLSDVNLHVGTGELFGIVGPVGS GKSTLISSILNESVLGEGSSITLNGKVAYVAQSAWILNKTVRENILFGLPFDQERYDKVV DACSLRKDLELLEHGDMTEIGERGINLSGGQKQRVSTARAAYCDADVFIFDDPLSALDPE VAESVFEECILKLLEGKTRLLVTNQLNCLPRCDSIVALGKHGKVLEVGKYENLIADKRSE VSRLLRGVTPSSRRIVKDKKSESGDNNNDKAGKELMTKEERNTGSVKLGVYKNYMRAGGG LLSAFLVFVAYLMSTGANVTSTVWISMWTADNQYTTHTLAFYIFGYALISILMGLMSWIR SYGLASFGVRSSYHLHGDVLRSIFRAPMSFFDTTPTGRILSRFSKDLYSIDNELSDFIDI FVFILLQLIVVMITIVVITPYFALVLPFLSTMYIFAMMYFRRVSRETKRLENITRSPIYS QFSETLGGLDTIRAFGKSSQFSDNFDSMLDANTRTMYCNKTADRWLSTRLESIAAGIVGC AALFSTQVVVSQGVSVGGDSSSFASLAGISLSYAITATGMTQYVVRSFAQVEAAMNSVER VVHYSENIPREAARTSLELEAAQNSVVESSASKAVKASNGEVIHTEESWPEHGAINLTNL QMRYRNDTPLVLRGLNVDIKAGQRIGVVGRTGSGKSSMLLVLMRIVEPYLSPGVLEKYRP PLSIDGLDALRIGLLDLRSRIGIVPQLPVLFSGTVRSNLDPFDEYSDDQIWGVLDACRMK EAVEKMTDGLNSLVAEYGTNFSQGQRQLLCLGRALLKQCKILLLDEATSSVDFETDEAIQ TTIRQCFKKCTIITIAHRVNTIMDSDKILVMDDGLAAEFDSPQQLLKREDSLFTEIVRHS QGHVDE >gnl|To_NUC_proteinmodels_ML|p58 XRPWPEDEDPGASSLAAALSSFYNRWTYSYMNEIFAKGALQKKDRTVQLTQDDLFMTPEL NEATRLNERFWVCYEETNRNFARTLWLLSKPTFIPAGVCQLFSLTAQLIIPLLVRKLLQA AEKFSGVGNIIDETKYYVLGIFLLSMTNALCTHRYQLLSYQTGIVIRTAVACAVYEHSLK LSPKGREGLTSGNVTNLVATDTQKLFEVFQQGHMIWAAPLGISIVVVILFVLIGPSSFVG AAILIGLVPLSKQVVHVIVRIRRKRVAVADERIEIINAMLQHIKVTKLNNYEDRFETRVR EARAREMALIRKEQIVWGFTMVIRVFTPVVASFATYTTYVLVDEGNMMTASTVFTIALLL NMIKFPINEAGVLLSKAALGVQAMHRISMFMKREVNDAHAVSTNDVTTKQVLQVDGAFLI GRRLGDPTNNPDEVATASFTLSGIDFSVNRGEVVAVVGSVASGKSTLLQGVLGDVDQADG TTVARDDNVAYASQTPFVLSATVRQNILFGSPFDEDRYERVLDACCLRPDLLQFPAGDLT EIGERGVTMSGGQRQRVSVARAVYANPallllddilsaldaGTSQNLFDNLSGIXXXXIR SGLLKESGVLLVTHAVHVLPQVDKILVLNEGQQVFFGTYEELQVFESNNPRHMSKLKSIR SSMNLSKMDISSKSSERHKGVCNSPAQSAVARSVDAKKGEIIAAEQREHGGAALSVWLLW FRYAGGFMFILTQVLFMTGDRGGYILIDWWLATWTTSANQDIEVFGVTFPSQLDGNQTPY LLVYCGMVAFMLTFLTARSQWAIWGGIRACQRVFETMTHRVLHAPMSYFDTTPLGRVLNR FTYDVEQVDISLSQAMSIFIIACSWLVAGQIVMISLVPYLAVVNLCIFSLYIIILRHYRW SAADLQRLDAVSRSPIQASLAEGLDGAFTIKAYGKNSFFAAQFQQHNNGNSAAMLNFVAS RRWLAVRIESMGAVVILCASLFISVFTEQLGLTPGLTGLLLVWASGFTVVLSFLITSFSE AEAAITSMERMHDLEQLPQESCMETACENAVVETWPAQGELAFHDVSMRYREGLPLSLED LSFTVEPRQRCAVVGRTGAGKSSLTAALFRLVEIERGKITLDGVDLSTLGLKDVRGRRNG MFILPQDPAVFSGTIRTNIDPFNIHEESDILNALASVKFPGAQDGVALLDKVVEEGGSNF SVGELQLLCLARAMIASPRLLVLDEATSX >gnl|To_NUC_proteinmodels_ML|p59 XFQQLIIARWTEVGKGGSVAAAMSGRYLNQLVFVAGLVSVGMFARSYLTMKLGVRASRTL HEDMLRSVFRAPLSFFSATPSGQILSRFGKELEVVDRSLPDGIGSVMYCFLNIFFSALAL AGAVSPGMAAPLALVGWFYVGIMrrfrraarDLKRSESKSRSPIYTHFKEALRGAETIRA IPTGRESWSGRYRRLTDDNNAVFLSVKSLDRWLSIRLESLGNVVVLAAATASVFLTRAGR LRSGSAGWGITQALSITGLLTWAVRCLTDSETHFMSILRIAELSDLDTEETEIRGFENRD KTPNGGQSYENDLLKSGWPWRGEVQFNDVSMRYNPGSPLVLKNVSVNVPPGSTLGIVGRT GSGKSSLLLSLFRLVEVEGGSIVIDGVDIRSLSLNGLRDSLSIIPQTPTLFSGTLLYNLD ASGRSSPEEAWNALESASPELARQFREX >gnl|To_NUC_proteinmodels_ML|p60 MAVLDQEISPSVSTPQQKRRPPHPFTNASLLSKLLFFWPYELMKKSDRVDDSNERCGFRV NSGDRAPIEEGDLPDVLEQDSSERNLQWFHRIWEAEKSRVARRNGSTSKQERPSLQRAIA VDFMKSLWYVQPLMLCSSAARLVQALALGLLLESFETSEHNPQAGKGYLWSGVLVVSGFV VLMEHHHVFFYTWRRGMQYRIACVAAIYDKSLRLSSCASVELGATARRNQKGVSSSSNAS SGNVVNIATNDVERFLLATLFASYIWWAPIQSVAILIIGFINVGWSFVAGFGMLCVIFAP LQVWLGNRFAKMRSKIAAVTDERVTLVSQAVSGVRVMKMSGSVVGLNKTQTEGWEDKFNA RIASIRERECRQIERVNLYRALNESIFFVSSVTISTIIFLLHVANGGLLSPRNVFSTIVL INVAQMEITKHLSLAVMGVSECQVSISRIQRYIESPELVKVDSDANALDEDEIKNAAVVA KNITCYWNGNGRSSSQSTLQSEEEVQVPNPLLITALKDVTVHFNTASLTCVIGAVGSGKS ALVQMIAGELTYSSGVLQRAKNNTVAYAPQDPWIMDGSIKENILMGLDLVPEFYERVVTA CGLNVDMAQLRGGENTIVGDRGVQLSGGQRSRIALARVFYRDADVLLLDDPLSAVDSKVG RGLVYAISKLAVDRGKCVILVTHQHQYIGDNRCVLMDGGRITCVGSYQDCLHASNGKMTF KAQHPSAPDLEKLDKSKDEVKDTAPDNVKRGDEASDEPAQDGVDDHKELSKVGVVDTFLN YLQAMPGGIMTGWLMLLFTATQGSVLGTVAFIGMWSEMSAADQSSTQviaivvalvvivi ilalaRAFSSFHLTVQASKNLHDAMTRAVLKARVFFDTNPLGRILNRFSSDVGSNDDLLP NTLFDFlvvrylyvvalvSSPRLNVSVSHELFVQISFlvagallsavvvlpvtllvvppl CWYFVGLRSTFVTTSRELKRLEGLSRSPIFAMLSESLTGISTIRPNGAIDYFQKKFFAAH DAHGRSFFAFIACSRWLGFRMDSLMFVFLAVASFTAVLVQQNEWFSIDPGILLSGLFQWC IRQSAEVVNQFVAVERVIGFRDLPSEAALSNERDREVKDWPSKGEIDVSDLCVRYRAGLP LSLRGVTFKVEGGGTVGVVGRTGGGKSTLVQSLLRLLEADSGLITIDGVDIKSLGLHKLR NAISVIPQQPMLYGGVSIRDNLDPFQRFSTERINEALDDVHMSDAVQALHDGLDTIVAEG GTNFSVGQRQLLCLARAILRRNRILVLDEPTANVDSRTDTLLQEAVAKSFHDATILAVAH RLDTIIEYSRILVLGGGRVLEYGSPRELVSRRGEFYSMIQETGEESAANLIARAKGVSES R >gnl|To_NUC_proteinmodels_ML|p61 XGLGLFGESYIPFRSGVLRPIWEKLYPLCFDEFDTSQCYTRGGYQSYQSITYSAVVGIMI GMVAIGALATKVGRRHGSILTAAMMSGGAPSLTLCSLFLSGSPSALFPCMsisffffgig vgGEYPlsassaserallkLKQRQDEEKVLLVFSCQGLGIFMNSlvltlllattrTSgdn agdddnlnDESDYQPSTLVNIWRITYAVGTAVLIYVFVSRLSHLAESEVWLNDKAQREED KIERDLKQTKSGDKGSYFGPYERQINKSYSGKEPVINHTMSSITMRSDFDQLGSTNVESG CCGGALTKLSDGKMILLLRHYGVRLFGTSAVWLLWDVSFYGNKLFQSTFLLALAGENASL AQITGASAVNSFVALLGYYASAYIIDDPDCGRLTLQQLGLAITGILFLVCGKLSDQLSSA WLIIIYFGSSFFGQCGPNCTTFLIPAEAFPTEMRSLCHGISASCGKLGALIAAIMFHLVD EHDLFLCSAYSSFAACAVSFLTIPETTSLDLFEIDKQWRLILDERQNEYEGPATDPRHLS FWERNQARMCFWK >gnl|To_NUC_proteinmodels_ML|p62 MTDAVRYPPMPARTGVSLLVAVFIKRKKRGGRLYTTTRPITNEPRRHDERQLPLHRIAGR CSIASVYARHFAHSGCPFLASQAVELELVQSKVSGFSNEAREASPEPRAAVPSKRNSKPA AISPEPSVLPSNRNIESKAILLDVPLFSLASRPFACMLAPVWRALCATRWALQRPLNAPL LPAPAMCGSVPYLSKVPYFTIGEALFLAPIALVMVMAVNSSLRNMDVEGSGEVATIPLVV VFLTANKSNSLITFALGVPYERMISWHALWSLAAVAAAGLHLYCAFYLGESDDRRLEEVD DGIIVETEDNEPAIAGGRRLVSRDLSGSVDSIYGLNGPNPDLIKFSLDGNTNFTGTVSLV AMAVLVLASVLSIFRRFGFELWYIVHIGGALLAGWYAFIHGADELGAVLVWWAVDMGARY VLMAGVLYPHKASLRKLPGDIVEISFPKPATFEYEAGQYAKISIPRIGFGQFHPITISSS PRDPVVTMHVRGLGRWSRRLGKLAERQEEVAFLMEGPYGKLMIELENRKRYKMVLMVSGG IGVTPMVSIANDLLYEHQSERREVKKIKFVWALRSVELMRAMADRNAGITGGANVLDPKS ASEVVDLSVYLTKCTGAVDEEINDIDAGVTKSGRPDMDEIFLEMKRAAREAGEAHIAVCV CGPSKMVDACRQASRRFSDGVFTRDGVKFDFHEEKFEF >gnl|To_NUC_proteinmodels_ML|p63 MRVGKSESATVIPDTMKAARVSDYGDDAAKVLTIEEGVKVPTLEDTPPAEFKDSMLIKVL SVALAPGDVRVMSGKTRELQgppggppytpgGDVCGIVVQVPEDEKCRFRVGQRVAARFL NKPMGALGEYALVSMSVCDVVPDDSISSNEaaalvssatvavilADRIKKGDRVLIFGAG GGVGSHLCQLARLRGASFVAGVARDTQRLLEKPLCCDQAIDYTATDPFSIQTFKDDPFDV IVDLSCGNFPKIVaskskskksiiksasKGGRYYTTNPDKPVFELHSWYGAMKVFLFPAL WRAIYSRSIYRYSLPSYSYVFALPETSDIVTRTLNLANEGELQASIDPQAPFPFTTEGLR DAFKLQGSYHAKGKVVISVSEE >gnl|To_NUC_proteinmodels_ML|p64 MLRRGFIAIFNLTSVSGTPAIITQRTAGSGFIANPPAAFFTRRFLSKSPIVMSSATLHSI PPEHIPAGMTVHVNKPFLPTSVSGAGFAFCDLYPKRRIIILTSAFFEEIEKLGSKKVFVV ANRSSVKFIEGDGRLIQTLESKDLLAAPVCTSIGMGGGEEGLLQACDAAYECGADCILTV GGGAVQDAGKLIRLWLTTKDNGDVAEASSAAASVEGIQAAQKQDPMPALHPQIAVPNSFA MAEATKVAGLTTKANTKSGAAHEDMIPNVLIYDPALSAGLPDWVRFGTALRGIEHAVGAV THPKSNEDIRSRALTGLAILKENIEKLKETPECSVAQSNVYVGGFMAIRALNTGCYPALG HLVENQYSARFGVHQGSCSGILCARIMDSHYEKSEELQKRISAAMGDASTPAPRLVRDLA GSLPGVSHEHTQVNVTDDMLGEFTQWMFDNHLPRYNSLSPKGFSRVDDILGMMTKPLDKL >gnl|To_NUC_proteinmodels_ML|p65 MRFLLGLQSMQTLLNNGDDLYERGQFSEALHRYELALQKEDAHSSQAARIQHNSPCALTC IIETNSGPGSQDAAETTAQMIRILDKIRLESGVGERKFVKGAESTHVGVDVGMNLLEWAD YKEAEKTLKECLAENGNDDAKSSAVTEDEKVKAICTLGKLSQAQGRYDEAKRLYLEALKN AKQIAGPESEIDEQIVESIAGYAEILRKSGDLWQAEALHKKVRSMLLVLKDKMDDSDSVI SEEVSIEHDLRLAVSHTQLGCTYFELRRYDSGLGEHQAALQIRLRHVDGNDALVSESLNY TAETLCALKQYPKALPLSMHAVAIRFREFGPSHPAYAHSLCVLARCYRGVGRSSAAGPLV DRCLDICGATFEDPLHANFIPNLLLRGDIYSEFCNFDEAIKSYTRARNCHTANFKIGQRE HILEEISTKLLEAKSKSNGSLSSGGVCTDDRERERAGVPIIVITDVGRDIDDAMALILLA SLKRMLLVRPLAVITTLPPEQDRARLARSILDSLGLHDVPVGIGTQIDMPDEKLNLSCFE GVHVGQNDFEWGDVLMSRVLRDAKDCSVKILCIANLKDVSALMVKHKDLCCAKVKEIVLM GGAIIQDGNLQPDPLAFNNHTHIQSANHVFAECLHHNIPTTSVSRLAAYDLQFEVSWFDK LQQSQHQLAKRIRELNYKAMETLWRRCHVPPWLRFPGRGLPKRCTRDWFIDFFDVEIDKN TSVPLWQSTSKVFLYDVLTLIASIDGYRDLYFTPKCVTIGGTPHRIIGYLDDDGKRRGCV VNQDHLLKEVQELVKHALEISLEGMVPLSADGETAMSKNETPX >gnl|To_NUC_proteinmodels_ML|p66 MLALPAILLLLSLISADAELVEYKFEVYPRRATEKDSQLSPDCAVNNKVLLLVNDQLPGV MPIRAKVGDTVRLTVQNNSPTDVLTVHMHGLTMKGQPYIDGVSSVTQCGSPPLSSQAYEF EVFDVGTHYWHGHVSFERSDGMQGPIIITDPESEEEMQLEEMYDDEAVVFLQDWYHAGGQ MRRTGLDTNPFIWIGDAQSFLINGGGIYSPCLDAEEGSLNCASDCSASNYIKDILVEEGK TYRLRLISGAELIGFNFAIPGHKMTIVEVEGTIVEPVVVDNLDIVPGARFSVLVKADQAP GNYLATTKVRYRSSGPMGYINLRYSGVTDEINIENATLSDHPAWDDAQAGLNQQEKLLTK SPSSFDDADVLTANPDSIRRLIIVGTQANDEVLGMLRWAFNNVTMHLSGEPLITTAYEAV NAEGAKQWPNTEIPSTVVVPDKPPTTFDYHKLVQGSVGTFNGERGRSYIALEDTEFVEVV FQNALALNGVAEMHSHHLHGHSFWVVGQGFGTFDEATDPETYNLVNPVRRDTVTLLPKGW VAFRFRPQPGVWAFHCSQNAHLVMGMGLNFIVSPDKLGAPPPASTSCLHNGFNPEALSAG AWSNLGWRPLLLGVAFGVSMSLFV >gnl|To_NUC_proteinmodels_ML|p67 MDEEKLQQDVEREDFSPVGNPGQQQREDDAVRNGGGWCLLASYKYLIAGAIGWILAVTFV GLYWKERRVSRDREVGEILPTPVILKLAKVSYPENAQYERMIIQGFFNGTNDNDNGDEAG ILRWRNQNGFWYLEVLPVTYSRKDCPIPNDENLYGRLEPVGVGLNPTSKSSCNLYDFRLV IFNDLEEGATVHFHGLTPPSNEDGVPFVSNANIFPQNLQRYRFKAFTYPGFHWMHAHTGF QQAFGVAAPIVLQHSNLYCRANKFQREDDLIVMFEEGFIYPRCAYSGHWWYKHECSGAGI DQTDFGKLAFFINRREEPIDHTPGRDVENIRIRFLNGGSEAPWRIDGTNISSDSAMEILA TDGQDVVRDGKRKSTFILGLANRIDALIKVDPSRDLLITGIQMKHSGNVTHPALRHIVIR GRDTPSTERIDIAGLPKYGNFNSTILKNFDLIRDLSAAHPLTNRSVSRSYTVWNRGGDQY GGFPLTIFTGLLTPENLENYNSTPLHGPIQTYXCLPTRCIGIGKPTSSLALDLRELHHDK IWNKKRPRGSYEIEYSDKNADGDDTCCWEWCDVKNCSGFELEDVETYKPNNNYIPVCFGD RVRILFINSASFEGSEGHPMHLHGHNFVLRELFNVSDDGQQLVYSAEYGKDQYNISGPRV DSIWVPFNQAVAFDFDAYNPGEHLFHCHNDFHLENGMTTTIRYMHDEYCRNSLPEFKGGK NNYPTQFCEMDNCSPP >gnl|To_NUC_proteinmodels_ML|p69 MVLVRSKGAVSESSVDLTTLIKPSTSITNLKRHESFVKKEHTCTELYGPYPSSVPVPMTH KDGRTPSPYVVAKAKTMWDMQSFPDHRDVGTPDEWIPRDGKLVRLTGRHPFNVEPPLFVH QKHKFITPTCLHYVRNHGACPNLAWETHRVRVGGIVPRPLDLSMDEIESMPDRELPVTLV CAGNRRKEQNMIRQTIGFNWGAGGVSTNVWKGVLLRDLLIAAGVSESNMTGKHVEFIGHE DLPNKVGPGPFKDEPWGKLVKYGTSVPLARAMNPAYDILIAYEANGERLQPDHGFPVSFS MFVSHFTENLECQHISLPSLDLVAQVRLIIPGYIGGRMIKWLTDINVLEHETKNHYHYHD NRILPPHVTAEESLTGGWWYKPEYIFNELNINSAMTAPDHGETIDLAKNIGSTYEVGGYS YTGGGRRITRVEITTDGGKHWEVCKINQIEKPTDYGMYWCWIWWTYELNVADLVGCKEIW CRAWDEANNCQPNDPTWNLMGMGNNQVFRIKVHLDQIGNKHVFRFEHPTQPGQQEGGWMT KLAEKPDSAGFGRLLEQGQVSAPEEKAAPAAKSPAGGKLITMAEVRKHNKEEDVWIVVNN KVYDCTEYLDLHPGGADSILINAGEDSTEDFVAIHSTKATKMLEKFYVGDLDESSLEEDA GIEERICEKTGRKVALDPKYKQAFVLQKKTVLSRDSFELDFALQSPEHVLGLPTGKHIFL SGEVKGEMVMRRYTPITSDYDIGHVKFVIKAYPPCERFPQGGKFSQHLDSLKEGDTVDMR GPVGEFDYHGNGKFVKEHEDCTATHFNMIAGGTGITPVMQIASEILRHDDDQTKVSLVFG ARIEGDLLCRNILDEWVAKYPDRFKAHYILSDAAPEGWEESGHSTGFVGKKLFEEVLYPA GDHVYNLMCGPPVMLERGCTPNLKALDHKESNIFSF >gnl|To_NUC_proteinmodels_ML|p70 MLAGALTPMRRPSSPAVVGALLASATSIGRAASSPAAETQTAECLTGSASFPEELQTYHV MNNTRVSPDSNILRVGFQGRNYLGFDDRTPTCISVYSPNAKAKSYSPISLPDERGTFELL VKSYPMRPGGGVGAYLCSLNAGDTFEAKVKSKRVVHRSSDIVGRWSQVGLVAGGTGIAPL YQLLQILLRDDTSVISVLSINKREEDILLKTELDELARKHPGRLRLTYSLTDENKPGYES GRGNVEMALRALPKPSLKREVMVFVCGKDGFVESYGGRVEREKTSDGSKGKKIQGPLLGI LKDAGFDESQVFKY >gnl|To_NUC_proteinmodels_ML|p71 MKGGLDTVDADAGWTPLVLSTIAGMSTCLGALIVFCHPVEEIDDENGEDVKLVNALPSRR ARGQRNVSPSTMAFSLALAGSVMVTVSVVSIGPECLAASSMPSNAAMADEENTFFIFGIT LMPIFSMTFLHRLISFGAGCMLYILLSKFAFPEPDEILSHHLERNNTMDSERTDKSSDDE SGRGVELQRPTGSNEKTAGNSSLQRRGVADDDVELHVQPDSSSKPRTIARRPPNERCCRN GPIASCVNSMRVFSKGSDLASAEARRANRVAMLLFFSLLIHNFPEGLAVAASALESDQLG LTVTIGIMIHNIPEGIAVAIPCLKARPXADESGQSGGEGSLENVLSFVAGIMITVSLLEL LPEARRHVDKTCKKPYWLGIAVGFIVMVVTELGMSV >gnl|To_NUC_proteinmodels_ML|p72 MVYVDGFRSALGAEILTDPTACGRNNIHGGPAVFRMLSPSPHHRRNTIHGRRPIRLGTRD AFGRRDEDQAILIHSCTTTKLTVPTPFLQIFVKTLTGKTITLDVEPSDTIDNVKTKIQDK EGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGMQIFVKTLTGKTITLDV EPSDTIDNVKTKIQDKEGIPPDQQRLIFAGKQLEDGRTLSDYNIQKESTLHLVLRLRGGN >gnl|To_NUC_proteinmodels_ML|p73 XMRGGGESIEIDETCNNEVVRPSSDFIQSAQPVKQQFAQRFSRLSKRLPKRPVDCCSDCQ DCMSSDGMIEYGCDALMANIPRGGHLGGGTTDAIGGVIATPCGLPLNAWKVIFQIFLTTI NVVCWLVPLRSKKISENKLGLSLANAFSGGVFLSLAFGHLIPECIHGFGEVNEALPYMIV LCGYLLIFFVEKVAFDAHDILHEMEGVGALRdaeesspaeedsSNGFSGRSAVILLGALA VHSILEMTALGLADTFGDsalltlsiaLHQPAESIALLVAFLKSGMPKHQIVQFLSIFSC MGPIGVAIGMAVNSFATPIVDATMLAVVAGTFVYVGATEIIPEEWEDSDHKWTKFASLIA GIVSILVITQYTATLEGX >gnl|To_NUC_proteinmodels_ML|p74 MATAMPDKVKAVLEKLEVKDQVVLRSYLAGLRDELKGWKTRVEHPDDDEHAHYHGHERCV QKKILLSTTALKNSSPALSCTADHGHDGADHDVEMEHAHEHEHKHEHGHDHKEVACTHES HDHGHGHGHKHEEEHHEGHEHGHDHEKKDHSHEHEHAHDHKEEKKDHSHEHGHSHEKEEK KEVPEWKKRAMESGADASAAPFGGSWNTESTTSATKAGCALAAEMLKDSSVQTQSLKRSA CLGSGSGGCRRALRTKQQCARTDDPMRKSQLESHNIKGDLIX >gnl|To_NUC_proteinmodels_ML|p75 MSHNVFARASPQNKIRIVKALQAEGQVTSMTGDGVNDAPALKAANMGVAMGKEGTDVARE ASEMILADDNFATIVYAVKQGRVVWDNLRKVLLVNTPINNAQGLSVFVALCIGLPDTPIT TIQILYSNFICAVTLGFVTAVEPAEEGIMDIPPRRVGKRLIGRYLFLRIILGTVVLTGCV VASGAIVSLTDIYADQDDEFRLDMMRAVSFNVLDFGAMSIMMSARFAYNSSVHPRVFRGN FAALGSIAIVTVLQIMLTYIPGLNKFVFGMRPMDGIGWGLVFASMFVVFVIMELEKALRR VLKAQGADTDDIVDNRPVVKGGPEAMSMPKGASKLNLQELSK >gnl|To_NUC_proteinmodels_ML|p76 MRSLIWPFDGCRSPILTAHGRPAEYIHIPSRWSAVSVVPSSYSDNIRGRRGCPSVPEAVD GEGADDANIPPLNPDDPLFGLRSSQVQASREIFGNNEILVPETPIWRLFLNQFIGFLPLL IELAAIVSLAVQDWQDFGIIVAMLLVNACLGFREEYHAKKSLDELSEQLESEIAVRREGD TMVLNTKELVPGDVVLLVGGTIVPADVQWLKGDVVQVDTAALTGEPIPRKYPGENGDIIM SGTTIVGGECYGRVLYTGANTEIGKAQESVLADKTVRVVSVFQQKIMTVVQILVSISLVL VLAVLLVQGLLYDGFETDPKESILAALSILIASIPIALPLVLQVNLALGASFLAKEYNAI VTSIPALQDIASMSMLCSDKTGTLTTANMSIIEERTFSTDGFTNDDVIMYAYLCSNADKK DDPIDRAIVNAMEKSSASADGWTQTEIIGFNPSVKRVVAFAKDQSTGNVVTIAKGLPAKI IDTSAGAEDDGELQWAVAQAVDKKFVERVHAEDKALSSSGYKTIAIAICQGNARELGDSA VWNFAGLLPMLDPPRHDTPATIESLNHANINVKMITGDHANVGKETARLIGMGTNIYPGE TMREAPAEQKNKMIFDADGFAAVLPSDKREIVMTLRNHYGLVTGMTGDGVNDAPALSAAQ VGIAVEGATDAANNAADLILTEPGLSPIYGAVLESRRIFSRIKSYVIYRVAASLILVLSL SIIMFVKGCAVDSTFIIILALLNDISMIPVAYDTADATTKPQLPRARALVYQSVFYGFTH AALTLAFFFGMNFAALPNSVDLAMCFGTGYGETQGFVWFHLLLVTELAIFSVRAPGFFLF SIPSLYLVASVGLTVIAGALMVTLIPSFGLHGDNLGYIFAFNAVTLVVVDLLKIQFRKMI GEEPGDIIVGDELIEPKPKTEAQKTTEKALRNVVAMDAKLDPEDRERVVQVRKRQGSILA AFFDVTGEEMATNTGFIRQGGLHASLVGGQPVGDVPRTKRSRQTSSPY >gnl|To_NUC_proteinmodels_ML|p77 MLYPYSRSEECAWQPEEASRKLDALGGAGGVFYDSYYTRRNHSQTNPDDPRTIPPQNQST LLTRGRNSADIRSLRREFGANALHGDAIDEPTETTNPRSRCLAGCKVLSPVAKAFYDQLK EPLIIMLLASASISLFLGNSADALSIALALTIVSLVAAVQEYRSERALEALSDLVPHTCT VLRDGRAIENLSAKEIVVGDLVLLSTGDRVPADMRLIDTVELSVNESSLTGENNPVNKIS QSLTVLEGGVTTGNGSSDVAPTPPPLTDQINIAFMGTLVVSGRGRGLVLAVGERTEFGKV AKELGEVEARKSPLQIKIDELGRTLAYASSAGIAMMALVGYLLGRGLLETITVAVSLAVA AIPEGLPICVTVTLALGVLRMARHAAIVKKLPAVETLGCATVIASDKTGTLTQNEMTARS MYSPAFPFTFSLTGVGYDVKKSGGFIVRSMSMENMQSENSVESVIMPGQSRVTNQCPEYA CLAALFGTASICNNASVAEDGKTLGQPTEMALLVGSKKAGVPDPRPTYHRIQEIPFSSDR KKMEVRCRPVGGSHSCLAFALSAKREESGKVISPDGSVYFVKGMPESILAECQTRVAADG SSVTLTDSEKARALSRSREMSGRGLRVLAMAFGPSLETLSFAGIIGIEDPPREGVVESIR SLHKSGVKVLMVTGDSKETAIAIARRCGILCSSTFTQNSSGDIDPANGQGRCSFDSSGDE RLDLDAHSDIPLDEEYGQYALSGRQLDSIGQQYLPDSIAGVKVFYRVAPRHKLALVRALQ KRGEIVAMTGDGVNDATALKASDIGVAMGLGGTDVAKEAADVVLADDNFTTITHAIAEGK GIFFNIRNFLSFQLSTSFAALAMESVATVFSLPSPLNAMQILWINITLIMFLTLSVFSKE LDDGRVTRRDTTMTFMTFVNCDLFNAYACRSAEKCFYELSPWSNPSFLWAMGFSILGQFA VIYWKPLQEVFQTEALSIGDLAFIVCLSSTVLLLDTVRKKFLRPYCSDDSKRRLFRRMKK PRNX >gnl|To_NUC_proteinmodels_ML|p78 MEQLFALPCAPCPSAPKAKKEVLSCRSRLRVEGICCSAEIPFVRSLMKTLPGVRKVGVNV ATKVVFVDHNPHATTAQVMANALNEVKFGATIITDGCAELAKREKVQKEDASSSLQSMMD LPASRFVESTFIIPGLATYSRERIDSCPIGRLLQQNFFKDHLRAFHLHAASRTLKIEHDP TKLGAGKVLSALLTGLNPEDWGSIELAHDGQAEGLILPVLDSDNEKETDDEAAKRTILGG LRFNVAISGVFWLLSILTHLPGGRLENFKYFGIVSVIFGMPPVMLKAWATLRRFQFDSNC MMASAAIGSFVLGEYDEAASVAFLFAISDYLEGKATRRGRKALGEIVSLRPEYANVIMPN TGEVKIVPAQEIPIGSKVSVRTGDKIPADGVIVEGTSSVDESSITGESRPVPKETGDEVC SGSINVGETQLVVKTKQSVGDSTLSRLIQLVEEAQANTSETEKLVDSFARKYTPVVTYSA GLAAAAQRGIVIKGGSKLEALGNVRTVLLDKTGTITKGRFALSHLESVGNSMNRLELLEM LVVMEAPSSHPLSACLIAAAKSEGVMAPAGVSLREHTILKGEGVTALVDGAQVYCGNERL FKRIGMFDANQGYAAQVEKWGGEGGTVGFVGIDGKGIVGSFCVKDSIREEASDVVSTLIS SDVDVVMLTGDGEGAARAVGLEVGLPEDNILSDLLPEDKLHYVSNQKEENVVATASSLSG RRQLVMMVGDGVNDAPALSIADVGVAMGEGATLALEMSDVTLMDSNLNKLLFSLNLGVKV INTVKENIAITVVINLVAITLTFLGKMTLLAAIISDVGTMLIVTLNGMKLLSQRVIDSIE VGRHEPTRRVLRRKGSTKKGGVVYSKPSDQDEDSCEEDQSDGTKLDPRFEIV >gnl|To_NUC_proteinmodels_ML|p79 MLIGTCSGDVYVSAACPVSAPKSVWGFLGRHTVELVNVIADAQAGDQNLFVYINVSLRTR IPVLVRLLHRERRHRHFRPPSAPHPTSFGGRSPYPHVPEGNRSWFIVVLIVDDDTSAWFC TCLEAILGCIVGGRPGVASLSVLDRRRRIPPPGRSRDGSCLEDRAPARAQGRRFDVHPAD PAKCDISMSDHEPVGGRDSSSKLIRSTPSEYGSTSQTRRVYVASFALDGLTCATCVNAVR GAVESLEGVERETVDVRLLPDATLTLQYDGDRLNDDDIAEEIEDIGFGAVLSSKHELGRD LERGSTEERTKTLYITTEDQCGVMSALLTLDGVDSVEYSKQQQGSANDDKSNEQAGFLRT VQNLMDKVFGRAGTQGYAPVSSPSPANGSGTLEVTYHPSQTGVRSIVDFVTSITSQPVQA WDSMSYQMKQKKIEARRQHEITAWRDQLLFATAFALPVFVTSMILMKIPASHSYLRRIAF FGINREELVCWALATPVQFISGGRFYRESYHSVKNGKLGMSFLIAMGTTAAYLYSIAAVA YNAISRCCGCSRPVLMQSFESSSLLITFVLLGKYLEALAKNRTSKAVSALADLAPDSATL VGTFDETSKSSSSEPERIIPLSLLQKNDILLVRPGEKVPTDGVVKSGSTTIDESMLTGES MPVIKIEGDKLIGGTINLQGSVNMVVEELGEDTALAQVIQLIETAQSSKANIQEVADSIA AVFTPFVIAAALTTYCVWAILLNCGPLDEIKDTWPYRRQGFNDWTLPLLFAISVLVIACP CALGLATPTAVMVGTGLGARLGILIRGGEPLESSKDLTCVVFDKTGTLTRGEMAVQDILL LSDRLAVEMHDEKSRCGSEMSDIDTVRDARRVATANMFYYAACAEQGSEHPIATAILSKA RDYGIGNGLDRPLDEVDGFEADAGKGVKCTVNNTAVHLGNRRCLAANDVNISPGTFDAME YLENKGQTAVVISINGTSEAVVGLIDYAKEEAALTVNVLQHVLGIQVFMLTGDNVRTANA VARDVGITATNVVADVLPSEKILFVKKLRQEGERVAMVGDGVNDSPALAEADVGIAIGSG TQIAHEAAGIVLVNSKLTDLLVAIDLSRTIYSRIRLNFYWALGLFYPLTHTALPPYIAAF AMALSSISVLVSSLSLNSYX >gnl|To_NUC_proteinmodels_ML|p80 XLPTNTAKVAFAPIDAVNAAEICSKEVYQQLAELCASTVTKGGYPTEILNVGSSSSEEGT SLLDSAARMEQSRRDELHGWKTLLLTSLLFTVPLAVIKMSRMRSENGALSDSNDGTVAEL QDVTNGMTEMPPTPLDWTELLLATPVQFYVGARFYKSAYRGLIHGCTMGMDFLVALGTTS AYLYSVIVFVIQILCKYEMFGVHDISIMKLRTTFDTGAMLITFVTLGKFLESYARGKTAG ALQSLMELQPVSATRVILPTTLMEKLMNLQNDLVGDEEIDYALAFEDANLNSIATEEKDI AEVQIGDFLLILPGGRIPTDGILVAREGTGKIKQPSLDSLDESDKTDHGGCAYIDESAFS GEPFPVAKRPGDSVYGASVNQLSVILIRVTATGSSTVLSRIVKLVDDAQTNRAPIQAQAD HIAGVFAPCVITLAVITFACWVLLSGDKFDLEERYFKALMSAISVIVVACPCALGLATPT GMYVAVMVGTGVGAMNGLLVKGGAVLEMAHHVKTVVVDKTGTLTTGRAVVGKRIEYMSQI TESDANSSIGRLISCLPSTVDRSDMALWFACCAELRSEHPLGHAIVNSGRELWGHDILKP VRKGQSGIANETELSILDFQVVPGRGVECTLVGMQERSCTVRVGNRAWARGIEEDANGLL AEVQTDQADEDVDFLRGQGQIGVYVSVKFSDDTDETNFIVTGVVGIIDPIKTEASSSVAA LQRMGVDVWMCTGDHAVTAHAVASQIGIREENVCSNVNPEGKADLIKRLQKRRDRRNRLV KNRVAVVGDGINDAVALAQSDVGIAIGAGTEVAVEAADIVLVRSQLHDVVVALHLSRVVF DRIVSCFAEPFSMPFANASPAQRLNFFWAMAYNLFALPCAAGLLYPFTDWTLPPAFAGLM MAFSSVSVVTSSLLLRTYVKPEIGNDGQVNERGCSSATSGFIDIFYTCIFDNPIMSIFRG NEPKHMPVEVESVSDDEERYLRQASPMRPSQGQWMEPLSEVELSNMGIV >gnl|To_NUC_proteinmodels_ML|p81 MTVKLGINGFGRIGRLVCRAALEHEGDVMPVAVNDPFLSLDYAAYLFQYDSVHGKYPGTV TADADSNSLIIDDGKTKVSIKFFAERNPSDIPWSSVDASYVCESTGVFTTTEKAKAHLGG GAKKVIISAPSADAPMYVVGVNHKKYDGSADVVSNASCTTNCLAPLAKVINEVYGIQEGL MTTVHASTATQLVVDGPARGGKDWRGGRAAVANLIPSSTGAAKAVGKVIPELNGVLTGMA VRCPTPDVSMVDLTVKLKKGCTKDEMLATLKAASEGDELKGVLGYTDHAVVSQDFVHDNR SSIVDGTACIALNDTFHKVISWYDNEWGYSNRLVDLAVFMSTVDK >gnl|To_NUC_proteinmodels_ML|p82 MARRRSASLIIIRDEQRRGAYSFSGTSLKAASNGSSMSMATGMGVNGFGRIGRLVTRIMM EDDDCKLSAINAGSATPDYMAYQYKYDTIHGIAKGTVEVDGDFLVLNGEKIQTSRCRDPK EVGWGALGADYVCESTGVFLTKESAQSILDGGAKKVIYSAPAKDDSQTIVMGVNAEEYDG SENFISCASCTTNGLAPMVKAIHDEFDIQEALMTTVHAMTATQAVVDSSSRKDWRGGRAA SGNIIPSSTGAAKAVTKVIPSLQGKLTGMAFRVPTIDVSVVDLTCKLGKETTYEEICAVI KAKSEGEMKGVLGYCDEPLVSTDFESDSRSSIFDAGAGIMLNPSFVKLVAWYDNEWGYSG RVVDLMKHVAAVDAKVAA >gnl|To_NUC_proteinmodels_ML|p83 MVTCAVNGFGRIGRLVFRYAWDDPLLDIVHVNDLCSVESAAYLIKYDSVHGTWDKEVEVT PDGKGFTVDGKLVTFTQESDFNKVGYKEMNVELMMECTGKFLTVDKLEEYYTNCGCKRVV VSAPVKQKGALNVVLGCNHELLSDELKLVTNASCTTNCLAPVVRVIKENFGIKRGCITTI HDVTGTQTLVDMPNHKKSDLRRARSGMLNLCPTSTGSATAIIEIYPELKGKLNGLAVRVP LLNASLTDCVFEVNKEVTREEVNEALKKASEEGPLKGILGFETKPLVSTDYTNDTRSSVI DALSTQVIDKQMVKIYAWYDNECGYSKRMAELCNIVAAKYIANVEPSFKYE >gnl|To_NUC_proteinmodels_ML|p84 MLLRLIVAALLCSGKLAEGAASEPLLGGKYVPTSDVGHIEELTELIDAIQNEPSQETGNT FYTNILREKISLESLSNEVESDSVQIYLNPATNPLYNVYMYALYRSPNGSGYEKDNTRDE MKFLSKDVDNYGHTIIQDELKKQSGYDRNLTTETIRVTSVWMSAIQSLYDAVKVCEQGGG LGVLEAPEFVNPVDAAAAFWFGDLDDENDTTGGSMYAWAARARDNFVLQSDLTGFNANKD LITDLNTMQAMLNDCMDASKTPDEETRNEAAKEMRELADGMCAKMTVPMVQNFIHYGTLK ATSTSADIIDQDYMILYALSTLPPIGVCDGASFDSLYDMIVTNSDDVTTSVWREALEMIQ SRFICLGVSCEQVGTSKALQAGLEMGDFPECRIPDYNLDGYTGSDPEVMVQFGLVDLDVQ TIGTLVAMDAPTAASDIFKYGRFSKPSSLGLGYFSVEDRSKMKNNERNTVLAAFQDYNDG SLVDTVAVVYAILGLDYFEEATTKQRRIIAEGTMVMTTLHLFAIDMFFDAIDVCENEDAF DNTMSADSIWDAAVASLTGWAEGDATVEGLLFFTVSRALCTDDTCKDGKNQINDLLVTAF KDGKSELQKNNCAEAESKIQDIEKLLQTILTDLVALFAENIAQRDNADDLAEGYGVVSAI LPFVNKVDAGAASILKENMGTYKSGAFQDKLPVFSALQSYVEKSNIDCSLLTRDICGDRA EETYDDDPGDTPVDDSTPGSDSDGMPGSLAVTSPAPPSTSGGDYWDDSLCRSGVSGGLPN NIDNDEASQSLLGGAYVPTSNVDHIVKLTESVEAIAATASLSEGEEIYKRADENGVSLQC LSTGVSQAEIATTNPLYVQFMYGLWTSESGDNAADGYSSKTFDDGNIFYFGDTTVQDEFN KSSGYDPKLTAETIRVMNIWMAMTTELYRSTALCRNGYVDDASASFNPVDFAAALYFGTA TDPDLVNSLDSNGSLYAWAKRAGNQFVDESVGVNDEIISKLKDLQKSFQDCRGLSDEDRE RKGLTMRDSAREITNLMMVPMVQNFLDHLASQSGLIDEDSPDQRNFMVLYSLATLPYISI CDEDAYDNLFEDLILELDSYDSGTFNDHMETFQGKYHCMGVRCDQVGSLAAAGDNWPQCQ EDAAAVSFAGYTPTPGAAPKANDAAYDVYNNGWYVRGGDGKPFQSIGTMSTQLQSRAKSL YEMFTENDGIDYTQVTRDAIQATGAFQYTSPLLNSAAASLAMTSIDLHLHIVAKMYLAVA QCQGIDGVNTTDSPAMSWDSAVACTVGSAEGEIEGSVLYDLAEEMCANFQSCNEDGKSQV NVKLIEEFLNGQNKLAGNQCEEAKNSAQLIESYIQAILVDTLAYHAQYASTKKRHCTMAN VATNALVPLIQPVCDTCAKKLSADFGEQNNASGNCAVDDINAVYQALKQYVEGAGDRLFT SSEQGMQTSYDLNPEEDETTVESERHFILNREYEPISDVKKISVLSSVLGNICNAADGST AGEIYSNDESIGMSLKSMSLAGKRSMVDELLFNQYVYALHDDVDLSDGSLQFDERPATDY ANTITSDALETNIPLGCESAKILNVWMWIAHKLNEAVQECKATEKTENYRPLDEAAALWE SGLLFEMAEQLGPKFGHGNIDGMIYLNRMIVDRLVAGRDIISKNNNMCSEQDTLDLRIIT RETISYMTAVLIQGMIDAVFGELIWDGMFSYLQNHLLIAKFTGSSSEERTKERAELFGFA VLPRIIACGHERMYGVLYDDLVVDGYSKKMNTDIMNFLHNYLNCLGLKCDDIGRHSAADA SNFMCADDLDIASYSPVNATRTNMAAKIDLDVVAIHQMMSIERYSIALSIYTDGLNYYDY DSSDYNFVSLQDLTNSQTIGFTDFTTYTLFNDYFGPGYLDQLMMETISGINRFANATSKQ LDLATLTAVSSLLSFLASLEALSISISQCESGRTETALTAFDGGVALLIGSVEGSRVYGD AYSEGKMLYSVGKRVCSYFRNCMAGNAEVNQKLIDLLKAGQELLKTGDCDVAQANLDDIA MTMRVPLLQQLLFFGERYSDDAFPDDSAAAYIGAQAALPIIDEISATAAVSIRDAVDFPK TEDDSISGIVRSAFTDFFSGSTEMDCLLIAGKLQSSICGQEGIVTGDSADDTSTIDNGES ELPSVDANQPAEEVVEERPASPQAIPNRDEPMPISDGLYIATTYVGDRSAIALDVKEIKE RLKVGDVDEAEFTYKHGRNSKFFDINGLPTGDLRSIRDFSVDSASTMLSDPTYNIFLHGL ADQNVEFMGSPASLYADTFITSLLYSKTAAASDAMVAVSIWMQVAHSLHLSYAACKDSFL SSDSRELKPIKDPSLFVDEAAAYWIGDSQDTGSASEGHLLYALTEFISSKYEDIPEESQS SINTRVLGLFNKAKNHLAITQGCTTSKDSHLSLREIVNELIPVMAVPLLRSLFYYISIED SLMVKVYAVAVLPLFSACASSTYRELKSELIDHDIYEFEKAYLYSKIESMFSCIGVTCDE VGFMPEDDFTRCTTTASLNSLAGYRYGSNKEKVTKRAHIDVDMRKIDILMTNGIAHHSGV DGGTDKLFTAAFEVYKYGTSPSVQDSLVNLAKDTGREIVPVFESFKRYYQFEANYADQMI TKAFRGQDVFQEASFDDRRRVINFVLQYMVPHMAILQQLYTSMDTCKTQRKSGSALALDL AAASYVGSLEGKSDGGSFDGSLLYGFAMRMCVHFGTCTVKYNARANERIISLFYSAQGEA EAGACDALGKTVKKIESTMLVPLIQGTIFVANENVDGVGYYPEGFVLAQAILPIIHDADQ NAAVEISGAMVEGFPDPDISAVAQPAKVFRAVQAAIAKLDGINCNDVGTIHGSNFCPGAT YDNFSPAVGARVATLIQIATLGVIVYFLG >gnl|To_NUC_proteinmodels_ML|p85 MESSLPAAARTSRLAAGANNTGRVLPSPSKAIRCQSALTREDVVDSDGLLQFKTLHEMNR NATIAFRDNELFGTYRAPPQEENFVKDQEEKPGKFEWMSYGEYGELVDRCRTVLKDLGIR EHDKIGLISNNRWEWAALASAAYSLNAAIVPMYETQLPKDWVYILNDSECAALFCSTDEI FAKAVKEVVPNTPSVLSTLCFDTPAGEPHSLLTALEHARGQATSVIEPTPEDLAGLIYTS GTTGKPKGVELTHDNFVSNVYAVRGMADDPRDFIRSSDRSLAFLPWAHSYGQTCELWCAI SHGGSMGVCRGVPLILEDLALVKPTVLFAVPTLYKKVYDGVQNLIGSSSPTKQKLMRAAL DLGRKKKESGGSLGLVEGLKHRVLDGLVTSKIRDRFGGNLRHGFVAGAACPKEILDFMDD IGITICEGYGLTETSPIIAINAPYEGKRKLGHVGKPVDGVEVVIIDPDTNQEVPSGQEGE ICCYGRNVMRGYYRNPEATAEVISVAPDGKSRLFHTGDMGNLSEDGFVAVTGRLKEQYKL ENGKYVVPTPIEESITMSRFIAQTVVCGANREHNVALIVPDWVAIRSTLNIADDASEDDL VNNPEVRGLMDEEIRVNTYKLKKYEVPQMWAFVAPFTVANNMLTPKMSIRRKIVIKHYDD IISEMYGDDLTTSASDGQVAA >gnl|To_NUC_proteinmodels_ML|p86 MNAFKLASRLSSRAAVAQFAPRPAAIGVGISNATRCPALPALSQQQQAFISSTTPASKSP ELYELGSVDHGFTDDRIDHVAEPERRAFTYLLLSGVRFAYASTARVLVVKFVSSMSASAD VLALASAEFDVGGVAQGKTITVKWRGKPVFIRHRTPAEISAEAGVALSSLRDEETDDDRT LKPEWLVVLGICTHLGCVPISGAGDYNGWFCPCHGSHYDISGRIRKGPAPLNLEIPPYKF TGDTKILIG >gnl|To_NUC_proteinmodels_ML|p87 MQHSALLCCFVAASLVGDAAVGAFSVKEAESGVPVIPTGDLKLFDPNESALLQGTNVLSE RVSSGSKFTIAPSQTIVDRPPAGAAVRDAQHFLEHLDAHGELPLNFAKPNQPVTATVLGR TKLIDDDAPGDIEHVILKLPEGFHYVEGQSLSLIPPGVDAKSGRKHKPRLYSIASTRYGD VLDGNTISLCVRRAEYVDPVTGEKDPAKQGVCSNFLCDVRAGDEVSVAGPVGKTMLLPKD SNTDIIMIATGTGIAPFRGFMHRLFMENTLARHMFGGRAWLVLGVPVTGGLLYKEEFDCM QRNAGADQLRIDYAISREMTAKTGGKMYVQNVIAENGREVFDRLDNGAVIYFCGLKGMMP GILDSLEEVAASQGIVWSEKLAELKKNHQWHVEVY >gnl|To_NUC_proteinmodels_ML|p88 MKFNQSTLSITAAAFFAPAHLVVDAHGYLKSPKSRNYYANTDGKWYGGTASDPAPENCPH CLNRGGTDGVCGISSNANYEQPPNAMGGLMPPVVQACYSPGSVVEFESVLTAHHKGHFEF RACPVSSGEVPTQECFDSNPLTFVEDPLYGAVPDTNHPERAYIPNAGSDDTQWTYKHRYQ LPSNLEGELVIIQWYYLTANSCNPEGYDDYNFAPLGIPDNSLGQCNYPLPSNGVPGKPEQ FWNCAEVKISTDCGSTPFPTKQPSVPITKSPVAFSTARPTISNAPTFNNAVPTKQPSVPV TKSPVAVSTPRPTISNAPTFKSAVAESREDSRLIAYLANWEACPTDDMLDAYTHIVIAFA VSYTWSAAKNNCDTSCSVSAPPTCGNQVRQDLIDKWRGQGKKVVLSFGGAGMGGSWPGDN NNCWDYCFGKEEEVSTQLVSIVQSQNLDGIDLDYEYCYDTEGTQSGLCTAKDTSLFPTEA SFDTAAQNFLTGITSNLRQKMDALGDDYELTHAPMDSDLAPTSKYYQILKDQSENLDYLM PQFYNGYIKVVSDGFTGTGAGAYSAESVYSNLAMDLFPNRPDKVVFGFCVNGCSGTGSNA NGQQAVSVLQQVKEFDQGQYSCNGGAFFWVASADASGAWSDPVAAELALTAGCLDGTTPP PPSPPNTTSNPTSGVTAQPTSPTTSQPTDAVTSKPTNAPIVVGPVVAVFDAVLGAPRCAS ATSSCESGDLLKRMSCSEPNGSNSVDECVDGSKAIQSIERITAASLSGEVFKEGDMVVIS ADVHAWLDGQSDVAVDIYHAVSASYPIWNLVHSEVVNGGMATVSANHELPGGSPTQAFRF NIRHGGQPETGCSGGQWDDVDDLILHVEPEVITTTTTTTTPRTQNPNCPNKSGSGWGSCG PDKPCADGACCSQWGYCGLSNDFCGTCCQNGPCFS >gnl|To_NUC_proteinmodels_ML|p89 MTRYTPSTATLLLSVMLAGCTSEEVREPEYVYGVCVHNVNNGSRYHLYDAGRYYLCAYND GGRLYIYNRGNDDYTIAATTDLTTSLESTSTTNPTGGADVVVVDDMIEIESGSEPLMIDV LGNDYIMSSGDSEPSGDSKLWWWFQASPSTTDQSTLSIKSVTEDSPDSPVSCEAMGNEVM VKVTDETYVGGADCFYTVVMRDSLGNEETVDQQGTINISVTPGCEPIYEILSSDSFSILY DAVLAAGLDKVLANPNLDPKLTLFAPPEMDERLLESAWLPQLLDNLLYYMLEGRELSVDL GFIQGFVPSVNFQREQVLILAEPPTIDGVELDPTDTLACNGVIHEVFDELQPASSDSGND IVEMLSRNQECKTLVDLIGPAGLVSDLQGEGPFTVFAPTDAAFLDLNQDYPGILDYLTEP ANIANLTYILQYNIASANALSPPLDEGVLLNSPQPPCPFKYRSRHRQGFASSGLCTACHN HDDYGELVQPLPLSTSDIHALILDQFQFTKTTTSTVTVTTTNAACQDRGFYFVTNADGSD GLCTNAVDSSNNGCNATQTGSGSGPCTMQSCCSIYQHSGDCHIFDNCNVVPTIVYVNDFP LFQPTGDVDKPAHFLYDVLAGDFIEDRKLGDFLVVDKVEGAFEFPLPPPVYDEFGNCVNC VNAGTCIAAKKHYNDKYNTMIKYTAPSRFAGKFKCRYTALIKRPQVQPDNTTKYVTVRPF KTRTGQPNNHRSRELQIIGPQPVPDPNFGFPQDPVGVIIGGVFNAPTDSPTTSPTKGPTT SPTISPTISPTLSPTLNPTLSPTLFPTLYPTLYPTLNPTLEPTEEPTAPPPTELPTLSPV GLPVVPDVSAVVSVNGTTLIFVLDGAHPGVPLPNSLKVTDIVVQGIPATKRRKALDALDM ITDDTTTEGDGATIEDKNIILEEGVEVVTDIEAEVELDINGTVGADLDINVTVGAELDIN VTAELELDINGTQSREPDGSFCAVGPDGRSVQYVSGGTYVGPYRCTFETEDERGFIVLAD LNLDVVLFVDPPDIEPLPPKPKPPTPKPVVQPSWGGGYPQTSMPTHKPVKPWQPAWKPHW SAPEKPDYHEPHGPIFWGWGKSGKTKGTKTLKAGKSKSYKTFKEWSGSKSSKSKSSKSKS SKAKSSKGMFTWWRGGIDGGSGIVFNKDGTAEGAAIKEEPLLASTNTEEASGLRSRGIRS TLVAAIVCSLHLLIDDATLRLVSRSSQTVVPDFDPPRRLSSLRYADFQESATISVNSHTT TSMHPTX >gnl|To_NUC_proteinmodels_ML|p90 MALRLSSAVAILSALASALTAAAEDEIVIEDFSSPVHTWATMNDPVMGGQSSSSLDISDG VANFSGNCAIVPFLKAPGFITVQTGGYGQSSEGNFPDVSSCTGLKMVVRSKVDYDGYRVS FGKKRSGEHGMGYKAPAIELSTSEKFEEVVLPFEQFSLKWDEGTGDVITMCSDDESVCPD VETLRNLQAVSIWGEGVEASFPILGDVDLDIKSISAVGCDGTLKVDETTSEAVSEEQTEE EHEGGATEGSARWVLHEAKWGTLTTVAADTLNGGEHDSKNPDDSQPLFANILPYATDKAT GRIFFYMMGSHHLHKSTLTVSQASVNPDLFSVGGCGTSTKSVVDAQDPRCAKISVSGALH PCDSHDVGERCDEVGRAAIFGSHPVMKDWPEDHHFTVHELVPKNDGFWMIANFGGGAAVT GELYSQVEKFDQPHEIQGGTAISSKPFVGDGTPQTMPSWNARANRARWVVHNSLFTTVST LSEDGEGTFGNIRSITDGATLGHSTGRPVFNLPDVDPAAVNLHSNEMTVALTFSEAAIYE RVTSDGETCAGQDAGSPTCAQVVIYGKAKPLAEGSAEYKAALKNFGKSHPLASWLSEGGS HMSGSYYTIEPTRISILDFFGGAVDVKVDEYLGVTFDDEKPKVYASPKTTFMNVMFGVVI GLFFGCCGNSCIKCIRYKGDAKYEEVKNDYGSLAMAVDVNGKPSSEDVNSSEMAKAAIV >gnl|To_NUC_proteinmodels_ML|p91 MASFIADKQININRVTPAARTEAFTIDRSNTAYLSASIDPPLEGRSEPLLIAGYLDYGEF GFGEGPKRHLTYTFNAQRSSTNPSQDWEGFFGALDNDAGYFDEGANIISPFSIFFRNFGP SYEIAVRGSMPGHSVDVCVYDSKSKIFCSKGNEVKESISIIGLPNKEVMQPCDTYWAVAN TTKDGDDGLEWSMWSTTSFVFNGPCPNFQEESLSHDDANVVGRSLTVGVSLFWGVMILLL VLSAVLQRRGQRFPRHKFLTSQESSDETAVLSDGSSSGSDGNEEHSNPVPVLELS >gnl|To_NUC_proteinmodels_ML|p92 MRRIRCLVAAVAAVGVEFGSAFTPRVPPRVDARTPPAYSAAAPPLGSSPADVEPPTTAAS DYMPPEDAIIMIKEKAMDRLVELKQKQADPSSPLILRMGVRNGGCSGLSYVMDFSTEEEI QDEDEVDDYSQEGIKCVVDAKSLLYLYGLELDYSDDLIGGGFKFFNPNAEESCGCGSSFG V >gnl|To_NUC_proteinmodels_ML|p93 MVFAKQPKDPPIFPTHKTEWWEEDSSPPLQASKDKATNEKTNSNTGNNYGSTGQFDEEEP AAFLLDNIVNVSFPSTGSSSGNAGLYDEFGIKDVPSDISNNSCSDEEDGSSGYSDEDDLE PPHRTCGLIFFDFLRFVAVSADMRLINTQIFPVFLDWKKMDILHISLRVLMTVLTLLLLL VEFPSFFRFLRLDYVPGETPPFSISNWIPRGIFYIFLSLVSFEQAIVVRAMDEEKHADTF SRFFNSFFIFVSAWCMLVVGLLYVILGAFCVQKIMIKVRREERKKWREYYERLHILDQQI DDEEESAWMLENPDSEAHGTRKPLYDKDASTNRGCSRHASSVRGDDGVYHARLSALYTPT GPLGRSVASATSNNHRCTDAADNHRSVCSDVRATHTHNLLFDSTTMVYNLVYLALFLTPS VLGYAARDSSSSVTKSFKFKDDLFLKFHPLDNAIGPSDIGTLMQALEDFAPTFKRIWKQQ VETKLLGGGCKVDRMRVPTFSYCEWKFPDRYDTEELPIAGQCYAHEDGFIQRVYYNIEKL VITCDADFDYADLDRQDYIETIFEAEEFVNLLNPPESLSDPLSLFSTLQSVEASTFDECN AELYINRYVDVPVFLKCLDLEGYGPDVANQLIEALDLYFVGEVIPALSEKILLQAPTGHI YNIVSPYESDREYGAHGSWYTGGNLVNQQTDCDGVLVTEFTYGFDVIWFKPVDFDCQDEQ LLNDVPQIIQEVFQMGSNFLDFVKSEFPGVDIIQDASVCDVVDPDLIEITFAPSRQPTVK PTSLEEIGGPCDRDSQCVDGLVCNQSTNTCVCNMDTNEGCSPGLFCRFSCVFLSNAPKCF DDQELRDCEIQYGPGHVCYDANQDGVIDALDKASGCAYFSPTASPSTDVPTKEPSTAAPV ESPTVSPSVELTMFPTIQVFRGVETLPPTEYPAPSPRPTCPKCEVQIPNDTTEDECPLVT SGTCGGGNRGDGICPFQGYCCSRWGWCGTTDDYCEDDFDAPTPSTVEAAPPAPTISIDAG MCGAQDVGDGLCPGENMCCSDFGYCGAGENYCFSTRVWTGDNPGVEDSTGKCGAGGIGDG LCPVVGGSQLCCSRFGFCGQGDLYCTGNNQLAQSDVAEGEVKLKSSPVPSDLRAAFGFRC GLTEADARSNCKPECTHHTQCDGDEECWGVQLNYCHTFEEGEHPICTDLDKADNDSRCGV DETSARSHCGAKCTDDSECATGEFCYPVLSNYCTCHEDNDKEAPLVFAKAQALISPYFVE SEMGSRFVDGEPEGVPRSSAKALSWTTFGTLLSIGVHFLLF >gnl|To_NUC_proteinmodels_ML|p94 MSSSSSYTKPPVAFDLSAESPCYTLNSGHKMPVVAYGTFRSEPGEVGPAVLAALSAGYRH FDLAHVYGNEKEIGAALKQAFDEGIVKREDLFLTGKLWNSDHDREIVPQACDHSLNNLQI EYFDLYLIHFPICWKHTGLSTPSWGKSELGDTPLIDTWRAMEKLVETGKCRSIGVSNYPM MLMHDLVTQAEIQPACNQIEVHAYYTRESLVNYCLARNICVTAHTPLGGGQANADAWSTP SPLKDSVVNEIADRHGKSPAQVLLRFLLQNGVVVLPKSVKAHRMAENIDLFNFELTDTDM ARIHKLDKYVSYKTNPNPLASFIGGKDAYSAEGTDIFD >gnl|To_NUC_proteinmodels_ML|p95 MGGDTTRQDGTKARGNGSVFDPSSSVDRQCLGRGGGGGMSDLLTVAGGPPSVVVVFFCQL TNVISFIQRSTVSSEGELAVSGPIPGVSRRLLSSRRLSNELCTAWVAALENVFSNNIANG FNGTDQQLISFKITSIDCETGTRGNFTELYELVSETFQSCTIGDEGCSETGKDDTEILNN TLSSVKDAVDNSISGSFNDEFSTAFSAVASEAGINDSDIADVQAAIAAKPNNIANFEPND IANFEPNEFANLEPNIESLEGRRNKIADAVPHNPSSDNFVPYYPVANTVSVDRIANNIAN YAGSCDSVAYPSSDNFIPDYAVSDAVPHNPSSDNFVPYYPVANTVSVDRIANNIANYAGS CDSVAYPSSDNFIPDYAVSDAVPHNPSSDNFVPYYPVANTVSVDRIVFAASAISSKSDIY SDWHSTLQPTTSAPTVHPFTGYEQVGGTGAGECADKDGGLFSYAQYNENASAEDCSVKCS GLATSAQVGLGVTSSACRCYFDGSAPDDEGQIFTSVDRSGTGPVAGVNNVSGENWQCYRL I >gnl|To_NUC_proteinmodels_ML|p96 MDSSAVTLGDAKINARRDLHPSPVMITDTLSDVRVKYHINPKEIGHGHYGVVRKCIDRET KACYAIKSIRKSKVGKVDVLRREVALLKECDHPNIIKLIEVHEDQKYLHLITELCTGGEL FDRIIEKTQSDEGHFSERDAAKLVKSILDAIAYCHDQKGIVHRDLKPENFLFSTKEEDAV IKIIDFGLSRHDDMQQGIMNTKVGTPYYVAPEVLNREYTKSCDIWSIGVITYILLCGYPP FYGDTDNQIFDSVRTGRFDFPSPDWDNISSTAKDFICSMLKLDGSKRMTASESLRHKWII EMTEVQGQGGRRNQRTSIVFAPRAIAFKKYRGMQKLKKAALTYLAQNATNEDIDELKAIF RKVDVDNDGTLTLEELDDCLKNAHFLPNVTSDLRNLRGELALSGEDSIKYRDFIAMMMDE KKVVKEDNLRMVFEHFKGSDPDCILASDVAALVGGEKHALEIMKMVDANSDGRIDFKEFK AMMEADGSRS >gnl|To_NUC_proteinmodels_ML|p97 MQAGSEIHACRPCRLTRLESVESDLDGRFGERDRMKILPFSVTSVVVACIGTTTSFVHPF GGSTSMRTRNYPKQQHIMSARRTSLGSAIAPPDDPDRQLGSDSNPSTKFGSPISDALKEL NKSSIDFLKSTVFDSFFEGEDRAYARFYALETIARVPYFSYLSVLHLYETLGKWRRAKYL KLHFAESWNELHHLLIMEELGGSERFLDRFLAQHIAFGYYAVVILLYVLNPVQAYNLNQD VEEHAFETYDKYLKDNEEKLKNLPAPKAAIDYYVDGDMYMFDEFQTGCEFRRPKIENLYD VFVAIRDDEAAHVQTMEQLQTELDVASIHDGECEIDIF >gnl|To_NUC_proteinmodels_ML|p98 MSLFYYPGSAFVFSELENESKRQHQNKCGGPSPAKKHCQYPSKHLPLVKPSPHFQVRETS THIQLALDVPGVKLEDIKAELVNGGRVLHLSGSRKVGAGSSFEEAKFEKRFSLGKDVDAS KLTAHLADGVLTLTAPKSEPKAQEIAIVRGHAPELTTTESKQEEVTAPTKTDEMEEAHGN GEQDKEYVMT >gnl|To_NUC_proteinmodels_ML|p99 MSLFYYPGSAFVFSELENEPKRQHQNRCGGPSPARKHSQYPSKHLPLLKPSPHFQVRETP THIQLALDVPGVKLEDIKAELVNGGRVLHLSGSRKVGTGSSFEEAKFEKRLSLGKDVDAP KLTAHLADGVLTLTAPKSEPKSQEIAIVRGHAPELTTTESKQKEVAASPKADEMEEAHKN GEQDKEYVMT >gnl|To_NUC_proteinmodels_ML|p100 MSLSFYYPGSYHPESAFIFSDLENEAKRRRQNECGGPSPAKKHCQYTSKHLPLVKPSPHF QVSETTTHIQLALDLPGVKLEDIKAELVNGGRVLHLSGGRKVGAGSSFEEAKFEKRFSLG KDVDASKLTAHLADGVLTLTAPKSEPKAQEIAIVRGQAPELTMMETKQQEVTPPKKADET EDAQEKEDDKE >gnl|To_NUC_proteinmodels_ML|p101 MNFKGATAFLTSLATPALTHGFHVAGKEGASRGVSSLAARRATLFPIVRRRTPFFRDMDQ MMEEMDTMMEGSLAMLQRPLAAPQLRRPLGGFDVSQDENEYRVSIAAPGVDANDLSLSLD SDGRVLRLQGQASSKEGGMAISSRFEKAVLLAPEVDTGKMTASFSDGTLTVVAPKVDPTA ALEHAQAKKIEIGVDESPTPAQLSLDENQEEAAADNDDTTIAETKLADEDPKVLRAEGKG GEDAEEKGKKWPARDFPY >gnl|To_NUC_proteinmodels_ML|p102 MSLFYYPGSAFVFSELENKPKRQHQNKCGPPARKHCQYPSKHLPLVKPSPHFQVRETPTH IQLALDVPGVKLEDIKAELVNGGRVLHLSGSRKVGTGTSLEDEKFEKRFSLGKDVDASKL TAHLADGVLTLTAPKSEPKAQEIPIVQGHAPELTTTESKQKKAAASPKTDEMEDAQENEG EQEKEYVMT >gnl|To_NUC_proteinmodels_ML|p103 MVSFGSTATGATTLLAVILSTSDAAAYVSHRTPRRHYSTALGARSRVPLFSRDVERVFRD FDYMFDSMMGDVESNFYSPLSLRRMTTPCLQQVPAKRQLSAPRATYQTSQDESEVRIAFN VPGAQASDIDLQLDEDKRILRITGETKLEEGDLHLSRKFDQTFTLARDVDMTKATAQFKD NVLTVTAPKVESKVRNLAIDVVESPPAIEQAADSGTTQETQENQEAAQNENANEKSLTVA VETDEPSSKSVIDLDAAKE >gnl|To_NUC_proteinmodels_ML|p104 MNFKRATAFLTSLATPALTHGFHVAGKEGASRGVSSLAARRATLFPIVRRRTPFFRDMDQ MMEEMDTMMEGSLAMLQRPLAAPQLRRRPLGGFDVSQDENEYRVSIAAPGVDANDLSLSL DGDGRVLRLKGHASSKEGGMAISSRFEKAVLLAPEVDTGKMTASFSDGTLTVVAPMVEPT AGLEQDQAKKIEIRVDEPPTQAQLSLDENQEEVSDNDDTLAETKLADEHPKVLKAEGKGG EDAEEKEKKWPARDLSY >gnl|To_NUC_proteinmodels_ML|p105 XMLKSLQLQGMQMSSFLVGTSVYSFLVQLIYTFILLSLMYATSGFRTAAEKECSDDGFPC YTKFGDKPDVDRGSISVMTEYNPDDGESGITLDAFYAPGGYVMMFGVIVLFSLSAPGAVL SIGFLPGHRMTLVLVTFVILAICSAPSFLHISYMTDTTLMAEYTKCSMSMIETLKCQDIR SLSPDNIDGLDLENFVECLGYELTASYNPFLFCATPTITSILPQFGAFNALSYMLVSNIV FISDPPEYAVDTFIPLLEAAGASCSGARCRFNYSRELFKESLGWMALGSVLLLLLGVSIA SIMVFPNQWVLKAKRGLLGMVCRARNAKTSITRDDHREAELPEVDQERQAVRAIVAETGK KEQGEAVEDREGSHAGNVIIEAQQAFPVLADVKAGTEAVTDVVPAQIPVLMSQLCKTFPP LGGAPAMIALDNLDLHVEKGEVIGLLGKNGAGKTTALKILAGIHDPTSGLGLVEGLDCQT DRNQIYARLGNCPQFDVVWPNQSVRRHLEFFARLKGIDAPLDAAHEIAEAVGLGAPEVYD RRAGALSGGMKRRLSIAISLLGAPSTLLLDEPTTGLDPSTRHEIWSLISSFATDDRAIII TTHMMLEADALANRIAIVKKGRLRVVGTQQHLKNTYGSGYLLQVNLSDDKEEAIEALLQF VRENIHVEAKKITKQARTIHINLPRDVSIQTIFYTLYSDEANKVAINQFLVAQSSLEDVF ISVG >gnl|To_NUC_proteinmodels_ML|p106 MADALKHEVECRLAEVKFTNEEDKDKPGTGGMKRKVLIAVALLGDPEVVFLDEPTAGLDP YNRRIVWDVIAEAKAGRSIVLTTHFLDEADVLSDRIGILKNGRLISCGSSLFLKHTLGAG YSLKWEGALFQVTNFVDDGNLISSENNCHEWSLPFGCEKCLPDLLLALSESGAKDIDLKL TTLEDVFLKTGSEEFDDEADDVENTQHDDGGDTDIEAGDGSKDILLSRAWDPRAPTKSLS FFQKLRLVESFVRTNAMKMKGSIALNISIPLIYLVVGLVVVSLIEVPPSGETVSNAAIQV SSPWTAGGFFGVKELTNRSIAPLQPMAEPESLADYFSGSLPALGGIFSGNETLQYAPNVD GFALQFGVAVLANYSSWLAEEPGEGISASVQQLPYVLDNPFRFDLLFLPMMLSFGFAGLA FSVLDVLLLKDNKTIDLFRVVGITEWLTYLGVTSYKALTTFSPFFVVLVVLGASIKSVLF GNAGRWLATLLICFAFAFSGTPISILMAKKFIRSSYKEATSWFPGVYYTLIALPYVAWSS ALQAAPSARTAILIAGDVLSIFPYFAFQRGLAGVIASSAEFHDPTLSWSNVWSFDSRIWY CILLMTVAGLMEWYYLYRLAHQREPKTELSEDETREHCQPVDVSNNVDLFAEVERSVADD EGINARELVKTFIIHKKIKDVRKKQRVIKQAVKGVSLGIRQNELYALLGPNGAGKSTAIN ILASQFSPDRGEVALAGRAVSEDDIASDSLYESGKVAFCFQEDALFEKKTVDEHIEFYAT IRGLNWEHEETQNHLDAIVKLLSLEKHRSKASSELSGGFKRRLCLALSMIGYPDALVLDE PTTGLDPSARRHVWDVLKPKRNGFDVPAILMSSHNMEECQQGTRIGIMINGEMVTTGSFN RLKDLYCNALFVEIALSPDITDCSASEKAMLDAFAGIDMKAEVYESLPYRFKLRVALRAT DEASMTTQLAEAFRLLEDRKTFVGIHFYSVSPLSLEQIFINLSRKQFEADEEGLLSD >gnl|To_NUC_proteinmodels_ML|p107 MATPWFRQVEVLSIKNGRQLRRQPVHLFILIFSSTISVVFAWLAGRDARGPQGELPPLTD CGTVPLEYIASLDDPYYSQHKVSMNEAWRGGLPQWLMTLGPTFCGISIYLILRDELSSKR WGMLKAADTSAKWISWFTVFVLLSLVNSILGSITAVALPDIHVFESVNFGVIFGLLFFLH VGLVSASFFLSAICGTVQSTALSIFIITVMIVASAAPSIATSIALGYGKYNANFSTDTGG SFWRYASTESWEDQYNLSDSSHSIDSCQSPIVSFDQSRRFKTAEERQAMQRDEWFVGCYT QAGASVLHTETFFLWFFIPQSYFAMAWGNVAGYTSLPGNEFSFEHASQSPESLSQLALRN VKGGAEYDQSADEKQLFSQGAMLFTEENWDGMSSRYYQWAFDENAPQTSNCPPYEESFCG EEDPNNPRGSTFYHPCANAKPGYPSSSPSVNDSLGLLFSLTVIYLLLAGYVSSVMPMGNG ARLKPWFPFLLRYWAGGCRKGRSDRYGESDEEIADTDDDNVGIVSRNVRKRYGKVEALKP FSITMKPGTVTSLLGHNGAGKSTFANMLCCVQNPTGGDILVNGNSVTTDQYTVNKIIGEC KQDDYIWPNLTAREHLELFAGIRGVSKAAMPAVVQKWLESVDLTLVQNQRVRTFSGGMKR RLSVAMSTIGNSIVVVLDEPTTGMDPVSRRYVWNHISEIKNGKTILLTTHAMEEADLLSD YVAIMANGTIEAYGSPLDLKTKYGSALQVSLICDKEQQVVVKEAVDTIFADSLDSVSLKM SHSGYSTLTIKKVRKESGVVTLEDWFASQGLPFLAASVDVLNDQGVTSVGDLKFLDHEVF LRLFTSESKTVQTKAASAWNELSGGDGVAPGGDGVSAASLTSFVGWLEEENSPVQEFGIS NSSLEEVFLTVX >gnl|To_NUC_proteinmodels_ML|p108 XSGHTGGSWKDFWLNASQPGGGDIDLIAGHLRNGVDPNYQHPEMESTALCEAVRAGNVKV VDFLLTYHLDDEGVVRVVDPTIPSAYEDQTPLEIAMEMKHHAMVDMILRSLPPDSYAEEA RYCRQILVSIPDGANVLHHKDATTVATQVLGLGHRMMALAPPSTKGQEDGSIDGVDSSVF LSNTANETGNTKFWPMSSVEEMVSFLSSSQTDASRKSPIKIDTWLCVLGGAKRKDDASLL ELMKSEYIQRSSTCEGNSTPDRVWLMIPSTCVTKQLSRQQLVWFLQRCRSRVAGNKQSTP KVNAMIIPCSWWDRLWRAGSPYWIEEGILARLLEMNELDGTASEFATGKVYNQILEPLPW AKTTRCPDPAAVSLWEEILSDSS >gnl|To_NUC_proteinmodels_ML|p109 MAPNLNPGADVEAAMPVVPPPSHAQSKTNRTGGRNLRWSRITKEVQVKEQSAGLLKGSIA NQQIAADSSMKSCKTKIILNAVSGSAEPGQVISLMGPSGSGKTTLLDVLSGRSSFDGGVI TLDGEVVDERIMKKLKKKGLDSTSAVALMKILDKLARDEGKTIITSIHQPSSAVFFAFDK LMLLADGNVVYFGTPIDSLEYVRELGMECPAGYNAADHHMDLLVVDSAIDDEDDSTIEEA ASSVSSGVKHRSIGGTTTKQKLIASWDAEALAKKVDEEAKADFVAVGPPRQLSRRQSSLI LRESSFKSTWFTQFMVLVHRSMKNSRSAIFTKLNLVKSGAIGLMSGLLWFQMPYTERTVY DRSSYYFFTMTFWVFDAMFTAYLAFPLERAIIFKERSSGSYRLSAYFFAKTTSEAPSRLA LPSLYMVISYWMSGVNNNFGVFIASTLCSLLSVLAGESIGLFCGAAVLDMEKX >gnl|To_NUC_proteinmodels_ML|p110 XPLTALALFDIIRFPMFMLPQIINRLVEAGLSFERVREFLCCEEYSKVGEGMLKEDGEVW MNKGTFVYDSKKPRLGDGKDRNHQKGIGGLIQQQRRLMQEAALDRKWEAMLLKAQLKDAE DKIAELQKQINPGLVCSLSDADLQIGDEEDDWTPSSLLSLRRVCMHIRRGEFVAVVGAVG SGKSTLINSILGEGRPLYGSELAVKGKLGAFLQTPFIMNDTVRNNILFGHVPTEDDGNQS AKAAGEAVDEARYQLSLNVSSLLHDLKLLSHGDQTEIGEKGITLSGGQKARVALARAVYH DADVYLLDDPLAAVDAHVGKDLFHKCIVDELLLGTSKRSPGDNNGANKSDDKGKPGWSAS LFGGPSTSSSNTSKPERNTTVILVTNALQHLSHPLLVMAESSAATLDEGDGSESVEIEDL SDEDDDKDALDLGNEPEVEEVIHPNRSPARTLTRRKSSVRKKSVGSASDLEEKVGTLMTD EFKERVTGSVDKEVYIAWAKAGGGLSIGVAVLGLFAVGEILSVTSKWWLTAWSGSSDGSR VFYFLGIYALVNFSAIASTFVRIMLFITAGLRASRLMFVEMTDAVLNAPMSFFDTTPIGR IVNRFSKDMYTIDEQLVLSGRSYLATMANVFSTIFVVVTITPAFLIGLVPIIIFYLHQQN FYMITYRELKRLDSITRSPIYALLGESIDGVLTIRAFNAEDSLNKRMVKMVETQQTAYHL TFASQCWLSIRLEFAGTMIVMCACLVAVLEQPRYGGNEHFAGIAGLSISFALSVTQTLNW TVRMASDLEANMVAVERVQQYSQITSEATRNSPSDKTTNWPTEGQIEFLNVELKYRPGLP LVLKGLNISIPSMSKVGVVGRTGAGKSTLMVALLRIVELYQGSIKIDGVDIRSIGLKKLR SKIAVIPQDPVLFSGTIRTNLDPFGTSSQESLTMPILTDESPGEYNDERLFEVLTLVGLY EKKLERRGSSNSLASLASSTDKGAGGRTQPVKSLSEEVSEGGTNFSVGQRQLLVIARAML TGACIVIMDEATASVDADTDARIQRVFRSEFKNATCITVAHRLNTIMDSDYVLVMDNGQA AEFDKPSSLLTKEDGMFKRLVDAWEEEN >gnl|To_NUC_proteinmodels_ML|p111 MKGFIDSSLEGVHGETKGEKRESNTTFETADLVDEFQWCSASADQEERPPNPLENASPLS RLLFLWTPALFRPKGENGIHEQDLPQIVANKDGSEANLKHFQDLWNEEKSRASHALANRK ANEGGGTAWSSKKNFSPSLKNALSSDWVRNSKRVQPFLFLSSASKLLQALSLGLLLQSFE DDDDGNGQYYFWAGLLVLSSLLLVLSHHQVFHEKQPNRAANDAERYILAANYGFFVLVFM LVPAQLFLIKCFCSKRGCCVKAGVAALTDGRIGLVSQFVEEAKVWKMMDFTSLFLDQMTS IRSCEINRILHINRYRAMNEALFFVATVVTAIVVFLVQVCSGHKLSTLSVFTTLSLVNVL FLETKYMTWGVMGMGDAAVSSRRIQKFLSVENLSSEFDPIPRDDKAHSISLSHVTAHWQG SQNTSEDNEKASCSPSVVALDDVTLDIGAGLNIVIGGCGSGKSALLHLLAKELPTSSGVY RRKDGASIAYFPQQPWLFAGTVYENILFGLPYDEVRYREVIQTSCLASDLEYMEEGDETP VGERGLNLSGGQRARVALARTLYRDADIILLDDPLAAVDSKVANDIYKSIKNVAKNKCVV LVTHQLRHITDELCILMNEGRVLCVGSLSTVSDASGGKIAADVAGGGSSSKGTDTEESPV QPVVSTSDPTKVSTVSKPALNDSTKVSNKNTFKNYCGAAGGIGPGLLLVFLFTASQTSMI ATIRMAGWWATVDDQTDSKVVSTVVTITLAVILFSVARALYYYRCVITASKVLHDRMSRS VVLAAASFFDTNTTGNILNRFSAALALGVYRLRDAFIAASRQLKGLEMHARSPILSALNE SLCGIMTIRANDASELFHRRFFDAHDAHTRAYFASMSVVRWFGFYMDLIMWAYITVISVS AALIHDQDWATFSPQDIGISLVLSVQSAEAENLFLSVQRVLDYCGLSSEAPLSTAEDNGI ASDWPTVGAIEVKDLSVRYREGKPLSLKALTFQVDGGTRVGVVGRTGSGKSTLLNSIVRL LEAESGQITIDGIDIATLGLSRLRNSISVISQTPVLHGGISLRRNLDPSQTHKDDEIREA LLCSSMLDVVDALPSGLDTVVNDEGATSCFSAGERQLLTLSRSLLEKKKILVLDEAMHLL TSVPIAANIDNYTETKIQQVVSELQGCTGCTILSIAHRLDVSELVFTFHQLDYRPTSCIC LARQTIIDHDKVLVLGNGSMLEFGSPHQLIEEKGAFFDMVQETGNGEALAARAKRSSDKL EFNRARAHRLLTVK >gnl|To_NUC_proteinmodels_ML|p112 MSDARAAVSEALSELTSSELVDEDAAEYIISILEGDARDTESRDTVASILSSIVEDEDEG AVEQFFERLDAKLSCGVTSSMEAVSIADDQENTLRKLDTAITMKQHDVQTFASGLVAEKD PTMDQEGQSNIQAFYANMIDVTNNPKALSERNRRKERQKQMRQKMEDDERKRAVEDELRM LEEDLLKSSNEANAQELTTANDNAADVHFIRFSLPNKKGGGPDLIVDSNLTLASGRRYGL MGRNGCGKTTLLTALASRQLNDGRNGVPKNMSMLLVRQEIMGNQMNAIDTVLKSDVKREG VKRWIQSIENDLNRLDNPEEGASTTEEPDEGKKLSKSKQKLKDRKRKTSSAKKIQSKKTS VDSAELAEDRRKMLNVKLADAYERLAQIEQEEGDPEPRARKVLNGLGFSTEMQDKPTSEL SGGWRMRVSLSAALFANPALLLLDEPVVTDVVHFHRAQLTTYRGDISNFTAVREENRKRQ IRQFEQQEAKRAHLQKRKYIDLHAQAGENGVKAAKQRKSRMKKLDKIGVMAASEGKKFKA SYDGEAEEIEEYQEDEEVELSFPDPGSFNGNIVALNQVSFGYAPDKTLLKDVDLTIDMKS RMALLGRNGSGKSTLIKLVVGALQATNGSVSIDPAAKIEYLAQHQLEQLDPDSNPMLSML ERYPGDGGNHHKGELRRYLASFGLGGEDLPVQKIHTMSGGQKCRLCLALSMYRKPHLLIL DEPTNHLDLETTEALIQAIDKFEGGVLLVSHDQHLLTSVCEHLYVVHQGEVELLKHGITK EETFKRYKKDVIAGRR >gnl|To_NUC_proteinmodels_ML|p113 MSDDAVLTSILFGVTAYIWLSRGFNKGRGGKSREEADEQLQGHPVLQPRRSTPPNGLTPS TSKRFSNEGVIANDKAVRGSPHRSPRTPGFARAPAADGDGTEERRRRADSLDERRRQLLE RVAYSDSVFSRENPLDYHPKNSNWSHFEHYTPEYTPSGRLCLSKHGETNSFTNPASVLDG VEEEDDDDGEDYLWCEPARTSSLANDPDDPPVSANIEYENRRLSSAGASVPCLPSKLILV RHGQSEGNVDEKLYTTKPDNAMRLTDLGWEMARAAGKALRDQIPKDETVHFIVSPYVRTV ETFHGLCSAWVEPDDFAHIGDRCKRLRKWYSRLMDSGLTWHEDPRIREQDFGNYQDRQKM KQMKEERHGFGSFYYRFAHGESASDVFDRVSTFLDSLYRSFESGRAQNYVIVAHGVSIRV FLARYFRYSVDQFHMLANPRNCDLVVLSHDGHGRLKLDGRCELELRDEGEEQDGSDSNEG RRRSRKTVVGYRKLAKLRTIPPQYVHHRTARVNEHDTV >gnl|To_NUC_proteinmodels_ML|p114 MMDYDQAVPVHCHMTSTTSTTVTEQAVRVHDSRQNFERETATLEPAKAQHQRNMTMEADA PLLTTVKVPDGSAEDQVVNERASRTDDPFDDPFEGHQSPNTGEDQQGAASSDTDALLKRI ESLERLQSGRKSVWNLEEFDLPESTYTLLITEPIFSAGFLTGIIALGMSITCLTLVLIQE LDNTSPDNIFGVPAGVETPVKIVQYLSIIIGVLMESEIPTGLEIMGKEIEQQQMGRRPFN RVRVFFSCILRLSVGYLFLLTLWMIVLQGDEVIKIFMDMLALEFVENIDDIIFELCKRGF LGRNLRYITNQTHSYEAPGNQLHRGSVVTQSSTESTNQPLTASTIASFNLESFSIWCKRI IRFIFYMNMLVMLAGLAYVSVTQDTGEYRCKSISVAFGVDRCEDDVWEGAWVIKEGEEPE DRLLIYSHFNGIYVEETLSLKENLASVEDRPMFTERNKEDGDRFQKTVPAEFMYCEDIEA WVFRHPQIRTKNTAVTGGETENECNWLLRSPRTDSFDLIEVAEAGGWEVWKGVKDVDYTI SITCNDCGDDADCNYNGKCVDQKCECYDELDFFGVYCELQKPCGEIRSEKDPNTTLKLAA DPYDDTSEFVEVYNRPLYYAHMSGIPFGLMRGADEDLAKYADMLGRRSLNNLVFDDDDFF DIHLREEAFLNLMKNYTFVLGFTGRRWYGQIKPWTPIEETGNDKFKEEEYHAFWAKTFTG LEKDDNATLIISEPTLEASPVGLDFFELRRRNKQVFGGGKYDYDYGPFGVLIPMIDYDGA GFFHCNRDDE >gnl|To_NUC_proteinmodels_ML|p115 XARNVLFGWHDFISSAEESTSAPSSLRYPSFPIREKPLDEHHVDVVVTWHCESIEWLSDI LPELSGQMTIILLHKISDMCGFRISAVTRTAPSFTLATTTRPCQGTPFFSKRGHHYTLSN SWMPSAQGYESNAEALNHFVHLALETDPTFLPIMPTLNGQPILFVDRDSNDNLEEQDRDD PPTLKKFDACAHDIPNRVRETHRILFGASPCPMKTFPFVPGFQFGVRSDSIRARPKSIWE GIEKLTYGGCSDIGYAVERLSINLFNSSELVSSPDTWDVPLYCKDDEKPYFNEPFDASVA HEFWSKYWRCSSEVHGGVSEMKRLGSPVYTSSVSCGGHVAKSCGECPSGNGESWCNGDCE WDASTDTCANSPSRLHKAYTSLIKNRLFQPVVNEHGEYVNIILVRSPFETKEQEDMYERF KDEILFLGIMSMEAYPLRSPNPHAQKFVPEDYISRFPGERLYCIIVHLTPSSNDGQLTSF LSTSWLNMYRNPDAIFSPDLPVIQMSHSDFSVLDIDYEDEVAQGKHDKFYDFIYFMTNTE DEVQSDCTGWGSFAKNWPLAKQAMDIMCGEMDYTGIVMGIVDSHGESCDLPPSCSEKVLK VQYANYFESLDYMRQSKFLLLPQVYDASPRVAVEAMSLNLPLLMNSEIVGGWKYINEQTG EFFRDDMSDFRDSLAKLMSNLDTYSPRSYVNTNYGSKKAGAKLRSFVEEYFAGRVTLPKR SGLLIPSEPAVPDVANSEDAKPEAKTFRLFSVFSTECSPFQDWQAQTLIHNHRVQQIQGY LVRLLACDDPHYVLPEHSYDKYRVVRTPNFGARGDDVYSPRNKPFSIAYWLDGFSDDDEL PDDDDVVAIIDPDMIFLSSDMNIESIDSGRGVASGYNLGSTWVGEWAWKFCDGKCDRISD DSDDVSFGAPYVLKANDMLRLANLWRTLVDEMRVIDQSWQLEMYAAIIAALRLDIAFTVE KSMIANAEDDAEPWDIAMWGASSAPEVGKLDLTATIQVAHYCQSYSISSFRWYKHDYHGL DIRKCDPLNNSFSTPSSNDLRLMAENRGEDLQSKAARNVWLLDSTMAVARDAIENYYAEF CTTSTEIVVNNEASPPPSMFDFRSSLATLGRKIMAFVDANLGLIPSEPMSIGRATSGEGA YDASQDSTETISIRSFVIGSHETKFKDFLQANLGSEGEFNWVHEENGWDQRVVDEWVGLA GGPTFDVSTYKPFEKDQFGPHAAGCYMSHYKLIQHLDEVGEHSHLASAASSAYIVFEDDA RCIRGWQNEVLKTMTALPDDWDILFFGGRPISYFHKFNGTKPGTSTPSSLRHDICSGMFG KARGPLAPDGSRNISNHDPYWRASYLVHTHAYAVNPRRIEHVLKVLEGKQGHEPIDARFA KAMDSGMLAAYMTPENICEQSSLMSDYTPNWDGLNPQPWLGHFGFPPEAVKQHPGIHEAH MWGRILLDGEDSPACVGVY >gnl|To_NUC_proteinmodels_ML|p116 MSSVGLDITRAQHESLDSTDALASVWSLAVQEAVESGKTVSWGGRGRGRGGRDLSRKTFQ PLDITVEDVCLEYVNDTSLSGLGKGGSKVLLDNAYLKLLPGRVYTLVGRNGVGKSTLMKR IAACKIPGFPPHISSLLVQQEVFGHDELTPIDILLRNQETIMKQSKESNKFSISQLEEEM DALDLDADDYQESMEAICNKIAELEESEDAENDGLTERAHEALQFFGVPESMFHKPTSQL SGGIRKKVSLASALMSRPQLLLLDEPTCHIDIGGILQLRKLIADCAQSNATIVLTSHDVD LMNDVATDTIHFDEKQHVLSYYSGNYRCFLKYRNESLAHQMKQAGALEKQRSAMVSTIDN LKKKSARSDSRQAKKKINKTIKSKTKKLERHGVEKNELGHRRTAQSDGGIRKGAINGLPA EQRNMSHKKLLKAAEINIGPVPDKAVQFDFEPVNCTWGDEPLVSVMDVGHSFDEGGDKGA DFIFDCVDLSVREGTRTVILGENGSGKTSLLRIISGELSPTVGSVHFASGVTVAHFHQHS VDELMYDYEDGRNDVVTALTLLSERYPAKTEQDVRGMLTRFGLSPKQADTNVRFLSGGER CRLCMADLMLQSPHLLVIDELSNHLDVESVDALIYGLGRYNGTVVMASHDAHLIRSVGGD AFVLFGGSLQRVEGGIDEYLKVFNQKYNR >gnl|To_NUC_proteinmodels_ML|p117 MNADFALPLNGAIAVLEGQSGIGKSTLLRTITSLHSGFEGSATITLGGEDRLAFEPSEWR KRVLYITQDGASSIPGTAAMFIESISLSLDEVAQIVGAWGMPKTSLGQPFSSLSGGEAQR VLLAIALASTPSILLLDEPTSALDESTKLLIETSLQEKSKSCSIILVTHDDDQKRRLGTM MLTMEEI >gnl|To_NUC_proteinmodels_ML|p118 MKRRCIALVLASVSSVSAYTQSIGVRRELSSSAAATHHPTLRHHSRNLRHFSSCRRRIGG REPLLQSLSSSNGDSFTPKTQTVIQSLTYFLRFVVQTIQSRRVIEERQDGHEPEPPKLGI RASFRKLNDSRKSLIRLVGYDSSLLVPAFSFLVLGAFMSSVIPHYYSSCISCVAAGEPSR EKIVMAIAGLGITTLLEAIFTGCRGALFWIAGTRANYNVRVKLHRNLLLQEAAFFDETET GFLLSRLNNDVNKIGMVISFHVNIVLRQLAQFIFGSAFLLRIQPKLALVAFGGIGVVAWI SKVYGEFARVLAERLQELFAKSSAVAETSFSLSETVRAFDGVSIESDRYESSQYDALQLE EVQAWAYGSHKFVSDAVQAALQCLLLFSCWTLGRAGSIPAARLTSFLFYTNYVLESSNEV GDQWAKIQSAIGASTNVFDLIRRIPKVRDPLEVTNTLREVNAAQNLINGANDHRPVIKFS DLTVAYGTMEKPALKKVNLDISRGDKVAIVGRSGSGKSSMLRTVLRFYDPLAGNCTLDGV DLKAMKRSDLASKVVVVEQEPHLFPMSLMDNVLYGIEMDAVDESTGEKSYSDKFRVAVSE ALSLAGLSVTGEDNQLGLELDTRVGEGGRTLSGGQRQRVAIARSLVRYPDVLLLDEPTAA LDSQSEKVVVEALESAMSKTRCMLMVTHRLGVIRSLGINKVVVLDRGKIVEVGDPEELLR KADGLYSQLALEQGIRPSSTNVSLSPA >gnl|To_NUC_proteinmodels_ML|p120 MGFTRTRVVVAVASAISLNTVFLGVSILFLFFPVAAVTRFLYPCLSTIEEEKSLDSIVEP VVEGDVIRLAGGLFLSFVVASSGLIVPLLPCCHCENGENQKQLRGALLRTLLLLQGCLGL ALVGTGLLNLASSVENDDYSDDGGGNSTEATRAQQNKSPVHHARCSSLMDQNVLWLGVGT VVLTSVVMMTSFWPDQRSTEDQFEPHRRPSEGRRCCLRRNRQAVPDVISIVDPLLPDDSS RENERMDEARSIEDPEIDGRSDTEQTSRLQGTMRVLKLAGSESLYLWVGVAVLLVRLPFS LAIGTDGACTRPNFVSAAITSLIAKDFEGSRRSVVLIFLCGSVDSVLDYWCIYLFGKAKE NIAKTIRIDTFQAMVRFEAAFFDANKTGDLISRLAADCGEMATVRIVGISTYMLIRSPIL GGCTLAIVPLVGIANKLYGDFLAENAKGIQSALAESTSVAHETLGSIKTVLTLACEKHEI SKYRKKIDRFYDLNMKQVIASGVYFMLVSTFIINTCVQSALLMLGSLFVQQGRMTADVLL SFMLYQGQLQEYTLNLFQSFSSLVKSSGAGDRIFYLLDRKPPPPATASLLVKAQEQDSSD GVRAGEDITLDNVSFAYPTRKHVLALDAITLNVQKGKVVALVGASGCGKTTLVSLLQRLY DPIAGCIRFGGVDLRSLNLSLHRQNIGMVTQEPTLFSGTIQENIAYGTSASLEDVVVASK IAHAHAFIEDFPKQYQEQVGERGGRLSGGQRQRICIARAIVRKPKLLLLDEATSALDPEA ERAVQEALEMNSTLRTRSSREETPPTSYDPQRQGETLSSESTLKKRDRKPAAAGFGWIH >gnl|To_NUC_proteinmodels_ML|p121 MLGETDEVPGLFSVQTKFFHVYFLPLWFWPIQTYLVLQGRTTRAVAIPMSVKSIMVAWMG GIMWICVVTGIILALCGAAFAGLVLSLGALVFQCLLMTRSIRHASHKRAAELCSALGPID GAALRRLVDRNFQQRLGTVTAEAVKDDDANCEEGGGSGGGEASDDLESRV >gnl|To_NUC_proteinmodels_ML|p122 XKTTQLRILAGELEPTAGDVVKSSKDLRVAMLRQEFVDELVLTRSLRDEFMSVFVEEAAI MDDLAAAEKELGEMTGDDPDAMQEVLDRMAKLQDKADTKNVGALDSRVAKIMDLMGFEKE EGDYAVSAFSGGWKMRIGLGKVLLQEPNILLLDEITNQSRRPTNHLDLESVEWLEAFLRN QNIPMVIVSHDREFLDQVCTKIVDAEGGICTEYNGNYSQFLKLKKARMDSWQASYNAQEK KIKEERQWINKFRIKQPQAVKQREAQLEKLMKSPDYVQKPPFVGKPFRFRFPDAPRLSPE VAEVRGLSHGYGDGANRLFEDSDLFIEKGDRIAIIGPNGSGKSTLLRILMGKEDPDEGSA KFLGQNVFPSYFEQNQADVLDLDMTVVDTVQAASNTQSYNELRALLGQFLFKGDNVEKKI ENLSGGEKARLSLCCMMLKESNLLILDEPTNHLDIPAKEMLEEALQHFDGSVVVISHDRF FISKVATTIVAIEDKKLVKYGGDYKFYMDKSKDFKKKVEARYFAGGSRIGNAPVIDLEEL NKPKKKNFGGAKNAGMVTRKDKGVKNAKRQNRK >gnl|To_NUC_proteinmodels_ML|p123 XMTCPEWAQLKFSEIDYQDIIYYSAPKTKKKXTFDKLGITLSEQKRLTNVAVDVVFDSVS IATTFKQELAEYGVIFSSISEAIHDYEELVEKYLGTVVPIGDNYFSALNSAVFTDGSFCY IPKDIICPLDLSTYFRINDENSGQFERTLIVAEENSQVSYLEGCTAPQYDNNQLHAAIVE LIALNNASIKYSTVQNWYSGDEKGQGGVYNFVTKRGLCSGTASKISWTQVETGSSITWKY PSCLLVGENTQGEFYSVALTNNYQQADTGTKMIHIGKNSRSRIVSKXELPLEFASEADRL LSLKLEGSVG >gnl|To_NUC_proteinmodels_ML|p125 MPLGDVDGDAIRDGTTNPIVFPMVMEFPEGLAKPTSQSRYNQLTIDDSDDGVPQSGEIQF SRPVLSPNSEFYQSKRSPHLTNSKHRSLESKYTYILSINDVSQNFSRDSSIFELGRIRGS SWLSQDSALEELGCLRRYQQSFPSFQFLTSLSSNTRCNSDNCPAPEYCHHCLNTNTGVCG KSPSNNYEYGEYKDRAGRPMVWESQATYIEGQYIDTVTYLDTHHNGHHEFRACVVHNEAT DCLKQEDFGKIEEGADPHLLEYIADYNIGGDGATVPDPNYWWRGMYSGGQAGATKEFKRK PFQFCLPSLIVHRSNFFIVHVSYTQNGNPADKFKLPDGIYGEKVLLQWKYITANSCSPPG YEGYFATHSYPASWWTQGNCAEVTILPSGNSEPTTAEPTMSMSPTSSPVVGGPQCLAKFA DCTNSPNGCCEDLGCRQVDAAGYSKCLDTCTNGPPSPPTNSPPTNPPTNGNNPPATNPPT NTPPAPTPPTPGSGCCSRDLGTCTHLGDAFCNANKENCEGPCGKHWLENGDISDTCSPLW TTCSTDSDCCLWSHCSAGECDHDGTWKPPAGSPQPTSPPVLPSPTPPPTPNPTSPPASSG GYCNWNGCNGAVEGGPWCNENSDRCTNGCSGTWCSGGGGPPPTPRPTPRPVTVPSPTPPT AGGFEDGLVMTHYWDCSGQSCDATTLQPWNKDKYRSPKGYQPQDPANFGGAVYGEKMWLT AAIMNIDMGPSDSCCGELDSGEGGCGKCVLVQNPDSINPDWTALVMKKNTCGSCVSGPHI DVNVPGFDVLQYSLANTCGAEGTGLTKDESTALGEWYKTYQNTKQAQLLCDRLPLQYQEG CQLFTEWGWTTGNPKNAKYRVVECPQAYKDFIASQFDENGITPAGMP >gnl|To_NUC_proteinmodels_ML|p126 XRVNQVGYLRAATKVGVIVSSASTPLEWQVQDDSGSVVLSGLTTPFGLDEASGDKVHQAD FSQIEQLGSFRLVVDGVGSSLTFSVAPSLYPNLPHERQCYPETQAMNYFYFHRMGPDDIL AEHLIDERYARIALHPKDTAVPAYPGWCQGCDDFDLLGSWADAGDFGVYTVNHAISAWTL LNLHELFPSAFTDGTLNLPESGNNVHDILDEVDYGSRFVRGMLPKTNGQSNGELASHKAH NHAWSAFTITIESENAQNGAGTRSAMGSSTAATFAVARVNAQLARTWHAQGNDAAYVTLL WEAAEDAWNRAYGTNKIYNAGEASPGPAVGGGDYPDSQIADDEYAAACEMYLAALSLGVS DADFFKSIVVNSSFFGKMEQWDWASVAGAGTLSLYAVSNDLSPSQEQTIKTNILAFADKI KTAIDEEGYPSNLNFPSEFGKYPWGSNSFIVNRMIALAYAYEVSGDISYQKYLMRSMDYV MGTNAMDISYVTGYGDKAETDTHDRWAWTIGQDEFWPRGWLSGGPNNELINDYETPGGVA AAKSYADPGTAPHAWGSKENTVNWNSPLAWVAWYIENKVVPNLGGCDGNCAPVAKSQSSK VQMNESVSLVLVATDYDGADTALAWTITDTPKLGSLSGTAPNLVYTPFSNTRGVDTFSFI VTDNNMLASNKATVTLTIRDCDYMDIFQLPSSYPPLTVNYNYVHVSEDGPNFGQTRTPFH KVQVGSQIDQFAMEFDQSPYYLDLKRCMSSSSLASPASISLSGCGISGLDGDYWIGQDGG NSIWVEKSNRWAIIFTNDVNLSPEFCRTTGPPPPPTTASPTSKPTPAPTSEPISPTAKPT PSPTSPPVVVPTTVEPTKNPTNNPTKNPSAGPSNPPTPRPSLRPSIKPTPKPTSGSNVCC TNRNLGYQTCDTNNWCNANANNCSTCGGVVMEVPLERNGCCTWGGDCATWNPENNRGCQY KQSDCESDCGGTWQFF >gnl|To_NUC_proteinmodels_ML|p127 MKFNQSTLSITAAAFFAPAHLVVDAHGYLKSPKSRNYYANTDGKWYGGTASDPAPENCPH CLNRGGTDGVCGISSNANYEQPPNAMGGLISSGEVPTQECFDSNPLTFVEDPLYGAVPDT NHPERAYIPNAGSDDTQWTYKHRYQLPSNLEGELVIIQWYYLTANSCNPEGYDDYNFAPL GIPDNSLGQCNYPLPSNGVPGKPEQFWNCAEVKISTDCEASTPFPTKQPSVPITKSPVAF STARPTISNAPTFNNAVPTKQPSVPVTKSPVAVSTPRPTISNAPTFKSAVAESREDSRLI AYLANWEACPTDDMLDAYTHIVIAFAVSYTWSAAKNNCDTSCSVSAPPTCGNQVRQDLID KWRGQGKKVVLSFGGAGMGGSWPGDNNNCWDYCFGKEEEVSTQLVSIVQSQNLDGIDLDY EYCYDTEGTQSGLCTAKDTSLFPTEASFDTAAQNFLTGITSNLRQKMDALGDDYELTHAP MDSDLAPTSKYYQILKDQSENLDYLMPQFYNGYIKVVSDGFTGTGAGAYSAESVYSNLAM DLFPNRPDKVVFGFCVNGCSGTGSNANGQQAVSVLQQVKEFDQGQYSCNGGAFFWVASAD ASGAWSDPVAAELALTAGCLDGTTPPPPSPPNTTSNPTSGVTAQPTSPTTSQPTDSPPTT TTSPSSSVIPQPTRPNTSQPTDSPPMTTPSPSSSAIPQPTRPNTSQPTSAVTSEPTRASK PPTTSPPTATPNPTSGVTPSPTISKAPTFKEPVTSKPTNAPIEVGPSFAVFDAALGAPRC ASAASSCDSGDLLKGMSCSEPNGSNSVDECVDGSKAIQSIERITAASLSGEVFKEGDMVV ISADVHAWLDGQSDVAVDIYHAVSASYPIWNLVHSEVVNGGMATVSANHELPGGSPTQAF RFNIRHGGQPETGCSGGQWDDVDDLILHVEPEVITTTTTTALPRVKARHALFD >gnl|To_NUC_proteinmodels_ML|p128 MAEEHHGFVVHQKHRCDSCFVKPIVGKRYTSAENKNFDLCGRCFAKGKMEGLTVAPALDK DRVASRDFVLKLKLEKGAAVQVRRIRVSELFVQGKNLSYDKLMGVAAGFLDDIGKTTLAA KSQVTYIDSDGDKINISSDEELNDSFEGILKKLPIITPFRITVSVSSNKDKPVGVGGRIP IVGATVPRRSGRNVSYSPAATLFANQPKAAQGPTADNALFIHGRHTCDSCSASPIVGTRY HSTKIPDFDLCTKCYDNYEGDKETDFQPQIHDRDLKMQKRWSKRQNRRAAKATGNVAGMW HRTNGDLVEFLKNIQEQTGTLIESAEIIQIPKDEGTKKTEESDAKVQDVNLAAARSPTVA EKKNPPVQAGGAADSFLDDAEGSIAEAIGKTLDVCMQAIEDAADKSTAGGDGKMPTINIK KIDAITSDALSIASSAITGVTEALQQMEEVNRDVVSASTEPLTSAVAPGSKSDERKASAK VGDILSDDEESEQWSIVSDDKARAKNAAEIVRDFSLLKEEQEEDVSILSVEPLSPVLVAK WDEELRQLRELGFRNERQIVDVLEGLEAAHLMANSDDKISLDAVINKLV >gnl|To_NUC_proteinmodels_ML|p129 MRIKSLSPEQQRFAQAYRSMQLSSSVFGVCVVQIKPQLEALLGLPTDALDKEMKLTQDLM ELFVEYQVPSDLLSYNGHSESAALEDKIANVKANVKAVTDVIESQKEKQLKDERARTDMA MEEMLQQEEQMPPPAQYYGASASSAKLKRKVKGGRGRAHPPKSIERFRSLERAAPSMAMA ACADEGDYMYEAADSISIDDEGGAGGYKGAADQTENGSRTNDIDGQGTTSGQSDQNDSTS EGVDFTLIPKQLDQSVERKGDAASLRSTTIKTSSNWTRNRQANLLSKPKKHGLDADEIRK EKAKAFDLLDALSRSGSLAIAHSELHVVVAMTHCFEKDLMGTVICDNVNPIEKLEGSTLL LASAVHGLSARELVRDANELQRLRMTVPRSLEG >gnl|To_NUC_proteinmodels_ML|p130 XQIDTLRNETRNCKALYRSSYNNRRVLEEGVRSLDEESNLYEKRNGRTMISFFYQNPEGG SVFTPEVLRSIHSFEDFVMNLPQFNEYCYGNAGGCFPFNSIAPYFFTNGELVDDIDEVLM KFPGSKLALMISFFYQNPEGGSVFTPEVLRSIHSFEDFVMNLPQFNEYCYGNAGGCFPFN SIAPYFFTNGELVDDIDEANNALEHDAKWSGAALIFIAMMIFVKVRNIIAVLAGVLGLVL SFTSALYWQYHFDFNELTAMHVAGIFVMLGIGADDIFLTIDSFEHSKLDFLKDSKGERLS EEHLSEPPIIKNRMILAYKTAGSMMLVSSLTTAICFFSNVSSAVISIRDFGSYMGCVVVL NFVHVMTILPSALLVDELWLKPLRRKFWAAICLRTKPSPLDEKCNVGDDIFDPQCPVSVS PDCCNIEGASGGSVGPTPVQTAVGLATHNFDEASGEDVTPETSSTSAEELQSLQNQHGGA TFLNHIDDMTKLDRFFVTRYAPSVRRLRLCIIFISLVVSVTLAVLASSNFSVYDGTIIVF KSKYNLGRVQRVVGEYYPDELVEMYTDENSDVIDEISSDAVGSVNVVQGRPEGEVSEGDS TMIGDIFSTLAPIASPNQKPTSQQAEPSNESPQTQTSSPTESPVTDAPSDSPVVPXATVQ ASNAAAWPTLKPYADSTCAFKWPTLKPYADSTCAFKLCAFRPEAHYGDSSFAAYAFKSGK SSTAATVTGDNSAESAXCPEGITNEKFCLKNSNTEIFQKREYYSVSLVWGIPPVRENSMV WKIVDDFTGTTAIEKSRGSSVDPSDPRIQLTLLEIVQKARSNERLRIHPQLVWIEALRDF AKTAGIGFPVKKELFFGIIEVLKIQSSAFRKSVELEIATKGTGLAGDLLFTSVSFFSEVP IGEERYQRDAKELWTDFAATVNENVVAEDMPLLEVQSDAFLDSQRTDAIVQSTLNSYFVA NGLYLVVMLVFTGNLLLTLMVTLALILIFMCLAGLTFAAYGIDFGPVESLGVSIFVGLSA NYLLHIAHAYHRSTISDRSVKIQRAIYVTGSPILWSAISTMGGSAFLFACRTWLLTELGI LIVSIIGLSLTFSLGFLLALLSWVGPIPIDGNLHSWDLVAVIRFCLRCIRKDGRGQEDSD SVVDWVYVVTMNL >gnl|To_NUC_proteinmodels_ML|p131 XSSLSLVDLAGSESAKNTGAVGTRQKEGQYINKSLLTLGHVIHKLSEXXXXHIPYRDSKL TRLLQSSLGGNAQVCIICNISPVLAHLEESHNTLKFALRAKKIEQQARITEVVDEKTLLK SYREEIEELKRQLKEAKASSADVPITSAGQTRTMLDEDSDDDQDDTHVLVSAIQNLESLI LKGGNRGASKTDTAEGPNRALDSALMQAESSAIATPASNRPPVASPQTHLSTDGDEQNNN LFDELHRIQDMLGSVMRKRKGGNSARKPATGAKLDFFKTPQRDAEVEKLRLQLQEQEVAS TMRKADSSFLQSQLNEKEELLKDVSVLLQALENRHLELETENRQLKEDLAQAASLLEEGE LKRIELEKCSREKG >gnl|To_NUC_proteinmodels_ML|p133 MLFKKLAVAAVAVLGAVAPQAYADDEASMGTVIGIDLGTTYSCVGVFKNGRVEIMANDQG NRITPSYVAFMDNGDRLVGDAAKNQATINPENTVFDVKRLIGRNFSDKSVQADKKLVPYS IVSDQNKPVVAVTVAGKESKYAPEEVSAMILQKMKATAETFLGKEIKNAVVTVPAYFNDA QRQATKDAGTISGMKVERIINEPTAAAIAYGMDKTGGESNVLVFDLGGGTFDVTLLTIDN GVFEVLATNGDTHLGGEDFDQRVMQYFIKMMKKKSNTDISGDKRALQKLRKEVERVKRAL SSQQQARLEIEDLAEGFDFSETLTRARFEELNNDLFKKTLGPVGRVLEDADVSKSEVDEI VLVGGSTRIPKVQSLISEYFGGKEPSKGINPDEAVAYGAAVQGGILSGEGGDATSEILLL DVTPLSQGIETVGGVMTKLINRGTTIPTKKSQTFSTHQDNQPAVLIQVFEGERSMTKDNH LLGKFELTGIPPAPRGVPQIEVSFEVDANGILQVSAEDKGTGKAEKITITAEKGRLSEEE IERMVREAEEFAEEDKKVKERIDARNGLESYLYNLKNTLDDDEKADNISAEDKKELQDIV DETLDWMDENPEADKDDYDAKQKEVENIANPIMRNFYAGGAGGDADDMGDFGDDEL >gnl|To_NUC_proteinmodels_ML|p134 MMPAELCVLNKGRARSKTTEKSRGPIVGIDLGTTYSCVGAMKNGKVEIIANDQGNRITPS YVAFTESCERLVGDAAKNQGTVHPQNTIFDAKRLIGRLYSDKSVQADKKLMPFKIVPDRD KPMIEIGCNGKAMRYAPEEVSAMVLQKMKTTAEAFLGQEVESAVVTVPAYFNDAQRQATK DAGTISGLKVERIINEPTAAAIAYGLDKTGGESNILVFDLGGGTFDVSLLTIDSGVFEVL STSGDTHLGGEDFDQRVMQYYIKMVKKKSNVDISGDKRALQKLRKEVERTKRALSSQQQA RLEIEDLVKGVDFSETLTRARFEELNNDLFKKTLGPVQRALEDAGISKSDVDEIVLVGGS TRIPRVRALITEFFDGKEPSTSVNPDEAVAYGAAIQGGVLAGDDALNDIVVLDVTSLSQG IETVGGVFTPLIPRNTPIPTKKSQTFSTSVDNQPAVLIQVYEGERSMTKDNHLLGKFELT GLPPAPRGVPQIEVSFGVDANGILQVSAEEKGTGKSEQITITSDKNRLSQDEIDRMLEEA EQYAEEDRKIKDRVDARNGLESYLYNLKNTLDDDSTGGSLPSDDRKELLDIIDETLDWID DYPEADKGEIDTKLRDVENVANPIMRAFYAGNDDADGDANFGDDEL >gnl|To_NUC_proteinmodels_ML|p135 MASVTGESVGIDLGTTYSCVGVWQNDRVEIIANDQGNRTTPSYVAFTETERLIGDAAKSQ AAMNSSNTVFDAKRLIGRKFSDAGVQSDMKHWPFTVVSGTGGTPIIEVEYKGEKKQFKAE EISSMILVKMKEVAEAYLGKEVKNAVVTVPAYFNDSQRQATKDAGAISGLNVLRIINEPT AAAIAYGLDKKGDEKNVLIFDLGGGTFDVSLLTIEEGIFEVKATAGDTHLGGEDFDNRMV DYFLQDFKRRHRKDMSTNQRSLRRLRTACERAKRTLSSSTQAHIEIDSLFDGIDFNSTIT RARFEDLCMDYFKKCLDPCEKVLRDAKIAKNAVDEVVLVGGSTRIPKVQNMLTEFFNGKE PCKSINPDEAVAFGATVQAAILSGADQSEKLSELLLLDVTPLSLGLETAGGVMTTLIKRN TTVPAKKTQTFSTYADNQPGVLIQVFEGERSMTRDNNLLGKFNLDGIPPMPRGQPQIDVC FDIDANGILNVSALEKSTGKENKITITNDKGRLSQEEIERMVQEAEKYKAEDDANKNRIE AKNGLENYCYSLKSSIEGEEVKDKIPEEDKTALLNAINDATTWLDANQSAEKEEFEEKQK ALEGIAMPILQKMAGGGGGMGGMPDMGGMGGGMPDMGGAPPADDPAGGPTIEEID >gnl|To_NUC_proteinmodels_ML|p136 MALQLGLRRTLTSTSFATSKCASQSALGGFASQAMRLKSTDAGDVIGIDLGTTNSCVAIM EGRNARVIENSEGSRTTPSVVAITEDSSRLVGMAAKRQAVTNPENTFYAVKRLIGRSFGD KEVKDIQGLVPYKIVKSNNNDDAWVADTRNEKFSPSQIGSSVLGKMKETAEGFLGRDVSK AVVTVPAYFNDSQRQATKDAGKIAGLDVLRIINEPTAAALAYGMDKVDGKTIAVFDLGGG TFDVSILEISGGVFEVKATNGNTMLGGEDFDEELLEFLLKSFKQESGIDLSGDNLAMQRL REAAEKAKRELDGLAQTDVSLPFITADATGPKHLNIKVTKAQFENMVNDLVEKTVEPCQK CMKDADVSKSDIHDVILVGGMTRMPKVQETVENFFGKKPSRGVNPDEVVAMGAAIQGGVL KGDVKDILLLDVTPLSLGIETLGGVMTKLIPRNTTIPTKKQQTFSTAVDNQPQVQLKVMQ GEREMAADNKNLGEFDLAGIPPAPRGVPQIEVSFDIDADGILNVSAKDKGTGKEQNIIIK SCGGLSDDDIERMVRDAEVNAEADAQKKQAIEAKNEIDSLIYSTEKSLNEHKDKLDDEAK KEVEKAIEEAKSVKESDNLDELKSKTEALSQASMKMGQAIYGQQSSGDASEGEEKKDETT VDAEFEEKKKDDDKGDEKK >gnl|To_NUC_proteinmodels_ML|p137 MSVVGVDFGAKHSVIAAAGRGGVDVILNGNSQRLNPNMVGFDQSRSMGEAASSTALSNYK NTITNIKRLVGLSYDDPRAQAEMKRCPFKCVPYAHPSGPAGIAVSVRLQDDQRTIPIECV AGMMVKHMGQVAALKAAADSPGATTAECFPRDWVVAIPGYYTDAQRRAFAAGCEMAGVKG VQRFMHETTATALAFGIFKDIRKEFSKDRSTHVMFVDMGATTYSVSIVDFQPGRLVVKSA QYDVDLGGRDFDAVISDWIATKFEEKYRGKLSGAVRDNTKVMLKLGVAAEKAKKTLSPAG VKEARINLECLMDDLDFGISLRADEYKAMCEPLLARLAGPIERALAETKLTAADLSSVEI VGGATRVSSVKLTLAQVLGLDASAVNNGLSTTMNADEAVARGCALQSAILSPRFKVLPYE VVEYQPFPIKIEWDGAHEAGMEVDAEAGDATPTNSVVMFERGCNFPIVRRVTLRRSGKFT VDAMYDESALGNGFPAGSSRAIATFNINSPADADCKIRVNVKQDISGSLTLSSAQMVEEI VEEDKPAEEGGAESKAPEGGEGDAAKEGGEKKKPKLKKTNLEFSILRPLDWTPAEMQKEN ELEVEMENVDRVVRETADARNELESYIYDMRDKIISESQLMPYCTDQEKADFGKMLESME NWLYEDGFDATKSVYAKKLEELKVVGGPIERRSYEASARPAAMSTLQKTIEKYTSWLNTS QGDEAYAHVTDDERAKCSDKCDKASAWMYDMLDRQGGLPANADPAVTVEQIYATNKGVND TVSPIMHKPKPKPKKVEEKKEDATPPETKEGGKEEAKPEPMDTSGSGGDPEPMAE >gnl|To_NUC_proteinmodels_ML|p138 MRVMHPLTQLLGLAALPAAVNSKAILGVDLGSLYMKVALVQRNSPLEIVTNMHSKRKTEQ MVLFDAGSRFYGADASSLMARKPHLTPSQFSVMLGRDDEHPSVRVLKERHYAFSPAYNET RSGVCLTVDGVQFTPEELVAMVLTHAKDITAAYGVTSPLKDCVLTVPAFFTQHERRALLD AALLADLNVLALINENTAAALHYGIDRIDEKPVNYLFYNMGAGSLQVSVVRYLSYPHKAS KYAKEKTVGGFEVLGAAWDATLGGASFDARLVDFMAEEFNAIWNEKRGTTDKDVRTVPRA MAKLMIQANKVKHVLSANADIPVFIDALTDDVNYQSHISRAKFEEICHDLLERAAAPIEK ALKLANVTLDELDAVEMIGGAMRVPKVQEAVSDALGGLELGMHLNSDESMALGASFHGAN VSTSFKVRHVGMVDVNTFPVNVDLTDLTVGKEKKKTGGGLFGIGKKKAEEAAEEGWAKHA TIFKLGSKLGVKKTIAFTHDEDVHVEVSYDESDTLPLGTGLSIEQYDVSGIADFAKEMKE KDPDLKPKVSLQFELDGSGLTRLVKAEAVVEETVMVMEEVEVDDDEEEEEAASDDKKEDE EAKSDEPAAEEKQDESEEKKEEAESAEKEGEEKTGDTKEDGAEKKEEPKKKKKKTIQVEK VSYRSRERNTIKSTEFHSAHQEKKKKHTRTLKVAKYHVGPIQPYSAEIMAESQAKLDTLA EADKARLLLEEAKNKVESYIYHIRNKLIDDEEEIGKVTSEEQREALRQSTEDAEEWMWDA EDDLKTWEDKYTELSEPAEKAFFRMAELAARPKAIEALTTKLGKIEDLMKKWETSMEHIT EEERQEVLDKVQDVRAWIEEKVKAQEEADPTGDAVFTSEEVPGQTKKIEGIVARLMKKPK PKPEKKNETSAEKTESDGAEPADSEEKKDGEESSEEKPDSEDSKEDEPAEGEADADSAEG DEAKAADEEAKESDEL >gnl|To_NUC_proteinmodels_ML|p139 MLDIVINSLYTNKDIFLRELISNASDALDKLRFLSLTKPELLGDKPEMEVKIDYDPEART LTIRDSGIGMTKDDLVKNLGTVARSGTTNFMKSLKESETNDISQIGMFGVGFYSSFLVAD RVSVASKSNDEDTQYIWESLNGESSFHVGADPRGNTLGRGTEITLHLKEDADEYLSEYKL RELISHYSEFVTHPVSLREIKTVQVPVEKPESDEETTSEEGEDDDIEVSDDEDIEEAEPE MEDVTTYEYVQINTDPAVWARDKDEITDEEYQDFWKVVSKNEPGDAEEWIHFNAEGNINF KSILYIPSQVPSALQQGQLEQMPMGLRLYVRKVLISDEFDLMPRYLSFIKGVVDSDDLPL NVNRETLQESKIIKIIKKKLVRKAIEMIRKLSQKTMPEDDEEPEEAEVDADGNVVVKEEN NKPAKIHPYIEWYKKFGLSLKMGCIDDSANRDKLLKLLRFKSNKVTGENEFVSLQEYVDK MPEWQKDIYVFPGESIKVLEESSFMDAFNDRDLEVLFLTDSLDEYFIGNVREFDGKKFRD ITKEGVKFQDEDEDMAKRRNKVYTETFKPLTKFLKKLYGSDVQRVSISKRLGKAPAIVSS SEWGQSANMERIMRAQAFAHGVAPGENNLPTGIIELNPRHPFVIKLLESLPEDEDAEVSD ELKDSAWILLDMAIMSGGFPIRDPKKYSARMTRVLKGSLGVESLDLADEIEPPEEEEEPE EPEFDMPGMDMDGIQMMDIDDIDMDSM >gnl|To_NUC_proteinmodels_ML|p140 MSDDQSESYAFSADINQLLSLIINTFYSNKEIFLRELISNSSDALDKIRYQSLTDSSVLD SEPEMQIKLIPDKANNTLTIEDSGIGMTKADLVNNLGTIAKSGTKAFMEALTAGADISMI GQFGVGFYSAYLVADKVEVVSKNNDDECYTWISEAGGSFTITKTNDSGLGRGTRIILHLK EDMSEYLEERRIKDLVKKHSEFIGFPIKLYTEKTTEKEVTDDDDDDDEDEGDDDKPKIEE VDDEEEAKKEKKTKKIKEVSHEWEHLNNMKPLWMRKADDVTQDEYAAFYKSISNDWEEHA AVKHFSVEGQLEFRAVLFCPKRAPFDMFEGGAKKKHNSIKLYVRRVFIMDNCEDLMPEWL AFIKGVVDSEDLPLNISRETLQQNKILRVIKKNLVKKCIEMFSDLTENEDAYNKFYEAFS KNLKLGVHEDSTNRAKLAKLLRYHSTKSGDDSMTSLEDYVGRMDDKQPGIYYITGESKRS VETSPFLEKLKKKGYEVLYMVDPIDEYAVQQLKEFDGKKLLSATKEGLQLEEDEDEKKAF EEAKARTEGLCKLMKEVLDDKVEKVVVSNRLADSPCCLVTGEYGWSANMERIMKAQALRD SSQSAYMSSKKTMEINPTNSIITALREKADADQSDKTVKDLIWLLYDTSLLTSGFSLDEP ATFASRIHRLVKLGLSIDDDDADDDDDDDMDDLPALDDDDEGESAMEQVD >gnl|To_NUC_proteinmodels_ML|p141 MKLSVAASIVLSTAVPSCAFVAMPPSARRGVTSTKFAGACGSAAGARPLMMSEVVDAETV EAPAGETFEFQAEVGRVMDIIINSLYSNRDVFLRELVSNAADACDKRRFLSITSDDAASV TPTISIKTDKDAMTVTIEDTGVGMTRSELQNNLGRIAQSGTKKFVEALGDGSADVNLIGQ FGVGFYSAYLVADKVDVITKSMQDGSPQLRWSSNASSSYTISEDDGEPIEGSGTRLVLHL KDDALEYLEPTKLEGLLQQYSEFVEFPISVWKEKTEYKQVPDEEANKDLGEDEEPKMKTV PETTEGYERMNTNKPIWLRSPSDVTEEEYKDFYQSAFRAQYDEPMAHTHFSLEGQIECKS VLYIPGMLPFELSKDMFDEDARNIRLYVKRVFINDKFEDIVPRWLKFVRGVVDSQDLPLN VSREILQKSKVLSIINKRLVRKSLDMIREIESDEDDSKYIMFWNNFGKYLKVGVIEDQRN KDDIVPLLRFFSSKTADEYTSLDEYIENMKEGQEQIYYVAADGRDKAEMSPAAEKVRSRG YEVLYLTEPLDEIMIESVTNYKEHKLVDVSKEGLNLDGDDGEERKKKEEELNENHKSVCD FLESSLAGKVSRVKMTDQLAGSSPAALVQGAYGMSPTMQRYMKAQTVASGGSDAGMMGSM SQAVLEVNPSHPIVQDLEAAIKENGDGEDNESARNSAVLLYDVAALTSGYDIEDSADFAK RILSLMSSKAGSSGVQDAEVEAASEEPAVEETEVEAKAVVPEVVSDDEA >gnl|To_NUC_proteinmodels_ML|p142 MLTHHILTKLPRLCPVTHRPSAHWSHTRRSSRPIVRDRVDCASETIMKQVFASQTFTPGV LAKLRPTTAPVDGAITSLPSRVESDSMGSLEIPPGSLWGAQTQRSIQNFPIGGLESRMPF EVVYAQALLKKCCASYWSDEANGAKLPKEVARAIGQAADEVVAGKHDDQFPLVIYQTGSG TQTNMNVNEVLSNRAIMILGGEVGSKTPVHPNDHCNMGQSSNDSFPTAMHIASVVTIMNR TIPGLKTLHEALKAKSVEFETLVKIGRTHCQDATPLTLGQEFSGYAQQVEYAIQRIEHSL DALYRLALGGTAVGTGLNTTEGYATEIAAKIADETGLPFTSAPNLFEALAAHDSLVEVSG AFTTTAASLNKIANDIRFLGSGPRCGLGELSLPPNEPGSSIMPGKVNPTQCEALTMVCAQ VIGNHAAISVGGMQGQFELNVFKPVMVANLLDSARLIGDASASFAERCVVGITPNLERIS QLLHGSLMLVTALNPHIGYDKASKIAKTAHEKGQTLKEAAIESGHLTAEEFDKWIVPEEM IGPKPRK >gnl|To_NUC_proteinmodels_ML|p144 MVVGGAVKVGVSEVTQQEVVTPGASEPTTGGGMRGLLQPLSSLSFIRTDSTIEGTRSGST MISSAPTGKRSYRRLALASRITLVVFLLVLATLASLLLYRVLKWNETRFAKEQFVSMGDR ASQSVLNLVSRKRLSMASFSAVISEQHPNPEMWPNVHVDGFGRIVDVIAKSGLHEQMGFA PILSYDGLREWEDHAYAYFLSRPDYFPNNTAVNPPFGRGVWRIGPNKTRVHDNDELTRDR GLLLPVFQCCIDNPGDRIRLFNLYSEPKRRKAIDGVINCTRSLRERYLFNESVNRFPDYS CGSQTEFTRIVRFEHRGPAAVMMQPIFPADDPWNIAGIVLSPIVYDEIFEDVFDRDLTGV HAVLESEESKYTYTITDGLVESFQKGDVHDAQFNDLAQSTYLTWTDLEDIAPMPRLKLTM YPTDGYIKAYSTDNPLNAALAALFAVIFATMLFLLYDFFVRSEISERRQLLDAKRRFVRY VSHEVRTPLNAVIMGLACLRSDLDNQRLVNSSEEENLNGLSESLGLLEDIEMSANSAVEV LNEVLQYDKIERSALKLELSMVEIFDTVEKTVSEFKLPAAKKEIHLKVTYEADNSNNTRH ATQAGDEIVVVSSAGDLPRHVRDLYAVADSSRICQIIRNLVSNALKFTSNQGRVLVRISY CNAPIMNSLKHRQQTRSGKRTSKGSDGSKVTVKLNNGDKIDVEPRGHVLMTICDSGAGMT SEQLDSLFGEGVQFNADRLQGGNGSGLGLHITKNLIEQHHGTLDAHSEGLGRGSTFTLSL PLFSAPDYKSFQENFVPCSPNDSFTQRNLRILVVDDVLSNRKLLCRLLERKGHRCDMAEN GQRALDAIKSDTEGGYHCILIDFEMPVLNGPDAVRRIREDLGCDVFIVGVTGNMLTEDVE YFISRGANCVLPKPVQLPILEELLVEHGV >gnl|To_NUC_proteinmodels_ML|p145 MVLLLSRFFPPQRVTLSEKVLTRQSCVACSVCLPVGLQIGRLHLAILLVFGDWVRRRGAM FVSACIAMAFGCRLGEALALGSPLQSAARHSIGGRRSGSAVFAESQPAGSSATVAMTLDD IQEAKDSALRGNVTSASQLVNVLVANRETTSYEQYLDSMLDEQAPFWTRLPLAKFSRRAR QRRLCELLDMTTPMDADXXEEDVDSRLKRRRRALFILLRNISTSSDYEGISKLLLAAKKD AKSNMSQEEMLRRTPDLETPTYEVLSRGKDGLEIRHYLRFSVASVKMGELKSTGSDQESI QKISNPQLAGASSFGALAGYLFGKNQDATAMSMTTPVYSTGEGMERTMSFVLPSDYWEDE GKAPKPIEDSAVKIAPVDGCDRAVIAFSGLGRKGDVDKQRRKLIELLKSNDDWRAAEGVP VVLAQYNDPFTPPWKRRNEVSVEVSPQSQ >gnl|To_NUC_proteinmodels_ML|p146 MGCCTGCNVCSRIQFGVGAIGLELDGCEQPPPRQPTHCPCTVAWSSLLYLVGESLFHPRA RGPWKMLLLPALLTLGLSTPGGLGNPPGLCCFVPGSGDFGGRRCRATGRVSCIAIRYCVH GPVRCQRHILLHAKQKGVVSDEAGGNSERSGIISKIRKIFAKTDEKEADTPSPREGRITF DSLRLKMARAMSNVNVFRSRDEWVVACTKARVGPGQIVPCVVNGLDIVIFASRDGTRLDA FANSCPHLGSPFDLATVERKPVVQEKGRSGDGTGDGCVDCIVCPVHQTAFEIQSGQVQGE WCPHPPILGSIMGYVKPKQSLVKFAVRLRGKNVEVRVSTSVDRVGLGDEGPTGLTGIK >gnl|To_NUC_proteinmodels_ML|p148 MHVTILYGSETGNAESIAKDLGKSINEAAKEGGKSEFTTANVFEMNQFKRKKLMETWAAS PANGMARHALIVVCSTTGNGDAPENAGRFIRFIKKPVPESLSKGCADPTKPLTNVAYATL GLGDTNYDQFCASGKLLDRKLNEWGAVRALPLVCADEATGLDSAVEPFLDSVLDGLAHAC DSGERKDEVDDALGGGVAAVSLTQDEEADPKPDIGRESECSAAIDEEKGSGSPSEGLANE SNGTSSSKPKTEQSTSTSAASKSASPLYILYGSATGNAEHIAKDLAKTYETYLSNPSFLG YFTSVQCCELNKYKKNCLDTWSSSPDSNNLQKVKHGIIIVCSTTGNADAPENADRFVRWI KRKTTKGDTFQHCAYAVLGLGDTNYDVFCAMGKLGGTRAVSLGMADEATGLEEVVEPWVG NVISELAEVCKGHTSIGSLNQGMRLPEANKRPIRSSSPTDEEKKMEDGDLVPFNSRSTGV RTIRALMSIPITDPLPDVPNSELPTMVSSLSSCRLIDETTRDRSESLAAENMTVSSASSG FLFTASRPYESKILNARYLTKTNTACAAKVAEVLDGGDGKLVEALDMYSHHFPLSEDKNG KRVIEMTMSLPDDFTLEYEPGDSVGMIVPNSRDSIGFVLEMLLRNHGIQRSQSISVDEGD PVTVEEALRTTVDLSGTMKKKRLLLLANFAKDAEEERALRLLSRASKPGEPDLYDTFVEQ QRRSVVDILHEFPSAQSISLEGLLGCLTAIPPRYYSVCSSPLKDRQDGNECQVKVAFSVV DFNTPVVQCDVTSGRRVRGLATSRLECVSSPFLSGRQPSAHATLLIFPKPTHEFRLPADI STPLVLIGPGTGIAPFIGFLSHRLAQIASLESTEAAHVASEGTWRGSYELNPDELNISKS DVKGLNVAADYLCKQKTGDVDLFFGCRHSDHDYLYQHELERFRDVGLISNLYTAFSRDDK EHKTYVQTLMLTDETCGKRLVEMITKKQASVYICGDGNAMGRDVQNAIVSLLAKDLMDSG KCETAEQAKSGGIAKIDQMKSFGKFVLDIWS >gnl|To_NUC_proteinmodels_ML|p149 MIHVFMVFQHPPETPTCSSIDWRCACFWAAADEAPPAPPAAPDDPIVNPSGISDADTSAL TNLSSPCRNQTMFAIDRVDQQGVTSQIYLAEKSMRNHPARFDASRERFATQRIHATPNNA VTKDKTSRNGPREPQGAFSDGGDTQQMQQPNGGGPSRGAAYLSPAASATLSYDLDDADTV PTVLSSTDSSAPPSPSGSQLGDGPERSSSPLFYGLPRDGRTAGEDGAHYDAEDDPYADVR ERMERLSRGDRARHAYRDERAGDGTDGELFDPHRDITVFAAMTTYLGYIVLIITGHVRDI CASLFRKGRYFRSSKRAASAGAASGVYPSDDPGRYAPLLKSWENFYTRRLYHRIQDCFNR PIASRPGASIRVLERVSFDGNKTMSLLGRLSDLGDGRGAGRGAAESYSAGRHYGETDDGR AVRRCLNLGSYNYLGFADDWDVTCRDGVLSSLETLPSSVGSCRLEYGTTSLHRQVEEIVA RFVGKEDAVVLNMGFNTNATIIPTITSRGDLIVSDELNHTSIVNGARASGAAIRTFRHND PADLDAILREAIVYGQPRTRRRWNRILVVVEGIYSMEGEYCDLRGVVTVCKRYGAYLYLD EAHSIGAMGRTGRGCCEYTGVDPSNVDVMMGTFTKSFGGMGGYVAADRGVIESLRRDCAG SAYHGSLSPVVCQQIISSFKVIMGEDGTSIGRRKLTALRDNSNYFRMRLSDMGLHVLGNY DSPIMPVMLYNPTKIAAFSRECLKRGLAVVVVGFPAVPILMSRARFCISAGHTRRDLDGA LAELEEIADLLMLRYSRSTFG >gnl|To_NUC_proteinmodels_ML|p150 MSNSSSSSTTVASTSSSASILSEAHVVSRTNSFEQQQMDAPNTPSRKRRHTTAPRSGPSS LQPSSALPLPKGHIRRTPSEIQLADDMARAEYEDVRMFSRLVVGMQHQVIRDHAAGRGVN PLSKKSLAGVVRTKHAKEDDLRGGSDETDGEEWVIDGIPESDESELSPSGENGTECENSH DCIDGGAADDDGGVFSLDL >gnl|To_NUC_proteinmodels_ML|p151 XIETSVILLDYVASAQRSSRKNCFRSILQGYPKAGKVVYSGTGKEVPIELKTNVILDEVR AGGSRSSSAVPYTSRAQSSLEGKENFSDIWTCRSSLGFTLAQKMVGKAYIVEGXPPGKFC EPNMTTVGSQNNTGX >gnl|To_NUC_proteinmodels_ML|p152 MTLTPEDVVGVLQGRGWEATIVKQSECSDLVPVESSGYLKCVDGRGVDHTNTRGPKMLGG VYAIAHNRGLKTTDDLQDICREVSEKGYIPSVHGDGDGNMLGCGYCKLWLTGKFADLDPV KGAPPTYSADDGAAAVKAKGQVEMCKGSHAEKFVYINFVEDQTIEPNHDDQKFVVDAWAA MKFDLDVPSYLVTAAATVERLGGPKIAKLVVP >gnl|To_NUC_proteinmodels_ML|p153 MKVFATAIVALALNADAFAPPQGQARSSTSLNSLEKYADELVATANAMVRPGRGLLACDE STGTVGSRLESIGLENVEENRRDWRELLFRSPSLGDYVSGAILFEETLYQNAADGTPFVD VLKSQGVIPGIKVDTGLKPLVGGVEGETWCSGLDGLYDKCVLHYAKGARFAKWRTALRID VAKGCPTDYAIDEAAHGLSRYARICQEAGLVPIVEPEILIDGDHDIATTARIQEVVVSRT YQKLKEVGCLLEGTLLKPSMTVAGVECSDKPTPEDVAKMTVQTLERSVPCSVPGITFLSG GLTEEDSSIFLNAINSIERKGPWAMTFSFSRANQSSALKAWAGKKENVEAAQKALLARAQ ANSEASEGKYVAGSQPSDQESLYVKNYVY >gnl|To_NUC_proteinmodels_ML|p154 MSMPEKVKNADGFIAALDQSGGSTPKALSMYGIEESEYEVNTVSMFDCIHKMRTRIMTSP SFGGDRILGAILFQDTLNREVEGKPTAKYLWEEKKIVPFLKIDKGLATEENGVQVMKPIP DLEEVCGAAFSNGVFGTKMRSNINSANAEGIKAVVSQQFEVGKVIISKGLVPIIEPEVNI NAKDKGECEAILKQEILTNLDTLSESQNVMLKLTLPSSDNFYKECCDHPRVVRVVALSGG YSKDVANEKLAKNSGIIASFSRALTEGLSAKMTDDEYNKSLGESIQSIFDASKC >gnl|To_NUC_proteinmodels_ML|p155 MKTAVFATLAATAAAFAPSQTGRASTSVSETKADLEVLAKKLNPIVGFYDPLNLAEADFW GQGNEATIGFLRHAEIKHGRVAMFAFVGYIAHANGFKFPWAMQMDGTPFPDANGNPPAVW DAVSDDANTTCAVGNAGRPSRTSSLDLTGIPHPVPFNFYDPFGWSKNRSEEAKASGLVKE INNGRAAMLGIFGFLCAQTIPGSVPLLSGVVPTYSGEVMAPFSTNYIGEPFIN >gnl|To_NUC_proteinmodels_ML|p156 MKLSTKMVAAASAARALAFAPTSYQARSSTALDASAGLFYSTQTGNTETVAGYLADATGL EMKDIGDVKDSEIAELDTLIVGAPTWHTDSETERSGTEWDSWLYDTLPNIDVKGKNVAVF GCGDQQSYSDYFCDAVGELYDLFEAAGANMVGLTSTDDYDHQGSKAERDGKFAGLLCDED NQYDLSEDRAKAWVEQLKGEGVL >gnl|To_NUC_proteinmodels_ML|p157 MGLLGDVLRCLSPLIVTGSLLGNGVMMLMVSKISALTNASTRMGFRGIPGFRRRTALSYP CTLRISARLERWREKQTCWLELFFPILSFVLKNLLMARDWFGRVLGQDNHRRLIILSTLA LSPSVLYFLSGAKSALPLTSRINHVILANWFGFLGVLPVNLLLYTTARFSVFVRALNISE VTILRMHIYAGTVCLIGGIVHGIYYTAMWLQQSRSLESLFPLTWECWTSLLDEDGGRDAG CHRKFVNLTGVVSALTWLVLILTSLWKVRRRYYKTFYFVSSTLTRIIENMKARDNVNAVT ILHAHAVESWYKSLFQRGLQVTSFTEVPSSGGCIEMSFATASSETDLYESALKTIGRYTR LTVPAVSIKSHPFSVFTHPERCGDITVLFRPCGPFTREVARALSSYQDGECSTADPIKFT SSGLHMGTAHQFEDAMSHENNVIFAGGVGIVSYVSLLLAVHTKGQARDQSDSSQQERKTK IFVHWICREEGLIAHVLSKHLRSICQDLSSRVSITVHYTGSREGTNASNPNSDSVQLGSP SRAFTESMYDNSSRTMSAAIVPILTQGVIMFGGMWLLQRNPGSRSLLKYPVAMLQVFSTS LLVSAAFIGLSRFAGIISLRCCKKYASLDTSEPKEQDVCESDHTSPISESESWSEEEGDS TLNLNISRSGNMDEGDVEVQTYGSTDAAFANVTHKTGRPNMGSILTQVMDDSVSKDVGVF FCGPSAMMNGVYSSANGIRKDNGKTKHCLPVQGSGRISLYPEAFEM >gnl|To_NUC_proteinmodels_ML|p158 MMKGFTFFAVALSAATTDAFATPPQSQPARAVSPSTRIYESTVPANLAEQAKEISDEPLM NPDNPNLPALKGdydwdaeyaadddWITENVPGKIVVNEVDLAAQVTELTKLEDKYRKIR EDQEYEDARIIGFVPKAEMYNGRAAMFFLVVGLLTEYWSGISITGQIEEMARVGGFIGPD F >gnl|To_NUC_proteinmodels_ML|p159 MKFPVALLASFLGASSAITHNADIYYHEKQEIGPAMWIFNPDGLTIMSPAGETILVQEKS KVCPNLRTNYQGVETDDCSFFDVQSDGHRYVWAANHDDSPHKIDVFDINTGEYAGYAPTC STPLDLNYHPLRQEMWVRCASKSEDSNGEVDVFSTNSLSSNHELVNFNATGSRAYGRMET HSTMGNVGYATTYNSPYLTKFDLSGKDVMTEFEIEKAHASYDSTFSHANRHVFAAVRVCC TCGFEGADVESCGRGSGSPVLVTTGPSASNEMQNGTCSSGCKGSAADTIGVVEFDTVKEV FVGNHNSVAGNGGVPKSAPDGKTIALLPFDGGATLRVLKTGQNGAESSVAADIPVDFQGG TPGKQAISDLAWVQDGSRNFIVVASNVDNSLVVADMDDGYRMVKIPLSNNEEATAAGNRQ VEWAIGTNYVWVNGGQTEEVYIVEIGDSIDDTKVVKTLQGVPDGKILYVKNYEREAQVQD MLSLLAEQQQEQQPQVTVTAAPAETSAVELRQEFSAAMDDLQGLNEDSNTETLAIAGLAV GCVALTWAVVATVYFTMMKEDSKPSPAPLASQSLDVETAKPAVANGAKEQALPVESDAVT LGSKQVA >gnl|To_NUC_proteinmodels_ML|p160 MRLINAFSFLVTGTLLPIACGQRPPGCKVPFGPDNCDVADYCRIDLDQAAMEGELAKLTE DSFQKAREIYENGGHSMPLAELTLATPLTQVALKGANVTGYSEDGLYVAGRVHIDRDAGS ASLQFEYTDPLLCQGWGTGEGDVRVSGCLAADGVVSLEFGDYQYAYNPLEGNKNGRTLAT ISQMSGPDSCVYGSCAELYTDYYGLEFFANEWINAAFEQRQTNFENGVVDFSQIGFDGRA KAISKGTVVLSTFVEINRLLTKSVESCRDLNMDDALHFLRAAKCMYTGSLVGHSEDYISG GELLFTLADKMCVSFRTCGANGDELSGTSQVNSVITMAFQFGEDALSMPSISEGCPIFIE IVEPITSWLYVPIIQGALRSSYRMESNATDEREYVEGTTYAAALLPX >gnl|To_NUC_proteinmodels_ML|p161 MYLVTALLLSIGGLSSSFVAADGHDGLCEATCSSVEGSEDEVCVFTTKVNLFAGELGYYQ FEECGDEVNPTLAMEVGKTYMFSQAERSNYYHPLGFAYYADGAHADADELEPGIVPPDSS SSCGDDMSCPAPMYFRRQEYLGTYSNNAMLVPSTTGEDNFGLDDYEPLYFYPLPNWVELG PFDVYLKFDVDDFEQDIFYFCHIHQYMTGRIKLTKNGEVISESDEPAIDYEYDSVSGHDE TCGTYGLNEFQLPHHECPEKFVCDVPEDNPELANFADCINSMDCAMMAGMTTKADDEVSL FLHQMIPHHQNAVNMAKALLKTGKLTCDDLADEDTEQADDCALEVILREIINNQNAQIQG MRSILEGKGYPETNDCX >gnl|To_NUC_proteinmodels_ML|p162 MKSVAALALIGSAAAFAPAQTGKASTKLNAFEDELGAQPPLGFYDPLGMLNGDCSQERFD RLRYVEIKHGRIAMLAFLGQITTRGGLHLGGNIDYSGDSFDSFPNGIAALIGPDSIPTAG LVQIIALIGLLECGFMRDVPGAGNEFVGDFRNGYIDFGWDQFDEETKLQKRAIELNNGRA AMFGILGLMVHEEIVPLGYDADLPIIGHLA >gnl|To_NUC_proteinmodels_ML|p163a MKSVAALALIGSAAAFAPAQTGKASTQLNAFEGELGSQPPLGFFDPLGLLDDADQERFDR LRYVEIKHGRIAQLAFLGNIITRAGVHLSGSIDSAGNSFDSFPNGWAAINGPDAIPGAGL AQIVGFVGVLELFVMKDIEGTGNEFVGDFRNGSLDFGWDTFDEETKLSKRGIELNNGRAA MMGILGLMVHEQLGGEIPIVGAM >gnl|To_NUC_proteinmodels_ML|p163b MKSVAALALIGSAAAFAPAQTGKASTQLNAFESELGVQKPLGFYDPLGLLDDADQERFDR LRFVELKHGRISMLAFLGNIITRAGVHLGGSIDRAGDSFDSFPNGWAAVSGPDSIPTAGL VQMIAFVGLLELGVMKDVTGENEFIGDFRNGAIDFGWDTFDEETQYIKRGVELNNGRAAM MGILGLMVHEQLGGTIPVVGQM >gnl|To_NUC_proteinmodels_ML|p164 MKSAAVFAAGLATASAFAPAKQAAKTTSINAFEGELGVQPPVGFYDPLGLLDDADQERFD RLRYVEIKHGRISMLAFLGNVFTRAGFHWDNKIDLAGHMCKDYPNGLAAIFGPDAIPAQG FAQIFSLIGLMELFFMKDIEGTGNEFVGDFRNGWIDFGWDSFDEEEKLFKRGVELNNGRA AMMGILGLMVHEKLGGSIPVVGDLDALRVWTPADGVTIL >gnl|To_NUC_proteinmodels_ML|p165 MMKLAVLATLAASAAAFAPAPAAKTSSALSAFESELGAQPPLGFYDPLGMLDDADQDRFD RLRYVEIKHGRVAMLAFLGQIVTRAGIFLPGSIDKAGDSFDSFGVGYAALDGPNAIPQDG FCQILLFVGFLEASIMKDIPGTGNEFVGDFRNGSLDFGWDTFDEETKLSKRAIELNNGRA AMFGILGLMMHEQLGGEIPIVGSM >gnl|To_NUC_proteinmodels_ML|p166 MMKSILVAAAVAGASAFAPAQVGKTSTSLNAFENEIGAMVPLGFFDPLGLLKDADQAEFD RLREVEIKHGRISMLATVGYLTTASGYRFPDFPADVPAGLGAWKALASTSDGQNILGQMA IFFALAEAANRNVDWNDGWEPEFPGDYRNGSLDWGWDRFDDATKIKKRTIELNQGRAAMM GILGLMVHEMMGVSIIFPNPSGL >gnl|To_NUC_proteinmodels_ML|p167a MMKSVAALALIGSAAAFAPAQTGKASTQLNAFESELGAQAPLGFWDPLGFLDNADQETFD RLRYVELKHGRVSQLAFVGNLITRAGVHFGGDIDKSGNSFDSFPNGLAAINGPDSISTAG LLQILAFVAFLEIRVMTDVTGESEFQGDFRNGYDFGWDKQSAEWQTQKRAVELNQGRAAM MGILGLMVHEQLGGTIPIVGEM >gnl|To_NUC_proteinmodels_ML|p167b MKSVAALALIGSAAAFAPAQTGKASTQLNAFESELGVQPPLGFYDPLGLLDDADQDKFDR LRYVEIKHGRISMLAFLGQITTRAGFHLPGSIDTAGDSFDSFPDGFAAIGGPDSIPSAGT GQMLLFVGLLELLVMKDEANGAAPGEFIGDFRNGYIDFGWDSFDEETKLFKRGVELNNGR AAMMGILGLMIHEQLGTDMPIIGQL >gnl|To_NUC_proteinmodels_ML|p168 MKSVAVIAATLATASAFAPAQIGQSSTQLDAFQNELGAQAPLGFFDPLGMLDDADQARFD RLRFVELKHGRIAMLAFLGQITTRSGHFFDGNIDYSGHAFSSYPSGLAAIFGDDAIPKAG IQQILAFIGILELAVMKGIDNDQGNEFVGDFRNGKFVNSWNSFDDATKMRKRAIELNNGR AAMMGILGLMVHEKLGSNMPIIGQL >gnl|To_NUC_proteinmodels_ML|p169 MMKSILAATAIAGCSAFAPAQTGKATTALNAFESELGVQPPLGFFDPLGLLDEAGQKRFD RLRYVEIKHGRISMLAFLGNIITRAGLHLPGNIDLAGDSFDSYPNGLAALFGPDAIPTDG FLQIVFLAGFLELFVMKDVTGENEFVGDFRNGSLDFGWDKFDEETKYSKRAIELNNGRAA MMGILGLMVHEQLGGELPIVGQM >gnl|To_NUC_proteinmodels_ML|p170 MKSVAALAFIGSAAAFAPVAPAGRSSTAVNAFSLETMPGALPPMGFWDPLGFAEKADPNT LKRYREAEITHGRVAMLAVIGFFVGEAVEGSSFLFDAQITGPAISHFTQVPDGWDALIVT MIGAAEAQRAQKGWVDPAEQAPDQPGTLKDDYYPGDIGFDPLGIKPEDPEELDLMITREL QNGRLAMLAAAGFLAQEAVDGLGIVQHFQVGMIEKAVQL >gnl|To_NUC_proteinmodels_ML|p171 MKKSLGAILLTAASSQAFTSSTCETRRVPLHSTVIDEAESMTIPSAGEALLAGESYARQE TITPKSYLDDGFIYGLDDSGLERPKGKNANVVVEGDSLETTPFQVGLVSSTFAVQAFFAA NAIHTLFEQTNGDVGQTAAFSVATVVASWVAADFGSGIFHWSVDNYGNGRTPIMGNIIAA FQGHHSAPWTIAYRGFCNNVYKLCIPFGIPTVAAINYLSGDNSMAALFFTFFCWIEIMSQ ELHKWSHQTKAEVPSIVNVLQDLGITIGRVPHAKHHTAPFEGNYCIVSGLCNETLDRSGF FRWMEHTVYKINGVESNAWKLDPELKERTLSGKYAMSE >gnl|To_NUC_proteinmodels_ML|p172 MLASSSRIVLRTSAGIGRRAALRAAVARCAAEAPHLASYSSLAESGRAPHYPSITLASKQ QVARKSALPIEEDPIRGDYDSSAIYSGPDACARAGITSLGINGPTTIYKNQTFEELFQHE VANAEGEVADAEYGDTFCVDTGKFTGRSPKDKWIVKNIGSESEALIDWGDVNQPTSPEVF EELYEKAVDHFNSIDRAYVFDGFCGANPKSQRKIRFVHEMAWQQHFVTNMFIRANDESDL EGFEPDFTVINACSQVNEEFQRHGLNSEVAVVFNIEKKCAVIFGTWYGGENKKGIFSLMN YWLPNEGSLPMHCSANVGKKGDSCLFFGLSGTGKTTLSADHHRALIGDDEHGWDEDGIFN FEGGCYAKTINLSEKTEPDIYRAITKDAMLENVALKKEGNHLVPDYFDTSKTENGRVSYP IFHIDNYHKPQMAGHPKNIIFLSCDAFGVLPPIAKLSPEQAMYHFISGYTAKVAGTERGV TEPTATFSACFGAAFLTHSPTRYAELLREKLEKHGAQAWIVNSGWSGGPFGVGERMSIKT TRGCVDAIIDGSIENTTWETDPLFGWELPTSVPGVDEAVLKPRNTWPNPEAYDAAEKKLA EMYVSNFEQYRGKGVDYSVFGPKV >gnl|To_NUC_proteinmodels_ML|p173 XDLDQLMSLQPPSKENPFANTRVADLFREQARKTPDAIAVEMGDESMTYKELDQRSDLLS SALVETGVGPNDIVAMNLSKNVYQAVGILGVTKSGGAFLPMDPSWPLERRQYIVKDANCR VAIIQGQFKGEYDGWFDGRFMVVDDPVWDDLLQQAQSDESVLGEQPTFSNPNDIAYIIFT SGSTGKPKGVMVKHLGLMNYQYAMVSLSYKDKGWRCALSFNYYFDAYYYDFFSTICFEAG TIVYFENGLALDGMTEDDNIQCLFTIPSLVAALSSIPDSVEIIFVGGEALTGAAIDAVMK SSAKLVTAYGPTEASNMVCTRTVVDSNNITSLGKFFPNLQTYVACPESLALKAFGEWGEL LIGGDQVGLGYLNRPELTAEKFIPNPWGAGTLYRTGDLVRFGADGQLEIGGRIDFMIKFN GQRMEAGEIENALMSIDGVDEAVVLHRKDLDGPDALCAYVKPSSAVVDGGANLKANLVLP KYMFPTAFVGIDSWPRNANEKIDRKSLPKPVAIKKVALAGDESNWEARNEMDNQLRKIFA DTLKLDLSNVASIDSDFFDIGGDSLGSMKLARAIQKDLGVKLPIAKVMKCRTVAALSDEI SNMSKCSSGSELSMPPVVASHAVDDNATELFYPAHSGQSYILKLSHDYKAAKSTAYSCPM ALWIDGDIHLDCLEQAMVSMKKRQAVMRTGFMRDPYSSNQWLQVIKPCDNQSHTFIYGLE EVETEEQALALYREDCATDFGAYALTTGNLVRSRLVHVQSTGRHLLLIQIHHIIFDGLSH QLFWHDFSTLYAKAVSEKSGVSDDVVSSCLKVPQLEPMPLQLIDLTQWQLDTSAHPEMKR QRQYWRKQLREGRLPLLAFPEDKPRPNERTWKGWSIPLVIQPDVGRRLVQLGKEEKCTPF QVILALWSLVLCRHTGQEEVVVGCAFGGREQDAGLSNVIGFMVNYLAMRVEVPKDTSSGS IRQYLRSTRETVTNGILNGALPYNQIVNECLPSLNYESNRPQVFQTMLSMAGTDGWANHL DAGQFASAGAEITPIAQCTWDRCKSDVRIRAHQLSNGGFKGDIEFNSDVYSHERIKALSE SLVELATSFAFAADDDSVWSIPMKKYREPKSGQLARQRSSVRRKSSRRTSFLTRQASSLR SICDAALAYDEERRQSLICIGGKRNDSMSAILVA >gnl|To_NUC_proteinmodels_ML|p174 MTVPTLVHEAFHAQAASTPHAPCLVDAATGTTWSYREAQLRVLTLASELRSSGTTTDRVV AIYMDPSPRYVVTMLAALSAGGAYVPLELAYPTPMVEKVLFDARPSAVCTTLEHVGKLPA AAGEIAIVCEDDGAHHQDGGKGGFDQAELDRLLAAYRRNWSEDESAGPEAVAGPDDLCFV VYSSGTTGQPKDIANPHRAPAVSYDWRFRTLSDYAPGDVVACNVFFVWECLRPLMRGGAV LPVPADVIYDGERLTVMVERFGVTEILFTPSLLENMLVSVDAKDVNSRFGKVKTIYLNGE VVSLALRKKVIDCLPSVRLLNLYSISECHEVAALDLTAPDVDLSASDKFCPVGYPSNEFC YILDEECQPLPFGEAGELYFGGDMLARGYLNLPELTATRFVPDPFPTPTEEHKTRPALMY RTGDRARFLPNGQLEILGRCDFMVKIRGYSVVLGAIETALVDHVRLSSAVVVADGEEGSE DKQLIAYVVRDHSDKPDDTRLADFQIDGRNGACPEIRRAIDGHIAHYMVPSVYIEVESLP VAAVGAKLDRKALQAQTKDRRAKLRSLQLNYETHTVSAGASGPAGPPPASLGPVRKLAKY LRVPRDTSLPEVESAMTLLWESILVSDDTIALSSESRFQERGGHSLSAARLVSLVNKCFG SSLSAARLFRDNMSVQQCALEVVKQWSEVPDEEDHSALSSNGSSDRGSWTVVESAEKDAQ IVARVREDAVLPQDIVVRPCAPEAITTAGSARSILLTGATGYLGAFLLAELLRSNPVATV TCLVRSSDPDAVRKNCDRYGLNDVDHSRVVLEAGDLSLERLGMTQSNWNRVASSIDHIVH CGAMVSLTAPYDGKIRDVNVRGTLEVIRLAAECGEGTSLVYVSSNGIFPSTADEVFMENE GISCLPDRLDSNNGYGLSKWAAEQLVTEAGRRGLPTLSIRFGNIGWDTESAKGNALDFQG LILNGCATLGKAIDLPGFNFECTPVDFASKSLVQLASHAPTLKQGHVLNCTQDGFTPFRD IYNFFSASTGSKLDSVDFNLWSNSLEDEALNSKDETIGALFSFISGLDNCMAYLQNVPEL DCSTFDKVLGEAGSTMKRKGLVSSSYFENYFKSILPQNQRKSDAAQIDPSGTEAAAGPLA GKVAVVLGASSGIGRAITVALAKAGCNVGMGARRVEQLEKTKQLCLEECRGTTAKAVISK TDVTSLDDVKSLVAKTEQSLGDIDILVNVAGVMYFTLMATANFDQWERTVDVNCKGTMFG IGSVLPQMLARGKGHIVNITSDAGRKAFPGLGVYSGSKFFVEAMSQCLRSETANSGIRVT CIQPGNVETPLLGLSDDAEALKLYGEPTGAKVLEPDDIGRAVIYAATQPEWCAVNEILVE PREEPA >gnl|To_NUC_proteinmodels_ML|p175 MKFVLAACLAAAVSAEGLRKAEPEIESNHLDLELEINGERQLFPLLPGTKCPTGHTCRTR AVEGGVSPMINSLKRNIKTPLAMSMDWIDMNGELETVVMSDNFCTRRNAMAKAAGLAAGL SMAAVSAPAYAAQTVEVKMGADSGLLVFEPAKVTVCKGDTVKWINNKAGPHNVVFDEDNI PDGVDQEKISMDDQLGEPGDTFEMKFDTAGTYGYYCEPHRGAGMQATLVVQ >gnl|To_NUC_proteinmodels_ML|p176 MSRSPSSLLCPPWADGVVTPLHWGALIAIAATIFSLGTLVGYNLGRRRRGRPTKTKGSAK HDGVRATGRLPKRKNIHSLDGSTMVLNGSLELMASKWGGDEFQNPREVAVVEYLSPEEMS RMLFKSVTEHSSDSTLSLDESDRDLSPKSSATDLTSLVPTFDKSRSIPTEPEKFIELLGL IQKYSVNTSHPYFFNQLFGSLDPIALAAEIVALSVNTSVYTYETAPVFSLIEREVMGQIG KLVFGPTSGKQISFESDDKFEGEGLMIPGGSLANLTAMHAARHRWKVMNGFIKQATEDTE QMLGTDWSTFGEEKKSDDVFPTTAKMQGEDTETGETLCDYIRSVPDLVAFVSSEAHYSFS KSARVLGLREDDLVIIPTHPDGRMNVHELSKRIEEIELESASHMDARIRVPFFVACTAGS TVRGSFDEIEEIVKVCRRYEARAKSSEARSIWVHVDGAWGGSAMFSSRRHIRDITHMDEI RHADSFTFNPHKMLGAPQQTTAFIVRHRHALKRANSAGAKYLFDPRKNGAEYDLGDLSYT CGRRTDAVKLWAMWKYYGKSGLGERVDQKVDELQLFVDELRGRPSFALACAPWPFNVNFS TSLQGYERFWRLAGWSKTTGRARMQWCKYPMILHKTCQMFRCNLSCVCTRRAR >gnl|To_NUC_proteinmodels_ML|p177 MKLLSLLTAALVGSASAFAPAPRVQQSRAVSPVSMAAAEGELYIDQNRRNLMNLILVGSA AVTVGGLAVPYILFFLPPGSGGGGGGSPAKDALGNEIFSKAYLAEKKPGDHSLAQGLKGD ATYLIVKEDGTLQDYGLNAVCTHLGCVVPWSAPNNKFMCPCHGSQYAPTGAVVRGPAPLP LALAHLDIDESDKIVFSPWTETDFRTEGKPWWT >gnl|To_NUC_proteinmodels_ML|p178 MKLAIIAACVASAAAFAPMAPVRQATSLDYSVKVFNEEEGIDATFECADDVFIVDAAEEE GVDLPYSCRAGACSTCTGKVISGEVDQSEQTFLDDDQMADGYVLTCVAYPKSDCEIQVHM EDDLF >gnl|To_NUC_proteinmodels_ML|p179 MKVAACLAASIATASAFVPQSAPKFGTSLNLKQDFLEYTPYYDHSAVKVNTHKNKAPFTG KVVSTKRIVGPKATGETCHIIIDHEGDFPYIEGQSWGVIPPGTREKDGKPHAVRLYSIAS SRYGDDMTGKTGSLCVRRATYWCPELQADDPAKKGVCSNFLCDTTPGDELKMTGPSGKVM LMPEEDPNTDYIMVATGTGIAPYRGFIRRLFFEDTPAADVYKGQAWLFLGVANSDALLYD DEFQDAKARYPDNFRIDYALSREQENKKGGKMYIQDKVEEYADEIFNKLDSGAHIYFCGL KGMMPGIQDMLKAVCEEKGISYDEWLKGLKQAKQWHVEVY >gnl|To_NUC_proteinmodels_ML|p180a MKTAILAATLASAAAFAPAPTGPKTTAMNSVWDSYVGGQDFKGGEWKWDPLKLSETYEPL VPFFREAELRHGRTAMLAVVGFIATDFVRLPGEAFSFESIPTTVGAHDALTGTALQQVAM WIGVFDTIVTAPAIAATMKGERDAGDFGIKGPKNDPDGSKMKRKQVSELLNGRLAMMAVG GIATQSVLTGHGFPYL >gnl|To_NUC_proteinmodels_ML|p180b MKIVNALLLAGSAQAFAPQASLATSKSALYDAATETATEMLDSSRASQPASSAFCNGLVG GEGPEPMPFISNGERTSVNFDPLGFTERSPELINWFREAELKHGRQAMLATIGIVAPEYF RVPGEQFSFEAVPNVLDAHNALIDTSMKQILLWIGLFEAMTFAAVNNMNEFDRMPGDYGF DPLGLKPNDPEKLKKMQLRELKNGRLAMIAIGGMVVGASITGKPFPYL >gnl|To_NUC_proteinmodels_ML|p181 MKSILLAATAASCAAFAPSSTGKANTALNYAADLDNMIGVGPETGGRVFDPLNLSDYVPT DWARRAELANGRSAMLAVVGWVFPKVAGTFASTDVTTTDPIDAIMQADPQWWAQFILLCG VGEFYKYQQELEGKSFLGGAEPAADYLKLWPTDPAKQEEMRMKELKNARLAMIGIAGFAA NHFIPGACPVPDFA >gnl|To_NUC_proteinmodels_ML|p182a MSKLAILATAIVGSSAFAPSNTGVRSATSLAAKSTALPFLEAPPNCEGYVGNAGFDPFRF SDFVPVDFLREAELKHGRICMMAWAGYVAVDLGLRVLPVPEALEGVTAATAHDASVEQGG LSQMFLWFSLLEVLDLIAIKQMLDGSGRAPGDFGLDGGLLKGKSAEYVEDMKLKEITHCR LAMLAFSGVVTQSVLTQGPFPYV >gnl|To_NUC_proteinmodels_ML|p182b MKTALLLSTLSSAAAFAPSSTGRAASSLSAEKSQSIPFLPSPPNLEGYVGNVGFDPLGIS NYFPMDYLREAELKHGRMCQLAWLGYVAVDIGLRLPGYPEAMTGATSATAHDAAVEFGAL GNIFVWIAFAEMTSWIGISQMLQGSGRDAGDFGLGKQFLKGKSEAQINEMKLKELTHCRA AMLAFSGVVTQSVLYDKGFPYF >gnl|To_NUC_proteinmodels_ML|p183 MMKSALIASLVGSAAAFAPASTGKVATEVNSMMEVGDASDSTVGSDSMSASLPFLTKPKG LDGYIGDVGFDPLGFAEMFDIKWLREAEIKHGRVAMLATVGFLTSEFVTFPMFQDMHVDD SNLAPTAIGASAMFQIIFAAGFEEIRTNKGAITMETMFEDPDRVPGDLGFAVNRLAGKSE EEVNKLKLQELKNGRLAMLAIGGMIHHNFVTGEPLIMI >gnl|To_NUC_proteinmodels_ML|p184 MKLAIAALTAATASAFAPAQTDRSSTALNVQWTEGTKALPFGSSPDTLDGSLPGDVGFDP IGFSTQPFASFNNPLYVEGNFMSDLEWLREAELAHGRIAQLAVVGFLWPGLFGTLPGAEG YADFSELNPFKALSTVPESAIYQIIGGCAWFEYQRVVRIKEQGANRVVGDIGIGYPGAFN PLNLNYTPEEFAEKQLQEIKHCRLAMIAAFGLICQANNSGTDVISQLAPAFEAPEYAAKA GYFLPQGI >gnl|To_NUC_proteinmodels_ML|p185a MVSSTTLAIAASLASSATAFAPASTNGAAKTSLDAMPDRMWDSMVDKTERSAALPFLPRP INLDGSYVGDVGFDPFYLSSIPKNFAGFIQPPQWEDVEGIPTLYWMREAELKHGRVTMLA WLGFLATDGAFGPVPLRFPGAIYHDVANSYEAHNAMVEQGSMGFLLLVVGFVEFCMSGAL VEVAKGESDRAAGDFKLDPLQFLKGKSEEEVNAMKLKELQNGRLAMLAFSGVVTQAALGG TEFPYLVPYTEAGLPDVAVTW >gnl|To_NUC_proteinmodels_ML|p186 MKTAALISACVGIASAFAPAKNVSLVSAKTMAIVADMAGATMPFKAYDPLNLASIGSDST LAWFRAAELKHGRVAMLATTGYIVQAAGYHFPGMLSSDVSFESLSAMKPFDAWDAVPDLG KAQIYFTIFFAEIVSESKGTHYTKGGDLPTIVFPNVNFAPEDPAAMKVQQSKELNNGRLA MIAVMSFAAAANIPGSVPVLADNPMF >gnl|To_NUC_proteinmodels_ML|p187a MKTAIIATLISTAAAFAPNAQPAAQTALNAEAADRRSFLGAAAAFGAAAIPAAASARVDY ENVAYLGGGSTVDVNNANVRAYLRLPGMYPSAAGKIVTHGPYKSVADVYNIPGLTSAEKE VIKKYESKFVAKDPAAEYVIDRVNNGLYR >gnl|To_NUC_proteinmodels_ML|p188 MRSVIVASIAASAFAPVSQKAQSTQLNAWIQDQVVGVTPPMGYFDPLGLSYGKDDATMMF YREAELKHGRVAMAACLGWYLDAAGVHPAFNSALSNDPLKAAVELPAVGWLQFVLGCGAI EWLGQQIKERPGYVPGDLLGASYWVDNSDEGWVDYQNKEINNGRLAMVAIMGIIYQDVST GEYGDMMYRQLVQ >gnl|To_NUC_proteinmodels_ML|p189 MKTFQLVTLFALVACSLAFAPNKAPQVAQRVVESGFDMKKAAIAAIPAAIATPAFAMDSI VEALPTQTLSLEVQFGAYLAVLLGTLLPVLFLVNLYVQTESRKAGRMGGQDSE >gnl|To_NUC_proteinmodels_ML|p190 MKLAIALTLAASANAFAPTNMAGNHATQLNAMKDDMAKVGAAALAGAALFGATLPAQAIT KSELNQLSYLQVKGTGLANRCSEVIGEDTITPKSGSRLTQMCIEPKAWAVEEEIGKAGKT EKKFVNSKVMTRQTYTLEGIEGSLLSEGGNIVFKEEEGIDYAPTTVQLPGGERVPFLFTV KELVAKGNGGSFKPGFQMGGDFSVPSYRTGLFLDPKGRGGVTGYDMAVALPGLQSGEEGD AEMFKENNKTFDIGAGRIEMEVNKVNSEESEIGGVFVASQSGDTDMGSKVPKKILTKGIF YAKVE >gnl|To_NUC_proteinmodels_ML|p191 MARLSLLVAVAAVSSTAAFSVSENNNAVSRRETFGKAASLIAGAASIVASPASSLAEVTD ETPKVTTRMGGLLEKYQDSRGWTILAPSGWNSFDGEVGAYDKKWQDLVSQTDNVKISSTP VKSTTTSVDILGPVEDVGKSLASKRSAKLIAALERQTDGILFYTFDFALDDGTHQLLQLC VNKGKIWSVDANTLEKSYAKKKDLYYNILGSFMPKLN >gnl|To_NUC_proteinmodels_ML|p192 MVQARDGTAREAGAISLSQASTNFGHSTARLRRIKKLQFGIINPEELRQYSVTQAITVNG RKIPAGVTRYETYMAGQPVYGGVNDPRLGDIHDKSDPGYFGHIDLARPVYHYGFLDVTLK ALRCVCFHCSRITVEDTEFKFKKARGIKNRKRRLNAMHELIRPKKRCDHCNGFQPKYTKV GLHVEIEYADEMERMPGTSGDKKQFFSAQKAVEIFKRMRDEDAKALGLDVTWARPEWTCI SVMPVPPLHVRPSVVMGGGAQSSEDDLTHQLVNIVKSNIALRTAIKNGEPNIIVEQFEQA LQHNVCAFMNNELRGMPQVTQRSGRPLKTLTQRLKSKEGRIRGNLMGKRVDFSARTVITA DPNLGIHQVGVPRSIAMNLTVPTRVTPFNIHELSALVANGPTEHPGAKHIIRSDGLRIDL RYVKNKSDLLLANGWIVERHLNDGDIVLFNRQPSLHKMSIMGHQAKVLDWSTFRLNLSCT SPYNADFDGDEMNLHVPQSLPARAEAELMMHSPKVIVSGQSNRPVMGIIQDSLLGVQKMT KRDIFIKKDLLMNILMWVTDWDGRIPPPAIYKPEELWTGKQIMTLILPKINLRGKANNGG PKPDTFNVYDNQVRIMEGELIEGTIDKKTIGSGMGGIIHTAWLDVGHEDTTRFMNQTQQV VNHWILQYSFSIGACDAIADVSTMEQIANTINKAKLNVLDLVRKGQRGELETQPGRTMIE SFEQLVNRVLNTARDHAGKSAQSSLSETNSVKAMVTAGSKGSFINISQIIACVGQQNVEG NRIPYGFKRRTLPHFQKDDLGPESRGFVENSYLFGLTPQEFFFHAMGGREGLIDTACKTA ETGYIQRRLVKAMETVMARYDGTLRTSGGQIVQFLYGEDGMDAVWIEKQSFDSLTLNKVD FEERYLLSSADPDFGYDDQGIPFIQSDTIDDCRHNSQVQQVLDREIEALKDDQRILRVVM ANREAGREDDPSSYAPGNIKRVIQNSLRQFQIDKGQPSDLHPRDVVEKIDALLRRLVVVV GDDVLSVEAQKNATTLYSILIRSYLSSKRVLKEYRLSEAALNWVIGEIETRFHHAKVSPG EMAGVLAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLKEIINVAKTVKTPGLTIHLK PQVSGDADIAKMVHSSIEYTVLGDVTKLTEIYYDPDPMNTIVEQDREFVKDYYEMGEETQ EDLSRLSPWVLRIELNQAVMVDKKIKMNEITSMIANEYGSDLNVVVSDDNADDLVARIRI VNDMPMAHSIDENGNAMMAEDVELGQEDDVFLKRLEKSMLQTLKLRGVEDVKKVFMREEK RVKWDDERGFQRIDEWVLETDGSNLMSVLGIEHVDASRTISNDIVEVFTVLGIEGVRAAI LSELRNVISFDGSYVNYRHLACLVDVMTMHGHLMAVDRHGINRVESGPLLRCSFEETVDM LNDAALYAEEEVLRGVTENIMMGQLARVGTGDIDLLLDEQKVVRDAVEVVVDNLEGDKDL GLVGGGIGMPTPYASTPFAASPSMNQNVEMSPFVDAAGGFSPAVGGFSPAVSGFSPSYSP AGGSGTSGSYGEFGPSSPQYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSP AYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSP TSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPAYSPTSPA YSPTSPAYSPLSDPEDKK >gnl|To_NUC_proteinmodels_ML|p193 MMKAFVGVLLLASAVVNGFVAPSVSRNVERVPTHTELPMKVVSSIGALKRRSKDCQVVRR RGRLYVISKNGRFKVRQGGAKMKKRRKQK >gnl|To_NUC_proteinmodels_ML|p194 MLSRLTVVVSICIALSLGVDAFSPSLGRRLTSTSMSTLRTPTWRLASEAEDSTTAVAEQA QEDEADAEPVDEATRLQLEKQKRADELRAQEVFMKRTTGVYVCKSCDWEYDPEKGDSFLI GGMIKPGTAFEDTPSNWRCPTCRASKDQFEEVVETIPGFEVNQGYGFGTNSMTTGEKNTL IWGGLAAFFALFLGGYALS >gnl|To_NUC_proteinmodels_ML|p195 MNFRGTTTFLTLATPALTHGFRVADEEGASRESRDVSSPAAQRATFFPIVRRRHPFFRDM DQMIEEMDAMMDPRATLFPIVRRRPPFFRDMDQMIEEMDAMMDGSLAALQRPFAARRPLG GFDVYQDENEYRVSIDAPDIDVNDLSLSLDNDGRVLRLKGQANKKEGGMAISSRFEKAVL LAPGVDTGKITASISDGTLTVVAPKTHPTAAIEQAQVKEINIRADEKPIQAQLSLDEGQE GVIDNDDTTAAKLATKDSKMEKAEGKGGEDAEEKEKKWPARDFP >gnl|To_NUC_proteinmodels_ML|p196 MKSVAALALIGSAAAFAPAQTGKASTQLNAFESELGAQXXXXXXXXXXXLNDADQERFDR LRYVEIKHGRISMLAFLGNIITRAGVHLGGSIDQAGDSFDSFPNGWAAINGPDAIPGAGL AQMVALVGALELGVMKDIEGTGNEFVGDFRNGSLDFGWDTFDEETKLSKRAIELNNGRAA MMGILGLMVHEQLGGELPIVGAM >gnl|To_NUC_proteinmodels_ML|p201 MSAPPPFPQPTHPQSTASRPLSLAEKSRRWSKLQSRRYSHRRRFTALNSSGTPLAHKELL PPEHVRKILADHGDMSSKRYDADKRIYLGALKYVPHAVFKLLENMPMPWEAVRTVPVLYH VTGAISFVNEVPKVVEPVYLAQWGSAWVMMRREKRDRRHFKRMRFPPFDDEEPVLDYADH ILDVEPPEPIRMEFGDGEDDDSDALDDVEKFVASWLYEHKPLSEPVDYDDFEDSDDEDED SDEDGMVQKRDRGEGDVSGFKVPGGRYTNGPSYRTWRLPTPVMSCLYRLAAPLVSSHLLD PNHKHLFALPEFLTAKALNVAIPGGPKFEPLYRDVPEKEEEDWNEFNDVNKIIIRHPIRS EYRIAFPHVYNSRPRKVVVGGDGYHHPQLCYVGEDDENESGALAFTSYSSALNPILRVEE RDYRLESGRDTEMGDEDEFNENLDLPMVGETDDDYSGEIEDDVWLRHDELDDAEEDIDAP DGSIWEDDEEVVASSALRPLLSRRPLSTSRTGPGIALYFAPRPFHMRSGRTRRAIDVPLI GHWARERVSRDLNYPTKVRVSYQKLLKNWVLNQLHSRPDVRKSKKCTELDWVEVGLQVCR QGHNMLNLLINRKQLNYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLTREILRLTKLVV DAHVQFRLGNIDAYQLADGLQYTFNHVGQLTGMYRYKYRLMRQIRMCKDLKHLIYYRFNT GPVGKGPGVGFWAPSWRVWLFFLRGIVPLLERWLGNLLARQFEGRHSKSYAKNVTKQRVE SHFDLELRASVMHDILDMMPQGVKQNKSRVILSHLSEAFRCWKANIPWKVPGMPAPIENM ILRYVKAKADWWTNIAHYNRERIKRGGTVDKTVVKKNLGRLTRLWLKAEQERQHNYLKDG PYVSAEEAVAIYTTAVHWLESRKFTPIPFPPLSYKHDTKLLILALERLKEGFGLQVRLNS AAREELGLIEQAYDNPHEALSRIKRHLLTMRTFKEVGIEFMDMYSHLSPVYNIEPLEKIT DAYLDQYLWYEADKRHLFPNWIKPADSEPPPLLVYKWCQGVNNLEEVWDVDDGSGDSVVM MQSQLEKMYEKVDLTLLNRLLRLIVDHNIADYMTAKNNVSVSYKDMMHTNSYGLVRGLQF SSFITQYYGLVLDLLVLGLTRASELAGPPQRPNEYLTFADVETEVSHPIRLYTRYMDKVY MLFKFDGEESKDLIQRYLIEHPDPNNENAVGYRNKTCWPRDCRMRLLKRDVNLGRAIFWD IKNRLPRSITTLDWDNSFVSVYSADNPNLLFDMVGFEVRIRPLRSTAKSVMGGGATPTQT VHKDGVWNLQNETTKEVTAQAHLRVEEEAVQAFDNRIRQILMSSGATTFTKIANKWNTAL IGLMTYYREAVLNTQELLDLLVKNENKIQTRIKIGLNSKMPSRFPPVVFYCPKELGGLGM LSMGHVLIPQSDLRYSKQTDTGVTHFRAGLSHDADQLIPNLFRYLQPWESEFVDSQRVWA EYALKRQEANAQNRRLTLEDLEDSWDRGIPRINTLFSKDRHTLAYDRGWRVRTIFKQYQV LRVNPFWWTHQRHDGKLWNLNAYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKAS GFEESMKFKKLTNAQRSGLTQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIP TLKISLIQIFRAHLWQKVHESITMDIVQVFDQELDALEIESVQKETIHPRKSYKMNSSCA DLVLFAAYKWPVSKPSLLHDTKDNYDDGNTSNKYWLDIQLRWGDFDSHDIERYSRAKFLD YTTDNMSIYPSPTGCLIAIDLAYSLYSGFGNYIPGGKPLLQQAMAKIIKANPALYVLRER IRKGLQLYSSEPTEPYLSSQNYGELFSNQVIWFVDDTNVYRVTIHKTFEGNLTTKPINGA IFIFNPRTGQLFLKIIHTSVWAGQKRLSQLAKWKTAEEVAALIRSLPIEEQPKQIIVTRK GMLDPLEVHCLDFPNIVLKGSELQLPFQAALKVEKFGDLILRATEPQMVLFNIFDDWLKT ISSYTAFSRLILILRGLHVNVDKVKIILRPDNSVVTEPHHVWPTLSDEQWIKVEVALKDL ILADYGKKNNVNVSSLTQSEIRDIILGMEISPPSLQRQQVAEIESQAREQSQMTATTTKT TNVHGEQIVVTTTTQYEAATFHSKTDWRVRAISATNLHLRTKHIYVSSEDISEEGLTYVL PKNILSTFITIADLRTQIAGYLYGITPPDNDQVREVRCIVMVPQVGNHQSVTLPKALPNH ELLDELEPLGWIHTQPHELIQNGAQVLPAPDVVMHADICEDNKKAWSGQNEIIITTSFTQ GSCSLTAYKVTDAGMQWAAKNKKTVGGVANAEGYSSSCYEKVQMLLSDRFQGFFMVPDGN GIGWNFNFVGVKHNRNMDYALKLDTPDRFYAECHRPQHFLSFVQMEDNAVDEDDADIEDF LT >gnl|To_NUC_proteinmodels_ML|p202 XREKSSEEDVDDEAISKICSGKWYGSGDMISSRQKNLVAAWKTDDAESKVPERSVLDPDA LYDICIAEQNTLAALSADDLCYKCDEGCISPYSLVLMARLFLIDEDFADVSTLPQLISCD DLRSLWTPSVQEQFTSALDICVNWSVKLATSDGFNGTITIQDTTISVSNPCPLPIKFRPT LVDDAYPTSAENLVRYTTSYFATKKGNNDLEAMYENSENNKYDRSDNDMLAGVYDTTDED FYDFYSDAIVGRDMTLAIGSAGVTALAMLVHTKSPFLTMMGLFQIIFSFPLAYFVYYFVG GVFFFPFLNFIGIFVVFALGADDVFVAVDKWKNARNELPSGTTEQVAALALPDAAYSMLL TSLTTAVAFFGTAICPVGPIVCFAVFVGLLITLDYLMNILLVFPALCLYDKWKIRGFRNM CVSFNWCCGKKDDTSKASYEIAQRRSAEDARRGGTTLTTLQAENNERKELLDVEHKAFIH RMLDGYYNVLHKYRWVVLLVCVAGMCACIVVAAQLSLPTSSVVALLPPSNEYERHRLWSQ NLLSTELAKGNGGVASIAFGLTAADTGDHLNPDSFTTLVLDDSFNPSSEEAQKFLLGFCD RLFANDFASPPKNDYECPINRFDEWLKGQSSSESPSDEYVQNCTGASSVPVPSEVFNPCI IGWSKLVGESFVLSNEGRVKILEVRSRTTVVYDSPFSELEEEWNKYENWLQQERESAPSG VQKPFHSDIAFYWFDTNRQMLRTAYGAAGIALICAAVVLFVSSKSFILTLFAGISIVYVL VAATACLVGLGWELGFLESILFAILIGISCDFVIHFGHAYTMYSGTVNRHYRSNFALTHM GPSILAAALTTFAAAVVMVFTEITFFRKFAVMLFMTILHSTLGCFVVFIVLCDCFGPSQP KMGYYTMRSKLCGSM >gnl|To_NUC_proteinmodels_ML|p204 MFLTVLTLAPDCALAHGLVALCHSPNYNFKGEAYYESTDHPEEELAVVNEPGANVQDTAA TRGHYPYPSQQVAAQHAQMAMDRVDILKRAQRKRRSTASAADEEIDEEDNMPQPISDVEV ELLSAICILTCQPGIDPKEAEKAVGRPYADALREIHERYPHDAEVCSFFAESLMVLNAWN LYEYPTGKPISKDVAEIQSVLEKALEIHPEHAGLCHLYVHLCEMSNEPHRALTACTALRS KFPDAGHLVHMATHIDVLVGDYESCVRYNLAALRADKSIMRASPDTAGPESFYFGYIVHN YHMLVFGAILGGFEKIARDCAAELNEHLNEELFIRTASLRFARGLALANLMRIKEAEEEA DEFSKLRANPSSKLRILHNNNVYRLLEVDDPMLRGEIAYHSNRIDDAFELLRTAVTLQDE LNFDEPWGKMQPVRHALGGLLFEQGKVDEAESVFREDLRFHPGNPWAAGLSDA >gnl|To_NUC_proteinmodels_ML|p205 XGSATRMSYMFLRATAFNQPLLSFDTAEVTDMFSMFGGATAFNQPLSLDTAKVTGMGSMF CGATAFNQPLSLDTAEVTDMWGMFWGATAFNQPLSLDTAKVTDMGLMFGGATAFNQDLCH FGDNFSQFIDVRFMFQDSGCSNKNDPTSASGPWCAVTTCQ >gnl|To_NUC_proteinmodels_ML|p206 XTPIFGPGPDFYEPFFMQNGGSESYFLQKSGIGFPDGALCGGKTNDGIDGDKFSFLYLGM GESQDLTNLTLGDLESLSYDFSVLECPTSDADDICPEQFYVNVYLRESIDSANFCDCNMV FSAAGGGEENGGYTTVTVTPFTKVSSTFPCSGGDGCGGKTSLYEYLREFPDAVLGQGGFP TFGAPFQFNVGDTAVARNGIFGCYDNIRIGLKDEATRVYELEPEMEEEETFVVEGLPTTP IVGPADLAGSCGLMFDAEDCPYFFEPFFMQNGGSESAFSFIADPDSPLGDGALCGGKTND GINNDKFSFLYLGMGDSQDLTNLTLGDLESLSYDFSVLECPTSDADNICPEQFYVNVYLR ENIESENFCDCNMVFSAAGGGKENGGYTTVTVTPFTKVSSTFPCSGGDGCGGKTSLFEYL QENPDAVLGQGGFPTFGAPFQFNVGDTAVSRNGIFGCYDNIRIGLKDEANRVYEL >gnl|To_NUC_proteinmodels_ML|p207 MNSMRTTLAPLARRAASSAHSTSAKSGATIRLSGSSSAGAALNSRLPDVDPDLCRLIEQE KARQRSSLVLIASENFTSRAVLDALGSVLSNKYSEGYPGARYYGGNENIDQVELLCQRRA LDTFELDTEEWGVNVQSLSGSPANFQVYTALLETHDRILSLDLPHGGHLSHGFQTPTKKI SAVSRYFESMPYRLNEETETIDYDEMERSALLFRPKLIVAGASAYSRLIDYKRIREIADK VGAFVLADMAHISGLVAAKVIPSCFPYADVVTTTTHKSLRGPRGAMIFYRKGQRGVTKKG DPIMYDIEEKINFAVFPGLQGGPHNHTIGALSVALKQANTPEFVQYQKQVLKNCARLSDE LQRMGYDVVSGGTDNHLVLVNVKSSKAIDGARVERILELACIASNKNTVPGDTSALNPGG IRMGTPALTSRGFGEDDFARVAEYFDRAVKIAVKLKGTEQGKKIKGFREMCAVGPSVDPE LVQLRKEVSDFASSFPTVGFEEDEMEYEGEYNVDFVA >gnl|To_NUC_proteinmodels_ML|p208 MTDATTKALQEALVIARDNGHSQAEPIHLASALFAEDDGIGSRVVARSDAGCGPSSIIDV RLVRQGLSRAMLTRPAQNPPPHEASMSSSLQKVIQRAMALAKSNADSLVALDHLLVAIYD DRSTKDVLESAGLTKKVVEKTVQSIRGKRKITSTSAEETFEALEKYGIDLVKEAEDGKLD PVIGRDDEIRRVIQILCRRTKNNPCLVGEPGVGKTAIVEGLARRILDGDVPVTLQGVSLR TLDMGALVAGAKYRGEFEERLRAVLDEVKQAEGNIILFVDEVHLVLGAGKADGAMDAANL LKPMLARGELRMIGATTLEEYRQHIEKDSAFERRFQKVLANEPSVESTVSILRGLVDRYE AHHGVRISDAAICAAAQLSDRYITNRFLPDKAIDLVDEAAAQVRVQLDSRPEKIDVLERK VVQLEIESTALSREKDKASKKRRKEVHDEIANLREELEPLNQKWEEDRGRAEELKNAKEK LTRLEAKVASAERVGDYEKAADLKYGAIPDLKAHIETIVREEEKRKADQAEKMGGCEDED SLALEVVLPKHIADIISRWTGIPANKLTQTERERILKLGDRLKERVVGQEEAVGAVVDSI MRSKAGLARASQPDSSFLFLGPTGTGKTELAKALFSELYDGDERSLVRIDMSEYTEEHSV SRLIGSPPGYIGHEEGGQLTEAVRRKPYTVVLFDEVEKAHKKILTVLLQVLDEGRLTDSR GRTVDFTNTVIILTSNLGAQFLLDYDKTSEISRDLARKSVMSAVKSHFSPEFLNRLSSVV MFNSLGADQLGKICQKSLCSVKRRLVEQGIRVVLEKSGAEAIIDNSFDPSYGARPVERYL EQTIVTKLSKMLISGELESGYTVFIEGVDDEDDSFEVVEPEKKRAKTLSYRIEQFPRNMA LNCEGELFPESGAMEVDN >gnl|To_NUC_proteinmodels_ML|p209 MSSLSCLVVGAFVAVLLVLILVPLSFSYIDYYDYGLVQRKTTGAVDVDNVYTSGRYMLGP DKKFVKYQADAHVESFENFGVFSATLSNESIGLEFWVDVDFTFFLVEEEIGLLHKQLARN YRNVILSRAREGIKNAAAQDVTFTEFFQNRSEVELLFRKAVQDRWDVAPSLHCELDQFHL GRIRIPQSVARKQLESKVQNERNQREEFSQRAQIEREQTQVEVNKINLSTNKELRTAHAE ASLIKTKAIARAKLIKAQAQINGTSLLLEAAGIESQDHKTVFSYIKTLRDREQLSIDVSY LSEDNVLRTNPL >gnl|To_NUC_proteinmodels_ML|p210 MTNNLQLAHPRGGASANVVATATLNCDDGAPDESGNNSFGNLTRVDSTRSREEPSGVDFV GPAKAVKELFSLSGGFNSDENVSVALHNLGNGTILLESADDIGEEGEQGYSSPRKRNLRR PRPEWSSDGEDDGGIERLTDEKGSENLLKSLSLMLGEEKRQEAQRLSIRPPSDGVISTLS KSDERTAEALIIPPSNALTVSDQIQRQSGQSSADDDCDDALLKLNPPQHYMRHVVSPPPE PRQYLDWRFKDMNLVVASDALICNRDPNDDVEARGSVQGNATSIVVRVEDAIDLKAQLET FKQHESRKRDPVALLPSSYADALTASPAPKKVENVSDSSDEEAASDENDLRLKMMCIPAS NISPMWSDMGFSMISTNEPASADNGAFGPSPAAGATGAPPRAPVCTVLDTYLDNIMANVP QLALILQEHGYVQNIELLRTEDIPSLMMHPSTLGMADDSGGSKPIFSPEIVETNAAMLLR FLKTNCTSDSSTYLLHRNKGETSIQLFDISSISQTRQRKWVWWLALCSYRFACRLEQLQA NTLAPDDNVTRRDYRKRQRSLLYNTLNLLEELSDMDGGRHETISAAVCEHLADSYLWRTA DEDMTHQPQASNATGIQTYASSKQPYRNVNVDCLNKAQDHLNNALAFLMPLLSKAKEDNS PLEIEALTAQLYGIHHKIINVCLRLADHHLGSYSSSNLIGTLRTAAKTLADAASLLGTMS AFNAGGDNDDQLYGMSIIQQHAWLWEYCGHFGRSFAADELWRDRGHTSGYDLCSLFREVE AACSSKIMRKYLDGERSDLNSASNGQVSLTSLCGVVILPNDFEEIESSVLSKEGCQEAIR VSKSILGHQTEIKRDARLVLVAACLCYCNSIASHDKLAQQTNSEPDGDVKSVSSPTKRDK KATGREQEVNPLLRQRLGDACNEIGKALLNESRAVLTSDGKEAGGDANMSHVSAVLLNSA TFWFNQGLEQFQLAEDLRNLALLRCNLCMCCKIRANTNVILPGGSSQRPFSIAEQFLNEA IGHLESAHESLGQRDTDAACWDMVSEELASTLLVLGVRRRQNSVLGKTSNPILLQAISLT PGDERRIVEPMERSCGIYESLRTVRANHQAAAAHYQLALLFSKIWTIQRDEAKTKEKLSA AFSHFSSAYQYFFSHIRGNETTFVVLSLDLANLYSTVSGEECLAKALGCCLDTKEAFNDV DPSSLDQMIVLADNIESRVSKLLLSLVKLEKEAKSGQVDKYKGMYRHALGYKMHSAKSGD EADDNRKRFDRLYELLGLLGSLTK >gnl|To_NUC_proteinmodels_ML|p211 MIPLPDSESRSKNLPGDRFLNTTVLTIISSKLVEEQEESQSTTRKAPKSMGARVFDQMEM DRMNSNNNGTTSLVRIIKLQKWATLEKVMVSGAVDMIPFDDSPLEQRLAGTDRILHLACQ FQAPLRIVQLLSDRHPMSMRTLDDDGRYPLHIACARGAKPKVINYMLDACNLAVRWQDKS GKLPLHHLGESWRKGWKTVRNTRTQPEQESMLAVTSMLLFEYPESPIIEDNNETNSIEYA IQSGADIKVVKRMQNESRNSWRRMKKRNESMCHDDLRLSVRSSTASSIDLEEISGLDISM MDERNADFAPGSMPPLPSSLEEFIQSARMA >gnl|To_NUC_proteinmodels_ML|p212 MTVISALFVLMVECGEDAGKVGSDDLHSGNLQALWLRRIEVRVRASFIRRGHCDIKKSQI VCRSKIVCRSKREDRVSIKIAGRVSIEERRLCVNRRRETKRPIVYQSGERFYITNRRMMV RSDETLESKYAKDCKQRPNETQHELARLAIGELGEYDSDNESINELGFFAMEHIGGDELQ GGEIIVEETKNEPYGPEIIDCHDGPGESELVQNDATTNSFQQFFGKGVTSAADSAKKLGE INNHWTNEYVYPTHVRRMSSSDDEDSDANGQDVIASNQKRRHYGQWTHKAYRYPYGTCT >gnl|To_NUC_proteinmodels_ML|p213 MGDEEYISQPYLPTKSVINMRRLPRFGYGSDFTDVFSSDQDSQAEYTMGLVLLFAFLLIF FALWTIAIIVFKIMGENSGFLSGRPFQIPDDSCVDVKSLKRPRRVRITFLVATGFCMAFT FLFVVMGLTNVNNATTTMTESLQTVGDLISNTKKIAYNLRIVGSNSLLIRDAAVAELDMI CPENPDIAETVGIDITGITDQARSDLTDLATFINEGLAVLDKNLESTETFSKDLDNGLQA AQFWSWEFKLLSAGLFILPSFFVVGVGLAMLDIDFKTYTNALTYFFMPVFAVTIIVCYIV CAAMLPISATAADACIGGGVHHGGPDDTILTVYRNIRGVVDDDNFLLFLGYYTQQCDPKY YPFGFIASYLAQLDEAIRSTDEAGVTIAGNLDLLKQQCGREFDTVIRLINDMNRNLVLLK EQADLSLDLVKCRNINELYINTVHQAGCTYSVDALTWVFASSLVISVSGLIMIMLRSSYY PEEYVPSSCDWKX >gnl|To_NUC_proteinmodels_ML|p214 MKLALVISAVIAALSSDPGAFAEETAVAVSNGTKLRGYAQTIASNHATPASDEDEGPIDR YDSDEEDEGPSLESDVELERRRGRGRRGGGRRRGRGRGRGRRRRRGRGSFRSRRGRRFRG SRRFRRGRSFRRGRSFSRRRGGFRGSGRCIDRCLDRSNRSVRSCENRCGFFSSESIVDLF YATGGFSEPPTDEDLGLYLSATPSSISDVPMNDPSDEGEHDDYGDYDDEPDYFDDEFDDD GFEDSDMDDSIDGPQDMTKPAAVSLQDAQKDTTDDYYYDEPNDKDFY >gnl|To_NUC_proteinmodels_ML|p215 MPATFRRPFPHTTELTGYRRQRLDDQRPHPHGHYRSERDFYRVAASHQGFCGGYWTGEAV DRHDLVRLEPSQLQQNGLGLCLGDQIGVLYEGRWVAGFVGVIGDPFDAWSTMLSASTEVL EVNGEVLKVTGPPISSDLSPLVPSPSIRERTYGPDADGTYSNWKQFSYPPPDAPFIDITV SVADGQGMTGDRLELREFLADPRKWRDNFCCSFYVVRLYQPEWDTFLREIDGATVVINRE TKEVLRRLSTSPSQQEAFLQRMPVQRRYPEESVGRNYGRSGHRDGLYGSRSQREPSRNRW SASRMPSDGSCRQEAPRFTFSGGTFRGCPVGPNYKPASKSQKPVVELKSASKSETAVESE PMVELKPVVKSKPAVVSEPVVESKPAVASTPALASKPVVKSKPAVAPTSVVNPKSADGPV SSFLKFVQSLADPTPAAAVPKPAVKSPSVVQSTPAVKSKSAVSPKPAALRSPVQSKSNID STSLVAPTPVVELPPRPAAAPDQQDAIGFAEGWTSRTSRARREPSPTSDNLHSLAVAECV KKYVKVQPRHKPRPRPRRLRATSHASKKDGDQFHAARQTSLPVGTTARRQFDGRSLDRRD ERCRGKSGRPPTTRPPSATFMASDDEVFFNADDGQAYRLGALSMDPKLVDSFLHSSLTLV ADPVVESVVTPVSVPTTPRVRKPLSITKPSSVTSPTATPVVPAPVSADDIGDVQPPSSEL VIHSSEGVDAVAAQVVPAPERVATENRVPEGAVDSTVDPTSAASTKAAVTTSSRRASKAP AIVSEGACADQGTMAVNPRAAISRFFIESLRLPPWMSK >gnl|To_NUC_proteinmodels_ML|p216 MTTLHLALPLLLLILAIPRTHAADPKSPALSRDAFSALFLTEADRLDAATWDAAVDFEWD TALAGQAETGLAYPYLLCNSDPSISGYRRVLALQESANTTNFRTGYNSDEITCAIASVLP SDAVGMEGEFFQIQPWLPAMKLAHTSWQTIDDEIEAANSGNATYYLPSVDVIMCPQNTEP ENDTDAGEWTEDLTESLLEKIGGNISENFYITSAAYLESGLAEEEVAEVSSRLKQILDTY QFTAECNDVFSTRVTYQFAQATSASRDLSTVLVEFNTTGITDQDTGCIVTLVLGFAVLPT VCSIEPRAQTNIRNVEAQWITQSYKEDERPFFDTGLRGEGQVVAISDTGIDQNNCYFWDS TTQASLLFNPDARKVAQYIPYVNDDDYIYGHGTHVAGTVAGKRAVKGDIESNGAADGLAP HARIAFLDIGGPLGGMRIPSFDRVMSTGRPHAKIHTASWGSDFNSYTLETRAFDQYMFDE DDFLLLVAAGNTGDNNRLNTVGTPATGKNMLSVGSHNNGGNSAPRNSPGVDYVSGFSSRG PTKDGRMKPDLLAPGNYVLSAGARPGEVGECDPEDGSLPRLGLKKEGLNYMAGTSMSSPA AAGVAALVRQYFTDGYYPSGTMTESDKMENPSGALIKAVLMNGAQQVRGVDNTFIGITSS KLYDENQGYGRVSLADSLFLPGSTNVQLNVFDREVVMDGNKKTYTLNIVKSEECNYQNLS VTLVWMDPASLPFCQSCLLNDLDLSLARGGDEFHPNGRRNADRLNNAERVIVGASDGDEF TISVKAHNLIGPKQKYALVATGCFKGTEEIRREPTGSSSNGGSTANGDVPLTPGDSSPTP SEGRGYPSPEQGGNASSAITCRFGLLLATSLFLWFVIFP >gnl|To_NUC_proteinmodels_ML|p217 MGVALVEFLDQAVAHKPLDASDCTTLEFAKRELSRLRRVTAKVVKVLRDADNAVKQHAEH QNGFMKENSVLNGQAKANKADLLEKYRPKSTRSRSAPASNTDRDQKNPQEDEAGRRDSKA SSSTPMTGCSQSTEKRNHVKGRSKSRTKSRSKSKARNKKKKSNKKRKNLSVDLSYQKYIK QTYDKSPEDRDLIKSALKNNQLFENFRDAQLEEFVDVFSPESFEEGSTVVRQATHGNTFY IVKSGTLKIYVDTIIDGRKMETQVGEPYGSGSAFGELALLYDSPRVATIRASEACVFWVI DRTAFKGLQLQLKQADHNIKLEHLKRVKVGNQMFADVLDDDQLERMALATQFENFKDGKA IFKEGEIGSNFYMIVTGQVDVYKDGGDYPINHLGPDTYFGERALLQSDGGCRTATCVAKT DVECLTLSREDFTDLLGSLDEIMVGRRQSLSLRQSSGDLIAPKAADLLSDDLGGDQGFTS YTLDGLKIKGVLGEGAFGRVHLAKGRQDGRLYALKAQGKHHIATNGSQTKVISEFGIMTD LNHPLIVRCYQAFQDTRYIYFLMSLLPGGDVLDLLEDKIKFSEDWARFYGATVVLTFEYI HSRKIAFRDLKPENLVLDADGYAHLVDFGLAKQIAGKTWTVCGTPDYVAPEIVKAEGHDW GCDYWALGVLLYELVSGDPPFTNDNPSVIARNILKGKYETPRHFSPDLTALISGLLTKQS KRLGRIKGGIRSIRKHAWFGDFDFRALLRKEMEVPYEPKLGTLSDLGGKIDNSCWEDAPE SDWEPLLNPNYRQMTSAQGVARKLKQSLPTRPSKGMANLLGRSVVIVEEKNEDDGSY >gnl|To_NUC_proteinmodels_ML|p218 XLGSTPPLPTPSIPTTIEFTTSCEGDESSAALSQYRRPSSSLPIRVATSLEPTLPIPIRS APDADDLWNRREDEATSAAQFDFMTWQMYQRISTARRLRAFARGGGHPAYDPSHPVRHRK EAQEQEDLTIATADISEVLFDGKNGVEPDEGVFVLDLGV >gnl|To_NUC_proteinmodels_ML|p219 MMRERKELTLLCLCCDELSVEEVVASRPSGGSGGGRPSGGGFGGGGGSGGGGGRPSGGGF GGGGGSGGGGGGSRPEPTPDDIQSKLDEKCGEFDCDSVDPGSVDCARFDGIMSGAVEVSL REVSLRGGRGDRRKTMLYCGCGCREGGDADFAEAEGRVGEGSPSLFESTIADGEGGDTSD SMSMSEDSADSSADSDGEEAVVEFSNGSLGSGRPGGRPGGRPFGAFGGGGGGSRSDPTVA DIQAKLDQQCGDFDCDSVDAENVNCARFDALMSGTSGTSGGQVRRRERKGTMLYCGCGCR AV >gnl|To_NUC_proteinmodels_ML|p220 MGKKKGGKGKATAKAKADDDDWAFLDQAAAENAEAGKDEAKATEQPSASADDVKGESNND NNAGGADAAAAFLAAQGISEGGGGGDNKKKKKKKKKGGGGGGGDDKKKDDKVSAKGKLIA ERMRLQREEEERQRAAEEAERKRIEEIERKEREEEERIQAEKDRKKQKQKEKIERQKREG TYMTKKQKQQAALAKQRLEAMKAAGMEVPGAQKPARKSKAEMEAEKKAKEEEMARRKEEE RLAEEARLAEEARIEEEKRLEEEKKQEEDAKDDWEDSSGDEWDADSNDSGPALDDALEKR LKLASTNEDSDDDDDDDVDLIEKEKRKEQKRLAELGKKRAEQERIQAEKLAEMQAREEEM ARQDMIAAQKREEGKKRRLEQEKANLAARSKDDLRCPIVVIMGHVDTGKTKLLDKIRQTN VQEGEAADWCNLLREKTLLAQTAKLNETEKFDLTLPGMLVIDTPGHESFTNLRSRGSSLC DVAILVIDLMHGLEQQTIESLEMLKRKGTPFVVALNKVDRCYDWKSCKDSPIRDALKHQS EGTIQEFRSRAADAKLQLQEQGVNSNIYWEMGEDDWANSDFVPLVPTSAISGEGVQDILL LLCQIAQRKLWRQLMWCANLQCTVLEVKAIDGLGMTVDVLVVNGYLKEGDRAVFCTLDGP IVSEIRGLLTPPPSREMRIKSEYIHHKEIKGALGVKVIGNGLDKVMAGTPVMVVGPDDEV EDIKSEVMSDLTKLQEKLSTDKKGVMVQASTLGALEALLQFLREETKPPIPVSAIGIGTI HKRDVTKISIMNEKGCSEYATILAFDVPIDKEAREQAEEVNVRIFTAEIIYHLFDQFTRF MEELTEKRREDAAAVAVFPSICKILPQHVFNTKDPIILGVEVIEGILKVGTPLCVPAIGG LHVGVVTSIEQNGREQQTARKGTSIAIKIVNENNPTLTYGRQFDASHSLYSTLSRASIDA LKLHFKDKLENEDWRLVVKLKKVFNII >gnl|To_NUC_proteinmodels_ML|p221 XFRNPPHFLSLLSDGTSKSELTRRDAQYETEAVLEHLFYHDNVAPFICGRVIQRFGISNP SPRYVKVCVDAFRSGSYESGSIEFGSGRYGCLEALISSIVLDREATDPAASFDSSFGSIR EPLLRYIQLFRSMEYKSSFPVSSKIDSSFPGDFQIRLANLPPKIGQGPHDHPSIFSFFLP AYIPGSGPLATSMLHSPESLMLTGPNTLQLLSGMHSLIKFGLSDCDGGFGLNPNVGSCRS SRGSYSGRLYYEPDGSSLLEKARDLSLVLAAGRVNEMNLSKIVSACSAEPDVAQLICLQQ LVVSTAEFHTNTETTPTMRENHEKRKVDRESLRSNLAYKAIIVLDLKGGADSFNLLAPLT CAPIDVYENYRSIRGKTADEEGVGMPRSRLLEISANADQPCESFGVHENLPILKELYEKG EATFIANAGLLLKPTNNDSHRIDTPVNLFSHNDMSHEMKKEDVRNQHTGTGVAGRIATAL TEAGIISESISVDGQQVMFSTAGRGPPQVTIGEDGLPDFNEGEESIPNEVLLSLNGPVTP SSGQFAHTWSTNMNEAISQYEKLKILIDGAVSTEIFPTTRISAQLELVTKLMQSHQLRGA QRDIFYVSQPRYDTHQNVDERLVDNFSELNGALEAFIAETKVLGLYKSMVLLQFSEFGRT LTPNTNSGTDHAWGGNYFLIGGALNGGKVLGHYPAEFERSESNKITLTRGRLVPTTPFDA VWMGTSEWFGVPSSQMEYVLPMHKNFPAGTLFSEEDLFRSEEVLPVSGHFQ >gnl|To_NUC_proteinmodels_ML|p222 XEAAQLEIRTLEEMGAWEVVDRTPDMNVIGGTWAFKIKRFPDGLIKKFKARFCARGDQQL KGVDFFEVYAPVVQWSTVRLMLILQALLGLKSKQGDITCAFLHADVPEDEKIHVEMPKGF EQKGKVLRLRKTLYGLRQSPRAFWQYMTDKLDECGMKQSELDPCLFIGPKVICIVYVDDL LFWARDDADIVELAHGLRDLGVDLEEEQDAAGFLGVSIEYDSATKTMEMRQDGLIDRCIE ALGLDADTSNTTTRPSSGQPLPRDSEGEPASEEFNYASVVGMMMYLAGHTRPDIAYAVNC CARYMFCPKRSHELALKRIGRYLLATRDRGMILDPNQELADLLTVDCYPDADFAGMYGHE KSSDPTSVKSRTGFVITFGGCPVTWQSKLQTETALSTMEAEIVALAHSCRVLMPIIDMTM ELLSAFGMKLDSTTMKVSIHEDNAGALVLGKTLPPGFTPRSKHYAIKTIWFREQIVKRGI KLLKIDTKEQLGDIFTKGLPEPAFIYLRQKLIGW >gnl|To_NUC_proteinmodels_ML|p223 MVVEPITSIRENDIVLGRKGFALKHRGNQAYRKLVNLNKELYATCQKCKKIRITKSIVAT MRVNGGRFLQRIDGKTSMTPDEVDDNGDPVVYFDIGDKKAIEKTSQALREGQPKLLQRIA ARRQREAAQTAFAQLQTQQQNYLPSSEPQSPVFGHSSSSPTNSSLSATNFASIPPLRLER RRRGLIEPMPLGLSDPFNEPIEPMPLGRSNGLSSLSLANQSMPPLPPSTIPRSGAREVTP ASTIKVRVSPPTDLLQQFETNDLLMALIDRLQQR >gnl|To_NUC_proteinmodels_ML|p224 MVGGIGIAGGAAIATLHSVTGSADDFYDYRFETQQDPDDLASFYGGEELMELFCIFPIVG QMMMRNAHFDDQGNVQTLGFPGKMTVSMVFSDESNEETGQTDWFNKRERFTNTLFGVKLW DMVVNFGFRTKEDGTVECYHFGEYFHGSAPVLSQAMLLIFKLHARWVAWSTEHHINHFAF GSALAGDEDVAEEFEHASRANMPLFLLKNYAWSDLKAMVFGTYDSNKPPSFLLKERDEEG RQWDELRERHLGEMAKLRAVRDEIAATEELLPFQRKSIQIQIAEDIEDDRTVIKSMLAHR DTKGFSDVKSLLVKHHTIALKRRSTVRKGGTGEEIDIYKIAKVSETNDVLSRMRHNLNLN LSMTILRTLPLKGLK >gnl|To_NUC_proteinmodels_ML|p225 MASDEDSDASSSSSSSSSSSSDEESVASQEQQEQVEREDVNDEREEGRDGYQGDADDDRL KAFLSRIPQTFNEDSVKRLLESKFGDGCAAEVSIVYEVPEEVPEDEGNPGESDAAGNNNG ETAKHRGFGYVRFSTIEQQQAALEAGTVRGKAKENSKRKHTLYIQPVVREHEGSSAENRN KNICFLFRKFRCPYGDQCKFEHSGEGGCLTKNNESKTKKEDLATRTKERKNRKKEAFANS DVCKKARDKSEIHCINWKNKGKCRKKDCPYLHDEAVKEKVLQKKQGGSKKRQRVDKDRQP LSIRVFGMNYETTEANIRTFFKDCGPIVELTFPLFEDSGRSKGYCGVRFQSPKAVAKAVA LNGKELQGRWLSIQEGKMFLRKWEQNETLRREDQKDASLDCGEERGKPLVGEFGQKMKKR KRHGHKVFP >gnl|To_NUC_proteinmodels_ML|p227 MIVGVTLVVALASAVTIAVATRTNYWNTDSLQRGLSTSKDYDVVIVGGGLAGISAARSLA KDGFDVMILEAEPSLGGRAKSYYALTDGMYDRPIPTDLGAEWTYSDYSTLESVLEQEQLF EYALDKSKEVEKYYMQTYDEATGELAKAEEFSKSSYSRVWKKFKKFKSKMTKKQDMSYEA VLDAFLESENLSNDKRQYMNLILAMGEADYAGDDLLQSSREIEYYFQIPGYHDRMHYYPH RGLGGNIELLGRTLDSDVDISLSSSVSEINYEDSDQVIVTYELEGEQLELTSRSVLVTAS LGVLKSGSIGFSPRLPVRKQRVIDNMGFGTLNKLILYWESDSAVVWPLDTGWFMLATADD ESSNDFVTVFNPTKEKGVPCLVLWVGGFDAVLKEDESDDEILRDAMNSLTAMFPSISNPD TVFFTRWNSEVNFRGSYSFATVGREFASDAAVLKESIGGLWFAGEATNEDGWHSTTVGAW QSGEDVAKSISKSLK >gnl|To_NUC_proteinmodels_ML|p228 MAATTTEDSTIQLDGTDPPADDASGDAAKAKDYLLAKRALTAVNFFVETNRLEPYAAVYL VDKMGWPKVDFGIVSLVMNVAMVVFQTPAGDLLDKTKRGKKIITTAAILVAAFTTVMVVW TSNFWAILFGKTVEGICSTIFLPALMSLLLGISRNEEEIPRFIATTEVSNKIGSFIIVVT CAAISYYTYPDVESMFYLLGAGGLAAAFFTFLIPESAIDHDKARQLDDSGSATDTDELDL EESPEEAPEEAEGKTKASPSRYRDLFKNRSIVLFAVLTFTYHLANAGPAPLLAQYVASIT PDDTSLTWTSAIMILWFLPQAATSFLMTYAIDRFDHKRIMTVAFVTVPVRCAAIALLVEF SNSPWALVSTQTLEGIGAGVYDVMIPIVVQKMTKGSGRFGFAFGFIVAMWRIGHGCSVLL GESLAQSFGYVASFTTLGVIGFLNLVVFVLFFNFDQEPSPVKEEKEIEDRRTLSVTSSSR DFGRTLSVSSSSRDLTRCNSSSRNFGRTLSVSSTRELTRSNSSSRNFGRTLSVSSARDLT SSNFYYKARILHFCRQLF >gnl|To_NUC_proteinmodels_ML|p229 MSDSAEQSGLVVTEAVPADPSAVEGGGGDAKPKLSPKNYVNLAAYAANIALVYGIGNAGW LGTPTNGDLSDKYQTLVTPNSSAFSIWAVIFMFQGLFAVVQFLPRFRSHPMVADGLSWWY NAVVLTQIGWTISFAYEIVPLSLAFMVLIWLSLMTILYRQYYAKSDGTLLEFWLLRFPFA VHAGWITAASALNVNVQVVYMGEPAYVQLSVAIVSLAVLHAVSVWVLFNIPRPNWTVACV LTWAFGWIYNELSDPLALITETFSTDTILGVQNAAIAVVVVICSQMAVRLGMLFLPSWNG YRKVESDSDSATQEEEPMKEADV >gnl|To_NUC_proteinmodels_ML|p230 XEGWYLIFPYDTPGVSQHPMSRWENRAVSPNVVYVGRYGDSVAYRDLPNELKTFDIAEVY EPTPDIVTGGIIICGSESEVGNDPSMREVFNVLSDVESADFGLGTSSQRSIVFYDAVRSA KDQLRQRVAWALAQILTVVPINIDDSLDRTEIYLKFYDIFVKHAFGSYLNILREISYSPL VAEHLSFLNSKSHAYIYRTEDGRIARADELTSYLSVGLFVLNDDGTVVTDSQGLPKEIYT NADIEEFAKVWTGFLRNDVRGNYEEMRQFANYNRIDPGSYPLCEFLPDKPHLKAGAKFRL IGGSSLPLYMNDADQYANDEANGVNRVVITASSGLYDHLKSTNGVYKLSVTLESDVSCPP NSDEVECKVEVLRVVRIDDVYYEYVERACVQLSFYEGKLVQPRNSEFKGKFCANKELTVA REACCVQENFRQVRTNAVLAQNITHFYDGERMKWSTAKERCIEYGRDLCFWNRADFQPQT RNADDARIDQWKHNYQWTSEDCSINLKIRPEDGYVALVHNVTNTYGTSQTDFHIDEEKSL NWFRVFWGGSGEYPGSNSSNTCSDYGCRVSLDGHSCYCKAAVEGSPFFSSLDNITAEGIL SNLFIGTFGPPDASSSLAIQAEGYTAYVVDNVVDENTVFLVAKNGRDYYLRNLVSTVTVE GWQDLTPQIYEAEDAVLTNVTSTSAAGASGGHYVPRLVPSDTSGIGWAIEWNVSVASGGL YYVQFRYAFTGNEAGANKSGPRPHRLSVNGVDIFKEPLNNNTETFNTIANYNSNQDITLG RCEGRCDKVKCGTGLVCLVKTATNFVPGCNGTLVSSGTGSGLARENWGYCASVEDFVDVF TPYSTGSDTASWHLSDKVLVNLVAGSNLIKLSMPGTNRYSAGIDYLRVEGLPLASSASFR NPPHFVNMAIDIDSDELGEVNLRDVEHETEAVLDYYFRHNNLAPFLCVRMMQRFGFSNPS KRFVKNCVDSFRSGSYSLGNHIFGNAEYGSLEAMVAAIVLDREATDPSARVDPSAGSVRE PVLKVMNVLRSLEYKGTPPALEDGSLLQSTYSTRMRNLQAKIGQGPHDFPSVFSWFEASY VADAGPTTAARLASPEAVMLTMPNVINLLNGLLSLVKYGMSGCNGGFAEAYAYGACVDDG SYSSSIGYLSISLDEANTTAIAQDLSLMLTGGRLSDENRIMIEQACENQPNNSTSRCIMQ LMMMTPEFHSTNSVTRSGEARSISDVNNAAVSEPYKAIVYLYIDGGIDSFNLLAPHTCPD TDLYNEYRTVRGLSGESQGIGMELNNLYSIPANNVTYQPCTHFGIHEDMSLLKQLYDVED LLFVSNAGLLARPVDIDNYKGETPVSLFSHNGMRDETKRVDVFDMFVGTGVGGRIATVLN RRGISTNIFSIFGSQAYLTGIAPDGPAPYILSTQGLSTFNVDPTLSSMNDIIKDLNNAST SDSGWFGETWSTKLSEIIVNQGTLQTLVENVTVNSPFKLEDEINEIADQLELVTRLMQTA NERGVGRDIFYVEDSGYDTHLDVKYRLSLKLPPLNRALDAFCSELKTIGLWNQTVLIQFS EFGRTLAPNSGGGSDHAWAGNHFMLGGSVKGGKILGAYPSELREGDTEGRILSRGRIIPK FPWDAMLQGVAEWFGMNTTDLDTVLPMRNNFPAELLYNSTDLFIG >gnl|To_NUC_proteinmodels_ML|p231 MSSGAGNIIQLEEGWNNVIKKGAIDVLEKTLDDGFDTCFEPKEYIRIYTFLFYGRPHDQW VRCTAANPPSPKTSCAACIKINRTCYDMCTQRSPYNWSRDLYTRHGETIEQYLRNTVLPA LQNKTGQGGTILLQELKHRWTNHQIMNKWLKKFFTYLDRYYVKHHSLPTLEQAGLQHFKA EIYMNSKENSTSAIISLIDEEREGEIIEKSLVKSIVELYESMGMGSLDAYTNDLEQPLLE GTRSFYGRKREDWIAKDSTPDYMIKAERALGEEKARVTDYLNPATEPKLRRVVEDEILQK VQTNLLEKEGSGCTVLLANDKTDDLKRMFQLFSRLDDGLQPMADIVQKFITSQGEACVEK RESRLKNEKDKNDDPEFVKSLIDLHEKYLGVIRETFASHHLFQKALKNSFEEIVNHDVGQ YSNADLMSTFCDRILKSGGEKLSDTEVEQKLDQIVKLFSFLNDKDVFAEIYRNQLAKRLL NQRSASNDAEKAMIAKLKLQCGTQFTSKMEGMLNDLAVGAEQKSEFDQRMEQLDTKLGFG VQVLSNGNWPSYQAPVVQLPPQMSKCMEVFQEWHDKKHQKRRLTWVHSLGNASVKATYGK KTYDLQVTTLQAVVLNAFNDNKSYGFNELKQKLNVDDKTLKPIMHSLSCGKHKVIEKSPK SNKIQSTDKFSPNPKFSSNMRKIRIPVATLEQSHNKNRVEEDRGVAIEACIVRIMKARKT LAHQQLIAEVLSQLAFFKPQPRVIKKKIEALIDREYLERSQDNSQQYNYLA >gnl|To_NUC_proteinmodels_ML|p232 MVRGKQLVLSLLLPPLALGDGGGAYFPDLTAEEKSPYTAWLSEASGRYARHGFLPSSGSD DGSDGAAIFWTIDEDGSGDGNGTASFAVASGGVGRLRPERGGGMRGSDVAIYESSTGVLT DAHVVDELAAPVADDCQSWDLADAAVDGDGWLIVEMTRALDTYDAQDHPIRDDVGATVPP TRLIAAWGDGDSVAFHGTNRARGAYSLHSDSVLPEYDLLLKRLEEESDGYFEIREDEHEV KAEDTEYHDVCKTADELGVEIPEGNDGITMIGYVPVIDEDTRRFVHHFVVTSTEDCSDGG DFDALGDTTLSAWAPGDTGTMFPDNVGVQMFGRGKSAVNLNIHYDNPDLVQGKKDSSGMR YYYVFNKREHNAGILQIGDPLVMTPGAISPGLTSYSYSCPGSCTEEVLDTPVTILVESLH MHTTGVRMTNEVKRNGRRFHLATSEVYDFDQQGSFAVQQQPYDLMPGDSFKTTCYYRDGV RFGLSSQEEMCIAFVLYYPEKTISGFGNEIPWMCAYTKGNIQLPTRCAEELVSADIPDET EFTCGDLSGFAVYLDGVSYTCGQISQFSSFLKAEDDGCGDIRMAEPLCCPSDDGGGTCSF CKGLAVDSEKPIPETEFTCGDLSGFAVYLDGGSEDCANIMLAEALCCQANEEEKPAGPCG FCSEGLTADADTPLPAASDGVSYTCGQLAEFVGLYNSDDDECKEMLLGEPFCCPKSDSCG FCSEGLTADADTPVPAGSNGISYTCGQLYEFVSIVGSGSDDCNEMLLAEPLCCPSSNSET STLDKDESADTTTDSSGPCGFCSKGLTVDAKTSLPVAPDGVTYTCGQLYGVVNFVKSGSD DCNDMLMAEPLCCPSSSSEEDTSEPINSTDEFDERADSADESDPGDQDVVQSDPANIPED PVDTNEDSDASSLSAASLASILCIAIACLLA >gnl|To_NUC_proteinmodels_ML|p233 MTSVGGGGLSLADACDYVAAEDFSGGQTCSPCSCQTSVPTNQPSTATPTIPEMLYCGGCE TCTESIWNSPATALDNQETYSCGSRISYLINSLSYEEDAACSAVGTANYAEGQPCAPCSC QSPQVLNDPDPAMLIFSDEFETDGAPDATKWTYDLGDGCSVGICGWGNDESQYYTSSPSN VHASNGVLSITARKENGFSRPYTSTRMVTRGLASFKYGRVQFRANIAKCKATGTWSALWM LPETWQYGGWPDSGEIDVMEAVGFETDLFHGSVHTRAWHGGNSKSGSMSAPDQDWHVFEI DWQETTIRFAVDGLVYYEFDRESTTDKTRWPFDQEFHLIMNIAVGGAWGGLRGIDDAAFE GDGQIMEIDYVRVYGSPSLF >gnl|To_NUC_proteinmodels_ML|p234 MMKYYALLLLAMLVRRAGAARRRRVLRADDISLRSIESSEFASSIGDALVLGREVHQDVL GGTERCSSPSKAWLLEKSSSIESEDLAYFFDNHLQSLPFAYKLYVADRDEDQNFGSNGKY TEQITQIHSKSQDFWRDSGAEDDIRLLGAHGEDLADRDKLIPTLKMIFGRSYHNGYTVEE HATEIQDLIERLPGGYSNPLLTFNAFFSNDAGAILIGDGYFEFQQSQGLESVGPSYAVTH EHAHHLQYALYALSDFDSDDISSSDSRKKELMADALSAFFLYHDSGGDLYGAEIDRIYRS AYSVGDCEIMDEGHHGTPNERRCASMWGAELAASSDASIIDLLELKSRFNLWYDGLDYHC TTIDSSSASTLSRTTVMTLIMAGTAATWLGL >gnl|To_NUC_proteinmodels_ML|p235 MKSSRSLLSSVALLAFSLVEADPTRSIETIMCDTQCDQVAESTDGPDGCMRYTTPLNECY NAARLFPGDESWSDVDIIDTIVMKSLNRRFFTSKDSTCQNDKQNTAIQPVDGGDSFVIPF DECVGPFGPPRPWGKFTLIESSEEKHSDEDAIVSVS >gnl|To_NUC_proteinmodels_ML|p236 MKLYKMAQMPLAMLLLTGSPMFSAAKSVADYVVPCVDDSVDFDVKLNFDDGSSYNSACSR SYSRCPDGSCLGGDDDWSTESNRSFDYLQTHTFAADGGASPAGVYVNDLTGRSACARALY MDQYRDFARRGRGARTWGTGGRTARGPADPLQITIVPFNSDQADCDKGNFDFEAERTVAV WVSAYRGQDTDPDDGSVYCDLSQNPEDCFKTDPPLSLNFAYWTVEHTNWGVDEIVLHGTL ATVLDTSPPAVNIYLPEGYDLNMPITNHQLRFQIADGSTQACQDYANGNGHSTCMVNKLY SAAEMPNTPLADSKILSKINVSGLGRISGWDAMEYQRSKGYYLNCGESNKYCFNNVGGGS GSGVALNDASTNWGDFSQWYMSGRLLDLSAKPPGYEYGSQMFAIDVSGIETSWGSKYGYS PIQLNVADEDTYNYPTKVFDFKYVGNFNDQGDGIDTMADGSFTAFAYTQTNDDNIKLAAK DNHRKHSTLLQGRAGCAINLGAYGWGNIDGSVIEGVYVHRILHASWDSWNGCASGMDSSN GNGYGGLICTRSCAMDDGGLVDTTITGLYVPELADANSVSRLFAIGVNGNGPFCPGTTST KYPIRNLVIKDSAVYPNPGCMSAMYDDLGIVEWGYNKWPSVRFFDEDASDTQKCDFQGTL NIHDDPQYFVCGFSDQTQAAKYCMTTDGVGGSPNIEYSIVSGDNPNVVFPTCGYSSSLLA VA >gnl|To_NUC_proteinmodels_ML|p237 MSNKSSSVIVKQAKESSRLRISPQSYELPLTWLLKQVNLDDMASQFKVNTSRESTKTPRR NDDVDEAIEFSGRLSSWIGQIHTGLYAGLARIEGQHNPALEKDRRRSNMGDLDYLEDPWE DPTFNPILPLMEDRSERSSETEDEVVPGCMIKIEAQSHEGNTDRNVLLSSSDLTKLTNEH ARSLQGVLETRAGAFDASEYIVTTIAEIELGAVLRNLRSLSIHYEETMNYVESMMERQLV AAIGKRLTQDDLQTYVRYHDARLLSPAPEPFSLSIRQPEHYPTGLISIETVDGKNDCMHS HSRQVNVGSTLNIALNSATLVRLSGNQYLHGWMNHRFGADKKNHQLVARARQFSAFILVI GTMTDGTTLDPKDAIIVQDKDELLIPLLLEEIPTAKEFKDAIKSLSPEQQRFAQAYRSMQ LSSSVFGMCIVQIKPQLEALLGLPADALDKEMKLTQDLMELFTEYQVPSDLLSYSGHSES AALEDKIANVKANVKAVMDVIESQKEKQLKDERAKADMSTVAFTSSRRKMTRIFVKTLDG NTISLHVKPSETVDSVKMKIQYEIGMPANQQRLIYLGTQLENGRTLSDYNIQEDSTLHIV LRLRSGPSSPSGDDYAAPPCQSSASRVKGGRARQSRQQTLPPVSIQSLAMNDAAPLNGAS HAEYHCDDAIDTLRAHGINGATIKKGTRHKGATGQSEKESSRNDIGGLGITSGRSDQSDS ITQEGVVDFTLIPKQLDYSVEEGKGVAVSLRSTTIKTSKSWTRNRQANLLSKPEKQGLDT DEMRREKSKAFDLLDALSRSGSLTIVHSELHVVVAMTHCFDKDLMGTVIRDNVNPIEKLE ISTLLLASAVHGVSARELVRDANELRRLRM >gnl|To_NUC_proteinmodels_ML|p238 MKTDNIRDSITIGVPSSYFSEAGSMTKDDSPVSDSRLARAKASLIGANFFVETNRFEPLM AVYLIEFKNWDVVWVGYVSIIMNILMLVLQTPAGDLMDKTNHKRLITAIAVLTASITTVS VAWKSDLWFILIMKALEGVAATIFLPALMSLLLGICESPEVPRMVSLTETSNKIGSILFT AGCGVLSYYAYPDVSFLFYLLGAGGLVATLFILAIPDDAIDDERARAGNTKSEDELIQDA AAQLSSGDRREEFANQSRATRRSVRGSVRSSIKVVQEKHSRYRDLLQDRNVAMFAVLTLL FHLFNAGVLPLLAQLIAQEDIRTGLAFTSGSMCITYIVQAPVSFIIGKNYKRFGYKNILM LGLAILPLRCLNLALVAMYWPNQYALAATQIWEGVGSGIYDTLLPLIVKALVDGSGRFGF TFGFIVTCWRVGHGLSILVGEAILKAAHHRYEVPFFFSMAGGIAVCALLAFGVHIPNPPD DDDADENDPNKKCDIELGGTTPLDLSDTTQALDLSDKSQALDLSDTTQALDLSDSTQGTP RKPSDCDGVLQALSGDELSWLDGVPPNKVNGPTKTIAVGSNI >gnl|To_NUC_proteinmodels_ML|p239 MGANGHPAERKFPPTAVVMSLEINSNLRGMADRAAAPRRTWTREHKPTSTSIKRWQRNMK ARFMPKKVHPMEGPCTSSEKGDPNQASSKSGSSRVNRYKHSDASVLRWHSKPSSNAPQAQ SSLQSSTARTSPQSSRLSGTSEPHQTHEISSRRSKSTNVDKKIERRRSSNARRPTTEGLT ESWHFSNWDDTAYRYSDSNSACSSQNSLYVDLDDFDDELELEPLEKKRKNLRTVLQADEE SPFENSLRSLNISERNPLDMSLSERQDDSYNKEEFSDLVLSFLSSYQEQP >gnl|To_NUC_proteinmodels_ML|p240 MHSHDYITPSIDESQDVQLIFAQQDDLTGETAWGVVLPKDSCDESDYHIDDKKRFMLWAY GQTHDFYFHSNNRGQFQANLLAPPPEPVSFDEYDKMALTMPNVPVVRGESGMDPTNPFIC SYFDLDVMGKEFGFSSNDKIHMVGYDVDAESGNEKYLHHMVLFECDGELMDEQVTSETRS GLGNSGLFHGKVEGSCTAMPNGCQNPVATWAVGAGANVMPEDVGISLGDGHRWLVLQMHY FNPQLDEGISDSSGLDVYFTKELRPVEGAVMFLNSGAATGQHPPIQGGLEDVTMETLYVE PECSKQWSEPITVMSVHHHSHFMGTRQEIIIERDGKNLGPLRDEMNYDYNHQTGVAPEAH LRVLYPGDRIATTCHFDTTSVAAYSMVEIGEESNREMCVPFFYYYPKQEAGAFQSYYPPE SYSKIVISDYTWCSTPPLEEMGSFGSRCAEKLYADVPAFYKQTFEAYGYDGPSFTFSEMC NGGEETKDLREAQPICPKDCSETQSCSANELVAAVKQACQATCGSLGLSLYPDNSRTELF ETANIGCPTPLFDPPTLAKPEKCVAKGALPISVALTEVSDNSNDQEPSLVDDQNKGNDLG ETGVVMPDDPTRNSSSRALGVVFSLGMSLMVVLSTVIE >gnl|To_NUC_proteinmodels_ML|p242 MSASLAPPPSEADCDELVKGLAATCLDDADVDMLLDGHPSHSVSQDGSDVATPEAGFFID GHTSPRSSRGGIDQQISMPHSLNRPQHVPSWPPSAPDKIIVQSGSTSYQSAAVVSPTQAL AQPPHQQIADDSCPASRAADARIEIGARNEWIIRDELQSGRKQLHPEEASQIDDELRPEP QTKRLDLESTQLQKLPPKSKEIFEAYPGQSTNTEAIPPLYRFTPNSLTGNLLSALSQKFV PNRSQTTHGGSALAGFSALLLVTCANYMLGPMRDAAALAVGVSHIPALTLASTVLALGSS VPVGWLFEAPDPRRRKTWKRMGLTRGETQGTSLALFYRVFAFLLLSYALGFKMVDMFGAK SGAMTSRGDEDITESTIAVVWTFFLRLGQRVGMPMDHFVVSLENLASRIPHIEQYTSSIH LLSTWQNFQDVSIPSHFVYGFKSVISQFGSIIYIMFFLVVHLMKLHSLSLIWGVTTEAMD YEENAELRQMLEKEKKSSGDLTRNTGSGGLVKLGGEQLDESSDRNNGGRKQSKSQLRLKR LAFVGFGGTLGGIIGSVIASFAAEVLHLSGLLVVAAILLELSANLSIELGRIMQRHWEEQ MKYQSCGDLTGLLSGDSSASLNSKNVDSSMRKSASMSSMKRVASGNFSCGNLAKLASVKS EEGNSDTNARKLHSRSVGNMAQAIEEISKSNETPEQPDQLSIDDNSFKQRLLRGITTILR SRLLMAIFTYNALYASTSVLLSFQRAELVANRSTTGTSAVSNTAFLAKINTASSVAVFAL QASGLGAFIANSCGQRGALALMPIIRLCGVILLGWWHVISNGKPPDLILFLAIDECTKVI NFAIAKPVRENLWRGLSFEAKYEAKPIVDTLANRWGSGSAAFLVSFLSRITDLTGLGVVN ESGERTIAGLPPTLILCVGISAWWAIVSADVGYIRSQIDHELKRQQ >gnl|To_NUC_proteinmodels_ML|p243 MNTLAIALLSLSSTALALNANSCLPKTTPLPSASTHSALFRGKFQKAVISPSHGSKSQAL YSAAAAAAAAAADYSPTNRDVLTKLPGSFWSGAKSKYVLDFDWAVSKGMGWRKRIAAAFK FLFLGGLWDTIVSTYALLMRRIPVTIYDARAKEAEEGLSKAEFFDKYGFVLLNSKSAMTA EDWVASERDVKGELKDINKISVDGGAAHRMRMDEFRNEDTPVKRIYAEEAKDMLKSILPR AKTIMPPAKGIRRDLSGGLLNGPAKQVHNDYGLVFDEVVERNPFFDFDKQRAIYEESKAD EYMLVNFWRPIKPMSTPLRSMPLCFLDSSTLGEDDFVTVDNASLGLATSLKDNPGHKFYY YPDMTVDEVVVFKQFHQFRNETKARMPVFHTAFPDPAADKDTEGRVSFEYRVGLMA >gnl|To_NUC_proteinmodels_ML|p244 MAGAKRAKQSREAMPVLLSIFGESSSTIGIAFGAAAAVTGCSPSHLQETKQKNKQNKQRL RSATVPNKQAASFKRQRLSMEASNDPPHCMPPLPSLSVSARGASTSNCRNDSNAETNFQM SRPSDSSDNFCQVASSPLRQFADCRPTCQIACRGPDGNDMNHHLLRQEPQSGVEQSWQPA SRQQMTQDEIMFDLLLQQQQQRRMQNNLQNQLQHQLQHQQNQLQAQFQAQQQRTQHDHQH QSFANQRSFSTEHCAADQLTQQRDLNQAREQLMFQQAMTDQHLHVSGSAQGSRSAHSHDL STTNNQPKLDLRPIKPSNTDSANVHFPLALPEDEEWLTPLHCFVRKYCVEAFVATNKDVA APCMGKRTPVSIGQVGIRCHYCSPDRLVNSDAARSRENGVVYPSTVSRIYNSSINLLQRH LRSCPFVPPEVLAKYEDLKASNARSGASKKFWSDSAQRLGLRDAPDGIRLDQAVHRTHRF NKSRAGPQGSASGNASSLANAEPSCAPVVTPSDKRSTTSFTYHLMLQTTPCVFTEADRLG RRRSLGIGFSGLACRHCFGVYGSGRFFPSSVKTMADASKTLDVLYKHVMRCTKCPDNVKS GLMNLREMHDAERSRMPFGSQRAFFVKVWNRLHSDLAIGSNAATVYPTNGTLPHAMSTLA YPSVSIPPRILEDSKPTAVKTDEQSGETLQRIYAGMDATPPTELRKLANEAA >gnl|To_NUC_proteinmodels_ML|p245 MILRLLGLLCATVEGVERMDLGDGTGGRRVTIRYEDVGYLIGGERGDDGGGSMLTIVERR ERDGAPALMSSILTLLGKKGTLRCVSNSAIYPDDGGGKRSLLVVHHRPLLRMLLRTAPYL DERACDRVPHEGSGQRSVTLKRTVTVIRSLRRFFDQGVQDSISLGGEAPGGAVALSDATA RSVWSSVESDVSRTHSNGCFRALIVVYLFHPSRCSASYYLSVLPLWMEAWRQIDRCPEWD HLWLVLFSRARKYLPPGSYDWTGLLRHLLTHCGYWLQIPTGGASSTDRSFPRAVSPAKRT CPARLKAFVGSEGKYEEGMDFVGRLTKMIMFCAGPGKGVAVAAAGGGDGGPASTAAVSEG SAVTLRFLSFVTPYFNPSNVGAWTFPLGAFLHYLAYELCHRVGNAAGLRTLRADHPETYD RLMADEPYLEGLELPGNEVVAFLDRLLPLCQQALYSKNPNVSHAGETAMLYLVQIDPRRA SPPLLDFGLRALDVSSVNLSHQAPAALSAISRLLQPTLRRDPAAVLRRLPDILRLTLAGI DSNDQNKSLRTLILYRNMAMWIPIGGVIVVPRRDSGGDEAAWDGTTTVGTDLMSARRALA ETESYRAALRALPEGSLLALPPGADAEDDSEDDACALDDLFEEAMAAMSDWSLSFLDRIF DLLRAAGEQEKLGRGHGGVGMRHTSADVAMTRNFQRIMKETLIYVFSGMDDETYGRALRR VVDFVSGETLPFAVKDASLLCQAVCSTRFASGCSSPYADASPGLDALVPVLVEDIDRRSG KSAAYRLRCLAGAVRYAGSAVLGHGESIKGAIEFALSKKDDRVLFKTDFCVEGCKLLRHT LASQVEEYIISQSYHPMRLESADSPRPALGASASLKGDRMAWHVPSGEQIDFAAGLIRQF TLTRLDGLSSESGSDGTGAVDLQRWRQSLRVLRYTLRGASGVLLDQDSAAIVSHDDDLCP KERATARLILAASDDTRVMLGGLRRRFCYGVLAIMSMIATDATANGQGAADRDQESQSRI GSPAKQISSDAKVCKETIELVELVATRRGAHYQSGTKKTIWRGQKELLIDFVVSSQSEFI ASVLRRSNDASLSDMNLSYKDSENGGKTVPSALVVNRISLTNEALAGNASTQVPRRLRKL RGGPGSVTPSSVFSVGMSLATVQEHLGPSREYAPGETTLEAYEGLVDGLSALTCHDNING RSCVVSFSCYRACLIRTSFGPVRGDALGILDFSLTRFGWVAKRRLPRLVAAMSLDDDALE GVDGIPSCSRLIDRFNSQNKRTRLAECVKGVTKIIALPRVMKHFMIGEGNRFDLMRIILG TQKILRLVPQEEVPKIVHYANEIFKQYRSKVLITPRITGKDQAAHAESLAFLLGVLREGN SSNEADESEEAVQLHWRDRLLAAWFILTSVDEGDLAVDDSEIVGQIWSACFMLIEEETGQ PLQRVALGLLGRLTSLVLVQRGSNLDGPGDDADVALLLRSAFTREKFLEHFAASLVFDHK ADTEVGGGHSAPFCTDTHNRKHEEWSSGIEEVIRDSTANLSRRTLFPFLRIGQKSRNFKL AHSQLTESILLAIGREEATAASRVLLAQATKLVDAPPSEDQRNSQMTAAELFSGVARASL LYCADDEERAKVWDEILLPFLNDAILKMPNMYISAFFDAVRYAIHSLPPSHFFPLLQWSV AKIEQTCWQHETGNIEEADEPAVSPAVADRFNVQSKWLFLIQAILAEIDIDRRDIKRPWY TGLLVSESRGNDETQSFTAEDGLGKSFDFVNQKLTPILLNALGHPYDKCRDHISSCLFRM CYCHQKLVRECGESPSGDGDPSIAIIDKLVSTRDSNEFSFREKVAAMGTVRKFISCCVHW GDTSRWYHQFLLPLLPITFLSLENIEGEVTQENRGLESDLAKGYRYTVADISSSCIIAYG VNEDRAMVLKVLREMSGQTHWQIRQAVAHFLRCFQGAHKFLLDNDQNEEALSITISMLAD ERREVSNAAMSTLTGILAASPDESLIELVAKYTRIANKSLKKKKRKAQQPAEQDLTTEEA ELRATKERNRAIRQQKSVFLLCAVVMANPYGLPSYVPDALVALSKHSFEQRAALNVREMV KQTFADFRKTHQDRWDEHRQQLTQEQLEALEDVVSTPHYYA >gnl|To_NUC_proteinmodels_ML|p246 XRRQPSFAAFNTLLSSVTLKLPDAEVSQSGLELTLTELECENVSLQNIGIDHSVLSSTDQ SVLLSVTGVSLECTLRWEYRWTIFRGRGSGYGRLDPESSASVRMDFASEDYALRPPTDVT VTQCEPRLEIGEMKFDGDGVGIVGGIIDLFEGLLRGTVEAELEKLMCDELRGLGDEAMDK LLDRIDGMLEGYVEPAEDGSGSAALEKESAAAVPVNGDGERVYLDFSRLDEYAGGWIDDA LGAIDGLLGGETAGATAAVGEETPTSAIGTAMLDMFQSAAGTEAPPRLGINEFIRDNLLD SAGALVLDTSLFTDDAEIFDGHDMLTETSLSIEEVSLKGLDSFTELDLLNAIGNYTLRNV LRLDHLTVQVNMRAVMAASSLSDAVIVGDAEPIVEHFSIEITLRDIEIDLSVFLGVNTAT LGSLQLGPLLHSSDVVTCLLSAVAELEATELLVSVGDVEPPTLTGFLDSGVDYVITSAAG SLFKMYEAVALRALPSFFDSFVKDRLNAYVDDTLRLFSSGCELSSDMEALDGHVDFRDLL YGAEEAASAGGNGDGRYGDLVPTVMDVIDDWLFGWDEDTGLLAINKAVIVPLTESRFGRR GAVVLEGDLVDLEHVSRYDHWKSFATDLRLTLSNLRVNGLDTLRTPYKVLEPSASGPHVV ENQLNLGPMSAGFEFGIGIGDETSPLAMANVVDIMFIMPEVELFVDLFATLQESRLMEFP LRDVLNGSCWLATLPQSDRHLTETTAGLALDYLELLFDQGMQITSSCIRCTNPWLDDLDR IIGFLSEHKFVDEVKSRALAIASNLANSPWVGGLIDEQIAKASRACPHDPQFGLALPETS LPGFVATRDMVDGILYTGMTVVQVIAIIMAQRHSELEVPEKPLEIELVVPDDSNLIDLTN LSAIVGFADMALDEGRKYVGSAAENGELGINNLLSMLLDEDGVLTIPLEEEANGFPGFEA GGVKLEVWEVQLVGMDSFTTFDVFQTIDSYTLKNKIALERLGCRVKMGLSVEDSSGEQAR MLQESGVDANSLETITASLIFKDVELDISLLMAMDRDLLGSMKLGSILNTDNIFYCLLST VHDVGLSEFIMDVGDIQEFAIEGFVSNATEASIRGMTNSIFDEYKSMVLDALPAFTAVTV RPILHDILGTLVELGRTGACPVPDPALDGLVDFRDLFLPAETARMMVGADGDSPYGNLLR MAYGFLDGMMSESDDSGMSDMNSLVASFTERQSGKTGDIHFLGKLFGRDLDIDFNGLNAA ISLAVSDVKVSNIDSLGSPIELLRPVIDRPNTLNNTASIGVGPEPLRVSFRLLIRGKGDE VEVDNDLVLGFSVNHMRMIAEILLQIEELSFLNFPLGDVLNIDCWLATVVTPVLNKYGLR DSESETGIVLRNLAMAVAEARLDIDCILCTSPLLLELADYFSSPQGVEDTTTVANDIFAY ISRLLGGDFIQNRLDKVLAEAQMKCPHSPPYEQNFAGLQFQAMETVEQPESLYGFLVAIA VVVGMLLVVISVLVLVTRFVSRRRHNRWCKTLSRTQLKELAKIERDEAAKAKDLNTRVTS LVRSKEVPLIARLMMPLVILANIGLFLSGHLSLGGTVNISGSFAGQDFDIDGFFEFSMLK STIEMWQAGAHELAILIALFSGLWPYTKQILTLIIWLSPTKYLSCQRRERMLIWLDCLGK WSMVDVFVLLMTLASFALSIESPEHLNFLPQGLYSINMLVVPLWGLYANMLAQFLAQISS HFIIHYHRKSVKAAEVEQCEEMGLDPPRNPDSRERLHQHSFRLDYEASNERAEVRRGTNW ALVASFILFTLLVIVGCSIPSFSIETLGLVGLAIESGNRNWSEAYTSYSVFGLARTIMDQ GRFLGTASAHVGMGTLAALLAVTAFLVPLFQAASLFASWFAPLSRKRRRKNEVLNEILSA WQYMEVYVLSIMIAAWQLGGVSEYMINEYCGSLDNTFSSLSYYGILDKEDAQCFRVNAGV ETGTWLLVAASVILYFINSFVSLSAKQKAVDDDVPVENRFTSDRWLHTKGSTLTVGISYS MEEDGGDEIASEGSTVVSPVRPRFTDYFALAVVRKVEVTSEIEQVETAVPPEEAWEGHKN >gnl|To_NUC_proteinmodels_ML|p247 MVSVTAKSYGLGPSGTLVELCVPPPSNASACRVQSTPVIILDQSGSMGQWSTRLPPIIAE ALLGAGFRSSQACHFVTFASGCAHSVTTVAGLGNRGQESRGGTVMAGAVAKLKDLLNSNP SLDFQLIVVSDGMVSDSARVIADAETLRENCQRRSGAIDASLLRLITSNYGSPDTRALSC LGSLSGTKCDVLDLDGRCGLSDEIRNELVLAIRDTLGGCEKSRVNFNMPLPRSLLGDGDR VSHMSFSEGRHYIMLPTDIDVSILKLEFEGRTYAVASHEKPFGEDDIQGFLRFLESRMRM LAVVGDASSRATLIHMSSWLSGVEGVLNSMKMAKDKEDDEKEEEDFTLASRKKVILSRLL KSKKGLINSLKQLANSDAVAGLNSQQKADFLRGASYNRSGRALAKRAAKADMTPDELAAE GIRNLATAVIPMDEEGSDTCTRSFYSHATAPECLELAVELNDQGASAVEVLQLIGLPGIA FNAPRGNYIDPWSFRVEKVFTGDSCILSQSDLWHYLSLSGGSSSLCPPGSTIPITGVDPV GAPNRKTYDVYLNEAAGINRLHASVSMRGMIAPVSDDDHAIKTAVALHLINSIAQGTSTE SIIHALQWEIADLSTLTINRSIQENMTRSEPGAYFTGDLGIGNVMRLYCTILGWTNLSAS LRKSQRQRVLRDIYSLAVYHKLKRIDCRDTSIQNLLGLDISSNATPLSDLFEPNDPHPEH CSLFQLNAASTILSKMVPSPDEVALVARARELTDVGASLRELEGVVDIETAYGVKADDKL LWTNAVAVQALKCSKASHRVDTNKRVMLTEQLVTHEQCSAFLSNSAREQFKADYEKRLRS KIAEEARVTMLRRVEDLVKADSMEAFITLLQQQVPSRSSETFVRVLNILVDSPDPPLRFR KMWVCLLGRNKRGEPVWNGGTCLVGDLTKYCEAFHNAGKGKLWKELKDMRAKYPVHRYRE GKENRHGHSNEFPSFYFWGYSSLECFKAGVDSDTFRTYCIAHSKKGCCGLG >gnl|To_NUC_proteinmodels_ML|p248 MRTWTDPSPHALCHGANTVEENAHVWLRSPKSEWGWLPGKIRRKALVPKKQHKNARREAY KERTASMGSYARSKMTNQPSFMASVQNIGGDKNPLVKKLEDSRRTELQTLMRDRTLPKDE KKAKMDAVKEKYNQLVAEVEVNAANGSTGGEGGPDAAPQEPEEEMIIELTIVDDFTGMED EGNFYSGLKSFNEIVYVDPAVQREEHPDIKLRNMGISGSANAIPFYGESNVMNRGVSGRN LVEKDNDKSKSRVKIDSVAGGVDDLIGLTHLHEPAILHALRLRYDADTIYTSTGPILLAI NPFKGEEAAHGSSAAGTGAASQKSLANGATKKTPLSKKASTRTWASKDGRPKCDDGQPIP RIFLHRSTGKLPPHVYQAADDAYRAMMRGIGMKSIMKGGGRGLSSRGGRGGRPGQVARLS EDEIPTNQSILVSGESGAGKTVTTKIVLNYFAMLSQKIQERARTDSTTTPAKNGGRRSSK SSAILDDTSIESKVLQSNPILEAFGNARTIRNDNSSRFGKYINIAFTDRGQLSRASIDTY LLEKVRLIHQTNGERNFHVFYQFLGAARDDERRDLLLNGYTVHDFHLTNQSATYDRRDNV DDVDMHVEMVEAMNIMNFGADTTKHLMRLVVAVLFAGNMTFSNMKTAQYGDTAVLDETEA SLAVAELLGVSFDNLAASLTSKVIFARGDMIHKGLDQGQAEKANEALIKSIYGAAFDFIA ELINASINKGTGRIPSSGSPPKSARRQGSSVGVDEPLNIVPPGGASIGVLDIFGFETFEV NAFEQQFNRFVFKLEQQEYEREGILWKFIPFPDNQDVLDLIDKPPTGILQILDEQCIVDW GTDKKFSLSLYSTCDQISNRFHVSPAQRVSNKFAVEHYAGLVEYSTENWLEKNKDQLPAA SAELLESSDFELIVRLKKYVRYEGAKIAMKSLGRQFSDSLKVLRSRINETMPHYVRCLKP NDALLPDNFDPKNIVEQLRYCGVLEAVRVSRAGYPTRYPHDIFMTRYYMICPNRDVDEDN LSPYHHEVSSNLTHEQKQLKRTVSRIATEIWKIESEIQKRQTSSGDAAHTKDNRHALAQP KTIEEFMRLDFSSRCAVAGLQLGKTKVFLRREAFQLMESIRNERFGKNVTSIAKNWRRFA AMKYLRDAKHAAVLIQSIIRMGLAAGKTSLLMKEFKIYLKKKNATMKIQKMYRNHYCKYF KEGAELKQRKAAVVTIQGALRGRLARKRVFGLVRSIIRFQSTIRANKQRKEYLSKIAAVT KMQSLARVLIACKVVEELRRQRAALRIQSMVRMMWVYHDFRRHVDAASLIKRAYREHLYQ ERRLYGTFLNNYYMLGETGDNKYVEEAKVLKRRAKLLTRHRQKIVNAKKLELNTLVEKLT LELWEPGKFESFAKPKVGFASADGIAAICQDFSPEPSIARSLPQAVPAPVGEPPSKSTKK KKFGFSVSRKKSSHNGSSKPRKGVRPNPAPLPSFAPIPQTEDEFMKRSPASRRALVGMQM ADGTIYLRPETYQLLEKMRNSIVGGSSAKIQALARGMLTRSRLKKERVAAIKVQSFVRMY MEKKHLLPKKREYAATKIQSVFRMSMTRKSVWSTYWSTQNRDLFGFIKEDNWYMVEKMLH KNPLLVEEADPDSGELPLHKIVERASAWTLLIDMILTLYPKAIVHKDFAGELPIHHAART DNLTALEIIYESYKNGAKDADGLGRLPIHVAAEHGSIEAIKFLTMNVPECAHTTTAGGGS LPVHIACKNYSSVGVITCLLRTSNKFGLVNRTDENGELPLHLLLRCGEGVDVVAVKTLLT CNLKTIGKRDKNGDIPLHIALKNKCKPAVIEALLSHFPGSSVVMDGQGHSPLHLALTNSA EDETNVSLIKYAPQMVTLKDDRTGLLPIEIATRNELSLFIVYRLLKQDMPIDLKERVQVR LIPHYFSWNHILLDVEDRYYQVVSKILQQCTQPQVLALAHVEDSEGKIALASATPICRHE IRVMLRLFNTLELVKQRPAFTNGASDTEIFYALRYEPPPEQSDLFSTDYEQKDDDRDFTE DWDDDMSHDSHADGPAGVTSEDASLSVAEKLNLIRNEQGQHVIAKITPRSDIVERELKVR KDYNLSRHYVPSVISVHHTVHHGAYQNASAEAAYCITMEGADATIEHLMLDYRRAGRAFP CNELKRIGMVLLHLHENGLVHSDFGPHSVGKFGPLFKLLGVGGCVPIGEEIDPKHGIYHS PEAITVETIEVDGAERKTARVVPVKASPAVDLWAFGHMIYESVAGTPLSAYSHRGKRVKS SNLAKIARWDDHSLERSLRFIDADDSLARDVISKLLHPDPEQRFLSIRDAIADPFFNTDA GDRKIKQVKGKVKS >gnl|To_NUC_proteinmodels_ML|p249 MKQNMSSVANFNHISFQRKVNNALTAVERTLELERSPRLAEEVDHTYGAKYELVDTTTNA AIIAYMTCLEKLGLNGLILGAIVEGAGNKPLTLRFDVSTTPTFVKEVRVKVPMEYSYEQT EETAGSKITKVLKAVRRVTKFHYDVELEWSLSIYSGTSVDERTVLRSNKSSSVIHKQTKE SSRLRITPQSYELPLTWLLKQVNLDDMASQFKVDTSRESTKTPRRNDDVDKAIEFSSRLN SWVGQIQTGLYESLARIEGQHDPGLVKDPVFSRSSYMDVLRSPVDPTFNPILPLMEDRPE LSSEPEDEVVPGSLIKINMQSHDDNADRKALLLHSDLTKLTNEHARSLQGVLETRAGAFV ASDEGIIATTTEIELAVVLRNLQGLSIQYEETMNYVESMMERQLIAAIGKRLTQDDLQTY VRYHDARLLSPAPKPFSLAIRRPEHYPTGLVSIETMGGENDCIHSHSRQVNVGSTITVAL NSATQVQLSGNQYLHGWMNHRFGNDKKNHQLVARARQFSAFILVIGTMVDGTTLGPDDAI IVQNKDELLIPLLLEEIPTAREFKDAIKSLSPEQQRFAQSYRSMQLSSSMFGLCVVQIKP QLEALLGLPADALDKEMKLTQDLMKLFIEYQVPSDLLSYNGHSESAALEDKIGNVKANVK AVVDVVELQKEQQLKDERAKSDMAMMMRCDGNMRCSGRAKSDMATMGMERGGSYTRCSGG ITSWKGKQRLPTENVTSQSRMRGCAPRQQLPVFLYSDGADDCDSFSLGDYDVASAGTISI GEPIEESQEYECAALGSESQRNVGGKVTTSSRSDRGDSVTQQGVDFTLIPKKLDQSVEKR GDAASLHSTTIKTSSNWTRNRQANLLSKPERHGLDADEIRKEKSKAFDLLDALSRSGSLA IAHSELHVVVAMTHCFEKDLMGTVICDNVNPIEKLEGSTLLLASAVHGLPLRELVRDVSE LQRLGMTMPCLLEG >gnl|To_NUC_proteinmodels_ML|p250 XMAETLSPTPLASRITRDSTVATMKAPTTSVAATVPSTAPVKARRKSMKKSDRSVPFAEM QRLMRVYGSIKCHRKRSSGPEENMKIDSVKRKFYRWFPDFDEKFEKDPEASAGGNVVYRP KAGHEAEMSYREEMRRLDGEVLCKKRAKCRRERHGTKLNVPSITTSGRISPPPRRVSSGS VNAKLAAPVSPVQSSVAPPTDVVVPEKVLSAPKPQAVVSRANPITLYRQVTPDLELPSQV SRDECDDGQQANERTESSGALSLDLSLNDALDTFIDQDGDMMTDVFDEVVFEEVETAFYG SLSGQIKCELETMESGLPGLLTLPPPPATKSVSLPDVSDSDSSTGDDSSYRCGSPEWERD WSTIDKMIGNDLEASLDRIADDVEGDEAVSSYLFDFLRHECQGNGWHPIIDKKRSGCSLT DL >gnl|To_NUC_proteinmodels_ML|p251 MPTATDRDSSAAAEKMSDAADAKSTTLPDETVPEEKMSDGAADENTATPEETTGNTPKAP EVAAAAIAADTAQTSQSKTVDDKQLARYFTVLRDFMSYHHNRKVLYPPSHTFTRDELLEV TPDAICKWMTVKLYETETPAERALPTYGSYHTLENYKKAISYFMPQKDGGRYDPDSHTGN PTRSTQVKDLIKRIRSIMDNNSSAKLSRKHAPKASDMIGNKKTKLNPLIANKKTKLNPPV ASKPAGQPATKLNPPVARKPAGQPAPRQAAAMATATILQGSSISSTAAQPPLSRDVQILA MQGILHRVHAQNLCFMELFGTLSTSLEQFRQTLAATNIGIMAEVQRLSLLNAPPPVVPSV TQTKRPATKVASTVAQATRVQPVSAVVNGSGAQMKHSSAAVVENNRPTSQDSQVQPVLYE YLYLHPDGVRRRANRITVSFFGNLLSIPRYNSSQDLQPMIKFAPSDMSFLPKRARQSYNE VKGLVRLVDDKAKSLGVNVKAVMTPAESASAFQKGSSSLVISSTTPSGRQRNVAKLKWAT LLKYANKSEKEDSVKKSQEASSVTTLNAEATCDNSDDPEKQDECKE >gnl|To_NUC_proteinmodels_ML|p252 MSGIIPDIASWALGGGVGRGEEDNSQTNAREESSSQTQPAAISGEDMRAKRMARLAAMEA QNSAASGNGSAANMDVDDSSSKASKPSVKETAQPMEVDSPTPSKPASDSTVDAAVEPAAK KKKAPPVDAKKKLRRTKVVMLRRVLQVTFGTEATDRAPSCVHLTLDDDEMYNPQKTPGGI EKRHVAEILAARLSLPPSSRSLETVPHQPSGLIAYLGGCHKRAGEEAKELRQTSSLMVPD LFELASEAPMQLAKCLASTATDPINSITFDVQGKNTSFYACLCEELHSQDGLDTTIDAVT KCITESLVKCKSLVDDIPDGGLGTKALYLTSALREMCSNKKAAVVLSNNPAFLLPPANTA QAVQVVKVDPPYMAQPQAAMSIRISGVKRRSGLGLESQTILGLVLRLGVPSASPTVVSAF QAPARRSPSDIRSTQAGFRRTLETYQSKCNELVKALLVSSGADSRQKVIQWITDALVINS NAAGSHPDPRKISTYEFLFNLSAVMLKMCDPFISDQKKAALVDPGFVCSPDALRGVYAIT GDDALPRLGENVSTDGVTYNPKNSFVPLCFFYCSRALALSIVPDANLYENTLRRLSSLHR HINARGGDVVADPRFNMFLASQHSQEIVMQSPGYVSDVFRYYNLAAGLFLNMPKEQLKTM PEHIIGDICSVLVYGAQFAEKLMAGLDFSNLFKLVVMLLSKDCASLVRNYNVRAELGDVL HDVFLPSNSSDRRRNVPDSVTCDPLQMGQPYLTSNKLALETLAPSLLLLYGEVEHTGYYE KMSYRVKIAALLKYLWECPAHKPAFKAIAGDEESFDTFANGIVNEMNTQYADAIKALVSI RSTQLLMANQQEWATRGEEEREQIEERYANDESQSRNMLALCTSVLKMLGFLSTDDDIRT MFTKPEMRQRLADMLLFVLQKIVGSRGLDLKVDNPESYGFRPKEMLQDLCAVFSSFASDD EFQKSCARSGYYSPELMQKALKTCRKQGLLVGESLDLFTLLAGKVEDAHKALADEEELYD GAPEEFMDPITQEWMESPVTLPSGNVTDLKTIKQHLLNDPHDPFNRSPLNLDQCVPAKEL REKMKAWLEEKKKTKGSS >gnl|To_NUC_proteinmodels_ML|p253 MAIEVSRFGLLPRKAMYPGGGCVFGKSECSDETAWRSSGEMIGAPYRAHGGYCLLAGTMR DTLLGRCGSGDCSPTSLGCGSEYSSFVELDKGCSIENTKFGYCDSIGASSGPGFCAWSPE DCEDDVSYVWKFPAEECTGDKVMVGGCLAEEETPYCAISADGCGGNARYLKPQQLSRSTD YECFVGMSKVKPLGIESTLEDLKEATPVDDQSSPGLVLDSSKYGANGLKNSEPAASLSQD SESGGTSPSLIIGIALGCVLLVALLAMIMLQNLRNKRMRSARDAEFLKQESVATTEQEDY PPSDLQIVNDYSDVVSDIGA >gnl|To_NUC_proteinmodels_ML|p254 XASFFTDDRGNLKAVNFIVRNPCKVFWMIIVLCFVISFLLQVLVFRKAEGSPFTTPQNEY DVSDERSIQYDSLRLAKDDVKAIRAATNKGAETVLKQSELSDIAYWVFEGEDDVGLFGSA ANIQGMKDAYDVFYEDKAFVDWCLLDYREEVAVNETRGCVPPLTPLTMYYASEWDSEMVA TAIEELKKTEKLQLFNDLSVCVVSGLYCGSQAIEDASAEDRMWVMELGNNITAITSKWDM KGPLVSNFTQVTELASLLIQVDLFKGTVDFGFDNDFSAENPVSQYSRGIVFWGGPLDERN TTGQEDDEEADELEESDGKLRKEFIKSEYLAGMEKQSEEDAHSGVNTYYFMTAIIGDVIL GIVTQDALLALFSLAFVFFWLRINTRSWFLAYIGLLEIFFSIPIAWFIFTVVFQIEYFAT LNSLALFVVAAIGADDIFIFMDAYKQSQYHVEILDDLETRMSWVYRRTGTAMAITSATTC AAFLCTLITPLSGLQSFGIFAAVVIFIDYVLVMTLFCTSVIIYHNRYENRAACGCCCPCG TIQPSNTEQAKLVLEQSDEEIQRDRVSEFFRNKVSGFIQTPLYRLAVGVVFLTWLSIAIW QASLLESTKESEQFLDENHPLQKSITILNKQFPTADDDLGLKVYYAWGLNEVDRKGVNRL LDPDFFGEPQYDSWFEFTPECQGEIIEFCDEMRADPKYKDLIKRKDGLGSVFCWVEELAA FNANPGAYSRKDRNQGDCNYVKQGTWKNGTDWQVDPANLSTIMPEFMAQKTCFGDDLTET ISGRYSNEIGWDGNHMKYVAVSLESKELDPFGLDAESLTRREYDQFIEIRDKYLDSTRQC LGNVITTDLDEKFVFMNNQAIYVKTAIQSSVLGVVIAFVVLLISTRVFHLAFFASLSITA VLLSVVGTMVMLGWSLGSIESTLIGIIAGFSVDYVVHLAHAYEIASGDTYSRVTEAFSDL GISVFNGMVTSVVASIPLFFCQLQFFAKFGTFLCLTIAYSWIFANFGFMSVLAQLKIPLK KGKRFQP >gnl|To_NUC_proteinmodels_ML|p255 MSDSSKSGPEADVPLQVFIPDERATGARRDLKRPRPGGERRERGTRDRRSNPAEKVTPAA VNKPSYRKLQSRDSFDSSDDEVVAYQARFTPSNDDRERDGKPRAQPRKFVSPPVLDSSDA DSDMEKAIAASLRDMRSDPRRMGGETKGGRRRGPEEEAAAESDLLVDSEGSDAERIPAAR RMKRRKKSSRIDARERRKWDIDSDEPSESDDDDLKVPGRKTCFSDNNVQSDSEDERPRKR SAGFAWDLDEDDSEAEDKATRHLGQIKNRKDAVLQLLLFDAGFTGFNLLPHQFRAVRYVA GLPDHFPYDEEWLEVEDEMSQADEAVEEMLRRDDAGKKARNGALKDAKMIDTMGGICADE MGLGKTIEALGGVVLRQSRRIYRGKKPKPSLIVCPQDGVMTQWVEALMKGGIHPSRIEII GNQSLLDGHPRSKASERRLGRFILLTRFRIMHTLKAKFEYDTLVTLQQSLDRCALFKNVK PELIRVMQNQYLARRGAEKDKYKVSGLKEKPCDRITRLLREHIDNLVDFTSTFETVIIDE AHFLRNVLSFWGLGACLLGSHSKRIVLLSGTPYNNGPSDMSALMTFIDPYLESAQLSFWE DATRDGSPKSVIKKVSEWSRTYMLRRTKSVLETELPPRTRTCTSVPIFPSQIGYYEHYEE IFLMILDMVETELEGDSPAVIREKKRMMDIMMACCACMRMVLIQPILPGGREMTKYFSPS RHRHVGREEQCDKCVFCQAEKDKSLTDENKNELEGKWTKRSTNTPQSGLDQFLAPGFDEE ESGDFEGNGWQQKQDKDCPSRDRMKNKDLGPLVPLCGDICRTAGTSCRHFAHEKCLKIRM KEEPNLTCPRCLDLSSRLHLGRGKVYCQNIKTPVSDQPGFKSTPKLDKLIKWFKGVPQDD KCIVLSFFKGSLDLVEGILHHEFGVNCVRYDGDIGKETRERELQQFKKNRSYRVLLATVQ SGGVGLNIVEANHICFIDRWFNPQTHDQAESRCHRLGQKKDVMITYLDCCYTVDMAMRLV NKLKENNSKVLLADGTDLGLAETDRYQDLQGIIGKSLKEIARMRKSAIETNEAMGFRDEP LICEDEEFKQALMSNMQSPRMKEENRDGELGDNKPWDHNMKIKDERKSLGKESEQIKSEP FKPEASGAEVIDLLDSDNEGETKGDSEQDLKRALEEATKQGDWEKVGKIASKLAQIK >gnl|To_NUC_proteinmodels_ML|p256 MMLNEGIIGIGGDEVFARCVADIANWDDEEEDAFDLDPPVSPASDHVRRACGGSSSSSST PRGTDLSASANSFDSKEFSTGEDNRSFPLLLYELVSDPNTNNSIRWLECGTRFVIADKNK FTAIVLTNQLGGRGRGSAKFTSFTRRLKRWNFKRVPSGKEMGAYYHKDFTRDDPDGAVKI TYPAATAKNPAKVATTKKVKARRASTGSLQHLKPIVMKLNASQGVSPTEELNISPTPIKN SLRDKYVECELPVINSDMSAWLSSSALLNEETIGEPRNLNHPLELPPSQSQQPPLSLSEI SQHYQHLIAPSTMRRHSSTDLQYHQQQQQRHQSHLIYRQQGLGINQTNVSTPMSTSQELL QGWGFRELPAKFPRGCPGPFGFQS >gnl|To_NUC_proteinmodels_ML|p257 MPFTALVRHRHRKHQGSLTRYPSADANKKTSQPHNSPVTTCPHHYDTPIPRDTMMNKAAL VCLQLIFASSSAWKVLRGSHRESRIIGGTESDPDHTYVASLQDDIGHFCGGSLIAPDVVL SAAHCGTDIPKVILGRHDLGKSSGGEVMTVKEAIKNPRYSTKNDNNDSMLIFLDRASSMK HTELVRLGRDFVGEGQHVTVRGWGDTNPSDYRENYPNELMEVEVKTLSNEDCEASKGRDF DYDGWITGNMICAEHRLRKDACQGDSGGPLVVESGPEPVQVGIVSWGYGCAEDDYPGVYT RVSTQYKWIREQVCSRSSSPPDYFNCNTASTLQQSQASGNGEGIEEDANEEASIVVEEEP AGPAMFEKGSFRVEEFDNGDFGMFTEHSESARHYAESHLQQGVVAVGDGLSIGTEYLSLG GFNSLAVSYRFLAENLHPGDEFCLEYILDEGESSSECTSRQGPFANGVWYIKTAKLDISN ASSVKLKFVLRAKSSSGGQEDSDVLLDKVIFRGSS >gnl|To_NUC_proteinmodels_ML|p258 MDMLADAWAGRCSRHKVFAADAWQTTIALRRIHIKHPGSIDAVSQNLALTTQSKLRSVDE GRRTLWKSTVNRILQDLCLTSVLCSSLSVSAPPGPRPRSEPKQQRSNSQTRSHNLKAFNT SFTPPYQTYPSIVNMKIVLVEHFVNDAEAWQKVMGGHFATFGQPGMSIDKAQKLPVFEGQ SCLLTCPTSTNMTLCVWQQGDLSEDEFQAFIDRFTQGLCTNKCYYISDSLGLSGLTPRGY MRDFINTASEGKSSAGYLDAPKLWFVHHSIVDPAKFQNRMGEFVGIAASSENGDDLKTKF AASFGVPLGTYACASLFTEGADAACMWSSGEDVSIEDFQKYVDSFTTSACKNTVYAVDTG MSCNVMGLSEDLFLTEQIAWAKAAVATPTEEALEKKKAHLIECGLPEAQAAISVATDLPG GFKETTPSSCFYTHVIKPDGNVDEALKWFAEYTKAAKNSKGHSLVSHTYNPETREIIAID VISGPNAMDNHIGNCFPAYAKMIGSGVEMKECVAIVPADEVEWWTNSLKVWGASRFIVKT DHTKVEAVKTVETGIDATKYAMDVSQ >gnl|To_NUC_proteinmodels_ML|p259 MTASPKDTAAGAKRSRTLRYDDGATQFRLRLAASVLSSRPLLIRNIRFDDIDAPGLREHE ASFLRLLDRMTNGTRIEINATGTQLRFKPGVLLGGTVEHDCPAGEGARGVGWFLEGILPL AAFGKEALDLTLRGPDYLQSSCVPVMLRAGIGSDGDDVPPSIRVTRRGAAPLGGGTVEFH CPIVKELRPMEMAEFGRVKRVRGNAVSCRIPPSSAARVAHSAKGVMHRLLPDVWVHTDVH SSGGRKRGRGGDDAGGCGASPGLSVMLTATTTEGVCLTAECSMRHGGASSDGEAVERGKM ELPEELGRRAATLLLHEIHAGGCIDTHCQAFALMLMCLTPEDVSRIRLGPLSGYSIVALR LFKEAFGTEFKLRVEQEAAGDDDIDSDEEEGRYRGERTVICSCLGIGYRNMARAST >gnl|To_NUC_proteinmodels_ML|p261 MDYENIATKRRPAFDGACFDSDGFCLAHGDVRIAKRLEDGGFKTLRTTCYRCSLTRSTAS TAVVPMGMVAQMTQRRGFRPVSSESSRVEVTRHERSLDRRTRFDGDFRKREDSFTAAERH MRQKSTRRRSTISSRSFAVTTLDSSVPSRRSRRSTVDVPQRIVEIGQDRRLPMVKASNWD NWKADKLRTEDKPKPSLRDQLKSLLPIPAVSASRNSCQIGPTQRQHASSLPFDCRGYCTV HKSVRLATKSRGGKWTTILASCPICHTLQTSFVSSTPINVYPPDKKNKIKRFSRDNESIT SSQYTRPLTPISSRSSSPENVPKQMTRDTLTVVGGRDNNMSALTRHFLNEGERELSKRVT RF >gnl|To_NUC_proteinmodels_ML|p263 MSKLARPLLLLPAAALSVGTAPRSHELDSTYGFGEYLQHFDKSYPDAAEYRRREERFSQN MEKILNHNKGRLSDTGAILDGGYVMGVNHLTDQYLDELPFGYDKTYHSSWSGQLMRGASK IERRLGESTTTSYSXXXXXXXXXXXXXXXXVDWSAQGHVNPTIPQQGACGSCWTFALTAT VESHLSIATDEPPMSLSEQSVLQCTPNPDHCGGTGGCKGATVEIGLNYIADLTAKKEGGM YNLSDVPYSPSTTLSCEDSTEGSTASVGVTGWTQLPTNDYKSVMNALVQKGPVAIAAAAS DWALYEKGVFSSDDATVNHAILLVGYGIDEDTGEKYYKIRNSWGPHFGEDGYIRVLRTDE DSTVCNMDNDPLVGLACALDDSGNQIDVQPVEVCGASGVLFDVSYPVGVHKIDVADQS >gnl|To_NUC_proteinmodels_ML|p264 MSSQSFDEEDFMMSHKSAMSHVSSALAISNFPVEVQGILTNAYDDDGDGHISTDELVEGA RIKMRTEKCNRILWRGIGLAVVLVVALIGLNAGLTYGIIDANKDTEVQGRALLVREKGMD AEVPVATSNNEITVTLATIPFLPSSAVSHIQNLAFSSEDGSVVYHRKVRSADVRPDQGLT LMTTQGDLVEWDLVDDAKKLSITLQDGTAWKTCMQCTECTAGNIYYTPDILDGLDSFEAA TGKEARRHLGTFSRRLQNGGRGDRDCNSC >gnl|To_NUC_proteinmodels_ML|p265 MGDSGSVTITLDLFTVSLLLATLLFGFTFILAKRKYSFLNKHYANFDLKKLLVLSLGTAA CLRMLSFLGVIAMDMGNVNAHYSLGELHDVLPRYMPTTYCAFPDGISPAPIHSSNADKHP IDINQSFYNSAMAVLFDLPNTIVVSTYALLILVWSECSLLSRFHTESQVQX >gnl|To_NUC_proteinmodels_ML|p266 MSSAISNGWSMSSGHSIRSKVSPNSHKQRHRASKDVKEYIKLILNYIENRNWDAFEKLVL SSVRTFRSTSHRISEIDELNGMSLIHCCVHYDPPLKLLKKMVKATSSSLSKQDVLGRTPL HIAVGSGASEESVRFLVASYPRACGCQDIDGRTPLHFACDSSSQLFEDDDPVRRDPPSYN LVQTLLSGCWDSAALEDNRGVNAIEYALVSKAHMRTIRLLQKATLRQVLREKSKQDTRRV HVHRSTIHGPTSLAARSA >gnl|To_NUC_proteinmodels_ML|p267 MVSRALQRSEPAYSGARFDAKGMCIFHSDVRLCKVKDDGKFQIVKKTCFKCGTAGLIVKD CHKAKTRVHGYKAKGVPKRQQLADKLQGLSSRNLDPKTDEQRPLESKDSKGSSKGRSRRQ KKTQTRGEQRPPTSRSRTLSPLRPKAQRSPKPTSTGRGRRREMYDPIEYPIPPLPLADTA RTGIAIAFTPTSRKHGNTGACVYAKSNKIQRDDRRTDLPGSTSREIKVEKISNGFGFGHC SIHPHVRLAIIRSTRGDGNWDIRKESCPLCPATRGKSETFGDNAPPSYLPTTSESSSKPS GTTFVRCSSGPSTTEPLALAPQSPNDVTDVGTCLALVPLRNSNTRIANRKKNKERAEREN APWPVPKHVLEALSLGDY >gnl|To_NUC_proteinmodels_ML|p268 MKKTVKRMDLLTELDTHNEFFDSLVNMIPVKLYVSGASGDDAYNPKYLKGQHKESKEARK ARNKLAKNAKFAPDKESTIEMKKRIQREQDEISDSSENEIDDIVMSDGDGEEEQAEVRST KKYPVAPPDASSEPSYASRIEALRAKLHKKMAEKRSAAGINPDDTEGADVDTSAPQLVSK RAARRAEKRKRKEAAMQRNKRKTTSTAESKKTEQKRVVNVGGSALNAPTNTKSTNHSPAD DLATIDYQSIAGLKPKLESGNLDNKSLGGGKKKSLEKLLADAERKQQRLKELKESAKEED REKARNIEWGETLKVAGGTHFAKTNDPKLIKKAMKRKAKKKAASAKAWGARLDQDKDAAS KKQQIRSHNLDSRKQGGTTGANLSSKRIVEQDDKEKKKERRRMGPHSGKNRAGFEGKKQA KPIMMAESPTGDEEAQQPAASRAVAVSQQSNGTFLRKKALLGSFLFIAAVLAIALGVAFG GAWQRRRYFVSTDGADDPDVVDSVTNLDAQPDVGDDGDEDNTGTDEPDVIIDSVTNLDVQ PDVGDDGDGDNTGADEPDDVIDSLENVDGTGGQPPPEVSEAVDPPPLDTEVPPVVQAKFL RSSGPLESRVRIASPDIANGYSSCDDLKDDIVEALKFYANSIIMSEQSNDWYEKCDPKNS NYWWGGEPEYALAAESTSALDSAAAPPALESISAPQAEVSGKVDSPESTESSYVPEPLPE EVVFENVRRRKTHMEHHRHLHPPRHSRQALSIAPGWHYNPCTKPRILSLLLHGKRLTTIV SENRYRGLDYTYEAKIIDDYSTVYARVYDVSDPQDAKPLTLLSETVIKGSYNSARSINSK GFVFSTSYINTWPFSEGIYRSHPDYCGMNSSQYVTVAAEKALNKTDDFAHQLLDELQLQL EGTCEDLFQIAALQSGDNGDDNIGGQVLENMVLISSFDMAADEVETHTSGAFSSGYLSSV YASQDFAAAMNVGSNYNALTNSWDQSTFILGFDLSGELPMPFAFAEVPGSPLNQYSSDYY DGNFRIVTTQTQWSWDSVDSESRTTNILYVLGVPGDQEGAEMPVIGKTTHLGKPNESVFA VRLIQDKAYVVTFERTDPFYIIDLSNATKPEMIGELQIPGFSSYLHPINISGVPLMLGIG EDVSEDGRRIGVKVSMFDVSNSSNPVENATFVDKGAYSAAGSDFYGFRYLPSSQKLIIPK SKYTTSSEGNFDGFVVYDIDLGDISEAYEIEHASSRAIWSGCWYDASMPARSLVFDGKIT TVLSHSVIGTDLTTGEKEWNVSLAEGVNKTQCHPYFM >gnl|To_NUC_proteinmodels_ML|p269 XARYECKVPACEVDAGQPLLSEACGSGSLGFASLNDETEALCEHQHCVGTLGGDAPCSCL LEQWEYNFGSKARGQSECCAARTGTSSEANFQDVQAASCECTLRFECNDLDIGEKCQAYS EYCCEDDECRCDFDTRACRLALENNDENAAEQCAKAEESCCTGGATFFSYGGCKCDFWEV SADWFVLLMFISPHTHLISIQPLCHENPIAATCDQTEAACCNNDHCTCDFLTYAADELNY IRQDKNVKCKLHILLNLRLCMWRSSSLTPGFTANSQVSDLDQELKALQGLYEGTGGESWS VKDGWLTDSDYCNWHGIKCNEEGFVSAVELQRNNLVGKFPSDSLSKLKNLETLDVSENKL TGSMDIVTIVDGTTTYIGGGYCRSADADYYTYFEHKDSVAANAGVVTAEFCDNLCRGDTI STAKSNLVGFEIANYGVDNFIQDFKEGSVAGTDCTCLYSYGEFPLPLKDGVTVSKDYTAY GPVKSSDDYRNAVCYGVTPQYTYVDSSAFLDLRKLTEVNISLNSLSGPMDLLFAPALISI DASRNNFTNVKPFKKFKPSHSSLRSFDLSSNVISQDLAVLFIDKPQNIEEIDLSGNAISG SIQTSEQFDFVRRLAIGSNRLSGALPNLQSKFPNLVDLDLSSNMITGSINGGLTNHLILK TLDLSGNMLTSFDETAVLSNLLQLNHMDLSNNELGPVIPREIGKLKDKLTVLDLSNNNFV STIPSEIGSLQSDATVRLAGNQFDSTFLAPLSVCSIGHLPDSVGYFDLSDQPKYCPPERV ALADFYNEAKGLEWTESTRKVLDIAAKQLLDSPESPTPYRTIAWLDEYESHCLWKGITCE DGYTVKIELNANGLSGRLSSSIGKLQNLRELDVSDNEIKDTIPSEIGLLEQLEYVKLSFN RFQREMPPEVENLKKLKLFQAHGNKITGTVTLSEGLVPDKFSESSFVTDCGSPSLFGDDP GPVDCASCTMCCNEFEECFPTEKNALLEWKDYEDFGFILLGIVLASMFAVNGVSSLWDHR KKSVRDRSARTLTTEYIRQKSQADAKYAIKSMGEGSVYSFFLTNNILAWFFALATVALQF WLLQPFIIAAEFDLSDDVKDLVYTWKCDRVSDKCEDLGDLTWQGWFLFGLLMTAYMGKDI VCGAKMIILSGKSRHGVSQRIRFFVGGSLLGCVSVYIFYASTIYNISIATTNTELIYNAV IILFIMEMDERVFDTIDAMNQKWVDRVTKRKELDEITMSGRPSIAGKSKARPAKDKLNDD DSENDKVYEEMYELKRKLFEMEKSMNSVKEHCGMSTSTSSSTGTTVDTNLHSSDSLSLSQ VTAAEINSSGTTVKEKLGESACSASPLGDAAEQSEVEDCGKTLPQDHGTDQKGPQVTELS QSEVTGSSESPPPKQPVEGILRDTSDTPDDQRNGSQVTFVLDPK >gnl|To_NUC_proteinmodels_ML|p270 MSNFEQSFTHYLSLFDGQPTAWSSEIEKAFDAWIHESYLEADGDEQLTKAHLRLIQANAL EEGTTSEIIHVKVDTRQESSADVKFRLKSSFFDLTVHLNVTLKDEKMFRAKSTAEGADEL HAMRRVMLNFYQVKKKLLKRAGLYDGNPKSFTPEIAQIFDDTYHDDFLFCIGDRKIRKEE IMATAKLLMAKSTKATVQICQPIDNTHMELKARVQLGEDLSFQEHLIVTVEGGKLIHSEP YDTDAQNELNKATVRIREYLGLAKMSEFSKELPVLEIAQ >gnl|To_NUC_proteinmodels_ML|p271 MCITSTARRRSLTLQLMPSHFLSSTAEAAEAAEAPPSSIDSACIQQPIAQHKPDLIIHHD YHDHANDATIDASTIKRPAKGGVVTPFPVKLHLMLSSVEKNGFDDIIGWQPHGRAFIVRD IKRFTSEIMPEYFKQTKIASFQRQLNLYGFQRITAGRDKGAYYHEYFLKGEKFLSFCYDM KRRRVKGTRIRLPTNPDEEPNFYLLPPITETHLSRLASCSPNSIDDLTFTFGKPFHYMNS AAALPSLSTLPQLVVAARPSFGSSSVVSDERSAPEFGLEAHTSSPEIQSAMRQDSFKTIN TESKSYDDASIDEFFDHLSMPIEEYHSKINRMYEGNDEQSYGHLIEQCIE >gnl|To_NUC_proteinmodels_ML|p272 MQRIKALADNSGLDRGGCLSLLEEFRTACGREECSDPNEFSLQISVNEVCNAVDTRTKLK METVGRIFEFRNFAVRSDDQNHMKSPDDHIHHITDKRTVASPLDVKHAVRCCRDDEVKDD DWIAPPKQGETCSRIKTLKQQWAGDTGLSAQGEVIGAHYIQRFTTYPGRLCNVTSTGTDW SEVRYFNQTPYIFLFRDAHPLLSSLAVLVAQVPGIHYCARLCREEKSCASFKYQDYGSYR RCYLLENKDCPSYTDTVPYPPGYITWAFNKHYRVPNGYKLYQKNLGCGTGNLIFREYHYR GGLISPQICANVCNERDDCASFAYQSNWGASRGNCFFYRHDVCGKVSQTIPTQDQNRDWY LKRDDTEGSKDQCLTMNYDDAVVGSLLLPCKHNSPCKHNSKHSFLTRFRCPKDYCEDLGG RLCTREELENECASYTGCGFDHFHVWSSTEASPEVCNDNSNEYQAVSYATCGEFVTEMEE LYRGKPPDDRFENQKNPLRKNGNLIARSNFENVHFPTSFNFEGPDERGMVRIVEQTPYLY EFAEILSSSSEDLFCSAALGPPSTLMQACDASMDFLLGPEASIGLLSSTAKAVNDFSNQK MLSPSVTIERIRIETGSAGFLRTLADIQVFDRRGVNVATASNQVSIVPIDDSRGSGQEVV FTTVGQKVDKVVVSLYDQSARERLSGATITLLSTKPPAKYHPETVRMGGACRRDGNMYRE VIHDLNGHSSDWDEIRDTCESICEPDIDDEYRGFSIKNVNNPRCACMFDRVPTAVGEVNS DHGNNQWRCYPYSSLFDPSGFYVVHQYNVGDLLLHKEEKVVVYDFLPHPRWVDEYLGMKH QPVIDFENEWQDFYINITDTRAGPDWLDDYRRIKDTAPFQFPKTRGNPSSERIYCRDPIV NPNDKHSPIATCPEHWLGSILNDSDPYIDEFWSDWNNLKPADTNCTIERCKDHWLEVPSN PFQDDNSCFDFFKDCIELAPVRYQCHKFWTHCRDELFKLESYSRDQPYPKEYCEEDQTLT CKLSWLIEEPTPNVASPVSPVCQDFKNYCRTTLEETYPGLGLDYHYPEKYCDGDRDLTCK EEWLDQYANEVPNPISSREDFGCHRFWNYCETQLKTTHVDFVWPLYDEPTVDDYMPGYCC LDTPADTAFKKEGEEGPDFWGQYYTPSRSKRGMACQNRGMWHMGMSEEACSEGNGRWYRT PCVTLYKCIESRPHPGDRGYRDSFEDWVIDNELVIYDPSDLDQCEETRAALGFDENHLDD NAVCDYFEGYMCDEEFFDDLEFLADGNKAAADKGFEQVTYIPIEYPPDAPLVFEKLDDRE STVTAIEKPTDFLDALKRSRHHLVWSTDKMETAQEYLGRFRVWQKLNFDTCKEVKENHAF LNNMLVNAFANQCNLIPDTVFGNSNPASMACDAKMDFLEDKVVLATRQIERTCDRVALAY NTAISLAMIFAESTIIGAQISHAVVDTIIFEIELGEKTTENVFNFTKKTHDNLISHDKWV TDSLGKINKNIMTQHTQMRGELQDRHQDITNDVNQHTTCMSNYLGIQIIKALGGKVKKLS PKCSRLLGVSGRRRLNEEDNEGEYPFVIDVLGWTEGSMIKQLENIIQEVDETEELSKEQI MQGTDISKELQEIKAVLGIQPPPRSPPPTMPPVFPPTAPTTQASCGNGICGFDESDLDCQ SDCASIQLDTTSSRSSSTTSRSIRFVAMAKRPVSINSISFYTRDDVESDITVKTLEGKYD MGIFSHWATMNWVEIYRKRTLTHGTDRMVRTNFDRPVKIASGKYQSFEIHSEPSQVLVHV DVLPEGVVAEQDYSIELFSGSTGSGSAAVFNGVLTYDGVGKIRAKASKAKTAKAKTAKIE SVTSMAVPDEPNAIEVLKNEIRDVNQKLNDATKKMDDSDLKLESKLGSMESKLGTMESKL GTMEGKLGSMENMMAHLVDLLEVSFNSKAGED >gnl|To_NUC_proteinmodels_ML|p273 MAVGGGPPPIFEGNTTLFAPSNDAFAALPSGTLEALLADPDTLNQVLMHHNVPGILNSSD LVPGETLTTAIGDELTVSVGTDTSMCPFTPCGEEEFCNYDLIESFCELCDDVYPNCQFVS EVQGQQECERKCGDSERRLSEPTKTFASTVLTTGIIHGINQVLIPDTHDMISTLEEQGEF TTLLSLIELAELGDAIMTTTPITLFAPTDSAFAKLPKDVVDAAVDPLNRELLVDVLLTHV IGTVVSSSTLGQIPMLPSLSGSQLFLDRDTITVVDVQSNTSGMVVAPFDTFATNGLIHAI DEVLVLSKPPPPSTNSTNNIMENLEEAGDYSTLILLLTFAGLDSVIAEHDGLTLFAPTDD ALQAMPGGLLAYLMFIEPDLLTDVLLYHAADGVVLSSDLEDGMLVPTLFEGEDGFEDVVI GVGGGEVTVNSASVVEADVICSNGVFHGIGTALNPLKWNTTLLGVLGPEYSTLVQALMAV GGGPPPIFEGNTTLFAPSNDAFAALPSGTLEALLADPDTLNQVLMHHNVPGILNSSNLVP GETLTTAIGDELTVSVGTDTSMCPFTPCGEEEFCNYDLIESFCELCDDVYPDCQFVSEVQ GQQECEIGVRDVRR >gnl|To_NUC_proteinmodels_ML|p274 MATEVTSHTRTCSRKRDVVKREFEAIVGGALLHTVGKAAESHALRKSRLVPKSALLPKCL YTFYYHEREAEPIDGKIVPKKDQPTLIFFHGISQKSEDFAPFINSLKVPPHVRILVPEQS AHGRDIQRARLDAQNYVQPTQRSMLESTVEFLDAVECGPNTNAFGISLGGGVCYYVAHAR PDIIKRSVLVSPAIVPCVSRDLVRSILEGTNNFFCYESREDVKLLMRDLSTGRDDKSRRK LDPVPTFFLETIYRYSQKIAPEGHNKAMLESLLVSAGYNESGEFDPDQNCSKSDEEVDPF SASADIDPQAQRLVIWPEKDQIINFEQGKKFFGVKEEGDRRFTSDGAETQFFSIPDCGHV FDANGKGIFDIIKPRVQEYILDFE >gnl|To_NUC_proteinmodels_ML|p275 MVITKLSPLDSSQPILCTNIMAPKNDNPLEKLVERRQSRPTRDLSLISSINAVYNGRANS KRVGFLPVEITIDADVDVDNDATRRKAAAARRKRDLRRTSTDVTDFTDVSEMSVPSRRNS QGTVESNFSASLPNLRDTFRRDESYLIGGSFRGGEEDFTASDGALFGEGRKEKQVRRRSD AMVRCQEVLEWVKAKTDKNRRKVHFC >gnl|To_NUC_proteinmodels_ML|p276 XMNHRFGEDKKNHRLVARARQFSAFILVIGTMADGTTLDPKDAIIVQDKDELLIPLLLEE IPTAREFKDAIKSLSPEQQRFAQAYRSMQLSSSVFGVCVVQIKPQLESLLGLPADALDKE MKLTRDLMELFIEYQVPSDLLSFNGHSESAALEDKIANVTANVKAVVDVIESQKEKQLKD ERAKADMAMEAMLQRREQSEPHLLERRLRRARESGHDDAYMDDLIQRREELTMSERVVQY ERGNDGSNRRGSARKRKTAEYQSIPLQHPIPYQSSDGGFGLGIATPPSMNSMDLGSETMA LSCMMEDCGDDLNFEFGCADEEDLCFSRDEFADATSTNEGGSPAGHEGAADRIENKSPRN DVDGQGTTSGRGDPGDSITQQEGVDFTLIPKQLDQSVERKVDAASLRSTTIKTSSNWTRN RQTNLLSKPERHSLDTDGIRREKSKAFDLLDALSRSGSLAIAHSELHVVVAMTHCFEKDL MGTVIRDNVNPIEKIECSTLLLASAVHGVSARELVCDANELRRLRMTTPRLLEG >gnl|To_NUC_proteinmodels_ML|p277 XMIDQIPELYQASSTEGARRGISEDDMMDLSYPTLRKSLEPVESPISMMLMEDPSASAES DDLLDGSRRLSAINRLCSSLAPLSPFGRRQSQKLAGPPVSSDVDCHLASVVGDSESTLPC PSSVSPTPSPSPNFSPSSPTSPQSLPTLPNLEFKISSSELDPSQFQLEIPDRESVLLCAR LCQFLRSNKNANPFDFDQNLRTCTGGIEIVKERFLTHSVGDTSIHVLVVSSEKLRQVFVC CSVDGDEQLAKKSLRYASSNELPGKGNIQALAAVVETSICSGLEERIFSELDRLDRPFFD IVFTGHSIGACISTLESARYADIHAELRISSQVFGCPRIGGDDLRAYVHALPNLRVFRVE SGHDCTVQLPAGREWMSVGHCIQIGSNSECKAYRFDKVVIPKAKIMLRTPLEISRGKAEH KIDSYVGNLIKVEKWAKDYSCTHGKGVVVDNEKREMA >gnl|To_NUC_proteinmodels_ML|p278 XMLSIMTSTTFADPDEHVLNAAAAAPVDVAPPPFAESVSGSDDSSNVSSLAGTAERGGHV VGAAPKKGPSTVSSVDRSSTESSRVESAKSRPHHDPERRRSHPLEKNKKSTDLVGSDHRT ASFIEFSRPGLGPRQGSSIRFHESVNLAPASSTNNFMDKLPLPNAVAVEDDGREEWLSVG TGELVANRKPRLPSLIKTPKYTSSLPPPPPAAKVDEDFDETLPATKRPSIRAHPIGGYNP RGTVSTRCLESWDHTGKGYLDEIEYNMRKRDVDGDGTLSKAEIKAIVIDLLADEKESRLY CKASCWLLLMVALLAISNLGTSLAAAVLAKDTQADVQSGAILSKATDKIMAMQVISYTYE LEELSDDEFEERRALVEREMESDPEHEDHLHRRLGRRNGRNRKIAYDQGKIRERDLHDIA VQCDGVNTVSIARRWQNSFGGGDYSLTRDVDALCGPGTVVRRKGKKVKKTKKNKSEVVRA KSKKKLRRVDDLDDGMVVVEEQVTFKQPDGREVSFKCGRGGSCYGSGRTLLQDEAHPCHI PRDYAGASECDEGLVCRDPLDHRARRGTGECTRLRSNALVGMACDVALGVDACESGYTCH ASDEAYAAAARFGRTKIVARKQKVGVVAGTCVGIEQRADLADTCDASLGDLACDAGYHCL GSNGVDLGGKGYGVCTNIPKNVRAGGVCNRSLGFDACEENYYCGEGGVGRSGRGLEVHVA DEEVESKEARDAGQQQRDLLGLFAGGGGGVMTSRSTGYGKCTRAVGVGGKCFDHESCGWN RNKGKPYEPYQCVGLNMMDFGAVTLDGPADGDRGKFGFCADISK >gnl|To_NUC_proteinmodels_ML|p279 MFKSLQLPAIACALVALSSAAAQSDEYTKIDNHPCLRDYNGITKSMFDLADANPTLATVT DIGDTYLKLHPGSAITEMSDIPEDGFDIWGFNVTNSDSSHTSDEKAKVLIISGVHSREYA TSELNMRFAEKLLDGFGDNSDINWILNHTEVHFIFTANPDGRYMAESDVELWWRKNANFE NGSTCDHPEDNPGVDMNRNYPFAWGREDGASADPCMDDFFGSAPGSEPEVQAVIAYAKSI FPEHQFLDDPENSIDMERGENTTGVFIDIHSSGRMIYYPWGFVDQIAPDDDALQAFGRKL AKADDPSRDHVLWAPEQPDFMYPASGDATDYMYGALGVASFGFEIGQNFDEDCPVFEDEV VADNIPALMYAVKAASLPLFYSQGPDILSLSVHVDEEYEESSNEIHISVKASDSELVNIE NYPTFHTGDQAVNGVTLFLDVHPENHNGAGHTWELAFVDTKDGIDTFGKVVRMPSDVSPG RHIVYAQAVDSDGYLGPVTSSWIDVPEITNPNQLLDGDGMINPCEVTIGRSTNSPLMVDC DKIGKQEEPRKEIKPLIEEIDNGTRSKALSLASLAATAIVGLVL >gnl|To_NUC_proteinmodels_ML|p280 MRSSHLCCLLAALRAMAVCTSLVHAFQAPMRSIPARNTLHASAASEGDSIDKNPRPTNYE DEFPLPDFEVPELRQLRWEREALAKGKYASGDALYELRRTIAYLRNELIDTRQQQQIVSI SNPNNKRMKHRISELEKELLELNGRDAEFVAAVSRELLAKAKSAGDEGLAKKYSAQLDEA RSQIPQFNLHGLWVGKYGENGYEMINVTYSDDTLIATKLTGDNNVPKGEISFTVDLAHST AALDPIELNSKAAKQWGKSFLPRFVGKGQVAAEGFVNPTWLEGQLILVGRFFSFAWLPLG HQVFFGRPSSELTLKMLRESKSNELRSDHVAVMRDAMEEAWEESYWIEREADNFFDEDGS FE >gnl|To_NUC_proteinmodels_ML|p281 MKITILPFFISLFINCQSSVVNFNHLSFQRKVNDALTAVGRTLELERSPRLAEDVDHTYG AKYELVDITTNAAIIAFMTCLEKLGLNGLILGSIVEGAGNKPLTLRFDVSTTPTFMKEVK VKVPMDRSYEETEETAGRTVTNVVKTKTLKAVRRVTEFHYDVKIEWSLSVYSGTSVDERT VLMSNKSSSVIVKQAKESSSLRISPQSYELPLTWLLKQVNLDDMASQFKVDTSRESTKTP RRNDDVDEAIEFSDRLSSWIGQIHTGLYGSLAQIEGQHDPALEKDRRGSNMSDVRSLEDP VFNPILPLMEDRPERSSETEDEVVPGCMIKIEAQSHDDSTDRNVLLSSSDLTKLTNEHAR SLQGVLETCAGAFDASEYIVTTTAEIELGAVLRNLRSLSIQYEETMNYVESMMERQLVAA IGKRLTQDDLQTYVRYHDARLLSPAPKPFSLSIRRPEHYTPPVNVGSTLNIALNSATLVQ LSGNQYLHGWMNHRFGADKKNHQLVARARQFSAFILVIGTMTDGTTLNPEDAIIVQDKDE LLIPLLLEEIPTAREFKDAIKSLSPEQQRFAQAYRSMQLSSSVFGVCVVQIKPQLEALLG LPADALDKEMKLTQDLMELFIEYQVPSDLLSYNGHSESAALEDKIANVKVNVKAVMDVIE YQKEKQLKDERAKADMATEAMLQERQRRAQIDGIGMVMMQQRAQIDPIDIHQFTRPLKQG AQIDPVNVQYHRRRGSASCGSFSLSNRRIAREQTSAHQSLYTPTGSFIGTVQSPQLHDEY SYMAAREAPPTAALRSAPVVASIHRSSSCDWSVGVYSSEVCGADGDDLCFSGLVADNEVA SAESTDEGGRPAGRKGAADQTENESPNDVDGQGATPGRSDLGDSITQEGIDFTLIPKQLD QSVEGSGDAASLRSTTIKTSKSWTRNRQANLLSKPEKQGLVADDIRKEKGKAFDLLDALS RSGSLAVAHSELHVVVAMTHCFERDLMGTVIRDNVNPIEKLECSTLLLASAVHGVSAGEL VPDANELRRLRTKKPRLMEG >gnl|To_NUC_proteinmodels_ML|p282 MGYLSKALLLAAATAAETRSLRGRPQHHHHQSAPTVQREAIPSRNRIIGGDVSVEGRYPY AVSIQDDISHYCGGSLVAPDIVLSAAHCKTDDATEGVKVVVGRHNLEDEGDGEVIGASLE LSHPDYDFQTTDNDYMIVVLERAVEADVDLVQISPDVVPVDAAVTVMGWGDTNIDLDISE LAIELMETEVRTISNEECDASNGTLTDSLGSYDENYHQQITGNMLCAENEHRATDSCQGD SGGPLVIRTAVGEPDLQVGVVSWGVGCAHDAFPGVYARVSEKYEWIRAEVCNRSADPPAY FNCGELRTQETSPSASAWEEIVDEEFLTGLGIFNSHGNGDSSRHYPAAHNRAGVVRVASG GGTASLASRSVSLADTSYGKLRVSLSFYAMGTEHADDLCLDYELDNGAITGERCWSSLHA FQNNQWTDLDMEFNALPGARNISLRFRLKGDDSDDGVLVDRVTIEGRA >gnl|To_NUC_proteinmodels_ML|p283 MFTNQDVDYSLNAPCRCLGSVAPPQKSSAKIDADDGLGDEGGGEADIADGWGNDYDDDDL ASSQANQRFLVGTAIPSVSKISDGDELVEEADEDRVNRLHLLKYHEDSHELLLESSWAHP TGEVWAMSCHPTRPDWVVTCGGGSVFASSAIEEGCGDENDDAAASSDPAAAALTQFKTTL WRIPDAGGEESGGGLGFDDEDKSASDTLEAVVSIPHGTTESASASGWEQRVSQVLWNPLL LSASSGDALYDLAETSSDDGGGNLITVGWGKDSPIMLWDISALSEVKEVWSARGGLMVPR KRSHLSALSSALPRKVSWDPHDVCNILATSGVDVVAYDMRTASSGGRLAIRSAHRHGVAD VCHNPLLMGVVVTSGLDGIIKVWDIRGISGEVQSQKSARTNPPVLKVLRGGHTHFASNVR YNPFHDQLILSSGTEGNANLWRVSSCSSAPLLDMAMDEDDGDGDGAERSLYGDEGEEDET TQEVADENRDANAPSENEETSQKEMKGASSSQDIRVSRLECSDAVSRVVWSSNDPWMYMT LSCDGTVTAQFVPSKEKYRILL >gnl|To_NUC_proteinmodels_ML|p284 MFFFSSPSTASRIEAFRAAEQQLLGYAKTRFGHRNQDPSRHEFELFDTPISKPEVIKKSS AKCQIHESSDTDEHSIHGVKVTNTQLGDGGEGGYPAPLVLLHGYANGSLYFYRNLMGLSQ HFGRVYALDMMGWGLSSRPKFDLVTDESVFEGSVHETADDVDDDLKKSTRQKVAAAELFF VESLESWRKKHNLSKMTLAGHSMGGYMSVAYAERYPEHVECLILMSPVGVPVKRPEDDKR LKSLPFYLRGMVSTARYLFNSGITPGAFLRSLPLSRSKAMVDGYIENRLPAITCEDEREH LSEYLYQNSMMPSSGESCLSAVLEAGAFARIPLVNRIPNIKPGLEVHFIYGENDWMGYQG GLDTQRLCLKKPDDSPTVFVHGVRNASHLLMLDNYEELNAALIIAANGEHRLTRDTPRPI EFACNELVGYRSMHKSYRGAIGETHAAAFFRGGRFNIREQNEVDEEEKAEENLS >gnl|To_NUC_proteinmodels_ML|p285 MDRCLVASPSPRETATPPSSSEYTIMLPRLPTYEIVRISEEAGDEEENVRLWAQPQTMDQ RDDMNGRTTARNQMDASVGTGPTGGEPDGWPYAMAPASAELPATQRVPSTVVGNTLPLPV LSVPEGDRVQQTSGKFISPSPGNTPQMVQTLAALYQSSRRSRNNTPVTGVKPKAPIVLPP PPPLYQKNGSDDDNRQTMITSTAGKGSNDSSIMTGSIGSPAFGLDYSTRSMMSTANAIMT LSMQTQRPEPPLTTHVAAQPSLSEDCIIEDIEKSRVGEYGHIHRDAALTVPSLSSGSAAT DLSEILTRNKLEGANGASLKRSPPLELNTEMNSEPPRKKPSILPGDTTPSMSIKSGVSFD VIAYRSSRLSALTTESGSQIPEDLRDENHEGTSQDIEFQMPLKGEDTLDETEREREERII RRKIQRLLLIKHCSTCRHQGTPVNEIHIPQSSAPSICKPCDETEPDYTSSMLDVCPVTSR CAEGKALCAHIRTCKLPDCKYKWCITSREVLGHYMSCKDPTCRICEPVRHKGRRRKIDPP ENVDIMRRSSSLETNDLWIENEQTGDFSVGQLQLTRI >gnl|To_NUC_proteinmodels_ML|p286 MKSIAQLFHRGTGRNQLNLFLLDEDYDAAIAEIDSHPNEVTLWSTREGFFDGSTSAQVLP LHVAVSLRAPLYAIRAIVEAYPTALGVKETSFKRLPLHISTQFGCCFDVVQYLVEQYPAA ALEADTLGRLPIHYSCSNGAPLPIIRTLLQANSAATLYSDYRGWLPLHVAIHFGAETDVI RQFLVSCPAAACLKTKKGSTALSLAEKVSTKNNAEVLALIRSANQTAAKPSHDRRSPPPK RSMPAAA >gnl|To_NUC_proteinmodels_ML|p287 MVVDRKEAFATQSSDVKLSKIGALYDIDGDGKLDKEEQAMRDMDKSGRGFLTNDKVYTLM QEHFAMERKFMRSKRVIFGLVALVILLCLSNLGTSIAAAYLSKDTKTSAESNLIDKRTGE AVGTETIAESLELRRVTRPDSEGRRRLCTTDGGELDCDVEESTLLLDTRACRRVLRRCKK GQGVNLVLKLSSGMEDSTSICPLSNGQVSATRRSSFKNSNNDEIVITPVDDGCEIGGLVR DEGKFCDVSAECRGDLLCSSDSEEDVKMCQGRCGRLRFASFMRDVCQDECAQKKCTKATD EVGALIR >gnl|To_NUC_proteinmodels_ML|p288 MKFVTLFAALCTTAHAFAPVNDVSARLPRQNTKLQSKIAENVLDLIGGTPLVKLNKVTAG CGAEIVAKLESSNPANSVKDRIALSMINEAEARGDITPGKSTLVEPTSGNTGIGLAMVAA SKGYKLILTMPESMSMERRVLLKAFGAEVKLTPAAKGMGGAISKAEEVVDSLGSDGFLLQ QFNNPDNPKIHRETTGPEIWADTDGKVDILLGGVGTGGTITGCGQYLKPRNPDLKIVAVE PTESAVLSGGKPGPHKIQGIGAGFIPGNADTSLIDEVIQISGEDAMAMARRLATEEGIFC GISSGAAVCAAIEVGKRPESAGKRIVLVIPSFGERYLSTALFQNLWDEAAALKAE >gnl|To_NUC_proteinmodels_ML|p289 XFLCAAGSCIVRIWRGGDSGDRHGGITGIDTVVQSGDCVSKRRSSRTRESLKGMSHEEFP TLGTSRKRERSESPVAARQVSEEAIGSASGSGDESGGDERKGDEIIPTRNKTAMRKNAEQ RSSTQNPLAKDDSGRSSRRGGRSRKLNIDYRDRIYSKSVEPVNSQDTKSRETRRPSTPFE RPPGPYDAGGKNLKSDEMSDQDTPPGSEIHPSKLKVSTRILILYKSATLYKGTILNRRFT DKGDEYLVHYDGNKKSNQKWVAGTNIRGLIEDESGDEDGTGATGSSAPQIERSSAGSTKL TKVDSPNAKPVGTRGRGGVRPGAGRKYVNGKAVNTGYVPDEVSSSPSARRTRQMNSSVKS EVKSKKKSPRLKKNNPEFTDADGVAGTLRRKRSPGRSSREMSVPDAVRNLSKGEECDEDG NSAHPKKRRRLFQRRSHVEMLMADEFKESTRRPTRSESVDFSREMALAKRSASQIKRRRN NIDMLLADDAARRVSELSFSSSSKKDPKDGKERARASVAAKTKRKPKKCEGRPKMCDGEL PPLDEMPGSEVDTTGLSEDSRVLVLYRGVSLYRATVQGRRRRLINDSNADEEDEFLVHYD GNKKTNLKWVRAVLIRGLLDDDDKLIQEKPIRRISRATRAKPKANSPRKRSTERPLKSDD LVKKSPSSRGITQRTSGKWQVQVYYAGKSRYIGVFNSEHNAATAYKVVHDLCSTYEEFPS SVQVEQNVKAMRGAACAATNTDMPKYRSRLRVGLNDGPISLKGGGRERRARKIDVASSSD GSTSSPLLSDDDESIYEEQRIFFSPVKHQLERLEKMNDKKRKLREKDLFLDDDPPKKRGL GRVNGIHQSRKIRAPSDLDPRPKRSGPMIHQRKVAAPSDLDARSTNSAGPNLTISAPKKR GPGRPKKSLMAVETMGSKKRGPGRPRKSDQVHSRTEDDSDVQFEKVGVTERPSRQLLHFI RALSPSKQRSSIAKGLRVKVRFADNNWYGGIVASSSQQGSLINIHFDDGTMEECDFPDRD VIVDDVGNGRHRQRCIMNASTFAPPDESERQGDLKTPPETQKL >gnl|To_NUC_proteinmodels_ML|p290 MGDKNDTSGHNDASASVGRILDEEDEGEEQLSTVSLSDDFESDVRSPTSVAAELQPNLPA EQHQRPSSSAPAPPRWRFVAEAKKSAGYSSDAAGKYVRGLSSNELGAKYTAPDIPDVQLY RRSVDGVGPVKYTLPSKQQSNGLADVGKLMMSVVVEKDKEGQKDVGRLDINEDGLTVEMR PSGSAPKLPRPPKDTPFYRDPFKWFDSKYDVTPKGDRDTKWMGGTLIDKWRWNSDEKTGR TVRGVGRLRFKGTNLAEEEFIGEKTQVHKLQRETDGENAGREGVLLQGSSESESSIASGA VFLDMKSLDDERKKKARMACIVLLLLLGIIAAMAALMGSGRQDEAGPTKVAAVIVPPPIL ITNTSVPSLSPSASNYAGPVSLSPSATLTLRPSNAKETATPTSVPSLSPSSRPSSRPTST PSPGPTPSPTARCSEGKDFDLCLAIDMSGSVCNDGNGSDCGECRADFLPLLFNSQCRDSG LSEDICCRNFERIQNFAKLIVGRLDEFDAKKSFSTVVFASTAKTVGSLGSTSQTIDVLNS LDYSGGTTNHRDAINRCRQTLNSGNPSRKKFILVVTDGVSTAPDGVDPESAAEEAAMQAQ FLDDAFIIPVFISPFNDFDALSFMSRLSSDGQVFDVTDFESLASLEERLVEQVSCS >gnl|To_NUC_proteinmodels_ML|p291 MLFGSPTRLVRSSTHAFGRTPCRGRGRGVVGGGAASSTAPFQLHNPIHDCGERIRHQQRV CDAGSLDASQLGYGKVVALSSRSPASATVQTYLTDLRHSSPMRLVPSRRHRRNLQSNHGQ AALLHLSSYGGGLVPGDSLVLDVDVGPDASLVILTQGGQRIYRPGPGRTFKDYSYSTSND KSSFDEAPNRKDKLCRSAINCRVQRGGSLYYLPDPTVPYNESAFREERAFSCMEHGSIIA VDWYSSGRRFSTGMGNERWAFDYLATKTELSVDGNGDTRPLFVEALEFDGQDPRSNEAAF GENYESMASLLLHGPASSEVLIRAEQVSLKLASMKARVRLSDIDHADQEDGEITRLISSM GEHVLMSVTAIDSIRDGGTCSVVRILAGSNEDVYRLLHFCLKPCSSRLGGLEPYKERIHS SKTVRQKVGVDVQAIKNIPASEGRKDGLEPIVREHKHNVHQRMLAEELANRLIFGEKIPV FSRDAWFRLCTLSDSALPVGSFAHSQGIEAASQMGLFESDEDSPLALASFLRSLSRSSAQ FATPMILAGYSLLSSSARDNVDLSQIKNLWVELDARADTRLRSNGPSRRASMDQGLGLIR VSPSFNHQHCRWSELFEMIRSSIDTPSGNDASSFGHSALIYGILCASLRVSPIDSCRVYS FGATRDAMSAAVRLNLTGPLEGLAVMDEVGHKAVEEGIMHGMRGIEKYIGNGIGPKEISW VDCATTCAPMVDAAQPIHDELSMRLFRT >gnl|To_NUC_proteinmodels_ML|p292 MKKLRKATTALLAVPSRLATAGKRAPSPTSFETAITVEPVETPDGIEAIPVEMTGLLLSS PSGQSMGRTVRAVGRRTPTSGIKHAAPPGGAVVDSRDDDEGGRPTSPRGLGLAGRVRTLS RHRTFLVSLEDELTGGAAALSQESRTDPERSRTLSAHGDFLASLEEELTRDAAAMIRGAE LAAAGDVPSSTAVRGAVPEGGGVNEAVKCPSPIDAEDSGSLRLSSSWLNITREDSFVSAL TKRDGVFDSTIGFETIADPIDIIFDAAARALQCRNGAYDSTGDYSILFRHISMSSTE >gnl|To_NUC_proteinmodels_ML|p293 MAVPNLDTNEDDAAEPPSDLLPDNTEDKLLKKKILLDTILDKLRGSNDEVLKQLSQAIDS GDNLNADVLSGGYTNYSYKVYVEGNPDLCIFAKLCFEYALWNPDRNAFYDLKRTENEYEL MVKVSKVSPECVVAPLGCWDLVYEGEKMKILASVWSKADEQLANQFVDGSIDPRLAPKIA KTLALLNLESFDENFNEQVNGSILNLLDSSAAHIQDLCATTSPADRVEEYCANVGAELDA AYKEIRTNYLTKECLMHNDSHVFNILAEAKPSIEQLEEFGPEGQVCLVDWEMCTPGPIGR DIGLAMSFPFVCALTHALNGHLAPVDSIVSYINTLLDVYLKQMKDSGKSDDELASIIRTI FGQSGVFLYLCFYFLKLQDMFPSQSDDDKVMIQETAGVLGLKFIHFAFGDTSNATSGELR DKFMAFYEEEMARMNDLFKSRKQKMRTRKSSLFRAESRRFSDASMMVLGAESVRRLSISA SDRTIDRRSSMAKLAKDMSTRFEV >gnl|To_NUC_proteinmodels_ML|p294 MNKLIASLLLLTRASCIESGPTIATDKSEYTGGETIVVTITNYDDPTFLDWVGIWPAVYD AQQLPSPSSEWDWLTSSNTVTFVGDLCDGEYEVYLLQDISPPYSSLAAAAFTVSGSTRKD CGDGVCTIATSPYDDENVHMAPVDGTAVSKIAFSSCYAPSYQVSNALWKHMRNTFEADLW LWLGDNQYSDGRSLETKRERYNAARNDQFYREYGPVAEPKIPVTGTWDDHDYAANNQGNE YECKKLSQDEFVYHFNVPSDDPRHPDQGVNQQEGIYSAYMFSKANGEDTDGIHQINLDAR YHRSPTFENYGTCQGAATDMLGETQWAWLEQELSRPSEIKIVATGIQVLPPLNLGRDLGD YCAYDWNGNSFETAVAEVGESDFASGTSYETWAEIPSSRTRLLKLCQKSINDGNAKAIVF VSGDQHWAEIQAKKMPFDENAGAEQILYEVTASGIDQRWKVDVDNPNRVRVRSADSRGSG DFVYECNFPFEFDGVVYNDCTTVDHDRPWCSTGTYADDKHIHGSWGNCLPEEEELVPRSM QSYGKSRKCTDQYHHTCTAQANYGGIEVDWSTNRMKLAVYTPHHEGEVEASSIFVDFSAA PEPPVNPFE >gnl|To_NUC_proteinmodels_ML|p295 XEPSLLCSVGGLPSRAEPSPLLCRWPPSRAEPSPLLCRWPPSRAEPSPLLRRQPPARAEP SGQQPSSTVRTNLLFDQRRPANVLQX >gnl|To_NUC_proteinmodels_ML|p296 XMTASHKEELERVKSELESSSAEALETRINELTAQHASALDDLRAELKADAFANLESSLT VHRQKHQDEVASMKEQHSADLEHVRSDLAKTHREELERVKSELESAAAATLDRRLAEASQ QNEDEAVALRREMTAASEAKVDAGIARVTAELKQVHSSEVESIRASHADEIDKVKSDVMA TATASHSEQTARMAAMHKEELARLKSELESSLEELTQMTASHKEELERVKSELESSSAEA LETRINELTAQHASALDDLRAELKADAFANLESSLTVHRQKHQDEVASMKEQHSADLEHV >gnl|To_NUC_proteinmodels_ML|p297 MKFVSIIFGSRRTFFFFSAVAIAGAAAQESCNEDDHHPDNGGYFFVGVGGESDTTNLDSA HGQKLPNATWAPGYYPYGQPVVCDSEQDRYCRAPCDTEPGCEQYLSTHTCSVLLFCPPEG MMDLQAIYQLPDFAAIDSCDFSNADELGIMNGPGTEGDDGCWAYTFELDHELTKYFFASK EGCEQGQKLAVEVNDFDMTADSCITIGLTTPRIRNCDCNLQIKPSTLGEPCRTAFSDSCN DSLLIEGDCCEQENCLSIYESYDHPEGKAKEDERQSNCNDSMPGLCYNEDGMGTDTNMMG STNCCTQVCTGCGIIDSATAQWKECTSLDAEAGTASCGFLSRYAQEAHQCDYSLCKEGDH WHMDGDAYKTAFLGETEEDVKDPEQAEPSSGDTPLQPSVPEPSVDNIPDEPSASDPSSAI FLDGANVFVALLLMMSLVADV >gnl|To_NUC_proteinmodels_ML|p298 MSCDIGLYGLAVMGQNFALNMASKGFKVCVGNRSPSKVELTVNRAKEEGSLPLVGSSGPE DFVKQLSKPRKIVILVMAGKPVDETIANLSQYLDAGDVIVDGGNEWFHNTLRRAKELEPK GIHFVGMGISGGEEGARNGPSLMPGGPKAAYDLIEPIISKCAAQVNDGPCTGYLGPVGSG NYVKMIHNGVEYGDMQLIAEVYDVMKTVLKLSNDEMAKIFEEWNSTELESYLIEITSKIL AKKDDVTDTGYVVDYVLDKTGMKGTGRWTIQEAAERSIAAPTMAASLDTRYISARKDERV KASAILKGPEMTSAEIPVHKGQLLDDLKSALYAAKVCSYAQGLGIIRAASDDMKWNVDLS ECARLWKGGCIIRAALLQKIQDALARDKNLPNLILDPVVAGELNDRTAAWRRVVVTAVGY GVATPALAASLNYFDAYRRERLPANLTQSQRDFFGGHTYERTDREGVFHCLWDDSHKSIG DIKERTAGEV >gnl|To_NUC_proteinmodels_ML|p299 MVLDAGQVAARDKRAKKARKAEAAGEDAEASPPASVGRIESGGSTMSLALLHGPTLLPLS ELGRLEILRSDPRAESGIVCNKRKCEVSIKFAVYPAGDGGGAAVGTSPVPDSVDTRAAMS KEIRAGEGLALQEAAVARAMARVSTSDPPESAAAKTAAAARKDEEMVVTAFEVSGAIDYG KLIEKFGSRPLTPHLLRRLENVTVARGTVPRLHRFLRREIFFSHRDIERICELLEGWYGV SPPDEGEVAGNVVAPSEMAVPSGPCPIYLYTGRGPSSAAMHLGHLVPFLFTAWLQRAFQC PLVIQMTDDEKFIFKGEYDPERSGDGSTGPDPCRTGDNLDHFANLTMENARDIIACGFIK DKTFLFSDLDYVGRMYPNIVRIWKAVTVNQVNGVFGFDGMSNIGKAAFPAVQAAPSFGSS FPNVLGSEDDDAGREANLACLIPCAIDQDPYFRLTRDVAHKLVPRNHPLGGKPALIHSKF FPPLQGATGKMSSSDANSAIFLTDTPEDIERKIKVHAFSGGRETKKLQEELGADLDVDVS YQWLRFFLEDDDELARIGREYGSGKGEYWSTGAVKARLVEVLKEIVSEHLGRRSGVDDGV VREWMTERFLK >gnl|To_NUC_proteinmodels_ML|p300a1 XNLDLSCTNCGVEGLRAALRSSNISTLRLFGNKlgsdgldaaalllrgGHPSIVNLDLGG QNSEEKSVVALLESIAQKTGCGFESKLAVLEIGGNEFGDAAMEALDRLKKAHPGLDVAHD KPVRDGEEX >gnl|To_NUC_proteinmodels_ML|p300a2 XNDSERHRSSTRQRTRGLRLCSFLSLTGWLFTWPCRVIRVLHENVNFFLNFFLYRKRRHQ NLIYYVQHALNDX >gnl|To_NUC_proteinmodels_ML|p300b MPPLPIEEEVEEEVDVFVEDPDDATGPSEKPASQTEKAAQSQPSSALSRAASMALGIIVS NTIIR >gnl|To_NUC_proteinmodels_ML|p301 MSTSDIERLLQAIAISSGANGIIVAHQQRLEAFQSLEQFKSYPGRVTSCIELIGRRSILS VIGSDNVDVTVSAKLYALGIIEEFLKIGYSGLNESDRGQLRTSILLAARQLSVPSAGALP GDGSRRILAIKVASLLSELALREFPQRWQSFVSDMLSPVSNGGLWCEKGADAGDATIGAK ICLECMKLITEDCTDSDFNSKISTTRRNDILLGMNEMSSQILPPIFELLSKQFGDVVSSK ATLQQMNQYLASNGRTVAQMTQDEQVQYQHQLDRREAAGSLVVDILGTIEKFCGSMPLDW MFKVEDGKDFVAALLHLLQENVANIQVLAVACLQQLSMRKLDENQFFRLVSSLPPALFEA SNAAALRASERGVDPNSIDMLVEQLKFHRSLSKMGSTLVTAHLAHITADKNIASGKGPKF DAVSNYLRLLSEMMSHQSGVICGEQINTWVGLLRDPAIVRTKVLSPHLGRVLTAYMNHIV KVNWDDIYEQEHPYSALIEESWDDNDDYNEWLGNMRSKASQLFRAIANMEPEISVTIVHS KLRTMLNAHYNGEPRDRLNLANNELTVKSTACIQIEGATQPLDNILSGMPSWVIDNGSYD EKRMKIRSIVQPLLSELAKMIVSWTPSDLWLKFRRTTLLEALKHYWKYEPSTLPQGVDSI LTYLSTKDNPPREELSDDVISLRKKCGVSLVAVSKEVPHLLVPWLAQLSDSAKTLLSSGD LSPTNEMHLYEFLSCVATAVENPVDRSNFVADVVSNSIQKIASQSIQRSIQSPENFLAFL GIAQAGTDPSCVANPEFVRKVTADFSSLFSSLNQLLSVGKRCHEAARKRANGGLPVERLT DIDESAAQNFPDEGPVSISDLAMNDPFVPLWPKILPTLLQVLDVAFQVWHPECQAVLLRN STQRYVLAISDDEAFLATKQDSTVKGVFGKGGTAGSIVSGIDRRDLNLKPRWSGWFNELR NTCFQLLGLLCVQRVLYAPEMSSLYPRFAAVMTNPNHLQSLEHRHLTQYLKQFIEMMMMS CPATMYQTHLTAILGPVFEHLQLRFQYTWGPIIATGQVEQSRPITSDNCAQVADQLAGIG AKSWLKAYYQRSGLFVGDLDSVTAEAASEKARVELTRTFADMMQTTLALKGVWALVLANK AKEEASKKNVAKSTRGPKSRVVNDNKGPSNADGTKRTGRRRAFFLWTTIIRTDIHFSVAQ RHIDSRFLNRIDKLCHFLLLENEQIAGYLVLTVIQCLEYPDAYTCRRCTRVVHRILEAVA WVERYTEILGSRLFSVAIKTIVTEPKWAVGTEWDVINVIRDIYDRLVLGQYLLPGRFRFD VSHCDLVYNFLTTKLSGGQGPGLQQNKDPSNPTRFEQTKVVNNPLQGGGISVAPSDAPRR VLFEIGMSEQAIIELEQKLTTKRSAKDQKDALKDVLRVVAERIKHSVGSSLERASENEGL LHNTKSGVIEALPEKLVTHSMVKKREGALSSQPDKNEIAASHLFG >gnl|To_NUC_proteinmodels_ML|p302 MCRIYNNNKNNNMKAIPSTSIDYERTTQTNTLGYQAAVLRSLRAANVKFLRYAILDSFNT VRSKTVPIDHALRILKRGDDPFANPVSIAEVCYAGLPPSADVPVEGAGLTAKNVLALRPD FATLRVLPYSPSTAMIMCTSHDQRTGELSPLCNRGLLEKAMQRGRDELGVEFTVGAELEF QLYRLGSDGDELLPVDSSTFANSATLNEQEEFIADVYEKITQQDINVELIHSESAPGQLE VVLGYENALQIADNIFYAKETISAVAKQHGMKALFLPKTSMMTAGNGLHFHYGFREVGSS SNAFGDESTLTGISRRGGAFMEGILQSLPALLSFTLPTNNSFRRVGPGCWTGSTVSWSTE DKESPLRVCLDLNSGGVSNVELKLSDATANPYLALAMVLAAGSEGMRNDLTLRPMNEGGS HDALPESLTASLDALKSNNSLLDTLGGSLSKAYIAVRESEAATEQSLEEEVRNALMKS >gnl|To_NUC_proteinmodels_ML|p303 XFGVCRQCGLLSNRLVALSVVGEEAPRRGAGRKPFTVPLHCSDAEAGGCVARAGIQIRSG HYVSPLPNRGRVCLVILSISXLSVVGEEAPRRGAGRKPFTVPLHCSDAEAGGCVARAGIQ IRSGHYVSPLPNRGRVCL >gnl|To_NUC_proteinmodels_ML|p304 MTKSNNVSSGRRSANKKKRSPLFDRPQNRYEPTDTSSMTRDELSAWRKDQRKKRNRESAA RSRIVHRQKVQELEQEVVEYRIRYEQMEERMSQMQERIDTLTRSLSGRAFASSIESNPSS SVRVTVSPPPSPSCPPLPSIASPDHDAVKTGALSKPGESGFKRFPMNGSLERSEKEATDW NESSGSSEEEDSSDLLHEFASSDSASSDSSYSDEDCLDDDVLDALMELDHDDNFDVINLL >gnl|To_NUC_proteinmodels_ML|p305 MLEMMDSSYNDTDAGKQVDTVRMTMMRVIKQHENWSRDTKDSTNSSTRNTSLGVDNKPVL RSSVNSESHLSVESSGFGENLDAFLRLQLMHLSLASKKFTTSLERDKAYQQFSVLSTCGT SIVQGKADILGCGDRFLDPFVTKKSNLIDQFPGGLSESDIDADALAAFCFPNGIRLRLVP RRALDGARRDLAGWARKATATSSRIIMGNGIKWGRKASGDDALRHSTTGIKAAMDTVMTN AQRRLKSAGDIAFEGEDDSCGFDPLPQHVRNLGIEAYQAMIEAEEEGDVCIVEKSYVITG TNLQDQCIFFYALHNLIQMEREYRERQRASPNNQNGISDLSHDRHKILAALQSKLSLTPL QRRVAHPRDEMAHVMNPQRRRFVVDLTSMGFSKISVPLPLPEVSGQWGVSTLFLRIKDSG LIILLKLLLLERSVLVVGETSEEVTACTTALLELLDPYKWASAFMPLLPRDMLDFVSSPV PFIAGMIVENKQHLNSIIQDHGVKDAMLHGLSVVNLVSGKLIVTREQGTSDMLRRSFQTI PELSLYQKRLEEYCSSPTSNLRSFDAFFKHGASRKESLTILRIKRVIKKHMSQFTLGLTD KPDAWRQYGEFNEASGAFDFCPDKFIQPLKDRMIFQIQFQEMMAHTQLFVGYVEELQIAR EKRGKLLSGPAAKFIASWVELHWNAKSNGQSVEARISRFVRDSAQK >gnl|To_NUC_proteinmodels_ML|p306 MIVYTYLATMILSLLGSASSAYAFRTTERLPVRIESIPKSEIRHRLGLKETDSQQLVIGE ATHQRYGLFNEHEVLCPVSGTHRGVLCTARLDEGRIDLGNAESLANHGELVSQSILAASL ATSVRNAASSGSGMYYLPYIVTLVESFVADEVSDLNLERMTDGRLFNSHGLETNEILMDF DEDYSADMSRLNHDEVLLFSCNTAFDSVSASSDVYILPQGDSGIDKVWKFGVEFKVATIN LAGIQCFGRRDSTTPLRPDVWLKDDYY >gnl|To_NUC_proteinmodels_ML|p307 XRGDRSPSSRDGWAEMKAEKKAAFQERRERMRMCVCCTDATLEELLPSSSSEDGVDSSSE DGFGGFVGPDEDEEVEVAVLLSQANSQEASVQTQEAASSAGLAVVSLSLAAALLAIAAAM M >gnl|To_NUC_proteinmodels_ML|p308 MSSKVNFNHISFQRKVNDALTAVERTLELERSPRLAEEVDHTYGAKYELVDLTTNAAIIA FMTCLEKLGLNGLILGSIVEGAGNKPLTLRFDVSTTPTFMKEVKVKVPMDRSYEETEETA GGAATNVVKTKTLKAVRHVTEFHYNVKLEWSLSVYSGTSVDERTVLMSNKSSSVIVKQAK DSSGLRISPQSHELPLTWLLKQVNLDDMASQFKVDTSRESTKTPRRNDDVDKAIEFSDQL SSWIGQIRAGFYRNLAVIEGQHDPALEQDRRGGYLDFRYMSDSSNREDPVFNPILPLMED RPERSSETEDEVVPGCMIKIEAQSHEDSTDRNVLLSSSDLTKLTNEHARSLQGVLETRAG AFDASEYTVTTTAEIELGVVFRNLLSLSIQYEATMDYVESMMERQLVAAIGKRLTQDDLQ TYVRYHDARLLSPAPEPFSLSIRQPEHYPTGLISIETVDGKNDCIHTHSRQVNVGSTLNV ALNSATQVQLSGNQYLHGWMNHRFGVDKKNHQLVARARQFSAFILVIGTMTDGTTLDPKD AIIVQDKEIPTAKEFKDAIKSLSPEQQRFAQAYRSMQLSSSVFGMCVVQIKPQLEALLGI PANALDKEMKLTQDLMELFIEYQVPSDLLSYNGHSESAALEDKIGNVKANVKAVMDVIEY QKEKQLKDERARTDMDEMESMQRKEEIDSALFETHSDTDRQRKMEDYSSQPSLMPSRAPS YSPSRISPRPSRPIFSSYAESIPIEEGPGYKGATAQSVNKSPMNDIGGKQCDQGDSITPE GVDFTLIPKQLDQSVEREGDTASLRSTTIKTSSRWARNRQANLLSEPKRHVLNADEIRKE KSKAFDLLDALSRSGSLAIAYSELHVVVAMTHGFEKDLMGTVILDNVNPIEKLEGSTLLL ASAVHGVPAEELVLSE >gnl|To_NUC_proteinmodels_ML|p309 XKTKGWLEHRVRIGRRNRRINVSNEEAAREEYLLYVFSISLLYLCLFIDHSSGHAYDAEQ YFRFPPAX >gnl|To_NUC_proteinmodels_ML|p310 MEVQVYSGGTNVALQGNATMSSTRTPFNPAYSLDASTAIDGLTSNSYKFAHSNLEDGAYL EIDLGQVYNIESAKVFNRLDYRYRYRLSDSTVTLRSPSDEVLHTYEIGDASSLSTIDISA SDFDPFNRCGEIDICLNRCLNVYNSGLEIVDPNDIPALKQFRFPGHWDTNEARGGYCSLA CIGGIAHDAETGEDGRITSYNNGFCSASDPFSSCIDTCNSIETTVDSKRAICRSGCEFWR RTEFDPFSPFSLPTPEGDDTSFDVRGASFGTSDFRVAMDLTGMGRSISQDGRNYGALLTR TAGATAWDGWAGPEIFIYESGNIIFRVRSDEDMTFASVLPDPDSQITRHIVLSRTGNKLT IQVDDNAPMEKMITKFIPDDTKAIIEKHKIRFRGNSGNPRVQSLYMEVSNVMFQTCIGSE LEQQAINDYILCNDDTYCKASLGYGYPIGNWCIEQIAKPKPFNEHVENTNFGALFVRGGD FEHSHQGPSAFIFTNGNVLFRVHEDEPFIFENALPDLANSAVTHHLKFSRRGSVLLAEID GEKYQKNITKDIGAATAEQLKKAPLRFRGSHVNKEKLSLYVDVKNITLGVYQEGPAEGDE LAQGQRKLQTYLDVGQIQCARLKEMADPLPILAFHVVASTDYITELFNITDSWSRKAIKG RLFEFKPGQRLEPSTTLGSTRSSQRPRPRRSIRSPCDIQKLEELGSSSRGAPSLQVPSLH RSPIMQSRPSFVDQTPSTQSRLTLVEQFDECVDEIVSSGSSARRMGSLARAASHIQRLKV MAKARSDVSDMAERIHVSWAVYFNLFNGALKNCRERGYGQNSHRNWNFPQYEIVLNWQKK FWKDSLECDMVREAGKMLFPSRHEGENLPISRIPIKATLDQKLSNDKLVMAYCFSISNML SVSCPPVEFEQNGQLERDCRMKGTVYQYRDEPAILTVHETNMFTMKKVMSECEAIWAKIL DSVRLQFDIITSPFMIRQSRDGMLKYWEGKLEDLESEYVGSADPSGLDKGCLTKHSNHLL FCEDCFNANILLGYMELYGTYEVYEVEKTVEKWKTDMKLVQRGMLPFSQIMERAASDICI DFWVKYINSVMNKAGVTCGPSCPAEPQSDIGENLKEATARHREIGLGKRQGSPTVSTTIL TRVVNEFTPGRQQLRSRFPPAVVVTPNIACPCPGGPEVECDRNCHDCFAESNTINPFKPP GDWGLGFKIDLLDQLFPRRNIGDLTPFHKNKANDWNWRPAQNGSVWGFEIYVRYSFRIFP NKYAVWTSPFPHDGCGTLEGNCKERCHLYYRQEANRNSCSKACKEDICKKTNAKRYEDCT EMCRNDHMPCGLETPDCGEKRDSCYQGCQFWKEKQCIGTKNERKCDGAFSFVGHGDCLDD WGKRYGRITAQKESGNFFYLNECRDICLDKVGPGKLAGFEWTSFKVDSKLVGTCQCLCDG GTTTDDDGCISPENARYDYDGKGEIKSSVFFGNETRLAAHSAIEKDLAWEERSCYKLREP PPITYPESPKSRQVTCPPVPPPAVLPSTPSTAPLCMFDQDCCSKDTCIEGGCDKLGAGGS KENVPCGSCNGNMACLGVDPAFLSEGSCNGNYACQDATFLSVGKGSCIGIYACLFVTSKY IHEESCNGKSACTFAFQSIAEGSCNGERACGRLTRTKSNPNWKWEEVLGVSEYNIGIGSC NGPAACLETSFLSIGSGSCNGDKACKTTLFEILEDVAGDSGESLSMCQGDCDNDSDCEGD LICLERDSLLGCVGLPTNETNDLDYCDSTDAAIRDFLGANVTLGANVTFAHLGANVGAGS CVGYGACAGVGDTNGEDSLVTIGNCSCNYSAENKQESACYRLGEYKTLPTSVEDMAWPTS VGDMSCNAQTACRSQESYKFEVGDNICNEADSCIGRLGFPFDLPDPTKCSTTSLPDCVRI GVDDPSHVVPANVAGDFGVGDFSIEMDISGKDIPIVEVRTCLFLCHRKYYGSLFIRGGDF EHSHQGPSASIFTNGNVLFRVHKDEPLTIEKVLPDLANSTVTHHLKFSRRGSVLLAEIDG EKYEMNITKDISAATAEKLKKAPLRFRWKHDKHDNQERSLYMDVKNIIF >gnl|To_NUC_proteinmodels_ML|p311 MGCGGSAPAEAVSAPASNGAGNAIKAGSAYESRNIDQEMERAKAEEEGKIKMLLLGAGES GKSTIFKQMRLLYGTERSDDDLRMYGVVARSNIVVAVRKLCSHLRNLGLEEELDRESREN EELEEGDHSSMTCRQAYDELMAYLVDNTATASAQDPMADMGGKKDWVGQSPRAGLAANND AKQFLAHHESIRILWQSNTMKQVWAKRSAVNIIDSHKDYLNDISRIASPDYRPTTQDVLI ARVRTTQVVMEKYRIDNIDFEMYDVGGQRSERRKWIDCFDQVTAVIFVAALSEYDQTLAE AKRTNRMVEALELFRSVCNNRAFSNTSIMLFLNKKDIFAEKILYSDIAAQRPFCDYAGPT KDFDHGILYFIQKFKDCLIDDDFNDSFIHVTCATDTNNMEFVLDSTRTIIMTDX >gnl|To_NUC_proteinmodels_ML|p313 MSDVMRFGRVATLRTLLELVLQLWPEFVTSLKTEPNRRGVRLTQQKINRRLSLRFNWLVT FSSLALLWGISIYCMTSPEAAMVTLGEWYSDCILYFTTFFTFYIAWRYGHIKLGMSPPMF VTLWLWTLAHITTGPKNAEPEFSDMTYFAMLFSAGVGVGLFFFGVSEPLFHLTGNRYDNP GYHSEDEMAAWSLTISLYHWGIAAWSPYLVLAMSAGLASYSFGLPLTVRSSFYPLIGNYC WGWIGDVIDSWAIVMTVAGICTSLGLGAIQILYVTIVWIVTAFATLSVISGLGVGIKYLS QAGFLFGCLILFLCFTMEKSYYLLNVLVQSTGDYLQWCIFQVPFYTDAFAGLKEGEGRAS DGKSAPAAWSESPRLCVCTSFSALTRIIENLQWVGGRARISKNRKLRSVIVGCVICPTIY AILWFGTFGGIGIRQARQAAELQALGETHYGSSDYFQSSGSEKTCYDVPQSDVVVNGTTV FTNTLPGVTPVCLFDSNNSESAWFNVMNSFSFPNGDSSFAGFGPFLSGISIIALAIYFVT SSDSGSLVVDTLASNGSLKHHWSQRVFWAFTEGAVATGLLMAGGNDALTSLQAASIVFGL PFNFFLIFMCYGIIKMCRVLEARGSDEGMTDARVLLPDKDWTLHLCGGVFNVIEWVLSFG HISKELQGFGVVPDRREIAGFFFNLFLPFVSLHAIYAALDPGNNRHIYRVLVTSVYTLLF LAWIALFICGTINKGKYELDSFYSFYSLLI >gnl|To_NUC_proteinmodels_ML|p314 MIARRSEHGQNRRTTTLEVKEDKANLQRMRIAILTCGSEGDIRPYIALGRELQSPRHSVP PHEVHIVTHPNASSRVLEAGLQFADIGSDFVSALADSELGRAVSDAGVFGKMSATKAFFA ALIEDWFESGQRALKSIEPDLCVLGTFPMNIHSVLCSQVLKIPYCQIHLVPIVATSEHAP PVGYGDGQTSFNFMARIKWNLFFKIGYKLLYASTLSRLYKPLGVSISHSKVMTEWAGAPA VLGYSTTLSPRPSDYDETKVKIVGAILEGHSDEDVARFLETEDGRRIAEHLGAASKPFVY VGFGSMWDMLTKREKQRVLGEIIESTKVLKDVGRFIIQIDEESADEARKSNTFDSSDSIL ISKAPHSWLFPKMDVVVCHGGSGTTHRAILHCCATVVCPCKADSDQPFWAGCVERAKLGV RGPNMRNLSAARLAECVRQSLSPSVKAAVKTASEEMKKQDGARDGAKVILRDFSQ >gnl|To_NUC_proteinmodels_ML|p315 XRFYKAGESDPLTVLRLPPYLERLGVKKAYLGLDECNLETERSKIINALVESLPPSTTVV NDTVISEATQDSEGKMELLGRDGSSLGKFDVVVDASGVSSRLRRTRFTEDADSYYTGKCL LRSTLDNPRKTCSAELSSKLSGTTGFYGPSSDGLGTEEVMLQRFGVHSPKYTLSLTISIQ DPQRLVDQLKFEGIHGSTNDPEALQRVKDYVDKRLAAFPSGYREMFDGISSAHVTRVKMH PSYEEAMKSTTPGSDELPFLSIGDALHALPPWSGMSGNYSLQDANDLATALIEEHQKGDW SSESIAIKFRELEKDFSLRTEDRRKEVAWTMDDKDYMSSTPIKDFAMVGRITNKPWSWKD PLQVIAGFMRFVTFLNRFDNYGVGVL >gnl|To_NUC_proteinmodels_ML|p316 XVVAAKWGFKQLLWLAHYGAKVAHSVVSGKLYLQTRRGTSAARTFDGEQYTKATYNTLIQ HDQWNHDALGIINKNMINQHTQMRKELQNRHQDITNDVNQYTTCIANYLGVEIISRLPGT TVGPLSSLCAQLLGLNGRRLEDGAEAQNDRPFNIEVAWDEGSVVKQQQASMEMERDILEK EDSIMEAQRRIENRLGINCGNSRCEQEEANINCPQDCAEAEFIALSTNESMATGFTKLKF TVHAKRKVAISALSFFTKDALEESNVKIRTMQGIYPQTMFSLDHEWQDVLSTKVPTTEYN GGTVAPVKFEFNHGIQITVDGGKCQSFEVYSDSGIMVHVSGNMTETDMVYEDDALQIFTG RGIDDHSPKPIQFNGVIGYDGLARVPEISTSNSHGGIAADQTSAAELVLEQEMKSAKRKI TAIQRKVGQIDVLSTKLSALDDKISAKLSAQDDKVSAKLSALDVKMSAIEGMLAQLLEKN ER >gnl|To_NUC_proteinmodels_ML|p318 MMVIGTTLLYVTTLLLSAAAGVYSAEHYPVELGAKYIHDVAVSTREMSAFAPTLFAEVCA DEGTGNVLLSPLSMYQLLALLDLGTTPDSDTYKELGELLGSEEDRANVNSLRNVTAEGIK FDLATSVWADELKRSFIRKARKRQGAESFPLPSTYEPINEWVSDQTNGLVPELFGDEDMG DGTEALLIQTVYFKGIWTKQFNPELTRMGETFTKSDGTETSANLMYAKQEMNIILNSDAL GGASAVILDYGTSGSAGEFAGIFILPGGHDTWEEYGSADMESLLSGLATHPLTDLLDESV TTTARVSLPRFKLEFGLNEPYSMKQVLKRMGMTSAFDEAKSWQFEKMSNQDLYVDDIVHG AAMVVNEQGSEAAAASGARMRGRSADRSPYLIFDRPFVVIIMHRPTGTPLFMGKLEDPEF I >gnl|To_NUC_proteinmodels_ML|p319 XTTSVATQRKTCNVCISPSSSFDAKHATFISSSGLWIEELATPYYIFKEAGHEVVVASPA GGAVPIDASSMSDGFFTEEAKKFMHDGEAFGQLSHSVKLDSIDVSTVDAIFLCGGHGVCT DVVDCPSLKSVIETLYASDKVVAAVCHGPMGLPQCNKPDGSPLVAGKVVTGFTNSEEKAV QLETKVPFMLETKLKEQGSKYESAADWNSKVCVDGKLVTGQNPQSSEECAKAVVALL >gnl|To_NUC_proteinmodels_ML|p320 XATATFLTMAYILAVNPRILADSGGPCVADPEDGGIFGEAYEECLEELTRQYVTATAIGS TFACLLMGILGNLPVALAPGMGMNAYFTYSVVGFRGTGNVSWQAATTAVMIEGAIFFILA LTGLRYRLIKIIPEPVRIATPAGIGAFLAHLGLQTAEGLGVVVSDIATIVTLGGCPPENR TPIVAYDDDCMNNGICIPSDAYTCDNLGGKMTSARMWLGIVGMMIIAVASAYKSKMAFIY GIAFVTIISWFRNTAVTFFPDDPVGDFKFTYFSKVVDITGLDLLMVPFTSDLSNVALALI TMLYVDFLDTSGTLLGLADTMGIIDEDGNFPGATRAFSVDACATMFGSLFGLSPITSYIE SGAGVQVGAKTGMSAVICGFYFFVSIFFAPILASIPAWAVGGALIIVGSIMMKSLTRLKF ERISHALSAFLTVMLMPLTYSIAYGLIAGIGSYIVMEGFFRFFLFFGIDLPGDEDGMSVN EETVKSEPDGVEPVDDKKGDIEEPKEQALNETVAETDHVEKVAD >gnl|To_NUC_proteinmodels_ML|p321 MTTTATNGELRCEYGTVLQSVLIRAYNLRITYPDRDICIHANDVAGAFRQLKSHPDCMPA FACIVADFLFMQCGLAFGTDFAPSNWEVVRRIAEILATRYFEDPSVRTKHRQYLDRLKWH TMLDNPRASFTRVKADSINKGVLDNQGSPTPTPHRFFVDDDIYLDVYDVARIEQAEQAIA ASIEAIFVLLGESDLSKRRDPISWSKLEEMFIAPINVILGQRVNLRAMTIEAAPKLVAET KREIKNFATHRKTFVVRDLATLAGKLTHIAQTAPWLRHLMAQFYLTATKCLRINLRHLCV TYASFRKALKLSKQQAETDNERRQVTFAQSTTARKVHNSKFQHFISTELRETIDLIRSAL NASHIHTRSHIGHMIPREPSGRAWSDSSLKAAGGYSVDMKFWWYIEWPAEIRTKTLSYVK NNKDGRLISINALEYAGIIITYVASTHYFLHNPRVDDPHPVALLTADNTTAESWVVKACM NSTIGRALGRVQSALMINNPVGLNSAHVCSEDNVVADGISRISGASWMSAFSPEQRANLD DYGGHLSTELRRSGRGKSATTERSRQNHFLNWAYDTIGLSDPCMGGRVHPQARNYLLACY AVSLVKGETIRGHLLRHKTITRYISAACKLFEKRKIPDPCKAADTDLVKIILDAVKDYEA VENRRNMISDEMMVDILRRAAKESPTSLVAALADWIILGQYTGFRLSEWAQETCRAYARI GHTSDAVAMTRDDFVFKGPHGLTLDQHNLPAYADIESVDVHWRHQKNGDHGEQIEYAKNR QHPQMCGVAAALRIVERADQLNNPSHHPLGVYDDHGTKFITGADTASFLRTVAAAVYKLD PEKKRDQEILSRYSAHSIRVRAANVLHRQGMSDSYIQKRLRWKSTTFLNYLRNTIYAADQ HNALMGIPTNNLPDLSDRAGRPLNRFRLPQPTDSFRALAAAAAA >gnl|To_NUC_proteinmodels_ML|p322 MIAGARRQLISRAAGSRSGPFTFAHSAAQAARPSVGSASAFHHHAHQTERICRRRRIGLP VQERWSLHNSRQRGQRLSTATAASIDAQFSPLPFKRRVTADGRPRRARKLDVRSEAKLDA LEDAKHHVERLRADLLSATENRSEGLREQLSVAEEAMNAAYSQAIKYTSRQSQDAHAIQT AVSLINDWTGRFIGGDIRNNDKGLNKKKMTRRVHEILRSLNGNNCTPDMNAATDSKDEMR TGMSPPSRKDYINILRAHSTSKALRKGEQCEALISRMMDVAVYSAHRYAESEDEDEREKL KRIVEESLPDSKVFALAIKCYAGSTLRTYRPLRLQSLLLIELNVSALKDSSSAARIDLLK SIHDELCQATDGQIGSVDDPYVLFHCIKSIKTFNIAAMMTKGYDWLGRLHEFVVDPKNAG YFGNGGDTGESTPQSIDVTSAYVHVIRLSARLRDSSPGAASKARDVLDKMHEVYRLSNSG HERLGEASALPAVATADIRYNAYNLVLGLYRDSRNADDSFKALDLIERMIDPRASDSIGV PLPTIDSFVYTVLALGKMQDSAKAIEEAERLIDLLEVNERLGLGASVAVYNAYITLLNKV YHGQEILFDKAMATLATMDDKSKTYSSVKPDSETIALVMKACSISRHNDRGRILSVVSQL FNKLLDQEPNDNSHESLTDLIYFNKMKSIELYEPDADVKRERIETLFSEACQRGLVSAAV LSVLRGAVSAKDFELTVAGSVLWRTEKTISANFRGMYLIKSFADSPALRRKSFALCKAPS RILCLLFWNSMGYGIKVYLNQSGKWKYSKGNSSAIKKDVVQTPENREVRRNPCAIAI >gnl|To_NUC_proteinmodels_ML|p323 XECFDAHPLEFVKDNYYDATEDPNYPERIYIPAAFSVDPNRPDGDVVGRVDDLSGFPGNK QEFSYQMNLPCGLTGDLVLLQWYFVTAQDCYHEGYPEYDFPASWGDVFEEKIEKCPSPLP PDGNGLPLQYWNCAEVQVLNTGPTDRCAPTLPPSLSPTEKVTQRPTKEPTKKPTARPVLI TDEPTVQPSDEPTSSPIEIQPLGTCDDLCLIPITSTECLGVSPGNSIESLSLLKQCLSSN GKTVEVGAVCEGSGECDTTDGLDNCAEFDVYRRAPSCARDDRAKTPEDTPITIPVLDNLT AINSDVDAVDGGLTIVEVSEPQFGVIEVVDNEILYTPDENYNGPDTFTYEIIDSNGFISQ ADVFVEVSPVNDPPLAIDDTATTPFNEQVIIPVLENDVDADGDILSIESVSQPSNGVAGI RPDGDVIYKPNPNFAGRDEFTYRVCDSNGDCDEATVSVEVLAPPNEPPVAEPDFVTLSEG LNAYPFIPVLANDRDPESEPLVVVEVTEPKNGSVEVSADQQGIRYTPDEGFSGEDTFLYT TCDPAGLCDSTFVLVTIDPASTDPFVFDDRESTKEGEIAVVNVTANDLPPNLVIVDVTQP NNGGTCAITDDNQIIFAPDENFYGQIDCTYTACVPGTTECDDGTLIMDVIESPDIPPIAV DDPATTLEDVPVTLVPMENDVEGSAPMQLVDVTDGRNGTCTIGENQTVTYTPATGFYGID ICLYTVCDQNDLCDQAGIVISVTNVNEPPTAEDDVVSTPHDSPAIVPVLQNDLDPDEDAL TITDVTDGLHGMCSIEGDTIIYQPNQGYAGPDICPYVICDTSNECATANVFIDVEPPAPE AIDDQTVTEMNRPISYDVLNNDISDEELQIVGIITTAENGDCRINKDNDQVVYTPNEGFT GRDSCSYSACIITYPMLCDEAELTVNIEAVNARNDLVEVPQDSPQAVDVLSNDQSSGVLT VTDVTVPSNGECLITEDNQIQYIPSPGFVGRDTCTYTACTDSLTCDEANLLVEVIPKPDA EDDAAVTPMNESVLVDVIANDKSKNGPISISDVGQPSNGRCVIIAGELLYSPDKDFVGDD SCIYEICTDDGVCDGATVMIKVLTEAPTKQPSPNPTATPLVVAANDDKETTIINTDVTSN VLENDATSPDGKGLEVSGIAGQPSNGSCSSMTSNSVVYSPRDGFIGTDRCTYQACITGTD VCDTAELVVDVVEVKIAEDDRDETPQGESVLVDVLANDIPPTPGTPVIIDNVGQANNGQC QVVDGQVLYSPDASFTGVDRCSYEVCAENTEVCDSGELTIDVIPKPDAEDDFAETLVNEP ILVDVLANDDSDEPIRVTDVNRPSNGRCEIVGGQVLYVPDEDHVGSDSCLYEICTADGLC DEGTLTVSIVSQAPTRRPSSSPVAVDLTVDAKNDKATSLVNTDVNVDVLENDTVNPSSSS PPLEVTGVDQADNGSCTLLQDETITYSPNVNFLGTDSCVYKTCLEGDEDNSCDTALLIVD IVEVKIAEDDRSETPQEMPVLVDVLSNDTPPSRGDYLVVVDVDQANNGECEVFNNSILYT PSNAFAGRDSCTYTVCVDSTSICDEGEVFIDVFPKPEANPDEAETPVNESITVDVTSNDD PNGSSITITEVGDASHGSCEVQNGVVVYLPIKDYVGSDNCQYTICSEEELCDTGMLTINV IPKAPTRQPSDRPTSLPTRQPSDGPTTTPTPKPTMGQRPYAFGIPSNGSVKINSDSTVTY TPDLNFAGKNSHCLSAFTYTISDPAGNVASAIVIVSVVEKAQSIPVVGYQPTSRPTRQPS MRPAMPVTLRPTVPLTRRPTVRPSKRPSPDDVATPGLCAELCFDPVEPDRCPLCDPLTLP SCSDPSLQLGDVCESDGDCGLNDQLNNCEGTFDVYRRVACVDRPSECIPREGEVIVIHAH GDTSVTSDQPESVLGGLDFFAVDESPKTDGLVQFKLSDSVCECVTIKRATLKLYVINQSR SGGFVHVMNPYWKESETSWSNAPDASGPPLVQIGYALENDWIEVDVTELVNNSDEYVAMR IEACVKNRVEYASKEYLQGQYAPKLVLELGPPPMGAQYLAHSDIISRSDYGRQRELGRSS SRKTKVTAHTSGSRDTQDDDFAEDILSLPQTPICPVPISKESIVGFSSDDDATIVYSEQD RSFGSDAVLEVSPRQCGEMHAMIRFDVSGFVNLRSIQYAGILVHAVDGSHLNGATFLQTP VGDWDENDITWKNSPEYDKVLGSLSTIEGGIWYELDVSRAVRELDGNYLSIRVVAASVNG EPCNEMRLVFSSRELGDFGPRLIIAANDDDGPPIESSMKVDPVSTPSVLFVETQSRCGLG VSPPEKNLVLTPTDDAYLSARRKRMKHGSRTELRVDGQGNTALLKFELSCLESKEITSAI LKLYVMEGSSSGGVFHVLTDEQDTEWVESEVNWNKANIAPSDFFQYRLQRVRNNTFVEID ITDAVASRASDFVTVMIKSTKKNGVVYSSKEGQHPPELILSHRDTQPIGHLFDQKITCSR PALTLLDTFEADNACVIKEREAETNFEVESRSRLIVKDGFGSRVDALVMFKLDCLFSKDI VSQATLRLFVVAGSPQGGRVVFMSSDWNKSQVTWRNAPTGSSNVLDYTIGRAIKGRWIEV DVTEALTSTSEYYISFRIEGVHRNPAIYSSGEENAPQLIVAYK >gnl|To_NUC_proteinmodels_ML|p324 MMNISSQPNAAAYTTNQMCLSPPLSRVQLIDSLSPPPTPGRERFPVHREPTPLFFPASAS LIHNAPGKRTRTTCEDQEALERMKSRPVPVPTFKLSRRDRRRVDTVRQRIDFDIDMLPPL PFDFTASSPQAIISRLPSLSEKAGRDCSCNSHSTTPSYDSKVLERGLKPLIIKETFLTGG HAKLVPLRRSYVSKGTVQRRNSFTAKAA >gnl|To_NUC_proteinmodels_ML|p325 XFIPNRPVFVPLFVNIARAGVRPESGLGRQRINILSTLCTFAHTFRFACFVQSVIFVVLQ QIILIDIAYNWNESWLENSEKAERDEGAGSGKKWLAAILVSCGVLYGASLAGIVVMYIQF RGCPTNDAFISITLAMSLICTAAQMLNRTETGSLLTSACMAIYSTYLCGAAVSKNPDAEC NPHLGDESIWSVVIGLLFAFVSLLWAGWSYTADSRLGGGDGSEAEDNDGEQQIEKPVGGL VVGNGDADDSSPNSETALVDKTGEAPTSFGNTWKLNAVMMLVCCWITMTLTGWGSIENRG SISNPAAGQVSMWLLVASQWLALLLYLWTLLAPTLMPGRDFS >gnl|To_NUC_proteinmodels_ML|p326 MRPSLSSIVSIFILSNIATVVCETRGSKLRRAEMSENRPNANGGRILFSDVDGTLVHYPA RLRGSGDGAGVHGEMMHLPPSSTGSRGVLSVETLRLCHRLRREGGVPVVLVSGMRATTLS GRLPYLPRADAYCCESGGRIFYPRTQEDEGVDDGGEEGGTVAVPHPYPGLADGKPFSLVE DMEWRARIEEVAGPFDSKPIGERDGKLWEFARELQKQGYKIDDRGYSAAFRVNRRHQPAE LAETFDDFIARYSRKEGVPNDLDCSTNLGCVDFYPSMSGKKRCAEYLVDKILGAGRGDAE VSLETSGYCMCDDDNDIELALSCREAYLPSLTSDSMKDLVGSGKWPLFLTEDVENGIVET RATEVALERIFSRIAEDWGE >gnl|To_NUC_proteinmodels_ML|p327 XVERAKERHGQLFKLISNCNDGDTIKSDGEGCVEVALSLLANDDAWTAKDNSRHVFRREL SGKSSSATPRPSRKKRTQPSSSSSTALSSVLSIDIDDLRTAVSIARSLLEVASFSRRVHD GTDGAGVTRKQARSLKSLLRMEQTTCLKNAIEILSVAARTLCYITSGALDRLDEDDLDDS VNTSGLTPALLLRRFAANRESSKSLLSVTGVLCADCWFSLGKMDVSAHTLLFSCERALVI LNHSRSASLNKQASDLLSGSLMAPLEQYRSFLLANINHQMGVYLYEQGDFERASAQLGTA SRLRRQRLDCMRDRAEDGEADVPSLLSRLYGEVMGGLSRAAPASMPEKDFTDLVAYSVRY CTRQLPKDSIELGELELCLSLTLEYSALVQHAVQNYQEAMGYFQEALILRTMHVGKNSLD VGSLHFNMGVVYDDLQRFDQAVSRYHESLRIRLDHLNNSSGSGNDDLEESVLLTLKCLGH VYKQSSDYDNSVACYVKVLQIMNQRFHQLRGEADRFDQQGLRLPIAVPVPTYIFEEMKGS KESPGNNSWKSHFQSMNKKELCHHFEPRTQPKKGMSKLKKELIKIHSHIVDLIHEKKQHS WDNQSVSSTRSTRSRLASSSLLLSSLKCSIKSGDVFEAALMRSSFALGKMRLEQMRYDDA AGSLETALRAKWVLDPASSSDSDSEASYRSLTSRKPPMRSTDEDDPGEGQIYYGLGVTNA ALDDHERAVRCFLTALRYLRRTSVDSLEIARTLFDAATSYYYLCNFEQAISLWEECLRIL SKYETPTKNDGVDGDTSAAKEALSPKDSSTKSSNYRRGVVLYCLVLAKSAIGFDNETSSL LNNAQTLLSSCNDKVILAYMEFMTGMFLSHAASQVPTRLRSITKISPAGVTLNEGLSWKD MCKSSLTLFEQVKNECWFDPMESLEDTEEVRHLPLSGHLCFKTGEVYELTGSAESALNSY MDAANFYRIACGDDNMYVASVLHRMGLVCSGTEDSEYHALGYFNESLSIRKSLLGGNDLL VADTLYASAVVLSRLNRYEASMERYHEALRIQMAASQDSNEVARTLSGSVKIRKYRVTKL SDSGDVIDLYSEEVTLATELFNLGNVQMQLGDFSQAMGNMIQSRDLRWRHVGGGSIDKII EQFSSDTTVDEDELLGLEGEDLSTADKLNPSESHKFVLQSLTKDVKNPRAVNKATLSASV THQKIAFVYMKENNYEKALFHYAHALRLQRLVLGKDHFRTGYLLSSMGNAMRKSSKHSES AIICYNESIRISKLRFGQNHAVVARALYDIGSLHDSKRNFSKAMHYYHRALSIYKEKYAQ NLRQRFCFGFDRASTVRELIGTSSDEMEILSSGDEIIVSSSAESPESKIREQYSLVTTAL QNARRQDLLRKGKRVNCDRDDWWIAFEVFIFRLVEMVSTYVVDPVQNAARRTVEHSRRRI DSAAHHAIITAADAIDYQFLLLLQE >gnl|To_NUC_proteinmodels_ML|p328 XMVWSSMLHTSLYTVAVATFVTVACIPNGGLRPRKMLPPRGNGSTTDRLNVALSGAAVNI TQSSWKSSLALAVVLTTFGLFHMTLQMGYPHIMWNPFMWGWYTVYLPGSIPKALRGACLD MQKHSTANQPLCLSEDQWQELSSGQLSSYNPDDVYSVQRGLDYLQNQSGGVVINALARNV ADAIPLLKQNMEGLAPFFQDSSRNKLSLVVFENDSNDGTRALFKSWASEESRKESPRYVV DIMGCGDANPDCILGIQDRYDKDLFKDPNASGVGKLGEFRNVLLEYILSKDEYKSFSHMV VLDVDLGVSISPLGLIHTLGLENGLGQDYAVASSSSQVWPGTMGSIIPPYDLSAFRPKES SGNEKLRGMHQWFCHLMPAGDRWRNLCEAASPMQLFMIQSANDPSNNHNEPYEVGSAFNG VTIYPMRVVRERGEQARYDSGDDNQQCEHVGFHLSLDRRMYVNPKWSMNLRPNKPGGPTG IRAVKTLFEAVIGRPNVVVVFLIGNNTFFFVAVFALWMIGLSLKRLMMVLYDQEKESRRR PWSADSTSGINTREM >gnl|To_NUC_proteinmodels_ML|p329 XLELVVHEFGHTLGFHHSGLPDEGDYDDNLCMMGCCAGAQQMCFNAAKSYFSGWYSEPGK DGHMDFGGADTAGEWWLGNLVGIDDYLNDLFDEDQYRMVLRTKGGLYAGFNRAKGVNAGV KAYKDEVVIVQQWGEGGKSKVRARLNAETTSYKFTADYYDFEVNFLLCDLVIQPTITGPM IEIEGTASQSSTVSDAEASRAIDGNLDQTWIGKSVTHTSGDDDDLDPWWQLELTDETSVE HVRIFNRNDCCGSRLNNAILELYDASGEVIFTRNLGQAENIKEVFLGGAYTIQMIRIKLY GYKVLSIAEVQIFAPLPSPRDRQPDYARVLTYISVGDEPRLTCETTAAPTASLEPSLSPT VTYSPTIFRAVPNDVVLLRDNTPCIFQVNPANHPDRYKCTLEPTASNSEPGGSNNTPVLI AGFNECVIEIQPMTWSWAGTMFIFESDASLEDVNACFGTGQLLREMRPITSSPKPTLPQL SDKPTVQPTESPSTAEPTLGPSTLSPTSNPTDTPTLEPTDKPTNTCHVVEVCSPIPNDVV LKRDSNPCVFQHNPSNHVDRYKCTLEPTVSNGEPGGSNNAPVAISGFEGCILVVKPMTWS WAGTMFIFESDDSIETINGCFAAGELLSWADTGSGYSLFG >gnl|To_NUC_proteinmodels_ML|p330 MMKQLAVILVAVFPPTVAGGLRTRHVSRVGDPDHQRGRGQPETAAIGSTHQRPSPSSYSR GLVGADYCVWSPNLACWPASRGFPPCCIDANVTCPEVVPDCDSNVESYCVADPDTTCYKE GWPGCCSAGRCPLQQPPCDVETGAASTESSYCSMSPEYDCFSGGRPACCQLGENRCPIER PPCDVAATTSTTSTEAATSQQTTTVPSIPEMTFPPKEEAGAADPSTAFPPRPDLGTKPPN EGLATAGNPAEYCSLDFNYTCYSAGVPSCCIFDSCTSQTQPPCEVIETYVVSTPNPPGEE PSPGLFGELSPPEEGLRACNEVIPSAYPDNVIPREVVLATLEGMIDSFGETFAPFKFVLS TMAEYDVRAHKVCTSCEEVYSMWGGGEDASDAMPYCKPGSFASGRTFSALVLEPIDPATK APIAGKVAATVWNWATQPDPFYAPSVIWPNDIVSSPVEYIPALSAATAGTYTIIPDVLGN AEDWSSARSYVVKDVYSASAVPILLKLKSVIDGGGCTELDKRVSIVGYSEGGYAAVAVAN AVDILNDGWVHTHTAVAGAPIKLATVQLEMIFDMSLNRYPFPAFGARLGNAYSSTNQDMA NTDTDVQQPFASLEYLDEDDPTKNVVEWARAGISYEEMFPLLPGPVTSGTQSDILNSEFV NMVVSAFGDGITDPCSSKYASDAVDHLCQAAFDNDLESVLQQFDYPVVLCHSNADELVPY GNVVGPSELLVLQQNVPGMELLNPDGLTHLQAGAICFLGFMVPYAFPPEDGEIRAIFPLE DADKSCTESSESSLAQETVQEVQGEDEVSSQTGPTTSSPDSANATELGADTERPDGEPPA SSSSLNSCAYFSVISMLVLIILS >gnl|To_NUC_proteinmodels_ML|p331 XPCYNSDFHNQKYFQGCADVAASDAGEKDDTDDHDHSEHDHGDEEKETDEPALEADEPTV KPDESDSSASRSSALAALAVAALTMVYGM >gnl|To_NUC_proteinmodels_ML|p332 MFGRTYTLELSNVQDGAGNIMETYSYDSNFECHPPNPRINVTHSGRSDQHYLPGEAIVFD VSLTNDRPADEWTRARLELGVDLTSNTGDLSVLSNGTDSYDPAIAHDIAWNREIWDEGLA EYIQWIVFTPPCPQIRWAGPFAEEREMVYSNDALVAVSDLVEVSIFNVDYTNGKLFDRSV LNEEAPESEQRLQKVVLVYRRFGEYDSPWVEAHLEDPVTRLSTPDIIEFANPGVEDDMGY ATLQWNVASIPDSRYLVKVMSNCTALENSHSTHNYFSTDAVEVVLDKTPAALFGAPFIQL RGSVDSVKQEVYRFIFTESLYCEEPHVFHLAVTLSIGQNSRIFTHGSHGSGIKTICEGRE IRYRFDTIELDEWYTAQTSSESTDLVIDVNIVLEGVPDLALNRADDFTFTSSWAQQTESP TMSPSPRPTNSPFQSSASNDSSDPSPASDGSSDPSLTEGQPVSVQCVATQNAERPNLGCD GVDNDCDGDIDECDEDQIPPKISFKDGLALDASENGDGLVTISTPTFSSIVDAQSYLTSI LVAEDDCMVDLPLSVMPPNLGASCQSTIFEVTAASSQCPDQSVTRQYQMTVDTDIPVVTV EFDLGQAHVNDFSSVGDGVYLGIASSNATVYEYVGFSYTVTDDCPQTLQTRLRVSSNEFE YDSSRAGDSMVLIRDTMDPRQSHSLMVFVEPGQCGVISDRLCVRNSNVGFRYYQFALEAT DLAGNAGSATAYVVIIPEAEDLPASRDSVGALDLDYFVSSMTPETSPASNVIQTKDMLWD TTREDPPDVPGVSVSVVETVSAEMTISGITVPESQEELDALILALEATLNGLNEVNANSD QRRLVEEECICTSVIKILSIGGMTPERRLQSEVPVVYEIIQSCEGNCDSDTSDNSGSEEG TAKSSVKDLANNLSSQLESDAFQDKLKEEAKDAGVEDVVATIQIESIAVAEEVQVATQTN AAVSFNITVSIPVQNFEELDPDVIHLVVGALNDVLQSVACPQDPNIISCVSSNVDTQTTS ISASRKLIHVSRRLVSEIISLTHKFDIVLECPSSGCNVNSITTALEDAASTDLNDAIENG QLLSELQSNTNGTVSLITGETTLDSDEGQSSGCSCDNSTQVSSGTTTDSSTSNNSSVWYP AWGATDKCSNAPGMPTYMQGSSHYTSTSLEACCQAHFHWDVTGCIIASGGTVSNSGTQEW YIRWETFRCVQSCISDSSSQAQGLNCGGLASTKMLFQTAEDCCTQMVPWVATATCVAESI PPYQFDGTDRWYVSYSLNKCVKDCPVGGAHCGGVVTESHLTMYDDASTCCSQKLWWLDNC VSSSLAG >gnl|To_NUC_proteinmodels_ML|p333 XDHKRTSSSKRPRDDVNGELRASPDRGEAMEIDGESTASPDRGEAMDIDGESTASPNRGK AAGDRAKAKNKYEFEEFAKGADEQIRKAERDAKKFDQRFRLGESNKFKVPVCAVCDRLII GTAKYCFIAKEDIKSKADRIGCASYESFYGDGSLPDLLKNQYEVKGYEGLLLSRRARKDD KGVYACESCKISLNRDYEAPPKYAIANGFVIGHLPDEIEYEKGDAIVTVDVPEALTSDVL CAILSPKRPFAFCISYFGGAHNSVRGHVTFFETDLSHVGEAASLFNRPGANPHIYCVLCG RMTKDQKELARKKAALDTEKLMALLTWYKEKSGHPAYANIVLPTTEFAAPTVIEDPENEH NTDKSGEGDVAKLEETFLGTTHSFPSNSPKQDTGIFGTNKEFTLAMLNGTAPTMIVSGGN HVNVKDIKLEDVCPIQFPFGVGGPSMERRNRVSATGCYQHYTKLSLSQFMKGDFLLILNS LFNKSKTFESALTKLSYGGNDSLAENFSTIDLEELKRAASEIAEGRHAGGDEGKYLRTVS SSCQAIGSSAEAAAYNRRNLFALDDYFGGCAIFLTITKCDECAYQVRLFSEAGGIDIELP VLYDPVTPDTDAREKVVQDLKERQDRRMLYPGACSLVFQHVMEIFIDVLVAWDPVTQTSK KEKGILGPVEAWARTDEEQCRKTLHAHFLFWITGMNRCRRDFMYGSKDEREQAKEAFAKY ADKVMKASYPDFRIKLPCCGIGEAKVPREVVSEVNLQPQPRQTLRDARHQDECLKINGKI LKCKDCSEPQLFTTQEIVMLSLDYWRRANDDVGQGSLTRDNIEPSWLDVAAYTSPFDDLP LPDGCDWLKKEEVRETLLHLRFDEHDPRHRKGCFKKGCECRFILPQTAQEETEICVDTEM KLFADWLRIEGGSVRSDQSPYMVLPKRGIGSEYLNQHSPAIARVFACNSNVSIGDRSHTY YVTLYSSKDTNPEDKKQHQRVAAAVGRRVFYQQMAHRQAGNTDELAREPDFGEGLGVLLT GINAFSSHDVVASTMSHNLVYNLGKRFHFSHDFKPLLLSMAEAVLEGIEVDLVLRQTRGK GAENASWADCLFHDYLYRPPELQKVCFYEFVMWYDRHRLSGNNLKSEDSDGDAKVAPDNM FSVQAGHPGEKYIRLKRRDKLAIPRISAGQGKLCNLSELKWTDPSGVYDSIVTKKREDYA KHALMLFKPFADLDELRCGGSYWKKFLNELRAHNESPAPASYELPELREDMSHRDLSSDQ FWKRGFQILQNMQDRETARSGSGRARDPVVKLTTCNVPTGERKKQGIDGEDEEAFDVDIN EFDRDAVNGANDTTFQSLQPGDRRSHDGLIRRGNINPDKYAAPTVDATSSLLPSAGDDSG SGEVESAHEAPAAGGAAPQVTRANYATILTFIRGTLLGDSVEGPAGDNTEEGAPLPVPMP EARVAGQGPPTLRGFVSASGKKFDRKQYAAYKILCCSFLLGLVREGTIGNSPNERLGDLL GDLGNDPDLAANRNEIEALLVSHGAMDQLIMLVTGQGGSGKSTCVSLAQRFCHSFCTRLA ILFDDKTFTFTSTTGSSAAIFGGTTVHSAAYLNRVKITDPMRKEWRNIKVLVLDEVSFFR TSDIERLDKVLRRLTTRHDKKFGGVHIVFSGDFYQLQPVMAKPEELLYSHSDAAMNFRLA INCAIVLEKSHRFKDDPEWGAILERLRTGSHTSDDVATINTRLVGESTGVNPPEGNDVFR VVPTNKERCSILCLGFQKHLQATHPRIDSNDTPPNHTVIVEAVFKKNNRNLSNALTDFIA TSLGDSDIKVTSPWTSKDAKIDPALRLFVGAMLMVNSNEDLNDNRANGTVCKCVGIRLKK GAQRQIKNWDGYKVWTISVDKVEGIVCEHYPNPPPNTEKTFVIDPKNYTATMTLPLLNDS RSTNLRVGKVKVDQFGVNSNYATTCHKMQGMSAEKLMVGSWSYTFKNWVYVVLSRVTKLS GLYLSKPLDATKEFPVDRDLVEFERTMTALQEHVLAQINLSDAPNE >gnl|To_NUC_proteinmodels_ML|p334 MVAGSLTKGFRMMLDVYILDALADKDHGYVFLFILFMAGLVGLIEKSGGLKGITLALQRY VKTAKTAQGAAFFAGVVIFFDDYANTLVAGASMRPLTDACVVSREKLAFIVDATAAPIAS IVPISSWVGFEISLIQTELNRILEQYPDTLSIKTTGFAIFLETIRYRYYCIFMLFFIPMV VISGRDFGPMLLAERLTRVYGRTDGGPGAALAVDGGELVSHNAPKPETPCRWWNMAFPIL MLVFYIFYLLWWTGKQSAAPGASFLEIIEQSNSYQGLLWGTMAASLTGLAFYFIQDHKDD RIIWLNCKGYLSRAKRIVSRMKAFCRRGQQEEEEDDEGDHAQILMDYRTAMAAFLVGMEK IFGAMVVLTLAWATGAIMQAVGLNRFFGDVLTNEALDYRMLPTLTFIVSVLIAFATGTSW GTMTIMFPLVLVPAYEASNGDANIFYGVTAAILAGAVAGDHCSPISDTTILSSMASECQL MCHVKTQGPYALMVALWSILVGTIPSGYESFSNGISLLLGFVFMAFHVVFTSEFAINKTG RFDIFTEIYLRCFDRKNEFLLQLKADVVKAYETGEPVEMSEEQTPFEPKIIDDENESVEH EGKEVSVKEAMPEQAVVKDVEVGDAPEKEELA >gnl|To_NUC_proteinmodels_ML|p335 XRRRRRDMVQEDRRRRGRRENYFRSNKRSKLMPLPTIDEHLSASWDGVGTCDDVEERGNS SSVCDGDDVLSRLVAEFQSKTSSSYNAPPHQIKIDDGGEHDNETSPCESSEGSSGNTLEN DETPDVLSQLVTKFQAEAGNGRPDTNLKTSQARISSRDDRPTEEERAKLLLKCTRAAEKV FAATSLTLDQLKEYESICSSACSRLLDKERSNDSKS >gnl|To_NUC_proteinmodels_ML|p336 MVNERLASNREEAVQLGQSIMEELSLFEHVTRDHPFADEYLFYHFIDRGDVSINETTGKK FSWDDYLDPVSSNGVGSLQPSLPQPDLECIPEEDVHVASHVWPLDEHNVTLLNHVHPTGW QDPDPQNKFDLVVIGGGAAGLVTSSGAAGVGARVALIEANLLGGDCLNVGCVPSKSLIHA AHLAHTTKNTSALADSGISVGQVRIDFPKIMERIRRVRAEISHHDSADRFTKELGVEVYL GRATFVDPHSVEVNGKILRFNKAVIATGGYPSLIPMPGLKELHALNTNPGDKPRPYVMTN ETFFNMTAQPKNLVVIGPGVIGLEMAQSMARLGTNVTVMGRSGRVLPKEDEDHALIIQQS LESDGCNFKLNVSEYVSVETTGTVLDNGLPELSFKIAEEIDGETVVSELFVDAVLVATGR RPNVTGMGLEAAGVEYDTRVGVIVNDRLQSTSNSNVWAAGDCASSFKFTHAADFMARTVI RNSLFFGKDKMSSLLIPYATFTEPEIASVGLYGTDLDEKGIKYRTFEKPFDQNDRSICDG TTTGGVRIRVEEKTDKILGATIIGQGAGNMISEITLAMQSGTGLGALAAVVHPYPTNAEA IRQAGDLYNRTKLTPTVKAILRGLIKVQRPGVAVPKLSH >gnl|To_NUC_proteinmodels_ML|p338 MMKKQIRFASHAQVCVVERLDEHCPRNRLFYSREEYDCFRRGIVKDVLVCRSRRQKGLPM DDAATETTFGIEQHLGERVDGPKMARSAHCGAILGAQEHAAKDELVCSEKILRDVSLSRS TDSMARARFIGIRQSRTT >gnl|To_NUC_proteinmodels_ML|p339 MDVSKQATLSTPLDGGAVSSPSSRTSGTETPEPFFSPSPTDDLEPALNEHRAVGGTASIV GAVPLVDDKDGNNEAAPSELHEPKATSLVRADFDLHESAGQHNPTKPGGAKEASPSRPAP NRKKWWLAGLGAVGGVVLIVIVLILVLGNDPPSDPITGLYADFSTGVSLVTEYHADCRVR NRLAQTTIKLEVANALNCSSIHSLSLQLPLNTRIASLETNADEGECTTARASSTRTIQVS IPPFGKTRVTLVVEQLLQQRLGEVEFELPLAPNEDVDDIKFDLNVLDTDGEPVLFELDLN VPGVYVVDENATQVLDPISLSIPDARQYKLPKVLSGKFKPSAIEESGYLYTDGRCFEHYF RPATVDSMRRNLVFLIDTSNSMRWHDKLAAAKIALAQFVDTLRPDETFTIQAFGSKGTED LWGSAYGTDEEKSEAKKFIGNLSAGGWETNLHEAFLEALLRAKHDVENSDDDVVTILVTL SDAYATRGVTNRRKIAKHIFDLNSDSSVKIFNMGFRDSADMQLLDAIALMNGGLSSPIVQ GRDFSDQIGNFLQSELGEILLSDVSLSYETISGARVYGETQSKLPVLADGYEVVTRGLIE SDESFENQLEALTQAYTTNGSMNWKVSAAPTVDESASSSLCFQSYASDRITQLMRLHEAA DFIGDDLSKNFVILKNDCKSEKFRDCVKQEALDLAIEANLVVKGLTAMVTVDEKKCAAPD EDAEICVDGMSPDGVTFRPSYSDQSDEEADLEYHNDAVATAESAPMSPYSSGSPRGTYYY SASWRLQSYTALALAAIVSSFLLMHI >gnl|To_NUC_proteinmodels_ML|p340 MGPSALPSIVVCDNGSKSVKTSPTPTPTRRVDLCHEDGSTESDSAGGVNDNDIEASPMKK SGSDTVGTAAETEEESSIVAPRPPRKSLPDELVVQERDELAAKFVGRVIGKGGEMIRDLQ ARTATRIDVNQNVAEGEPRIITYRGRPDDIEFAKRCVDKICREEGKHTQLPLGHAIKKDI MVPSPAIGKIIGKNGEMIRELQSKSQARIQIDHSRDGVAAHNRRVSVTGDYQSVASAVEM ITHLCHNPTMDSMEVLRRMLAREKNGVEPLVTPRSYNLATFAHPPASLPPAFTPASPAIS VRSHGMGLLPSDLFNTPSDMPLYNEGSTGFANRQHRQLADETGEAETVTISFPQTKLERI VDPHGIIVNDIQKRSQCDISVKEDRQEGQDCDIIITGPREGVQMARQMIKEIEMGVHYSY NAAGGGYANHGYYSFNPYHYVDHTYTQHYDFYACSGLPGYSYRPDSYYQRGATYYHSDNQ VTPEK >gnl|To_NUC_proteinmodels_ML|p341 MRELILSTVALVVLAAESPHIPQYEKQSVAADTAVLGHSDIISATGTLRTHTHDTKTTTL RRLLNSTITTSPVESRPAEMADAPDAKTTKRRVLNGAVIFPAEVGGEGADSQPSMDIHSP SDHDRLIPLQANDYIGLTLAMIGLILAAGGGIGGGGILVPIYILILNFLPKHAIPLSNVT VFGGSIANTLLNWRKRHPVADRPLIDWDLIVVMEPPTLLGALIGANLNKLLPETAIAILL VVLLVYTSFNTLKKAHSMYQKETSEIKHRNINHRSVEPTAGLFKTESENGLEEYFLLNDH LRDGRDSDSDTSDTGNIGLSLNEIRYPGDVERDGDLRNVRLDEQSDEGVDEASQGSSTVE SYTEFGHTLELQNILDEEKRPKKKNIALIATLFMVVLTINILKGGGAFESPLGIECGSAS FWVAQILLLIWICVVSWIGRKMLLKSTAKKTDAGFAYLDEDIRWNGKKTIIYPMISTLAG VAAGLFGIGGGIIKGPLMIALGVHPAVASATSACMILFTSFTATTTFSVYGLMVRDYAIA CSILGFVSTLVGQKVMNSILRKTNRSSYIAYSIGFVVLLSAILMALQSVLHLLSAEGGMS QFGGICDSHLTPH >gnl|To_NUC_proteinmodels_ML|p342 MANDDNGDINRLSSLLPTGRGGLITQKKNPDELGPDRKRNTNRRANTRRAYEETPSHPGG VDYAAVRRMDERRDRDRQWRSVSSGPGGGASSAAKRGRYDYGHGRGPSNNATNVTPSTHG SADDNRKSWNATPSRATPSQRSSWEQDTPMSARSASLSRSRAPTSRSSYRSSNETPRSGG YRHGDGRRTGKDDDDDPFDELHRDLPNPAPPSFNPDKPADDADGDEFDRQFYLADDDEYL PDDSGQGTGRFIYESDRTKKREEEMARKRTAGAAVSLRQAKQSALDKDQQTWEENRLLSS GAAMRSNVDLDFSQENDTRVQLLVHQIQPPFLNKDGNKATFSVIREAVPTVRDATSDFAR MAREGSVTLMRLREKKDRNTMRQKFWELGGSRMGSAMRVKEVAQGDDAKTKDGRERDATQ NASYATTDSQACGDEEEVDYKKSSGFAQHVKEKEQEKKSEFSRTKTIREQREYLPAFSVR DSLMQTIRENNIVIVVGETGSGKTTQLTQYLHEEGYTDYGIVGCTQPRRVAAMSVAKRVS EEAAAMVKDEGKRDIIPEVDGLGGTVGYSIRFEDQTNEHTVIKYMTDGVLLRESLRDPDL NKYSAVIMDEVRAHMRCIDSPCQFGLTKAIPNLSLKAHERSLNTDVLFGVLRKVAARRSD LKLIVTSATLSADVFSNFFGGVPIFRIPGRTFPVETYFSKSVQEDYVMAAVKQTLQIHFN SPPGDILIFMTGQEDIEGTCTVLAEKMEALEDEHNSKPLLVLPMYSQLPADLQAKIFDAA PDGVRKCIVSTNGIKYVIDCGYCKLKVYNPKIGMDALNVTPVSRANANQRSGRAGRTGPG FCFRLYTDRQFREELMETSVPEIQRTNLSNVVLLLKSLGIKNLMEFNFMDPPPEDNIMTS LYQLWILGAIDNTGDLTTLGRRMVEFPLDPPLSKMLLFAHEHGKCSSEVLIVVSMLSVPS VFFRPKNREEESDAVREKFFVPESDHLTLLNVYLRAKQYKFDNDWCTRHFIHSKGIRKAR EVHAQLVDLMKQQKLEPISCGIGEYINMLSGIPSALHPSSALFGLGYTPDYVCYHELIST TKEYMSCVTAVEGEWLAELGPMFFSIKESYESALKRRQRERADALKMEQEMKNKKAEEER EKKEIQARTDSTISRRSEYATPGRQSSATPKFGRKKKRGRLGF >gnl|To_NUC_proteinmodels_ML|p343 MRENSGGAHADSSPHIRVLNAGEKVPSPRRIAETPIGGMSRETETGMVGLLAASATSFLT AEGAGRNAGGFRPLLSGSGSDTRQPRLDNSVFLENRPPVDSSWTDVVNSIPPRNAETVLN VLTPNTTSRPSPSQSRDNPSTGNNHFNRWGLPPSHPGVTRRGVQGPSTPVHSGMIYPGNI PHDFKTTRSTYSPLSMGDATDQTSPMSSPSLSPHVSPSMRGRYAGEGLPSWHLSDGHPPQ GSLSEHNPYLHHVNSPRSGPTSPPRHVQHRGTGTRPSQKSHSIQLATSHHRSSIEVLKTL LRKKACLYEPETSNAIALVTWLVGHKLALSQGFFSRQQLQAGVHASVASKIEAGHVTRTK VNRCMQVILNSCFHYIIPRPDGIEDGQVFRLTFEEEAVSQDHLLGGLAPPWNNLRIDTDA HREGVYQIDYNGKESSTSSRCDESADSSNKRSVLLCFNENVRCAADVFRCHNEFIRDVAV GKLCLSNEEWRHFFMGKNCRRNKPQSFEDLKLWDFHERVDLSKFRTTQCAKRYDHNHLVC AFAHIDVNSGWLRRDPSLFDYEPIMCKHVKPLRGSDCHFVNSCPLGKMCKHAHSREELMY HPQSYKLKPCTSGAQCRLSDVCPDIHSDTPTARGKRHSGSKMMRNNCISNSNSGFEKDPG DMPRTLYVDPAPLSFYEETLQLPGLQALFRDRSASIVYKEDPSYEYGVFGWKKV >gnl|To_NUC_proteinmodels_ML|p344 XWYERNDDGDIKIVDGEKVVRYDVPPQLKNLTIAEKLLIRRACPFIPSVHIKNGVYGIKG HCVAFPQDISEMCTELPQRKETIVTFIRQMGNRNNSAPMIRHLKVNKARVLGALRWLKIH HSGYHDIKIKEENLDWMAGKDEANLSSFGKEVNVKPSRNSAKEQKEYVSKLQCQGDPDEC DDLEFMTMDAQEPPCQARGEQQCQPLNELAKELKPDEQDKMMYFPPHSNEPINEFGSTAQ VFVNAYPWLFPGGIGDLYDPVRGEVSNVTEWGRHLIRYYDGRFLSDQTFSLYLYNVMQRH QNNKQGGFFFHSPNFLGKNPPTLRELKMQIRNGDYHFINMLRYFSKCIPGSDNYWRAKTS ELHSWIDFHVSRNHGPPTHFVTLSCAENWWPDLRRIMRSLELNAGNTQHAQRLLDKNDFA AMCQSAKKYSLYVNEFFMRRGKEFMDNYAREVLGIEYYWGRVEFAPGRGQIHLHILAIAK DKAYLHDFYRARDDESKQIEVLNGYAANVLDMTANTDVDESRQRLLPTSSPLSHRYSECT DEEEDARNLAHDCMFHECNDYCLDLDHAKSSNRFRRCRVGYGREETEGKGDTPGKDLRSE PTIVKDRRGVEHLTMKREHSKRVVQHSKSCLLAWRANVDCQLIIYRSDPNCPDIAEIEGV CRYVVAYAGKRYKTLKQEKEAIQNIILDTSENDIGAEVRSITKKSLNALSGHRMIPIQEA VHQIGGFPLTISSDYITNFSLSSALKLKTGSESAGSNKAGAKYNKELADSYRNRLKKNPM LSNMSLERYFYERFRKSAFYKDTESGREKHRILLPTGLNCKPKFPVSYEYARGMLIMHKP WSVEHPLEPILKDKERTIATFLDMIQRNKVPYNVKSEYYRAMKHSQEHQKEAIAKEGTAA EKPPDLSKMTEDEREQYIESEQCNYVTEQTGIDGIGFQSTVRGEEVDIGLHHDWSKSFFN EERDITMKGEDYIETIREKYYEAEKDANSALHVPKKKDGSDFTLDDLSEEQRVIVLGAVD AIVKFLNNDEDYVPFRATILGCGGAGKSFIVNNLISIILNMTHCNDSVKVAAPSGGAAFN VRGCTIHRSFSVDVSEKKMAEGLSVDRETELTERLKRLLLLVIDERSQVSSKVLAATERN IRQCIFGQQNFEEKWGGLPAILLFGDDYQLPPVIDEGAIQGYSKYSNIVRAVNLPDENDT PQRLEEIFSPCEGFKSVGMSKKRNSISAAIKFQTAANASAAFEALKGTNHSALGTLNLRY HNDKPKDDYKETNIQCLVDRGSRMFIEDMTENVFLLTRNYRVKDEECKGILSRLRTGDST EADAERLMGLHDFFHKGGRNSEWYETIQNDPKTMWLYARNENKDRKNVERLVKLSREKKL PVARLECQWICNRVQYKGKSTVNKKHFNTSNLVLSTDLCVHAKVAIGGINYVPELGLYNG ARGEIVDIVYKTMAGPNNKHEDHLPAYVVVDFPHLRLPPYIEPWDRLHPTHVPVPTRRVD CTYNRGDPCCTVRYCPLVLAWATTIHKFQGYEAGFEETDTVNRLIINPGDHSTEMDNPGM FYVAASRGKTFGKPTAEEPYPKDSAVYWNGPVSTQSIQTVKYKQNGDVCVLVQKRDEWVE YLYDRAEETKQRCNEEWKRQATARVKTAIEEEPLGQHNLDNKIIGMMRQRNEKWSDRRKA EYMVPAGYFNS >gnl|To_NUC_proteinmodels_ML|p345 MVDSSLELHRRAIADRLGAGAVVALIKNRNQPTNEPKNIHSIQNTMKLFQVASLLAFISV SSLTVSSATHAADELNLNLQTSSNIITLQEQQRKSNKCYRQCDRKKNRSDRIAGRCKRDC DAQDIDPEAQCNSLGEPCHRDSDCQQGGFDPCMKCGDERGTEYFRRCYGGIKRETPAPVP PPPPQCNTYDEPCERDSDCAQGGFNPCTKCGKSRGTSYYKRCYDGSEQEDSLEFIQSDEL VEDNPTWDDMKNGTQRAAEDAADWGKKAADETAEGAKDAANKIDEWGKKTFDDNSGDPLS SRLAALTTVTVLAGMQLTGFDLW >gnl|To_NUC_proteinmodels_ML|p346 MKHSSLALLLATSLSASTSPATAFVLSKPASNSRSLVARSLFNMKKEFAAPCVMGDEEIM SKKAHGTSDVPVQKNLRWNCDYDTADRICNFNRHYAEYAGYWTSTSFVEEARKEYAEKGE ITFYDSNTGKPLFVAPKGRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSFR DEEVIWDNVRCLPDGESVSLAGTHLGHNLPDRAGNRYCINLVSVAGRPAEESKE >gnl|To_NUC_proteinmodels_ML|p347 XMSVKDYIETKLKDKVWDVRLSITANASVEESIHRGRLNLPTQDEVAILFPDDITAQHQR NVILNYRAPVGSSGLRYIPDYHRMYDATQYPTLFVKGQDGWHLDLDETCLEHTNFMIVDR MNDDKEVEVVTGLDGNISRVDGLRTNPILLGRKLGQQYIVDQYAKSEFARLRYVETHQKE LCCEMYSGLQDAMGRDAQGSVGTRVILPSSFTGGDRYMHQQYLDSMALFQRFGRPHFFIT FTFNPEWREMKEALAMFGTGDQTVLDRPDLVARVFKLKKAQLIRDLTSECIFGRLKARTH SIEFQKRGFPHAHIIVWLDLDRHLTPGEMDKVICAEIPDEKYSDKFGELHDNPIYEAVTK FMLHGPCGARNSKLSCMQDGHCRFNFPKKYQFETELSEDGYPVYRRRSPVDGGNTLIKFR NGERVQYTNADVVPYNKYLTWKYNCHINIEYCYSILAIKYHIKYINKGSDLASMTVGSAD DDSEGANNEARNEVDEHRTKRYISGAEGAWRLRGNELAERKPTVSRLPLHLYNQQAVYFV PNDKKGTIERMEKSSRTKLTAFFELNEEDEFARTLLYREIPEHFTWDCKKRRWKRRARGW DEGIPEAIGRMYSIHPTKVELYSLRLLLNSVRGATGYSNINYYNGIQYETHQEAAIARGL VKSDIMWIACMKEAHETETYIPRLRKLFVSILTKCEVGDHRKFFEASKDLMNEDFLRRYK RQYERNEMGVEFEDGWTLEDFATNSCLAHLEKLLELEGHSLDDFHLPLANLDKEQVIQNS MLEEIIGEEGNLPPNKAKEFYEENFPKLNPDQLHAFNTIKQLILEDNRDGLLIFLDAPGG TGKTFTLNVLISWIRMEDREVASSATSGIAANLLHLGRTAHNRYKLPINPTKDSTCNIPK QSDLAQLLRKMSLGIIDEGPMLDKLAYEALDRTLKGLAEPQDCNKKFGGKIILVSGDFRQ LLPVIPKANPAKVVSHTLKNSVRLWDEEVMCLHLRENMRVKNEIAKRPNDSEFRDQLEKH EQWLLDLGEGRLPCHGYDESDIIDIPPSMSKDSKEDVIDSVFEDFTEHIGDGEYYKSRVI LAATNEVVNEVNNEMVRRIPGVLHTLESVDTVGDMDSQTSFPTEFLNSLSLSGLPEHELH LRVDSVVILLRNMDIKGGHCNGTRYLVKHIGEYRLVLHKLEAGPDDKDKILILPRIPLRY NGVDLPFEICRLQFPVKLAFALTINRSQGQSVSKCGILLPKNVWTHGQIYVAFSRCGNPN NIHVWAEQEQFKRLFGGKLPEGKILVKNVVYKEVVR >gnl|To_NUC_proteinmodels_ML|p348 MPCSAALLALSLSLSALLSVARAQQSTHKYKFFCGSSFADIQADTCAQRQWCPSSSDDEC LIPGHTCFANTPCDARLIEGVSMPTYSLSQHADYRDPSDKMFCGTSYQEALSTCEAGGEK AIARHCPNTEGCPPGMFCFIDMPCSHFVLTSPDASPLMDVGSIALSPEEMELPDPGDMTS HTFCGPTFAQAAAACSSQTWCRTGTSQECPNGETCFVDVHTVNPDCEINKIAKKEYEAAK AAAAKTPGPHGQAHEPPPLPHGSKNNNFCGTDWTDASTNCDLRRHCPNGNSDCEDGMECQ TYTSCNASRLTGTPTDRPVSMTPTESPERTEPTGRPTTAKPSREPTHNPLAYDDLRRQFW CGLGYWDAMTNCPQKCPSGSDDECPTAPGGERLKCLSGVTDCKNEMGMRAYGAEEEEGDE KPDKVESRPVSDSKPVVTASDADATIQDDGSVSANAVAEPAEPEYELMGPYDEDMVRVVL YGIDSLTSANLNRWKKLTTNYLEEFYNNYPTQNMLASGEEPDDEIRAGIFDVEFTLSDVK ADPVPEHSFDSLVVGEVDKRGLPGGRRNLRSRRGLESKAGARGASGLLRSRALQEDEDVM VMITYTQSTTYRSSLDVINEDVRFVNERPLATAEYRAAYVNYLRGAAFGIFSELEYASRF LYTDFPTASPSGTPSSEPTMSPVVPGQPSQSPVTDAPVTPAPTGGPPTSFPTSNFGCNLC RPGQYGVDAELIFNGEVQRCVQVYNWFLVNDTQGSGSCRAATEEMNSLCCRDGLPTPPQT ASEPAPQPTSNSQAEPSNSVQQSQVQTAYPTQDFGCNLCQPGQYGVNADVVLNGVQQSCQ EAYQWFLSNHYQGSGDCRGGQAAFSSTCCAGEITEPAEPAPRPAPVASEPVPEVQPRPAP VVQSKPTVTAKPTAILEIPVDENGLPDAASLAETYYCGDSWDSVDRNCDNAQACPSGTAE ECPGSQQCIAFTNCGGKFAFVSNPSIEGGGPDADLVRSTFYCGTSMQFLENVCDGATPCP NGPNDCEGDGEKYGCFAFTGCNANVDPGMFVGFLRPPDEDDVNVPLPVVLSDAADTFYCA EDWSTLNGQCVDGIPVGARPCPSGNMLECDDGQGCFAFACNNIVGITPGSSPTSMTGDYT VEDLDVLKNTFFCGVTLDEIDNDCDSAIPCPTGEGCPEGTGCFAFSQCGGVDIGGLVDTF GQTDRPTRAPTVPIEQVCDEANKMSLNVGYWQSWSIYRDEDCQRMNTASFDAAPYTHVIY SFASIDSSYRLEAWDGTYDNEVPLYKEFNTVKQRQPGVKTMIAVGGWTHNDPGPMQKRFS QMASSKTNRQTFAKSVVNFLRTYGFDGLDLDWEYPAIKDRGGKRADYDNYVLMTKEIRNA FNAAPEPFELSVAITLNITKLEMGFDLAGLSEYVDFFNLMAYDLWGSWDPKQTAMAHTDI RMIDEAVEYMAHFIQKSQLVLGLGSYARTYTLADDDCLELGCPFDGPGKSGCEGTDGFLP YFEIADLVTTRQYDTVLYDEDTMSMVMVTDGNRLISYDNTVSFNRKLDYAEENCFLGGMV WAIDMLKDSSNPLSSNNGNSALTGDPSDQSFCGFSYSDVLDGCKQPCPSGDSSQCSPGQQ CFANTGCSIDSIGDPPPTKCRLCPDPSTQGIKDWLEIDYSGESTTCGDADMAVVGEFAKG SEECDAAKQALSDQCCYNYPDNPCMLCRTETVFMDLRALAEIDFNGETMTCFDLSKRLSP EENDSGMCMSAQKEHFDACCYNQCTLCEGQGIKWWNEVMFSGGEKEEGEEEAEEEPLNCG ELDAKLYADETEAEEDTCREVLSEYKGQCCYDYPDDPCDVCTKGGEKLTLMPNEEIDYEG STFTCAEVNNFLSPFESSSKQCEEVQGMAIDTCCFDRCSLCGEGARLDRDVLVELEDGEG TCADIESSLFTEKVLESSEECTETRTLNYDACCFEIPSDPCQLCATDQYMHHTTTIDFND DEMTCKSANNYLMERFDTSSNTCSEAKAVLGETCCYEICNICGEFDLDWDVFVNFEGEDM SCGDFNEFFREEAIVDGSDQCSAVQGEFFDTCCYTSPSTSCQLCKRGDEYFEVNDNVQVD FNGPTTCYEVASFLSRRTEDSEQACSVVQTSLFDECCYEKCNLSEAEGTYPDWGAEVEMD GNTATCLELDTAIKEAAIANESPECQSLKDAFSPTCSYQIPKNACDLCPSNAVSISAKAG WNGKEMMCSDIKSRISSREESDSQVCLSAQQQVGAACCIDQCEVCEKPQKTDPVLTVYHE GATKQCTEVDNYFYEKSILRSSEECSSTRDRLQETCCFETPASPCNLCQKGSEFFDVMGA NSVDFMGEKISCSDVSDLMFRREEQDGETCAANRDAYFDACCDSKCSLCEEMGLEAGVKV SFEGRMQTCLELDLSLGPASIEKGSDECSEVVEQFKDDCCYKKPDEPCQICAGENFSTKK DTSVLYLGTETKCDQLSNYLGSREEQQGQACQIATTQHSDTCCYEHCSLCGDGKADWETF VNYNGRAMSCGDFEWVLRGDGVAADDEECGAVKGQFYDTVSPLETLCVFFSQKKQPDNEF LSQCCYEPPEKTCNLCHVDGEWLDVNADLEINVRGSATTCFSLYNSLIVRESASSDKCQD TKDNHAEECCFEKCNMCQFGILDVKASVDMDDGSSVSCVALDQSFSRNAIVEGSQECSAA KNEYAEECCFQIPDDPCRLCSGGAQVSVDGTVDFYGDRMTCGEVGNMLSISEEATSDSCA STRQDFSDQCCFDSCSICPDGYNLNWEVNVEYNRANIACGEFDQIIRANAVQKNTQECGS LQSTYSSACCYNYATAGGGLSPTTTLATITTTGYLSTNLDTSDLSPSQKNDAMEVFENLI KSTLQSQGVLPNDAKVTVVDIDDNGVVAYEIEMQVNDSSIAETARPLQTVSAPLREDTSS LEEVATTAMAEIDNTLSDSDSLKGIAEEIRTEAAGVADSEVAEELANINLTGLSSQGTSL ENTGDSTEVSATGTLDTNIDAPDLTIAQKQEAQDVVGNAIQSTLENQGVLPQDSQVTVTD ISETGEVSYEVGVAVDSDTIANSVVNAIDGTLSNAATLQTISTEIQQESASSTVTQELAN ADVSGFELGESTVDTKEGATQVNTDATVNTGIDTSNLSDAEESLAQGVVENTIQATLQQA GVLPEGATVTVLDIDDTTGVAQLQIGSNIENDLVADSVVSALESAVTNPATLQAIEEEIK NESLDATAAAGYLADVSITDITLGDSTVKNTEDATQVTTSATVATSLDTSNLSQEQIGIA EKAIGSAITSNLQSAGVLPDDASVEVNSINPAGQAEVGIIVNVDPETIADSIVGALLAAL GNQGVQGTINNAIIQGADDTSVPNDALSAATISDFEIGDTTTEMGEDATQVSTSGTANSN LDLSNLNDAQKAEAQEVISNSIESALESAGVLPQEAEVTVTDIDEQTGEVSYDIEVAIES DMVADSVVGAISAATSGSDALEEISAGIRQESTDTTSVAQQLSTTTVTDFSVGEATVTNN EDGATQVNTQATVATNLNAATLSDSQLETAESVVEASIGNALESSGVIPQGSSVNVTNIG DNGEVSVDIEVDIDPVYEETIATSVVSAIDNAVSDSETLDSITDAIKDESSSSQVAENLG NAEVTSFNLGDTEVETVGDSTQITTTGTVATNIDASKLSDSETGYAEEVVATSISNTLQS AGVIPQDATVNVTSIDEATGEAQVEIGVNIDNELVAESVVNSIDNTLSDTQTLNDISNEI NQEAKGTSVEEPLSAVSINESTPGETIVRPAESTDVVSSGTIETNLDTSGLSAEQKDEVE TVIENSIEQALESEGVLPDDAKVTVTNIDDETGEVSYEININVVDSEIGSELEAIASAIM AEIESILSNAATLKALEGEVKKKSKGTSLEPTLSGINIDSFELGDTSLSVAEGSTIPRGR RALSTVLLTTSGVLNTNIDPSKLSYKQKAEASDMIEDSISKTLESKDVLPSGSKVTVTDL NNKGEVFFTMEVVVGSVPEKEQEAVEPETPKISTLETLTEDIVNLIDETLSSEETQSQIA AGAKKEAEGTSVADELSDVSVENFLQGDTTVEQESSSATGAGPCNLCKAGEISLDTDILF NGVETSCPEIYKFLSTQTEAGSDECNAGKEALQSTCCMKKCDLCSGGGLPDWYAMVNVND NTMTCLELDGIIVESQIESGSAQCSEVISAAAPSCCYEPPTTPCNMCKTANGFGDVMTSV TVEYGGTTATCGQIFNTLFSREEHESETCSILRQDLASQCCYSKCSLCGDLQVNAAMSVM HDETRLGCSEFDSYIFASNFITEDTDECSQFQQEHGGSCCYDVQCNLCARGNDIWTTKED AIVPYGGSDVTCGEVANFLYQKAMSQDNVCIAAQENIFNNCCFKQCEMCGEAGATINWAA NTIFKGQAMSCTDVYWSLMSDAIEVGHPTCSAIGQVAGDCCYQIPQSQCTLCKDDNAVTY NTRWNKDVTVNGVTKTCGDFNTLLSTQEDDSQTCSLAKDEIFSECCFAGSDTLVAIANDA ESDAMCQLCPSDQVGVNQRIIFNNGPTTCKEVYDFLVDNHKESSTTCKSAQVKLREECCM YPDDIDSSIEQFGMNAKNQESGSKTEGEGSVTSDSGLKESDETKKKENPWGTSLDSWTVG INSSTRMCSTLSLSLFSLIGGLMMYY >gnl|To_NUC_proteinmodels_ML|p349 XMDSGADRESVPPQQDADAAATAAVEDDGGEPAEEDFFPNIEPEASDEQEDRPAGFRSNL PSTFSGDRGGAAEDADLLAELRAISNKSSTDRFASSEDDAESDPSEKDTVVDKEPRPKKR ADKAKGGDRPLPPWKRKAGRTKKAEVESKRWSHRRHRSIQELTERAFRRNRMPVPLQRQQ STTAKWPRRISSRTCRRSKLCRAELHRMERAASRSDLPCTFSGDRGGAAEDADLLAELRA ISNKSSAGRFAGSEGDAGSGPPENDTADGRPGHQATASEPGKVSKAKIDDRPLPPWKQSN GANQNSAVELDVVVAATAPAGDDEPTEEDFFPNMDTETAGSLPGQQEDEQILGGFESDLP NTFTGDRGGSAEDAALLAELRAISNKTSTDRFASENDAGANPPENDTLVCEEPQRAINRL KKKAGKVEGDGRPLPPWKRKASKKSNSELDVIVAAPGGQSIGPTVPKPNESANEPNLAKT FSGDRGGAAEDAELLAELMAISSKSSADRFANDESESSQDGHSGSTETGNAKGGNRAERS LPSGKREGAGTNLDSANVDVAVPAQSQEERNKYISGGSRGGVAVNEASASDEMGIKSNLP NTFSGDRGGAAEDADLLAELRAISSKSHTDRFAEAKSDNTTNAQTEDDKPLPPWKRKGAK NKPVTVDSVVAFPEQCKAEDSRIDGRRSEVAVSSGKGITSDLPNTFRGDRGGSAEDAALL AELRAVSNKSSADRFADEGDVAPPGDPETALTSGAPALASDRPVPKRKTDRPRPPWKRKD TKRATSAGFDVVVAAPPATSSDAPTDGSSATAGQSEATAKSSSHPTAIKSTAPKTFSGDR GGAAEDEELLAELRAISNSASNRFDASDGADTQASRDQPSMTDEYAVINENSEMTREKGD QLPPWKRKSARKKVTPSVATDVALPQNQMQIADATSLHGIKSSLPNTFQGGAVEDADLLA ELRAVSMNSSSGRFNGEKEKPAESSEKATYQAVESKARPSTAPTLPKRDLPNAPSTSGGI TSAPPDTDIKITDETLEESLKSSKWQVRKASYSFLRDRITIFLSGNEPASQLISCEIHSA LDASVPTALMDKIAGSLDEALLLSVLYVDCCQGARLEENARKIMASLVKGNAFASSKTST LNSSEELVLKLIEVSDDGSATIETIFELIQDNGLKSRKPKAVLFSAKLILKAVESFGVAV LPISKLTTQSEILIAHSNAQAREVGMKILAELCRALGSKSPLQSSIDKLKPSQISQLDSL LKSQPSATRITRRLRCKMGEPESAQSPEETLAALKMSQAEDEAKRQAARPAIDLFRVLPK TCYREKIKLDKWSEKVAALNALIDAGGEQPYKLVAPSGSVDYAPLIRELRQLWVCLPRVS GLNSSRIFDLFLPPFWLCSRIKKVCNAAGSSLDKMFANVFSFEHLLEANDSIPSSLDEKK QKNALVRKSALEYLARCVKSNGTYGTRGQLRSRDAEELAKLSCQKLNDSDASTRKAANGV LVALLSSNDSQVVEATKGVTESSLKTSNPRAYKSLQLASGSANGMSSNPSSRSKPGARPK TAPAPTSKPDNSASHRAKPIAVSNRQRSVTKNKTENSSDLSDDSNESTLPAFDDAVSCLS ALRMPNWDDEADEDGNTNLGILRGIASSNWKERVKAISHLTSFYKSEGGKHVTTFPSLFV LVRDSTKSFKESNFNVARALLEMFTAIFDVHATLVKVPEPCICVSATKLAVEKVGDRKLN QASAACLNSLCAVKQPSKVLAVATRTVDGIKSPLVHEALLGWFKSFCSGFGVSSLSTETQ HCLIWILKEVGSNNMKVRKSALDLIGEVHAQLGPALQSFVKTRDLQANVISLVDKSCAAN PYNPGAQQVDRPLKCLTKECAAIDSTGGESKTSSSLLSAPSLDLVASLKCDCIAQISSTE GKNSWKMRKEGMDNVSKALERCGGRLSTEGKAGVSLKQLVLALRSSLSDSQSNLKPVAAS LIGNLLSHMDDEAQAKFGKTVFPALCTASMNDIKLSMRNASLSALSLGTERSQQDGGGAN QTAVETLIMSLESVLTDAALKSSGLGDLLSMMDGRLKAVASKISPQKQLSKVIVLSLLSS KSGSRTAAEKLLNTCSEGGVLSSDSLDEQISKLLPAQQRTLRAVIPKHSTKDKELIAAFK KSTTRTRNRPMTTSTQQSTRREAITRPSSASLPPTTATTKKSSPTNELPYNPLESSSTRS LKSQRLGRNASWPDYPEEPSGISFQSLQKVWSQLVPSTASFLFPKGGIKSHQDSVSGCVM LTQALEYSKNHGNNSFIDQLDLIFRWCTIALLARDHTAGLRSLLSLLRLLFERLSEVSYV MLNEEASILLPHLLDKCGVAKSQFKDQFLNVISYVRSSGVCQTKEYYGPVLCMTVVDKSK FASARALAANECRLCVESHGVTAISKKGIRITAKALSSEYQLLDMRNAYLSLLEAAFMKF NDNPERLLQYIDKDLNDKTRELVLGRCVRPNPPGQSFHSVQTPEKRRPSRPSNSQQIQSD PPPTQLASSAAKENLRQRLQNLKDDKHQSGNAAEPKLMASRVCPERSTESSDIFSRTLNE IEQLLDAENLTDDPSVVRGATAINYLLSAMISDPESRPIKTLSDMEVRLLGEGISFDFNN VVETVTGALKFSFHASNDESLPVQLINSIITLLSYLFKLAQNADAISQSTLEYVLRESVQ VLLDPRLSGPKYEMAIMRPTNKFAMRAAMAPSRDTALSSLIMLQKETITSSEGKYLAKQS RVYTKLYKKVIADEYDKNDAGPLAGVKLDSMLRSLDWLLQSSQDVRSRDPNSSELLKSSS EMSRSIMLELIKCRGSSVRNAASQLILPGNDLIGNLLAECESELGVTVSQSASSKGTACL DTHKESHFSNLINAFAQAANNGDAHQQQVALSEIIDFRHTHQDVDFDSHLEYLSPQFRSY IREQLRKETKENVQESQQEQPSLDREKINSMRLNLSASQERARDEAGRPSSAQPVADKAA SLRARLEALRAKD >gnl|To_NUC_proteinmodels_ML|p350 MKPFKRRTLRTLGRAIVVTVLVLIGLLASRFNEVGRLLLPPNYARNATESSPIFNGTDAG DCTCSFSPHRSTAAQLGLQGKVSDCCCSFDTIEKTNEELYPLLRRIVATPFFSHFKIDLC SECQLWKDSPMCVLRDCSVCECESPPIWASEVDWMPDELECQHIEDEVVTSVDAHVTDTF GSLVQTDTFFGEDTTAELEDEDAAVVVDLRKNPEQYTGYTGRSAEKVWKAVHEENCFQQE DESDGTCGLSAEQRVYGRVLSGMHSSISLHVASSYCLELDTDRIAECKTWGANRTLAYER VLSRKDRLENLYVVFAVMLRAVQKASAAIAAAVPEQDSYFAESLIEWKEHLQPELLKITR SCPLTFDESELFNAYAQEDAEYKRKELKRRISHLFEVMECVGCDRCKLWGTLQSLGVGTA LRIIIDDDDLQVTNLSRQEAVALVHTLERFSSALVYADQLR >gnl|To_NUC_proteinmodels_ML|p351 MKQQSLDSEQAKRVSDADKEMEMRANRAKALLAERHVGLRQNQAARQQRKIQLETGMTKA GLSNEEKKARRKALEKEEARVQKESRRNVTSADFESLSVIGRGAFGEVRLVRRKPGNARG ETGIFALKSMKKEMMVVKNQVGHVRAERDALAAADDENRWLTLLHYSFHDNENLYMVMEF LPGGDLMSLLMKEDTFSEDATKFFMAEAAHAISCVHALGYIHRDIKPDNMLLDARGHLRL TDLGLCKKVGDVSPGDHPEVVLEMMKSKQKGTGLGSSSIPEGESSGDGGHLRDMSIDGVG GKSTIPTGGALRADLRTGQARRENAYSTVGTPDYIAPEVLAAQNGASGYSYTSAVDWWSL GVIMYECLVGYTPFYADDPVTTCRKILKWRQTLEIPGEVRSKLSTECIDFLSCLLAGPES RIGSRADGGPEFENGFAQVVRHHWFDGFDWENLANVEGPLLPAGAKDFPRVLEYLKACPK TDPNFKQLVAFATQNFDTFEDHGTTLDSGGRRRVDRSNLDQFYDYHYRRTRKPKVPLPDF E >gnl|To_NUC_proteinmodels_ML|p352 MGLFTSLLCLFGAHCVRGGNVEVSSVYDKAKSNHDFSTLGTQREENRIVSFMSFTHSIYF PPVAAIDAAGLAKTLDNPSGTFTVFAPPNAAFDKLPNELLTKLLDPTWSPQLLDVLLYHI LASEVFSTDLVEGLMVPTHNFQGDEITVSLDPPSIDDSEIYLSFADIAASNGVVHAIDSV LTPPSISNNIVDVAVGNEDFSTLVAALSAAGLVDTLSGEGPFTVFAPTNAAFDALPEGTL DSLLLEENVDALSGILTYHVVAANALSSSLATGDVETLNGATVAVTVDDGVMVNDSTVII ADIITSNGIIHVIDAVLLPPSDDAESEESTTESTESVEVAEEPVEETSDPASVYEIASTT DGFSTLAAAVDAAGLADTLDGDGTFTVFAPPDSAFAKLPEELLTKLLDPTWSPQLKDVLL YHALGSEVFSTDLAEGLMAPTLNFQGDKITVSLDPPSIDESVILINDGLVDIAASNGVVH AIDSVLTPPSISNNIVDVAVGNEDFSTLVAALSAAGLVDTLSGEGPFTVFAPTNAAFDAL PEGTLDSLLLEENVDALSGILTYHVVAANALSSSLATGDVETLNGATVAVTVDDGVMVND STVIIADIITSNGIIHVIDAVLLPPSDDAESEESTTESTESEEIIDAETDEEEPVEADVS SSSLPPLGAFSGLVSFLFVNLVL >gnl|To_NUC_proteinmodels_ML|p353 MSRRLARIAIISGCCCSAVAFVPAGGDDMGQESEAPPVRTAMATTSCLVQPNYTRFEPPF ADLVVFGDELTDVGNSFMYGSGGRRAGYMGRFTDGLTWIEYVQRHLGLPELYASMNGGTN YAYGGATTNSDFIKSLDGVPSVSEQVDEYLQSKDELSPDTLHVIMAGRSDYLRYVETGLA ATDEGTGAGSSANQTHVYFTVASEVIRQVQRLYDAGGRHFLVGNVPSSISRGEGDGVYDT LLFGHNSVLSTELEVSSAELDSFQQTHSGTTIYYVDVDHAFGCVDEHADELGFTNVVDAC RNSETEKECANPFNYKYWDDGGNPTTHVHHYLSQIVLQAMYEREELAGCDKKKKRQAQTQ HLRRRFNRLESKYS >gnl|To_NUC_proteinmodels_ML|p354 XLEIGDERAWKKSGQALRESAPEIRAERQAQLQXXXXXXXXXXXXXXXXXXXXXXXXXGG KKKAGRKSRGGADPPGPRHRPNPPEREGLATNIVRSNFGVDMIGRNQLDEDVELNRMRQE YLQMQKLQQLQQRRMEQYAHMLEEANGQGNYSRDNVYDEYNEMLRQHQMQQQMMMRQQDV AAATIMDDGLLPMDAGFDIPSLREQQQNSSIQQLQQQTHQDRDSSNYQSCEQGKDSQVEE AFDRQFNSCDKTVSTMSSFDIQSMDMSSLGGFSFNQSGYQSNSNYNMSLISTGSAMSRGG GASRKSALERKLEKVNDAHRREVSRKQGLSQNRPKKQGPPIQHDPNYQPPVAASGVKSIS VPTHRPSTNRRKQTAQEVYNSSNMNMASLSSFGFEAIEEDDITEAASYKMSNLGLGLSEM DMTFSSDILSIRSKSVPKMRDSSEEGDMKPTTTNQKNGKPKRRSSTDDGRLQPSSDGPSK NPIDMDEFNESFKSMEFSGSPSARAPVDPDGHNSADNASPRRSNKDPSGGRLPTIQSTRA TNPSSSGHHTSGSRLPRQDHHESDKMGISDPDMLLSSNADFGVSFNSIKSFKSQASDASS WLNQYNTMEKVAATGEERRWDAEGEDSSGKSSLSEIPSPHMVHT >gnl|To_NUC_proteinmodels_ML|p355 XMAEDAATDPEVGTTVASPRASAWGTGTAVASDSATFADIMRLQEQESKSKQSSRGVSFS NQVVETEEERMLRLAIEASLKHSQVQEEPKKMPPSALRNVDSSNAVESESEEERMVRLAI EASLQDSCERKPSSLPTPSDATGSSSDGTIAAEHYDAKPPASSSDYAAPAASDEDESERL ARELHEQEIAQLNETQTKNDSSVAASLELAMRLQREENALHSRSQLADASRLKREEMCHG RGGNVSVSMVSRSEFERMKSGIDGCSSIVDQRKHEEVGMGRLLKGDIPDGGVPYSSATAD DDLDQYYYYDYDKAGGAAALEADDMLDDEVDGFRMNSQSSSKWSRFDKDRFIGPDGELRT KHDVEVKHRANAANLLGSHGSKLEHAGSSKAKATLSDRAYNAYKRAETRQQGMKKGVARG GTGRAEDPSSKTRGGGMDGNVRLEIAAAINSGIINHCNGAVKEGKEAVVYHADAGDLNVG SSHDVAVKVFKRIAEFKGRGAYVDSDPRYHKQAFKTQDGRHQVVLWAEKEXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEYNIL VCPTYQTAKGEPKPISDRTDDDEALQTVIIDFGQAVEVNHSQADAWLRRDIETIHKFFTK KGIKVFASEEAEEYILEEVNELPDDNNDDSNQIESPKGDDEPAKEWRHLKAGLIDEEEME LLISKLGRC >gnl|To_NUC_proteinmodels_ML|p356 XIRFFLLQNRQGKTRLSKWYVPPPTGGNKDNGAPSYATADSTATDAEKARIESEVHRLVT SRDKKYTNFIEFNNYKLIYRRYAGLFFTIAVDMHDNELSYLESIHLFVELLDSYFNNVCE LDIVFNFNKVYMILDEYMLAGEIEETSKREILDRVKFLDKLD >gnl|To_NUC_proteinmodels_ML|p357 MKSVAALALIGSAAAFAPAQTGKASTKLNAFEDELGAQPPLGFYDPLGMLNGDCSQERFD RLRYVEIKHGRIAQLAFLGQIVTRGGLHLGGNIDYSGDSFDSFPNGIAALIGPDSIPTAG LVQIITFIGLLECGFMRDVPGTGNEFVGDFRNGYIDFGWDTFDEETKLSKRAIELNNGRA AMMGILGLMVHEEIVPLGYDPDLPIIGHLS >gnl|To_NUC_proteinmodels_ML|p358 MKSVAALALIGSAAAFAPAQTGKASTKLNAFEDELGAQPPLGFYDPLGMLNGDCSQERFD RLRYVEIKHGRIAMLAFLGQITTRGGLHLGGNIDYSGDSFDSFPNGIAALIGPDSIPTAG LVQIIALIGLLECGFMRDVPGTGNEFVGDFRNGYIDFGWDQFDEETKLSKRAIELNNGRA AMFGILGLMVHEEIVPLGYDPDLPIIGHLA >gnl|To_NUC_proteinmodels_ML|p359 MKSVAALALIGSAAAFAPAQTGKASTKLNAFEDELGAQPPLGFYDPLGMLNGDCSQERFD RLRYVEIKHGRIAMLAFLGQITTRGGLHLGGNIDYSGDSFDSFPNGIAALIGPDSIPTAG LVQIIGLIGLLECGFMRDVPGTGNEFVGDFRNGYIDFGWDQFDEETKLSKRAIELNNGRA AMFGILGLMVHEEIVPLGYDADLPIIGHLS >gnl|To_NUC_proteinmodels_ML|p360 MSGKGKGGRGEKKSTTSSAKAGLQFPVGRIGRYLRQGKYATRMGAGAPVYLAAVLEYLCA EILELAGNAARDNKKARIIPRHITLAVKNDEELNKLLGGVTIASGGVLPNIHAVLLPKKS GK >gnl|To_NUC_proteinmodels_ML|p361 XSGKGGKGGKGGKGGKAPTIKKAPQSRSSKAGLQFPVGRIHRFLKNSVQQSQRVGATAAV YTSAILEYLTAEVLELAGNACKDLKVKRITPRHLQLAIRGDEELDTLIKATIAGGGVIPH IHKSLINKSQGKTKKAKF >gnl|To_NUC_proteinmodels_ML|p362 MFIPKSNRIAVFSYLFREGVLVVKKDTVSPTHPHIEGCTNLECLALMKSLASRGYIRITF SWQHNYCYLTPEGIEYLRGYLALPAEIVPATHKKAASRPEGRERDEDKFGGGEGKPAFRG GDREYRSREGGFGRGGGM >gnl|To_NUC_proteinmodels_ML|p363 MLFTQSSATAIIITAAILGMVWALAQFLLISRIPVKSEGISDSTGLVTGSNDEATTRRLT EIYNAIYEGAESFLRAEYRICAWFICVFGAIIFILVAWGTGWDFARGLFTALSFVLGACT SILSGYLGMKVAVYSNVRTTVSAQKSGWTLCFNTAFRAGAVMGFALCGLGIFMLYISLLA FRIHYPQAEDWIYLTECLTGYGLGGSSIAMFGRVGGGIYTKAADVGADLVGKVVHGIPED DPRNPATIADNVGDNVGDVAGMGSDLFGSFAEATCAALVLGSSIGLSGGWDAMVFPVAVS AVGIFVCLLCSFIATDISTVKKEADVEKALKIQLISTTILMVPAVYFASETFLPGEFELR ATVGLDVITLHPWQACMCVIMGAFGGLIIGLITEYYTSHSYKPVRELADSCKTGAATNMI YGIALGYKSAIIPVLVLAVVVYGSFALADMYGVSLAAIGFLSNLATGLTIDVYGPVCDNA GGIAEMAELEPYVREKTDALDAAGNTTAAIGKGFAIGSAALVSLALFGAFVTRIRHSSAD ELFQDGVNMLEPVTFSFLIIGGMIPFAFAAMTMKSVGVAAMEMVLEVQRQFDEKPHLLDA NPTERPDYDACIAISTKASLKEMVPPGAMVILTPLLTGIFFGVYAVSGLLVGSLVASVQL AISMSNSGGAWDNAKKYIEKADADSDLKGKGSDIHKAAVVGDTVGDPFKDTSGPALNIVM KLMAVLSLVFADTFYAVNSGNGVFQL >gnl|To_NUC_proteinmodels_ML|p364 MKLAIVSTLLAGAAAFAPSSVSRSGTELAATRAVKDGPPKSGKKNLTFNKGFDDPVPPQA EGLIGVLPPVGYWDPLGLAEKRSAEAVQRSREVEIIHGRFAQLAIVGFLVPESYAASGAY GDDFLAPTGTALEAFNTDPIWLALTFGIIAALETVRILQTEPGNRVNAGILENGLYSMPT PERLEELKLKELQNGRLAMLAFAGAVAQELVNETPLLVNLQG >gnl|To_NUC_proteinmodels_ML|p365 MQVDRRTPDRQEVPDLVQNVSPPSSHDQLKKLTTVVADSGDFNAIKQYSPQDATTNPSLI YKAALMPEYSSLVNEAVEYGKGDLLVTMDKLAVSFGAEISKIVPGYVSTEVDARLSFDTE GSIAKARELIQMYSEVGIDKSRILIKLAATWEGIQAAKVLEAEGISCNLTLIFSIAQAIA CAEVGATLISPFVGRIMDWYKKRDNVEGYAPAEDPGVKSVTSIYNYFKKFGYSTIVMGAS FRNKDEIAELAGCDRLTISPKLLEELSSSASDLPKKLDAEKAKDMEIERVDMNEKTFRWM MNEDAMATEKLAEGLRGFARDIEKLEKIVQEKLSNKRQKKGE >gnl|To_NUC_proteinmodels_ML|p366 MKTSIAFALIGSAAAFAPVQQGRVNTAIAGDARSPDGLGVDPGPLDLFDPLGLVEDADSF PRRRAVEIKHGRIAMAAFIGMMVQELGITFPGSLDLAGDVPFSSVLDDGMGFAALAKVPT FGLAQIALFGFLAETVAMPAGEYTGGPQNLPGGYDGSPPFIPGGYPGQIEDVDARDRALN VEIQNGRGAMLGVFGCMCHSMLDSCDHHFFYPITHN >gnl|To_NUC_proteinmodels_ML|p367 MRSFALAAALAVPTEAFSPVNLSQSLVGRRTAATAPLNAEVPTQSNFLTPELAQTCSDAA GGTPLYAYSLDALAKSADACLAFPNAYGLTVRYAMKSSPNGSILKYFLSRGISIDASSGY EVRRAIDMGVPAEKISLSSQELPGDFDELIRLGVKINACSLSQLERIGAAFPNQSQKVGI RINPGVGSGGFSSSTTGFSKTNVGGPSSSFGIWHELISDGSASAIVEKYGLVPERIHTHI GSGSDPAIWQSVAKKSLSFCATWDTVTALNLGGGYKVGRNPDEQTTDLGEIGAPVVEEFK QFASEHGRELKLEIEPGTYLVANAGALVTTIQDKVTTSDYTFLKMDAGMTDVLRPSLYGA IHPITILPGSGXXXXXXXXXESVVVVGHCCESGDLMTPKPGEPEALEERSLRAAEIGDYA IMDGSGAYCSGMSTKNYNSFPEAPEVLLDLEGNAHLIRKRQTLRQIYENECDVPDGVF >gnl|To_NUC_proteinmodels_ML|p368 MGQVLRKLLDVFFTKKLDMVVIGLENSGKTTLLSALAHGEPVDTVPTIGLNVKVFQKGRV QMKCWDIGGQAQYRTEWSRYCKGCDVVLFVVDAAAPQKLATAKKELHKLLDDGSIGSTPL LVLANKIDIQPHVGESELIEKLQLNYVMETPWMVLPISALHLTNLDQVVEWLTAQGKS >gnl|To_NUC_proteinmodels_ML|p369 XVVHIDPTEFQLLDKPHFGYAHGKAPEKLEAAQTVLKNADAYVCVTPEYNHSPSPALLNI LNHFGSSTFSFKPSAIVTYSAGQWGGTRAGVSLRTTLSELGCLPVSAMIHLPKAQQVFDR DGKVNVSAGEDPDQWEKYFARTWFQLEWWAEAAKRHKSIVDPFGESPAFASSPDERNAP >gnl|To_NUC_proteinmodels_ML|p370 MKSAIAFALIGSAAAFAPVQQGRVNTAIAGDARSPDGLGVDPGPLDLFDPLGLVEDADSF PRRRAVEIKHGRIAMAAFIGMMVQELGITFPGSLDLAGDVPFSSVLDDGMGFAALAKVPT FGLVQIALFGFLAETVAMPAGEYTGGPQNLPGGYDGSPPFIPGGYPGQIEDVDARDRALN VEIQNGRGAMLGVFGCMCHSMLDSCDHHFFYPITHN >gnl|To_NUC_proteinmodels_ML|p371 MNFRTTSSLLVLTSLASVDGFSTVSSFTRRPLLVESQNYALRRPSSTVLSMNLFDRFSRV AKSNINNVLKSLEDPEKIMNQAVEDMQTDLVKVRQSYAEITATQRRLMKQKEQADALGED WYKRAQLALQKGEEGLAKEALSRRQQQMETSEGLQTQIDMQAQAVDKLYEGMQALEAKIL ESKAKKEQMIARARTAQSTQQVNDMLGGITGKTSMDAFQRMEDKVEALEAAAEVSAEMGS FDGKALPGSSAGSIESQFKMLEASSSVDKELEDMKKMLGSTSSASKNDGVDDELERLKRE AGL >gnl|To_NUC_proteinmodels_ML|p372 MARTKQTARKSTGGKAPRKQLATKAARKSAPTAGGVKKPHRYRPGTVALREIRKYQKSTE LLIRKAPFQRLVREIAQDFKTDLRFQSTAVLALQEASEAYLVGLFEDTNLCAIHAKRVTI MPKDIQLARRIRGERA >gnl|To_NUC_proteinmodels_ML|p373 MAIPKTMNTAALFAAALVELKFGKDPQVEETGVITGKEMLTNNPGPLVRGDAKAITAKIG EGEFDGLSVKAIVKETGVDDEGLNDFHYEFFPHDTYKDGELKFYHSLGSGKMNLGFNPMA IARLIMRSTKEKGNVKGEGFLQGGWILFKPDGEPVAAFQENAKTRVPIDEIVAAIKKMRE EN >gnl|To_NUC_proteinmodels_ML|p374 MACLTFVRRKNLSISAKTPLPIARRVQLGMTCSTGRQPSWDLMNLPTQEGCSSLIFTSQP TIHSKPPKVHFTTRIYHCNINSNGGICLDILKDQWSPALTISKVLLSICSLLTDPNPDDP LVPDIAQLLKSDRARHDSTAREWTSKYAM >gnl|To_NUC_proteinmodels_ML|p375 MFRSFFVALLLAVSAHAFAPLKGANSSSRPSTELYEIKRASKVRIKRPESYWYNKVGTVA AAAKEGSERYPVIVRFDSVNYAGTNSNNFSYEEIEEVE >gnl|To_NUC_proteinmodels_ML|p376 MLLGISATCRQGRLLSKRLVAGGRGHASPLSAVVRQLSSTAPDSRTVVSSSTSTPSSSNW HARTRPALFPPTAPPINIAHQYFHTTTGRLVDAAKEVTSETTYRALSPEIRSAIIGDLNS VDFDKNGKIDAEELKALLRRHNDSFTEAEIIELSQLFYSSLGAKGVEISRFVEALDAVAA LQSTSGGGEMGEGETVPLVKEGAFKTHPLGIGTCASEYMYAKTHGQYTPEDLDINLTHVK PKTFLDRSALFAVKCVRTVFDIGTGWNRGEITTDKILNRAIFLETIAAVPGMVAAVIRHF RSLRNMTRDGGMLNMFLEEANNERMHLLTFVRMKDPGTLIRVAVVGSQFGFGCFFLTAYI ISPAWCHRFVGYIEEEACHTYTRIVEEIQKAPEGTPLAEWRTQAAPKIAKGYWHLGEEGT VYDVMMAVRADEAEHRDVNHAVSGVAEDTVNPLYDPRVKLDNMLRQYVKDIMQREKTEAK PAGVTD >gnl|To_NUC_proteinmodels_ML|p377 XSSCRRGIGKAAESSCRATSTNLPKSFFITSFLSSPCGRISQILPLTSPARTNPRSETTT LRAQRRKKSNLQIIVEGREISDQDIQSAIQTQVESAAQSEYAVAEGLQREAPDKAHAAHD AHHNYPRVLLHADRLGRDATKRILNVLPKQKALVSTTAEWKRLQRHASETISQTHLRDLL KDEERCDSMFATHDGVYYDYSRQQATSETMDLLFNLAEKQELTSRIDQMFSGEKINFTED RAVLHTALRAPRDEAGTVFVDGLDAIKEVHEVLDQIKAFTEAFRSGQITGYTGKRMRNIV SVGIGGSYLGPEFLHEVLKTEAEGVTSSLGYSLRFLANVDPVDVERTCADLDPEETLIIV VSKTFTTAETMLNARTMRQWLWDFMGDDIDVVRKHVVACSSVSATDKVAEFGIDTENYFF RFWDWVGGRYSVCGAAGAVPISLLYGFDIFRKFLDGARSMDQHFRNAPMRENIPIIMGLL GVWNSSFMRYNSRALIPYAQALLRLPAHIQQLDMEXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXXXXGQTIPVDFLGFVQSQHDLLMDNEKLSSHDELMANFFAQPDALANGK TVEEVRAEGCPEDLILHKVFDGNRPSSSLLFPQLSAYVTGQILSLYEHRTAVQGFIWDLN SFDQWGVELGKKLALDIKEHLMEARNNEIGDYEIAADNAATSRILNYYVENSADAVRGDS SSNPVSSVATSVTRKTHMDHFPPQHHDLGGHGGRLS >gnl|To_NUC_proteinmodels_ML|p378 MQHQRLAHATLLGILLQHASPGLTFSVLPSPQSVRPCKRVSSHRLGESRSSRSELWKIER IDRTNDWTDSAKTYPSPLALKPNIPKGWFIDPLRDAVAAKVQVDALESWEDSACFLSDEE EEEEGDCEFDRGQNAEAFVLAGPRAEIAYDTEKCKAAVVTCGGLCPGLNTVIREIVVCLR RQYGVNEVYGIPAGYRGFKSPETWRLLDEEAVRNIHNQGGSVLGSSRGGHDTFAIVDSLV AKEINLLFVVGGDGTVRGAARIAEEVKNRGLEISVAVVPKTIDNDVPLLDRTFGFETAVS SAREAIDVANTEAEGFPNGLGVVKVMGRNSGFIAMHSALGSCVVDLCLVPEVDFFLDGPG GIVDHLYERIRANDKAVVVVAEGAGQKLMAAMGAAGETVTDASGNILLDDIGPWLCRQLK TRLDPRLEEASDHGDKLTLKYLDPSYSVRGIPPLTADNLYCLQLAHNAVHGAMAGMTNFL VGAINTRECYIPIPLVADKRNVIDTRHQSLWETLVFATGQPSFQREGDDAEDFDALTTAS GGVVLPD >gnl|To_NUC_proteinmodels_ML|p379 MANYKFETTDDTKLSFWRESMDSDAPTLSRYVMSQTKDSELVLLLNALATSFKLIASAVR RAGVAQLYGLAGEVNSTGDDQKKLDIMSNDMMINALINSGVCSILVSEENEEPIIVPVDK AGKFCVAFDPLDGSSNIDCNVSVGTIFSVFEKKDGVVEDLLRSGEECICAGYCAYSSAVE LVFTFKDSTVEGFCLDPTIGEFVHTRFNMQFPADGGKRIYSCNEGNFIHWDQPIKDAVDA FKNGIEGKPYSARYVGSMVADIHRTLLYGGIYIYPADKKSPKGKLRMLYEGIPMALIIEQ AGGIASTGYFNGKIGEVSKLTPDAIHCKCPIIMGGHRDVGVVYDCYKKAGVEVPDLKKDD >gnl|To_NUC_proteinmodels_ML|p380 MKLAIATLLVGSAAAFSPAATVSRNSALSMSTATEEKVRHRKDVSELFPPSRIYYDHTMR TNLHGHIATRMCSMVHSMLTYAKLPASVTSGVVTGQALVDLLDAAKDQGYAIPGVNIVGT NSINACMEAAAKYGGPIMVTFSKGGGQFIAGKSLPNDADQASIAGTIAGAKHVHEVAKLY GVPVVLHTDHCQKAWLPWIDGLLAASEEHFAKEGVPLFSSHMLDLSEEPLEENISICKSY LEKFAKVGILLEFELGVTGGEEDGVDNTDVDSSRLYTQPEEVFYSYEQLSQVENGAFTCA ASFGNVHGVYAPGNVDLQPIILYNSQKYIQEKIGSDSDKPMKFVFHGGSGSSVEDIQYAI GAGVVKMNIDTDTQWAFWDGVRAYEAKYHDYLQGQIGNPEGDDKPNKKYYDPRMSLRAGE ESMAARLCKAAEDLKCVNVLN >gnl|To_NUC_proteinmodels_ML|p381 XGGVSSDLGTPCEEECALESFPNMPESVHPGVLTGQAQVDLLNHAKENGYAIPAVNCVSS SGINACLEAARRNDAPIIIQFSSGGSQFYGGKGLDNKNYAAAIAGAVSGAFHVRTMAEQY GVPVILHTDHCAKKLLPWIDGLLAASERYYEKHGEPLFSSHMIDLSEEPIEENIDICKDY LSRFSKIGMLLEMELGITGGEEDGVNNEDVPLEDLYSKPEEIYQVYDALTPISDQFTVAA AFGNVHGVYSPGNVKLEPKILTRAQAYISEKLGGSAPSDGKPVKFVFHGGSGSDVSDIQE AIGYGVIKMNIDTDTQWSYWEGIKNFESKYHDYLQGQIGNPEGPTKPNKKYYDPRECMRA AEVNTVTRLDQCFADLKCVNILGLGEKADAENVLGPRRGGLPV >gnl|To_NUC_proteinmodels_ML|p384 MAPPCDVPVGVITGADVLKLLKHGEFIEARRLLSKGKRSKLALRAAPPGSSVGDPRCFDM DNELTLRPLSMHGGTQPRTMAMPSPHSTAPGASSVCNTVMEAAAKFNSPVIIQASNGGAQ FMAGKSIKDKKGAAAGAVALALHVRQMAPYYGVPVILHTDHCAKKLLPWMDGMLEANQAY YDQYGEPLYSMHMLDLSEEPDEENIAICTEYFKKMAAMDIFLEMEIGVTGGEEDGVDNTD VDQDKLYTSPEQVWSIYEALSPISPMFSIAAAFGNVHGVYKPGNVKLSPERLAKAQAYAK EKLSSDEKNPLFLVMHGGSGSTEQEIGDSVAAGVIKMNVDTDTQWAYWDGARKFIAEKHD FLQGQIGNPTGADAPNKKFYDPRVWTRKAEESMCERCGLSMKTLGCAETFPCKKPESGSI PMWNFAPKK >gnl|To_NUC_proteinmodels_ML|p386 MLEETTYTATHHYELCTKYKRKSNGTIASVEKIVADFNGAGPIPANAEVVIGCPYIHIPL LLSTLRSDIEVAAQNCSLTGMGAYTGEITASQLKDLGVNWVILGHSERREGFGMAGEDSE LVAKKTKMALSEGLKVMLCIGEKKEEREAGTTMDVCAEQLKPAAAMLSKEDWANVSIAYE PVWAIGTGLTATPEMAQETHASIRQWMSENVGADVAAAVQIQYGGSMKGANAAGLLGQTD IDGGLIGGASLKMDFFACINAVPEP >gnl|To_NUC_proteinmodels_ML|p387 MIGQTLAFTTRSASVASKRIGSSLRMASSRPYLIGGNWKANGTVASVEKLVAEFNAAGPI PPNTDVVIGAPFIHIPMLLNTLRPDIKVAAQNSALTGTGAHTGEVCASQLKDLGLEWVIL GHSERREGFGMAGEDSKLVADKTKLAIDEGLKVMFCIGEKKEEREAGTTMDVCAEQMAPL AEILSEDEWANVSIAYEPVWAIGTGLTATPEMAQETHANIRQWVAANVSQKVADAVQIQY GGSMNGGNAKALLAENDIDGGLIGGASLKLEFFDVINGAGAPVAA >gnl|To_NUC_proteinmodels_ML|p388 MKLAVALLAAASSAGAFTPGSNGVRSSTQLNARQPIVSTNGRASSPDDFVFSRXXCGRRK LTFENANCLFHFVFKMNPATEVKDIVSKEGITVGSQQIFYEDKGAFTGSVSASMVKSIGC EYVLCGHSERRTLFSDSDSNINAKVKKVLEFGMKPILCIGETQQEYDLDITQEVCAMQVS KDLKGVSKEEMQNVVIAYEPVWAIGTGLVCDADAANEVHKFIRTVLAKMYGDEVAQATRI QYGGSVTPDSVDGLLAKSDIDGCLVGGASLFSDKFARICNFKAE >gnl|To_NUC_proteinmodels_ML|p390 MKLQIVASLLSTAALVDAFSTSASIRASSTTSLYERKPFISGNWKLNPQSKDEAVALASG IAGSITESSPDAEVALFVPYVFIESAQSAVGGKLSIGAEGVCPQGNGAYTGAISAPMLNS IGVNWVLAGHSERREIFGETDEYINEQCLKIIDEGMSVMLCIGESLAEFEQDLAGSVCAV QLKKGLKGIKKEDLGRVAIAYEPVWAIGTGKVATPEIAQSVHETCRSIIAKIYDDEAAQD MRILYGGSVSPESVDDLMKQPDIDGALVGGASLDSDKFGRIINFQV >gnl|To_NUC_proteinmodels_ML|p391 XDWRGGRAACYNIIPSSTGAAKAVGKVIPELNGKLTGMAFRVPTANVSVVDLTAQLDKGA DYETISAKIKDASEGSMKGFLGYQDEDIVSSDFIGDARSSIFDKKAGIALTDNFVKLVSW YDNEAGYSNRVLDLISHMEAQK >gnl|To_NUC_proteinmodels_ML|p392 MKLINAALALLALPAANAFAPLAPSAGLRTQTAGSFNFELEAKKSIEDLTEDELKGKKVL VRCDVNVPLDGKTITDDTRIRSSIPTVEFLKSKGAIVTVCSHLGRPKDGPEDKFSLGPCA DRMGELLGQEVKLAPDCIGDEVAAIVDAASEGDVIMLENTRFYQEETKNEAGFVEKLAAP FDMFVNDAFGTAHRAHASTEGVTKFLQPSVSGFLLAKELEYLDGAIQNGEKPMAAIVGGS KVSSKITVLNALLDKCEKIVIGGGMVFTFLKAKGFSVGTSLVEEDFVDTAKEVLAKAEEL GKTILLPSDIIVADNFAADANTQVVSADAIPDGWMGLDNGPASTAEQKEFLSDCKTVIMN GPMGVFEMKAFEKGTFGIVDILADLTKENGATTIIGGGDSVAATELSGRAGDMSHISTGG GASLELLEGKVLPGVAALNEK >gnl|To_NUC_proteinmodels_ML|p394 MLKSLVILSVCASSALGFVAPGSKETARQSTSAGVVSLSGVSRSTSQLEAHKLVLIRHGE SEWNDLNIFTGWADAALNEKGLREAAQGGQLLKDXXXXXXXXXXXXXXXXXXXXXXXXXX XXXXXXXXXXXWRLNERHYGGLQGLNKQETVDKHGKDQVLVWRRSYDIPPPACDESSEYY PGNDPRYANVDKEDLPFTESLKITGERFMPLWENEIAPKILAGEKILIAAHGNTLRALVK HLDNISEEEICGLNIPTGVPLVYELDDDLNVIPNENGIGSLQGMYLGNQEDIRARIEGVA NQTK >gnl|To_NUC_proteinmodels_ML|p395 MLRHFISLRAAATARNYSSINCNTHYNVVFVRHGQSTWNRDNRFIGWTDTPLTXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPVSKHWRLNERSYGDLVGK NKKEVVKQFGQDQVKIWRRSYDQPPPPMSDDHEHHPKRDARYRLMLDDIPKSESLKCTRN RTKIYWDDVLAPALRSGKTLLIVGHENNLRSIIMRLEGISETDIINLSLPRAIPLAYRLD KDLKPLTRTDGGKDEATGMLSGYWLGGDDSVREVLERDHAQVX >gnl|To_NUC_proteinmodels_ML|p396 MMTPRDLARREGTRMDGPLTTSGGGLPRGPLPQVPSSPAVLVVRFYPASLSHDKRRPTIA NIAQHPFPSFVAGDMFAFARNTITLGRSASRVAPNVRSLAAFQQTADKHTLVLVRHGEST WNLENKFTGWYDCPLSPKGEEEVVEAGRQIKAAGIVPSVAFTSMQKRAIKTLQGCLEETD LMWLPTTKAWELNERHYGALQGLDKQETVDKHGKDQVLIWRRSYDIPPPTVDKSSPHHPS NDPRYADQDFPEEWTESLKTTLERVTPYYEKNIVPELKAGKDSSEMMRGRTMKRTNDSAH GNSLRALVKHLDDIGEDVIAELNIPTGTPLIYELDDDLKPIPQEGAIAPLSGRYLGDQDA IRARIEGVKNQTK >gnl|To_NUC_proteinmodels_ML|p397 MNRSAFVAFIHATFFGEKKEIASLVLHSTPTTGPTHTLILTRHGDSIWNGKHVGCKETFT GWTDVPLSPLGEKEAIRTGEILADLTREVEIDALFTSTLSRAKMTAHYVWWAYNENLEEQ YRREKHYQYYNRPGNMPQQNSMQYQGPAQWLADYRLNERHYGSLQGLVKVEAERGDFGHS ADEVYKWRRSWHAKPPILDDDDPRRLEERRRFGPICGLENIPRGESLNCVAKNRIRPFVE EKLTPLLEEAARRKPGDEGGTALIVAHANSLRALIGVICKADQNETAAAFSKLEAMKIPT ASPLVLQFRLVEKQFTPVDSIGEGSNPLPVYPIQYLF >gnl|To_NUC_proteinmodels_ML|p398 XVELCTEDGKFVASVPSGASTGAYEACELRDGGDRYLGKGVLQAVANVNDVLGPAVIGMD PADQQAVDDKMIEIDGTKNKSNMGANGAAANNLPLYAHYAKLAGNDQSKYTMPVPCFNVI NGGSHAGNKLAFQEYFVIPTGATSFSEAMQIGCEVYHTLGKIIKSKFGGDATLIGDEGGF APPCDNREGCELIMEALKKAGYEDKCSIGLDVAASEFRVKGEDKYDLDFKYDGNVISGQE LGDLYQSLAADYPIVTVEDPFDEDDWENWSKFTDKNGEAFQVVGDDLTVTNIEKIDRAID EKACTCLLLKVNQIGSISESIDAVKKSKQAGWGVMTSHRSGETEDTYIADLAVGLCTGQI KTGAPCRSERLAKYNQLLRIEEELGAENTDYAGKGFRTTSWMG >gnl|To_NUC_proteinmodels_ML|p399 MLASKFIRPAARSMAAKRSMSAITGVQGREVIDSRGNPTVEVDITTADGTFTASVPSGAS TGAYEAVELRDGGSRYMGKGVSKAVANVNSVLADAVKGLDAADQRTVDDVMIKADGTPNK GALGANAILGISLAASKAGAAAKGVPLWKHYADIAGNPTPDVLPVPCFNVINGGEHAGNK LAFQEFFVIPTGAETFSESMQIGCEVFHNLKKVIQGKFGGDATLIGDEGGFAPPCDVESG LEMIMEATANAGYLDKVTVGLDVASSEFKVAGENAYDLDFKTTGADKDASLKLSGDELIA FYKELIAKYPIVTIEDPFDQPYRVLKDDWDNWTKFCKDVGTDVQVVGDDLTVTNPTFIKK AIDQGSANCLLLKVNQIGSISESIDAVKLSKQNGWGVMTSHRSGETEDAYIADLAVGLST GEIKTGAPCRGERTAKYNQLLRIEAEIGAAAAKYPGMNFRKPGWMG >gnl|To_NUC_proteinmodels_ML|p401 MQSQSXXXXXXXXXXXXXXXXXRLRPDASSRKTKIICTLGPACWSVETLEQMIDAGMNVA RFNFSHGDHEGHKACLDRLRQAAKNKGVNVGVLLDTKGPEIRSGFFADGAKKIELTKGEK ITLTSDYAFKGSSKRLACSYATLATSVKPGQSILVADGSLVLTVLSCHPAEGEVVCRIEN DCSIGERKNMNLPGVVVDLPTLTEKDIDDIQNWGIKNNVDYVAASFVRKASDVHKLREVL GESGSKVKIYCKIENQEGMENYGEILDATDGIMVARGDLGMEIPPEKVFLAQKMMIREAN IAGKPVITATQMLESMIVNPRPTRAECSDVANAVLDGTDCVMLSGETANGEHPIAAVSIM GRTCVEAEGAVNFDSLYQAVRNSTLARYGFITTSESIASSAVKTAIDVNAKAIIVMSESG NTARQVAKFRPGMPVKVVTTSPQVARQCYGTLKGCSAYVVESMEHEDEGTKQCMEDLKAA GKASPGDSVVIVHGSVAKAGATNTMKIEYF >gnl|To_NUC_proteinmodels_ML|p403 MVHEHCQALQRDLHCPPRFFMHAFVLNDPSTPPVLHPIHQAENLLAQSKRSMATVDMTVR DAINSAIDEEMDRDEKVYVLGEEVAQYDGAYKVTKGLYQKYGAKRVIDTPITEMGFTGMA IGSAYKDLRPIVEFMTWNFSMQAIDQVINSAAKQYYMSAGDIACPIVFRGPNGNAAGTSA QHSQCFAAWYSSVPGLKVVSPYNSEDARGLTKAAIRDNNPVVILEHELMYGTSFPX >gnl|To_NUC_proteinmodels_ML|p404 MIFNRLAVAAALCAAGSTAFIAPKNAGQVSFSHRVSSPLSAATLETAATETEEVAKPEVM RPPIDYVWNDVAGQLKEAFGYSDAEIESYSNEVEGSKDNLMQIYKALQMARGFENACNQQ YMQGKIRGFMHLDNGQESIPALVDYAIKNGDKKYSYYREHTHAIASGVDPGEVMAELFMK ETGSCKGAGGSMHIFDKSAYFQGGWALVSEQLPYAGGAAKSILLDRALGISDDENYEKQD VAPPADDDRISGGAQNGRTAELLNSAAKDNLPLLLLVIDNGRAINTYTGDVATNGDVYQQ GKHYGVPGLKVDGHDAADVAKGGKAVIDYIRSGKGPAILQVHTYRFNGHSPADPEHERGR KDEKAWARSDQDPIKKFEDKYTANGVFTEDELKAAKKEILAEVKAAVKFADESPMPPVEL AKELEYPDAPDTDYNQRPAPTYADAVNKRTISDEQMATVTAHIDALREKAKAGDISIGDA INLAIHEEMLRDPATTIHAEDLQAGSSYDIPKLTQQTYGQIRAADEIIDEGHFIGKALGE GMNGYRPIVELMNTNFGIFGMAELSSAGNTYATTGGQFDMPMTIVGAGGTAPNQSLGAEH SQPFHAYVMGIPGLKICTAASPDAAYGITKSMIRDNGPCFLFAPVKMMKEAKGVLDLDVC APLNKAALLHEASADSVANGNAVTVLTYLHGVKEAQLVTEEITDEGFDIDLIELRSLKPL DMDTIRTSLERTNKVVILDESTQSGGVGATVSARINEELFDLLDAPVKRLCMDDAPVPYA STMEVGVVKRGSDLVQAVFEI >gnl|To_NUC_proteinmodels_ML|p406 MFASRAASLLLRRSAVPSAASNAACNSLRGGLSVRQMSSSDEVTFDLTGSFETHNLETAP SDTITMTKDELLSHFELMYTMRRMEITCDNEYKARNIRGFCHLYDGQEAVATGINAAFDL EDSWITSYRCHCTALARGGTVESVLGELFGNAGGQTKGKGGSMHFYNKAHNFFGGQGIVG AQVPVGLGLAFTNHYNAKPGETMNVAIACYGDGAANQGQIWEAANMASLWKLPMVFCIEN NHYGMGTSMERHSSHSDYYKMGNHIPGVRIDGMNVLAVKEGMRFVKDFVGSGNGPMYVEM MTYRYHGHSMSDPGTTYRNREEIALTRSTRDPLEFVKKTLIDAGFADAEQIKETEKRIRK DVASQVKAAKASPKPTVEPHLFEYVFTSDGGKEQNEFPPHIRMPDFAKSKWY >gnl|To_NUC_proteinmodels_ML|p407 MRAAIRTGLLGWTISFGEQPKPEKVADTELLIKVENAAINPVDFKIPRAIGGKVVGIDVS GTVERVGSSVTDFQVGDEVFGRAIGSSGMRGSLAEYTVVSQDEVAKKPGDLPFNEAAALG VAYLTGLQSMKVGNVGEGSAVLVIGASGGCGVAGVQLAKALGAERIVGICSGKNFDFVRE QVGYDALELVDYNDSELMKEFRDENKAKFDCIYDTSTGSTADENYSTSVMSMLKEEGQYV QINGGMADWVKHAAGRKTPRKTMVLTASKGRKSLEEIASLLQKAEVKPHLNIKKFDEAGV KEGFEMLKGRRTRGKIVFNIN >gnl|To_NUC_proteinmodels_ML|p409 MVGFSGKGVVLAAALASSCNAFTQTTPTSLSGVTGSSSALSAAPATIEQVDKRTGKPTGT SFLPGEAIDRAAKGNPIEKAKLAKDGTSAFVDVYEYAAKIREGSMTWDEVEKADLNTRLK FVGMLHRDKRTPGQFMMRLKVPNGIVNADQMRFYADCVEKYGEERGVVDITTRQNIQLRG VKIEDAPDIIDGLHARNQTSFQSALDSVRNMVGNPLAGIDDMELVDTREFCNALNDLVSL DPETNTRGNPKWGNLPRKFNIAISGSRDDYAHTHINDIGLVPVAHAETGEMGFNVVLGGY MSIKRVAESIDSNMWIPADRESVVTLSEAILRIFRDESERKDRQKARLMWLVEKYGVEDF KAAVTKEVESYGRGVTVGDAQPLPTDGPFERRELLGVHAQKQEGKSRVGILVPTGRLSPK ECRDIADLADEYSGGEVRLTVEQNVILPNVDDGKVKALLKEDSLGQDSRLKVNAGFIEGN TVSCTGAQFCGLALIETKSHAEDISKKLESLVDVDRPIRIHWTGCPNSCGQVQVADIGIM GGPARQLNEETGKMMAVPGCKIFVGGRIGEDAHLALEPYKEGVPLAEEVLVPELVEILKN EFGAKDKKRGFRSKVKSLIGRG >gnl|To_NUC_proteinmodels_ML|p410 XGADESYVLEKAEGATPISAYLDIDQIIGIAKDGGVDAIHPGYGFLSESPQFAQACADAG IAFVGPTVENLDTFSDKTSAREAAIAAGVPVVPGSDALKDADEVNAFVDEIGLPVIIKAA MGGGGKGMRVVRERSDLIPFFESASSEALASFGDGSVFIERFVDRPRHIEVQIIGDGNGN VIHLWERDCSIQRRHQKVIEMAPAWSLPDDLRAALHKYAVDLTSKAKYKNAGTVEFLVDQ ENNPYFIEVNPRIQVEHTVTEEVTGIDVVQTQIRIAGGASFEEIGLRQEDITPRGVAIQC RVTTENPERDFAPDTGIISLYRHSAGAGIRMDGVGYTGLAITPYFDSMIVKYTARGSSFA ETVARMRRVLIECRIRGVKTNIPFLLNVLTHPEFESGVVTTAFIDENPGLKKISESTWDF ASEEQSDQRKVGKSERLIRYLANLAVNGHPPELGADPTKIVSERTSGTTTVMAVPDEVKA KSTGGMRKILLEQGPEGYAKHVREHKGLLLMDTTWRDAHQSLLATRMRTKELERCAEYTN AALSNAFSLEMWGGATFDVAMRFLHECPWERLESLREKCPDVPFQMLLRGANAVGYTNYP DNVVKKFCKQAKESGVDIFRVFDSLNYLDNLKLGVEAAGEAGGFVEGTMSYTGNVADPTK GKYNLEYYMKLADDLVGMGVHSLAVKDMAGLLTPAATTMLIGALREAHPDTPIHVHTHDT PGTGVASMIAAAQAGADVVDVATDAMSGLTSQPSMGALVSVLAGTDLDTGIDKSSIGPLN TYWENVRSMYLPFESGQLSGSSDVYEHEIPGGQYTNLLYQSRQLGLTDRWPEIKRKYAQA NVLLGDIPKVTPSSKVVGDLAQFMVSQNLEPDQVLAEAETLAFPESVVQYLRGEIGVPPG GFPEPLRTKVLSESTERPGEALADYNFDEATELLTEKYGEKFVNEKDVLSHALYPNVFTE WKEFEAVYGEVGNLPTDLFLNPMKEGDEVEFEQSTGKRVIIKLVSIQPPREDGSRTCTFE VNGERWFMSVTDQSVVDSADIRRKASGPNEIGSPMPGVIVGLKVKEGDSVEEGDPLATLS AMKMETVIPATRAGVVKQVSVNVGDKIDGDDLLLTIEDE >gnl|To_NUC_proteinmodels_ML|p411 MIRGGGYAAAATALLALSVNSFAPQHSAFTPQSIAPQKAGVFGTIKSAFFNKHETTSALS AATLDAPASTVQTVEDYVEARGGNRPIRKVLIANNGMAATKSIMSMRQWAYMEFGDEKAI QFVAMATPEDLKANAEFIRLADSFVEVPGGKNLNNYANVDVITKIAQEQGVDAVWPGWGH ASENPKLPNTLNALGIKFIGPTGPVMSVLGDKIAANILAQTAKVPSIPWSGSFGGPDDGP LEANLNAEGTIPDEIFEKGTARTVEEAIEAARRIGYDNGIMIKASEGGGGKGIRFVDNEE DLAAAFETVKSEVVGSPIFIMQLCKNARHIEVQIVGDQHGNAVALNGRDCSTQRRFQKIF EEGPPSIVPKETFHEMELAAQRLTQNIGYQGAGTVEYLFNADTNEYYFLELNPRLQVEHP VTEGITGANLPATQLQVAMGIPLYNIPEIRMLYGREDPYGVDPIDFLEERYRDMDTHVIA ARITAENPDEGFKPTSGSIERIKFQSKPNCWGYFSVGANGGIHEFADSQFGHLFAKGPDR ESARKNLVLALKEMEVRGDIRNSVEYLVKLLETDDFKANNIDTSWLDGIIKEKSVTVDVP DHDVVLGAAVFKAFEHVKSATEEVKESFRKGQTSTGDIPGINSFSIEVAYLDTKYSFEVE RITPDTYRFTLGSNVLDIEVTQTAEGALLANFGGVGHRIIGMDEPLGLRLSLDGNTILMP TIFDPSELRTDVTGKVVRYLQDNGSAVDAGQPYVEVEAMKMIMPIKASESGKITHSLSPG SVISAGDLLASLELTDPSKVKKIGTFDGTLDIDRTEFELDAEKSVSNLLAGFNLDAEAVA TQAFDGADVDSATELVVNALNEFYRVESQFDGQIADDVVRSLTKANVDSLDVVISENQAH QQLNKRSQLVLGLIRQLDTYSDRFGTEIPDSIVEALDTLTTLKGKVYGEVSLAAGEKVRE AKIPSFDIRLAELKAKLLDPETDLVQLSQSSTLSAGVDLLTNLFDDEEDEVRAAAIEVYT RRVYRTYNIPSMTVEDVSGTPSCSFDFQFNDVPASDRVTRHGYHAVIEDAGTFASVLPDV LNQMGAKMEGDAAKDGPVNVVQISPLKGDASVEDLQAAVYANKDKLNMLGIRTVTVLVPR AKKDPMYYSFPQCEGFEEDPLRRGMRATFHHLLELNRLTDNYNVERLPAVGRSVQLYVGS EKTVRRNPAQVVSVRGITHTPGLTTFSGARRALLQGLDELERAQGNSKVSLQSSSRIYIH SLPVVEGSTPEEIAAEFNEVIDKLKGRLAQRLLKLRVDEVEAKVRVQSIDADGNPMIVPI RLVASSMEGEWLKTSAFIEKPDPVTGVTREFCTIGDTDGACVLDPYDGANIVQTKRAIAR RVGSTYAYDFLGLLEVGLIQEWDAYKESLGSDISTPANVFEAQELLEGEDGELYLGKREI GTNKVGMVAWKVTMKTPEYPEGREVVFIANDVTVQSGSFGVPEDEVFFKASKFARENKLP RVYIACNSGARIGLVEDLKPKFNIKFVDEANPSKGFEYLYLDDDTYKSLPEGSVNVEKCS EGWAIKDIIGTSEGIGVENLQGSGKIAGETSRAYDEIFTLSYVTGRSVGIGAYLVRLGQR IIQMKQGPMLLTGYGALNKLLGREVYTSQDQLGGPQVMYPNGCTHEVVDDDQEGVKSIIQ WLSFVPKTTDALPAARESSDPVNRPVEWKPTPTPYDPRLMLAGTDDASGFFDKGSFKEYL DAWGKSVVIGRGRLGGIPMGAISVETRLVERVIPADPADPNSREAILPQAGQVLFPDSSY KTAQALRDFNNEGLPVMIFANWRGFSGGSRDMSGEILKFGSMIVDALREYEHPIYIYFPP FGELRGGSWVVVDPTINEEKMTMFSDPEARGGILEPAGIVEIKFRAADQIKAMHRIDPQL QLLDAELESADDDSKADIEEQIAAREEILKPVYLQAATEFADLHDKTGRMKAKGVIKEAV PWADSRKYFFYLAKRRIAQDNYVKTLKAAGSSLDSTSALDIIKX >gnl|To_NUC_proteinmodels_ML|p412 MMVGPRSTNSSISAKDEDKRAPQLDSAHRTTPETMLFGLLRIDMPKSNAATQRNPHLPNA KPQKADHELDLSRNTDPPPPHRRVAGGVPPAVASASSAARSVSTLQETLAEQVPGKQVAL ASLKKEHGSKVIGSVTIDQLIGGARGVKCMLWETSNLDPEEGIRFRGLTIPECQEVLPTY SGKKGDGEPLLESLIWLLLTSEVPTKEQVDTLTAELHARSAKLPSHVVPLLNSLPRDMHP MTQFSIGLSAAQTSSAFAKAYADGVPKTDYHKYALEDILDVFAVLPEIAATIYRNVYFDG VVSKDTTLDYSGNFCRMLGYDDPSFDELMRLYLCIHTDHERGNASAHTTHLVGSTLADPY LSYAAGLNALAGPLHGLANQEVLKWIQALQAKFEKEGKDVTAETITEFAWETLNAKKVIP GYGHAVLRKTDPRYTCQREFGLKHMPDDPLFKIVDTIYQVMPGILTEHGKVSNPYPNVDS HSGVLLWHYGFTQYQYYTVLFGVSRAVGGLCQLYWDRALGLPLERPKSHTPEWLEAFAKN NPDA >gnl|To_NUC_proteinmodels_ML|p413 XGRSALNPTVRRGGAAAFSTQAQQKSDSFLSGASSIYAESMLDMYETDPESVPESWRVYF RSLESDGGPEIDETTFNTPTVVLSSGNLKDAKSNAVVSATLPSDSLGIAHLIRAYQVNGH RSANLDPLGLHSNESFPFRPGNVRSRDDLDDGYADTLNVGFHGFTEKDMDRELNLKGVHT GGNKGYLADLTSMPGKITLRSVLDRLRMTYCGTIGVEYMHIGSTHQCNWVRERVEDPSFW TYDKDKKMHVFERLCFADTFESFLAHKFNTTKRFGLDGGEAVVPALKCAIDRASELGAHS FIIGMPHRGRMNVLANVMRKPMDQIFSEFQGTHFDVEEHMKDAEDWGSAGDVKVSPMADG LVRLVCERNPRPTFFSAKYHLGTSVERAYPDGRKVHLSLVANPSHLECVNPVVLGKTRAK QVYCGDSPEDVRNVVPILLHGDAAFAGQGVVYETMQMAGVEDFNVGGTIHVIVNNQIGFT TNPINSRSTPYASDLGKAFNAPIFHVNGDDAVAVSRALEFAVEWRHEWGTDVVIDMICYR RLGHNELDQPSFTQPILYKAIQKHKSTLDIYERRLIDEGTMTKDEAKEVRAFVLDNYEKE YEASKTYKPKPSDWLSSKWEGFKSPRQHSRVRPTGVDPDVLRHIGMKSGEVPEGFKLHRQ MAKIFKQRVQTSEAGVNIDWGLAEAMAFGSLLIEGNHVRLTGQDVQRGTFSHRHAVVKDQ DTEEEHTPLNSLAKMLNMSAPLEELRLSDTQAKITVRNSILSEFAVLGFEHGYSLENPNS LILWEAQFGDFINGAQIILDQFIAAGEDKWLRQSGLVMLLPHGYDGQGAEHSSCRVERYL QMMEEDPHNVPDMTFDNRTQIQKANWQIVNCTTPANYFHCLRRQVHRDFRKPLIVVAPKN LLRNKRCVSTLEDMGPGTKFERAFDEMDEEISSNPDGVKTLVFCTGQIYYELLTERESQG RTDVAIVRLEQIAPFAFDKVAKYCQKYDSAEVVWAQQEPKNMGAYSYVSPRLMTASREIN KNEKRARYVGRPVSSAPATGMGAVHKREYNAILEGVFGKLSE >gnl|To_NUC_proteinmodels_ML|p414 MKVMSSLLLTAASAAAFQVRPTAKTYLRAMNRVASSASASSLSAEASNGESYDYDLVVIG GGSGGVRASRIAAGHGAKVALLESKLKHGVSPYFAAIGGTCVNVGCVPKKLMVFASRYPG EIGEMAGYGWEGATPGEFNWDTFMAAKNEEITRLNNVYSNVILKNAGVEIIEATGSLDGP NAVNMHITETGETKKVTAKKILIGVGGWPFKPSIPGIEHAITSNEIFYLKEQPKSMVVVG GGFIALEFATIMDGLGTDVKLMYRGDLFLRGFDQDMREHLKEEMTNNSNIDLQFNTDPKE IIKNDDGSLTVVTSNGDSVTVDAVMYATGRKGKIEGLNLESAGVENSGSFIPVNEYSETN VPSVYAVGDITNRIALTPVALMEGHCMADTVFGGMDRPSDHEYVAATVFTTPEIGTVGYT EEQAAEKFGDLDVYKSRFRPMKHSFPKSEMYTLFKIIVDAASDRVVGVHIATDGAGEMMQ GIGIAVKMGATKKDFDNTIGIHPTSAEELVTMRTPSYYYRGGKKVDSLEEVKEAVAA >gnl|To_NUC_proteinmodels_ML|p416 XKTACVSKLFPTRSHTVAAQGGINAALGNMGEDDWRWHMYDTVKGSDWLGDQDAIHYMCR EAPKAVLELEEFGLPFSRTEEGKIYQRAFGGQSLEYGKGGQAYRCACAADRTGHAMLHTL YGRSLAFDTTYFIEYFAMDLLMTPDGRCVGAMAINMEDGTIHRIHANNTVLATGGYGRAY FSCTSAHTCTGDGNAMAMRAGLANQDAEFVQFHPTGIYGAGCLITEGCRGEGGILRNSEG ERFMERYAPSAKDLASRDVVSRAMTMEIREGRGVGPKKDHCYLHLDHLPADLLAERLPGI SETAAIFAGVDVTKEPIPVLPTVHYNMGGIPTNHHGEVIRTNFGKDGEFESDEVVPGLFA AGEAASASVHGANRLGANSLLDIVVFGRACANRIADIASPGDKIPDAPADVGMDSVAELD KLRYADGAIPTSEIRSEMQHVMQDKAAVYRTESSLAEGAAEIDEVVKKIDDVKVTDRSLV WNTDLVETLELRNLLPSAATTMHSADRRKESRGAHAHENYPDRDDDNWMKHTLAYYDEDT KKTSVAYRPIHYYTLDEEECAIVPPVARVY >gnl|To_NUC_proteinmodels_ML|p417 MKRTRQFTHLGGRKASTATLFAATALALVEAVNVSDSSAPLFPASVEMPFPSVTGPLEDA SCDVEQLEQANDSQLHVILAELMATSYFRSFAVDLKQKCPLVAWDRSTKKEKVDQSKSKE EPEESCAGGLPEADAGAEPACSVDVGSPFGTSVGGVGNMAAYPSDSDSEVGGDDAEEFEC TGGRDELDEEAEPLCSLTTDESAGPFSSSALHSISEQIGTNNRWESESQQNTFAWKQETD PVVEHDDEPCEDDSSAGNLPETFWLDICSEIKSGDGMSIVDLQLNPERNTGYNGTHIWNA IYDENCLAVDTTKSEMCYEERVLYRLLSGLHTSTTLSIAKNYYPPSKRKGRVNWEPNPQY FIDKFSDHPEHLRNLHFSYVVLLRALKKASPFLYNFQISTGDTLDDTTASLLLRRLLDTS ILRSCQDVFSAFDESLMFKDQDSVMLQENFKGVFHNVSSILDCVQCQQCKLHGKMAMLGY GAALKILFTREDLIALSRNELVAFLNTIGKLSESGMIDEQAEARLVKQAFQRNPELMVLS KHYAGDLRKFFNFLPNIGGAVGDSARTEEPDAIVVGEKHLRHDATHVEFPLTLCRTFQGT GLAGLTATLNILDRGGRVVLIEKEHRMGGNSNKASSGINACCPQNSTYGDDLDTFRKDTT RSASSSAKPHLIETLVDNSEAAVNWLKSRVGVDLSVLAQLGGHGHKRTHRPNQGMVGAEI IFHISRAVKSYSNSGALKIMMDTKVESLIRNDKGSVIGVHVVSTDGDNSTEAITAPNVVL ATGGFASDRSPGSYLEKYRPELMKMPASDGITLGSSVGAGLVDMDKIQVHPTGWVDPSDP TNPGKVLAGELMRGVGGILFNSKGERFCNELGTRAYVTDKMFSHDRYYNESKTWNITRDI PVFSLVLSSSAAEGARKHVDHYLNKGLLRKIEGVESLANWMNVPISDLLGTLRQYQRDAK TGHDEWGKTSFFGVPATDLSSETFFAGTVTPVLHYCMGGLTIDKYGSVIDEDDNVIPGLH AAGEVSGGVHGDNRLGGNSLLECTVFGSIVGKKIPIKSARTQQANRPALSPAASTDTPLL TMADVEKHNTDDDVWIAIHGKVYDLTDFAEEHPAGPESILELAGKDGTEEFAAVHSAGIL DDFDPVGRIEN >gnl|To_NUC_proteinmodels_ML|p418 MISPRLLLRSPLSTRCTSAASAAAARSVSAHPLSTPTSRAFSTVDQPGDPSRTVVVVDGV RLPFAMASTIYEDQLAVDLQRLAYQGLITKTALDKSDVDYVLAGTVIQEVRTSNIAREAA INAGFPSNIGAHTVAMACISSSVAITSAAEKIMCGKATAVLAGGVETFSDVPIRLTRPIR QKLITMPKAMKKGGPLGAVRHLLKDLKGKDISLETPAIANYTTGEVMGVSSDRLSAKFGV SRIEQDTFTVRSHTLAGKAHSDGWYDGEVIPYKGSTEENGIKADSTVEKVSKLKPAFIKP HGTHTAANSSFLTDGASASLIMSEARALELGFKPMAYIRDWSFKSCDPFEELLLGPTYCS QEVLTRNNLSLETDIGVFEIHEAFAGQILSNLTAMNSQKFADEKFGGKKVGDVDMDKLNT KGGSLAVGHPFGATGSRLVTTASRRLQDENQRFALIAACADGGAGHACILERYDN >gnl|To_NUC_proteinmodels_ML|p419 MVPKVSRGLRVLPLPLPTSDKSDVKAERDALLKEIAQIETWWREPRWDDTKRAYTGEPVG QIXXXXXXXXXXXXXXXXXXXXXXXXHCSNSLKDTFGALDTVQVVQMAPHLSSIYVSGWQ CSSTASTTNEPGPDFADYPMNTVPNKCDQLVRAQLHHDRRQNEERASAILSGKDPGPKVD YLTPVVADGDTGHGGLSAVMKLVKLFVEAGAAGLHLEDQKPGTKKCGHMGGKVLVSTQEH VDRLIAARLAADVLGVELIVVARTDAEAATLLDSNIDSRDHPFIIGASVPGTASLQGEID SSEGANPAEIEREWTTKARPMTFGEAVMEKIRKLPVNEARKQQMTKMWESSNPFALSNAA ARRVADSIFGEKGSVYFDWEACRVREGYYRIKPGVEYCIQRARAYAPHSDLIWMETSKPG IPVARKFSQGVKAVFPHQMLAYNLSPSFNWDASGMSDEQLAKFNDDLGKLGYVWQFITLA GFHSNGLIITKLARSFGDEGMLAYVRDIQRQEKVQEVELLKHQKWSGAELVDRMVNVASG GQSSTAAMGAGVTESQFGKH >gnl|To_NUC_proteinmodels_ML|p421 MVVVNVGEYQVLCRERVPRPLYEYLASGERELDGEPTRTRLISRVTGTDDEQTLSENQSA YKQIYLRPRSMRPVGDLSTRTSLFGSELDFPVFVSPAGVHALCHDEGECATSRACARHGT MFGLSQHSTRSIEDVADATRQLLLRTNNWYQSYILKXXXXXXXXXXXXXXXXXXXXXXXL DSLRLVNYDESPETSNLHSSSDKSKVYNAKESDAWDQNTERLFEDNPTWDDVSRLKDACG DLPLVVKGIMTAEDALAAVNAGADGIMISNHGGRALDGCLASIDVLPEIAEAVGDRVPIL LDGGIRRGTDVLKALALGATCVGIGKPIFFALAVGGEDAVFHVLSMLKTELESAMALCGA RTVQDITEQLVTRHPHGGGHAGRYMRAKL >gnl|To_NUC_proteinmodels_ML|p422 MRPLSLSPVIVHLRQLTLHLQHIPLNAIFDAPAKARLRKAVNIADLRLCAKQRAHKMVFD YLDAGADDEISLRRGKDAYSELEMHFHILSGLKPPLDLSTKIFGQDVKLPFFGCPTAGNR MFHWEGETAAARAAAHHGTLYGLSSLATTGITEIGKLCSGPKVFQLYVWKDRELVREVLA RAKEGGFHAMALTVDFTWYGNRERDIRNDFSIPPKYSLNQMIEAVKKPAWTYDFLSHEPY TYACINTEVPADSLAAFVNSQIAPEFDWKDAEWLLGEWNNASAVKGVCRPDDAIKAVETG FTTMWVSNHGARQLETSPATIDVLPSIREAVGPNVEIIMDGGVQRGTDICKALALGADSV GVGKPYLYGLAAGGTEGVIKAYDILKVELDRAMGLLGTGTVDDLKKRGPGLIKRRHSSAR DYPDRYAYERGYGGGVI >gnl|To_NUC_proteinmodels_ML|p423 MADEFDLHICCMGAGYVGGPTMAVIAAKCPKVRVCVVDLSQKQIDAWNSPDLPIYEPGLP EVVAQCLGKNLFFSTDIDAEIKKADIVFISVNTPTKTMGIGAGRAANVKNCELCARKIAE VSESDKIVVEKSTVPVRTAQAVRRVLDCNERGLKFQVLSNPEFLAEGTAIPDLMSPDRVL IGGVQTPEGIAAAEKLAGVYANWVPREQILTTNLWSSELSKLVANAFLAQRVSSINSISA LCEATGANVSEVSRCVGMDDRIGKRFLNSSIGFGGSCFQKDILNLVYLCETYGLQECADY WNQVILMNNYQKKRFSEKMVSSMFNTVTGKKIAILGFAFKKDTGDVRETPSMFVVRDLVM EQAKVHVYDPQVKREDMWVEMNYTCNLSEETHPGVEAAVTTSTSAYDACDGAHALAVLTE WDEFKELDYQKIYQKMAKPAFVFDGRNILDHEALRKIGFEVHAIGKPDPNKFSDL >gnl|To_NUC_proteinmodels_ML|p426 MSGAMGGGGASEAATSGPGGSHHQSQCCAINFRVRCEDARHGESVFLYQNDFTAGGAKIP LYTTPKSYPWYTTRTPVTVPLSTSRTSISYRYAVYRAGVFHRAEDGAPSLHSVPLSLLQA GELYTVNDCLGTFTDQPDVDHVRLRRSQGGSAASLARRSSFGNYGINPSQSGTSLGSATG KKRVGFAPDRGGGLRAGPAAARQAVNLTSSDGLIVVSAFLPVHLNRSEAGEWSADWDYEA LLSMQTHLRVTRVGTVKWRGWHGNVGGGESSESGVPVEERSKVEAALRPFNCVPVWVPTT LFGEMYNGFCKGVLWPILHNVASVYSSPTDVDHTGDDAQAESDREEYDADYAEYSMDDVA QGPIHGDGGREGELWGAFTAVNRYFRDTIIQCFNEGDLIWIHGFHLMILPSFLTRRIPMA KIGIFFHTPFPSSEIFRTLWCREDLLRGMLNADQVGFHLFEYARHFLTCCRRLLGLNYGM FPDSSGGHNLAIDTNGRHVSVTSIHAGVEPHIINQVLGHRSTVAQVHNIRNQLAGKVIFA AIDRMESLKGIPLKLIALERFLQRCPQWAGKFVLVQVGISAFERGDDYTKTRAEVIAMVN KINDRWPGTVQFQECAESEMRLQQRMALLRAADVAMVTTIRDGLNLIPLEFTIAHMDAKN DTGPNSRKRGICILSEFSSCTRVMRGALHVNPWKISEIATAFYQALTMSDSERIRRMSIA SEFVTRVTTQRWALAVMLDLKGVMKNEGAGRYAGAGLGLGFRLLGMDSGFNSLDVPSVSR GYRNALSRLILLDYGGTILANDNLDGLQRYQFVKKSRAPSVPKECLIETLKELCADCRNT VFVVSGKERHSLTKTLCHIPNLGLACEHGMFVSWPTPKVGGKRVWETLVPNQDQTWKSIA IAIMEVYTSRTHGSYIEETEMKVLWQYRDADPEFGFLQSKELEDHLSNVLRGFSVDILHG GVEEGGYVEVRPKGVNKGVAAMHIVNNLEKIPGKRKVDFALVMGDDHCDEPMLSVMRQVG RRAAESRSTKSGTQLAPLPATIARVDVSSCDDHISPALQVFTSTVGKKPSAAANYVNDVD EVHELLASLVKVTNRELVQTARDGFYSVLDLKSMAPNVNSSMTSSLKPVLSMQDLKTESG DDFEVSFPTQRVSTNLADFLGMTTIEDGNDEEEDDMFF >gnl|To_NUC_proteinmodels_ML|p428 MIRGRLTRVLLAALMLLESCAFTPPHRMENRLFLDTADTAEWRSLLPTGMLFNTDTHIEN MLKKFHGVTCNPTLLERANETCTVSNLHKLAAEALTYTNEFMCQSWGSDSQEMYDNGMKI SEPDRDKIVIKVPVTKEGTVAATQLQKSGVRICLTACYGSKQALVAAGMGVEYIAPYLGR MTDNGKNGFDECVKMQQIVDGLQADTRILAASIRDAESLADLAAEGLDTFTFSPDVARQL FDEPLTLSAAEEFEKAAARGSQT >gnl|To_NUC_proteinmodels_ML|p430 MKFSMLAIAGAALAAGVEGFSTARSFNVARQVSSLKMAEAATEVDKAAALATAANEARGL AMDSIAAAHSGHMGLPLGCAEIGSVLYGSQMQYNPSDTTWINRDRFILSAGHGSMFLYSW LNLAGFDLSIEELKNFRQHHSQTPGHPEFPNSEHTTPGIECTTGPLGAGVSNAVGFAVSE KMAAAKYNTEDHTIFDHHIFALAGDGCFQEGVSAESATFAAHEKLDNLIVLYDSNEVTLD KMAEYTQSEDIIKRYDAYGWETYEIDGHDLAAVEETIAAAKASNNGKPKFIKCNTIIGKG MEETEGTNAAHGEAGVPYVDKARASIGLPEEKWFVSEGTRDFFKGVQEKNGKIYDDWQAT YASWKEANPALASELEDAVADKTMPAEDMIADIPEMGGDAEATRVSGFKVIQDIARLVPN YISGSADLHGSTRNYINDGGNFGAGFDKSYAGKNLYFGIREHSMGAILNGIAYHGIFKAS GSTFLVFVDYFRPTIRVASLAELNRVSYILTHDSIGVGEDGPTHQPVETVSGLRVIPNLD VYRPADAEETVAAYVSSVTRQEGPTALILSRQNLEQNTDMPAMERRAGALKGAYVAKKES ADLDLIIIATGSEVQHALKAAADMPGARVVSMPCMEAYERQSDDYKESVLPSTCTKRVAM EAGVSAPWYKYASKVVGVDRFGFSAPGDIVMKELGMSPENLASEIATMN >gnl|To_NUC_proteinmodels_ML|p432 MVYSVKSKMEPPSKKEKVDPQEAQAASYSGTDLELKCVNTIRCLAADMVQKANSGHPGAP MGCAPMAHLLWSSSSSGGAGRKHSARNPEWWNRDRFVLSNGHACALQYVMLHLGGYEDCK MDDLKAFRQVGSTTPGHPENFCTKGVEVCTGPLGQGISNAVGMAIAAKHLGAAFNTADFP NIIDSKTYVICGDGCLQEGISGEACSLAGHLGLGDLVVLYDDNHITIDGDTDLAFTEDVK MRYEAYGWQVLTVDDVANGLDGLRAAIDEAKNTADRPTLIKVRTDIGYGSPAKQGKASSH GAPLGDAEIEAVKGKLYGMDPAKKFSVDDDVAAYYKQQAEEGEKAMAEWDATFAKYSEAH PDKAAELTRRFNRELPDGVYDDLPTFVFGKDSAKATRKFSEACLNAVAPKARELVGGSAD LTGSNCCAIKGEPDFQRATPEGRMIRFGVREHAMSAICNGMFAYGGLRPFCATFLQFAGY ALGAMRCSALSRFGVLYIMTHDSIGLGEDGPTHQPVEMLESLRSMPNMNVCRAADANEMA AAYQIAMDSVHTPTVVCCTRGTVSPLEHSCRTKAMRGGYVVLPEGGEGAPDLVIVATGSE VGPSVDAAKALGSEHSIRTRVVSMPCQEVFLAQDAEYRASVLPGNVPTLSVEASSEYGWH RFSHGQIGMTRYGMSGPASELFEKYGFGKGNIASKGRELVEFYKNVPVPDLNARPTFVNF SGEGH >gnl|To_NUC_proteinmodels_ML|p433 MKFSALFAASALSTASAFAPVASNNRAATSLNVWGDKDYLIAPSILSADFAKLGQEVDNM LEAGADVVHFDVMDNHYVPNLTIGPMVCKALRDHGVTAPIDVHLMVSPVDAAIQDFIDAG ASYITFHPEATNHIDRSLQLIKNGGCKAGLVFNPATSIDYCEFVKDKLDIILLMSVNPGF GGQAFIPATLDKAREARKFIDDNGLDCRLEIDGGVKVDNIKEVAEAGVNMMVAGSAILKE PRTKESYKETIDAMREQLAQV >gnl|To_NUC_proteinmodels_ML|p435 MKTIAAALTTLLSSQHVAGFAPAAGVSRSRSISSLRMAETALSQDELKKMVGYKSVDDYV ESGMVVGLGTGSTAYFAVERLGQKLKDGELKDIVAIPTSVRTKEQAESLGIPLVTLDTHS KLDVAIDGADEVDPDLNLVKGGGGALLREKMVEVCADKFIVIVDESKLCDGLGPGFPLPV EITPFCHEHTLRTIASLPTCEGCDAVLRMGSSSTNKPDGDEIAVTDNGNYIVDLHFKEAI KDAPAMASELKNTVGVVDHGLFCDMTTAVIIAGSDGISVKD >gnl|To_NUC_proteinmodels_ML|p437 MKIIAAALLASASAFTSFTPSNLRGQSALAALKDGETPIVIGLAADSGCGKSTFMRRVTS TFGGDQCGPLGGGFGNGGWETNTLVSDMATVICLDDYHLNDRQGRKVSGLTALNTAEQKF DLMFEHVKALKEGKSVMKPIYNHVNGTLDTPEEIEPTPVIIIEGLHPFVDERVRDLIDFS LYLDISPDVKLNWKIQRDMEERGHSLESIMASIEARKPDFDAYIEPQKAFADYVIEVLPT DLDKEDKKTLKVRAIQKKGVADFNPTYLFDEGSEVSWTPSADKLSSPAPGMKLSYTQEEY FGNDVSVLEMDGSFDNIQELVYVESNLGNANSKFYGEITQAMLSLADSPGSNNGTGLMQT LAAFAIRELYNKKAASAKLAASKEAVATA >gnl|To_NUC_proteinmodels_ML|p439 MARWFRSEHMEYISLIVNEDAAHDCLADLGKLSAIQFTDLNPDLTPFQRRYVSYVRRCDE LERKLRFFGSECDNFGLNLETAGDIDSFIESNSSIVGARSGVAGKAASGTQLLESLEVEL EGYESQLKELNAYSEKLTREYNEKVELQEVLEKARRFFMTDAPRLAVSELSSGRTKTGSH QGLLAGAHDDEARADLDMRFSSITGVVSTEEKVRFERMIFRATRGNCYVRFAPIKQPITD PESGALVEKCVFIIFFKSLSIETKLKKICDAFFAHRYSLPDMDDAPAVDRMLTENAQELV DSRTVLLKNQDTRFRLCQMLAQHTERWTWIVLREKAVYHTLNMFKADVSGMLRGEGWVIS EKFDDVRMSVNRAHSEMDSNMPSHVDQVAKPWPTPPTHFTTNKFTYGYQEFVNTYGIPRY REANPALFTAATFPFLFGVMYGDIGHGLFLFCAGLYLLWNEEKNDKAKLDELTAGLHTGR YMMAMMGFFAVYAGLVYNDCFSLGLNLFGTRWSFGSDQPEEGDVAEMTGQYGDGDSVYPF GLDPMWHVASNELLFFNSFKMKLSVIFGIVQMFSGTCLKGINALYFGKRLDFLFEFLPMV AFASSLFVYMVVLIFMKWSINWNSRMLSATCLDPNDAGWGSPDYPPNLITLLINIALAPG VVDEPLYSGQASIQNFLLLVAGLSVPTLLCAKPYFLSKEMASHTHSAHDDDDDDEEHNFG EIIIHQAIETIEFVLGMVSNTASYLRLWALSLAHSELATVFWEKAMLSTLNMNWFATFIG YGVFAGTTFGVLLMMDVLECFLHALRLHWVEFQNKFFHADGIRFAPYSFKQLLTDASA >gnl|To_NUC_proteinmodels_ML|p440 MPTSAEASYVVESCLEDLRIDGRSRNEYRPYTVTNGQSSTTAVQAPSFILSNGSSRVHLP GSTTEVVCSVKADLVHPSPSKPSEGMIGLNVDLSLCGGGGSVIGVSSSVGAAGQSQRRQQ REEESQIASLLQRLVLPHALNYEQLAVWPGKYVWRLAVDLVILRCDGCVLDTSSIALREA LRNTKLPKVQAVMEGKESNEGSSKNDLVVDGDFSKARSPAGADECPLVVTVSVLSEPLST TEGEPKRRRSVTIIDARTEEEACASSRICVSADPTGMICGVHTVGGSGTDGCIGDGSSIP LAMLNDVIGAAGSAAENLFNLTKKTFSTNSDGYGSLLKSHFLVQ >gnl|To_NUC_proteinmodels_ML|p441 XDERYGPVRRYNVHFDDGDVGTAIPEHLVFPSVEYDLNLQRDGNDEDSVSSTSGGGDSAR FDGVTNVTDESSPDPWAAKVGWWEAEVDERDVCFGLLTDALRARDSEQIRRDGANVARAS LGIPGDYDRLYNDVVTIKRNATKAANEEAREASHEYQRKLNERAEQAEAETRAKFQKKCD EEVRREVSEAVSKERKEGREAQSRIRRELEEKHEREMTSLRREHEETLAQHCRAESEQVA ERLKPVFDVGEEVYAAWWPAWDSKRDTNPSWYLGHVVSWKEQRGKGDYGSLRLYTVKFED DGSTLKRIPEHFVSSRRDYELHIEKDDDPTSDTWTGVRNVLDKNSEDDWARFVGYYVATV DGGAGEERYGRLHDALQAHDRAVVRRRGENTKRNELNLPGEWMWLLEDVDEEDTSSNKSA RSGQNDGGRTYTRAEVDEELDVLRKELSRQQNKRQREQEKDEQDFKRVKAQHIRELSKAV ENALIDAKIESDKNRAVAVRDALRKQRQELAGAHSRDKAGALEQLGDELHRIHEDELERQ RSDLEAYFDLEREKALHWARVESDRMIQDRVEEELDRARTRLDEERVRVRVESELERLRP SIERQVEANTLGYVEEMVAAEVERQRLALEKDVELKVEARVRARLERLGALEDDAVVNPP PSKRARIDAHPATTVSEVETREAANDLFRLRRGSGGGYVSPTSPSSVGSTRDIDLGDLHL VAVPSLAAEQQRVVDRRDGNIAPAGPASEGKENSLPSGGPSSSLPLDLARKIACMITGKS AAPSAAHSDDKSSAPSVAHSESSPERMRMRDDAAEVLLGMAQQQGVAC >gnl|To_NUC_proteinmodels_ML|p442 XMEGNGIRILQDLGPACTASINTLRSTLGRIMSDGQPGNPGNANPTELARLIHFFAAVSS KPTQDTASSSLSSAFVGVYLNTTDVSASTAPSEWNLEVIATVMQDYVNVNWKAVARSFDF PQFKVRDVRHAEVIHFLYRSGATAALNTGQIVVLPITLLTEDWVNREGQLSLLENLLRLP PTTYEFVVDSEEQKDAATVFQDGKLVATNCANPSGWACARVLQRLLIISDDSKNLHDQVR QVFIMGLLSCPEIILCALVRLQLTVARAAEGAASNLSSSSISAGMPMKGELMRELIPLFF KPNTKHVVRNLHGAVRRLWEISQNTVVAASIEAFRSTSAEVQSVRYQTMVHIIGVLRVVP APEVAIATILNNNKDLDFSFTMAFIMADLDMLQLEPWLKERFTSAGQNNVMFVVAILTFV GKNYATASPRSATEKPLYSIENIKITLEFIIGLDNNVLKNMVPAQGKALTIAETLETISK ACANRHPALQDVLKKVKTINMKTGKTTTASDSQDDIEEAANQYFQQIYQSEESAREVVKK LKQFKVSGSARENDIFACMIHNLFDEYRFFSKYPEKELRITGILFGLLIKEQLVSSITLG IALRYVLEALRKNPNNNLQSGKMFRFGMFALEQFKNRLHEWPQYCSHIIQIPHLKDGYAA LVGEIEGAVDDNQSAASSTAASAAGAATTSSATEAGRKGTPAPASSTDSSAPPPNQVSIG GFTGGIAASLNSLSSAGNTGPSKIDLPAPAIPLKDRRKAVFGPDLGRAVNAPEKSDESDS DKNEAPPDAVLDRVQFLVNNLSMSNCKEKSNDLREILDRKYFGWLGNFLVVKRISTQPNF HALYLSFLENLGEYGRGLVEAILASVYVNVGKLLRSQKITTSTSERSLLKNLGSWLGQIT LARNRPILQIMLDCKELLLQGYETGMLIAVTPFVAKILEGAKNSIVFRPPNPWLMGLLSV FRALYCVDDLKMNIKFEVEVLCKNLGVKLEDVPMRTDLSTRVPPVKEKNPDFNLKASSAA ATPSKAGQGGSGFNANAMMPSPDNKSSSASTGGDSTRSDGTSAADDQQQTVIPNLAAYVN VNPNLTQLFLQVQGGPLANNISADVLRKTVPIAVDRAIREIIQPVVERSVSIACITTKSI VTKDFAMESDENKMRKAAQLMVANLAGSLALVTCREVRPESNTQFKHIIIHFLTLRLQPL HTSISSHLRQLLTTAINNASAGTTVQLRDQETSALDQCVAICSTENLELGCMLIEKAATE KAVRDMDETLARDLQVRKTSREKTGQPYYDMSIFSVDGQRYPKELPDALRPKPGGLSSQQ LMIYEGFQRTPRLPAPAQSSDLTPGIGSDSATSDGSSLASPTGQINMGAMNAIALKLDKS VSTLLISAGPRAQEIHLSMIPAEHEIKQLIAAIPRVVNTSNRSLTSAETDLILSFSQVIF KRLYEVSLNERLKLEALIAMLELLNKACPALGRDMCTWATYAPTKTDGQRKLHRAVLLLL VRSSLIKMEDLDGHLVNNIDEGRDQVWLEFLFLFVRTACLEKIGTPATMPKMTDVIRKIA VDKSPWIPDAFQKAALRLIEELRNSGINLETDVLSPKAHVATLQESSSISPESLSTLSGA SLAIAKSSQVFSSTDPPNARSLVTEILVQWLRVHSEAAGNEQVLAQFLHRLQQQFGVGTS DAQTERFLRLTMEVVVESCVKNAEGGAGLNYQAVDGAVKMLSYLVKFTSDGGATDQGQHR LVLLNNVLGVAARSCATSYEKAQQLKTPWDQRPWFRLLLEMLTELTQNDQNLDSIKGGMV AMFGSAFHVLQPLVVPGFSFAWLELISHRLFIGNLLLPKDRKGWAVMHELLIDLFLFMEP HLGKVELTPALRQFYDGVLRVVLMIMHDFPDFIAAYHLSLCNVLPVTCIQLRNLILSALP KGVQLPDPVSTQFKIDKLKEITQSPLILSNVTGPINGFKSSIDGFLSKQQPSNLLLNLHS LVRMDGKSDAPVVNSLVLYVGMKGIARLQSEVESSVGHSPEMEVFHKLMEADDFSRYITL NAIANQLRYPNSHTHYFSCVMLCLFNESKESVKEQITRVLLERLITQRPHPFGLLTCFIE LIKNTKYDFWSHSFTRCSSEIERVFQNVASSVGSTAGDSTHNQIX >gnl|To_NUC_proteinmodels_ML|p445 MLSNTFRALQSPAAAARVAALSTAKRGMSISSLTQFEGKHFISIDQLSNDELRGLLDLSK KYRDTYGKGSSVNPVEAPKPLTGQSVSMIFQKRSTRTRVSTETGMNLLGGQSLFLGPSDI QLGVNESMRDTALVLSRFNSLVLARVFAHDDILELSKYATVPVINALSDLHHPLQTLADL MALEDHFGNLKGKTLAWVGDGNNVLHDLMLGGAKLGMNIRIATPAGYDANSGVLETTKQL AAENGTADVFETTVASEAVKGSDVVVTDTWVSMGQEDEYEQRVKEFDGYQVNSALMAQAN DGAVFLHCLPRHPEEVSDEVFYSEQSLVFPEAENRMWTVMAVMAAQLGKL >gnl|To_NUC_proteinmodels_ML|p446a MMKSVAVIATLASASAFAPAQIGQSSTQLNAFESEVGAQAPLGFFDPLGMLDDADQERFD RLRYVEIKHGRIAMLAFLGQIATRSGHYLDGNIDYAGHAFSSYPSGLAALVGPDAIPKDG LAQIVALIFFLEFGVMDNVKWNGGWEPEFPGDLRNGLGVASWDRFSDASKARKRGIELNN GRAAMMGILGLMVHEQLGTDMPIIGQL >gnl|To_NUC_proteinmodels_ML|p446b MMKSVAVIAATLASASAFAPAQIGQSSTQLNAFESEIGAQAPLGFFDPLGLLDDADQERF DRLRYVEIKHGRIAMLAFLGQIATRTGHFLGGDIDYAGHAFSSYPSGLAALFGPDAIPEN GLAQIIAFIGVLEFSVVDNEWNEGWEPEFPGDLRNGLGVASWDRFSDEWKARKRGIELNN GRAAMMGILGLMVHEKLGTDMPIIGQL >gnl|To_NUC_proteinmodels_ML|p447 MMKFSAALLATSLGFASAFTSSDVLRTQSTASTLRMSDEVAAAPEETSSTAVAAESSKAD LESMAKGLNPTIGYWDPLGLAGADFWNQGDDATVGFLRQAEIKHGRVAMAAFVGYWVQSN FVWPWPNTLAGDAFPSTDLSPEQQWDALPGPARWQIIAIIGLLELWDECGGGNPNHKHYM KGGKPGDYPSFQNFRDEVHWVLDLYDPFGFTKKMTQEKKERRLLIEVNNGRLAMLGIFGF LSADKIPGSVPALADIAIPYDGQVMAPFFADGSFFS >gnl|To_NUC_proteinmodels_ML|p448 MVRSALIASLACSAAAFAPQPAARNSVQLSETVSDLEALAQKANPVVKFYDPLSLSDQSF WGKSNEETIAWLRQSEIKHGRIAMFAFVGYCVQSNFVFPWAETLSGQPHPSADLVPEAQW DAVPAGAKWQIFAVISALELWDECGGGGAMPHYTKGRQPGKYPPFXXXXXXXXXXXXXXX XXXXXXXXXXXXXXXXXXXXXXXXXXLGIFGFLAADAVPGSVPALDGIAIPYSGDAMIPF >gnl|To_NUC_proteinmodels_ML|p449 MMKSAILASCLAAAAAFAPAQTGKASTALNSAFDGEIGAASAELGCWDPLGYVTGQSQER FDRLRYVELKHGRVAMLAAWGYATTWSGARFPGCEDFPAGHQAVFDIPKVELLLPILAIG GALELGGWKQAEGSFPGDFSASKFPVGFGPFARSEEDAIDLRTKELNNGRAAMMGILGMM VHEQLNGKPFIFFDTIDVYAPLAN >gnl|To_NUC_proteinmodels_ML|p450 MVRSALIASLACSAAAFAPSQVGRAASAVSETKADLEAIAEKSNPVLKFYDPLELSDTTI FGDTQDATIGFLRQAEIKHGRVSMAAFVGYCLQSNGVHWPWPTSTDGAPFPFEAGSPPEQ WDALSNAAKWQIILFIGFLEWWSEANGTHYMRGGVPGKFPEFIGPDSKIPHPIPFNLYDP FGFSKNRSEAAKADGLIKEINNGRLAMIGIIAFLQEQKLEGSVPLLKGVVSHYDGEPMAP FGTFFSF >gnl|To_NUC_proteinmodels_ML|p451a MKSAIIAATCLATASAFAPSSGAGRSATSLSATEAQIDFFGLPEQTDFSQELGVTAPLGF FDPLGLLKDGDQSTFDDLREKELKHGRISMLAVVGYLITAAGARFPGCEDIPDGFAAFSG LMGLFFVVAEIINREAYWAGCKSEFPGDYRNGWIDFGWDTFSDEVKFQKRTIELNNGRAA MMGIWGLMLHEQMAAMCLSMQRSWLVVEAFEEDTGLKVGSGVAENDEEEDQN >gnl|To_NUC_proteinmodels_ML|p451b MMKSIAALALVGSAAAFAPAPVQQKSTALYENPLANELGAQAPLGFFDPLGACADGDKEN FDRLRWVELKHGRISMLAVVGYLVTYAGVRFPGAEDIPSGFAAIDATPGMVWAQFAATTA MMEAANQDQFKGPWGTNQNSLGESPAEFKGDFRNGALDFGWDKLSPEAKMRKRAIELNNG RAAQMGILGLMVHEKLGNIDAILP >gnl|To_NUC_proteinmodels_ML|p452 MKTSALLASSLVCGASAFAPAPQSHARSTEMSAALPRETVLAEPDSIEFGSVWDPLGLSE MGSDETIAWFRHAEVKHGRVAMAAFTGWWAVGAGVRLPGEISHGLDFASLPSKGLEAWDA VPGWGKAQMLLFAGLIEFHDELFFSRRGTHYMKGGVPGKNMVPGLFDPFGISKGKSEEAL ARGRSSEIKNGRLAMIGIAX >gnl|To_NUC_proteinmodels_ML|p453 MVRSALIASLACSAAAFAPQPAARNSVQLSETVSDLEALAQKANPVVKFYDPLSLSDQSF WGKSNEETIAWLRQSEIKHGRIAMFAFVGYCVQSNFVFPWAETLSGQPHPSADLVPEAQW DAVPAGAKWQIFAVISALELWDECGGGGAMPHYTKGRQPGKYPPFTLFRDNFSKFDMSHF QMKVRSVWLQQEHVRGDEGETPDRRDQQRKACQYVRIGFF >gnl|To_NUC_proteinmodels_ML|p454 MKTCLTFALLGAAGAFAPSQQASRTASVAVQSSLIDDAFGISVETGNKCPPLGRWILEDA NENGVKWFQNAELKHGRVAMVATIDWFLSTSSGVTFSEIAKAAPLDAVKMVPNAGWLQVF FAAGLFELTAYNRQWTQGRSVPGDYGYDPLGFTKRPGGFDSEELTRMRMMELKNGRVAMM >gnl|To_NUC_proteinmodels_ML|p455 MMKSAIVACAVAGAAAFAPSQKSAYKSQLALGLDDIGGSSLPIPNFDPLNLATSGSDETL LWYRAAELKHGRVAMVATTGYLVNAAGIHFPGMLSSDVSFESLSSMSPIDAWDAVPSGGK AQILFTIFLAELITEGYQGTHYTKGGSLPTMVFPAIDFSNVSEEMLTKKRQAELNNGRLA MIAILSFISEHNIPGSVPALTNNGGHF >gnl|To_proteome_BD|BD.ferritin1 MSACRQNFHEESEAAINKQINTELYASYVYLSLAYHFDRDDVALAGFFKFFKKQSDEERE HAQKLMSYQNKRGGRIVLHDVKSPXXXXXXXXXXXEDALALEKKVNESLXXLHGVAGSHN DPHLCDFLESEFLAEQVESINEISKLITNAKRCGDGLGIYQFDKLTLSS >gnl|To_proteome_BD|BD.ferritin2 VLQDTKKPDRDEWGTALDAMQVALSLEKSVNQSLLDLHKTADSHNDAQMCDFLESEYLEE QVNAIKEISDHITQLKRVGSGLGEYMYDKESINGDS >gnl|To_proteome_BD|BD.ferritin3 XXXXXXXXXXXXXXXXXXERDCSWMEKGLTEIETKENRKGDEFIDRVYPYAKDLEKLLNF QIKSEMNAFYNYLSMSHFFEREXXXXXXXXXXXXXXXXXXXXXXXXXXXXKSKRGGRVKF FEITAPFYQTFNSGYHALQSAFDLEQNVTDEVLCLHKMAADRYNDVDFSNFIESEVIPEQ YKGLKVFKNASXXXXELGSSKGKTDMTKNYGIAEXXXXXXXXXXX >gnl|To_CP_proteins|psaJ psaJ Photosystem I reaction center subunit IX 33:158 reverse MW:4866 MNDFQKYLSTAPVLLTIWMTFTAGFIIEINRFFPDMLGLYF >gnl|To_CP_proteins|psaF psaF Photosystem I reaction center subunit III 197:754 reverse MW:20450 MKRVNLLTLLFAVLIALTPSQALADIGGLTKCSESAAFTKRLTTSVKKLEQRASKYEVDS PPALALQQQIERTQARFDKYSRSELLCGTDGLPHLVADGRWSHAAEFILPGFGFIYISGW IGWVGRKYLRAVSTSANPSESEIIINVPLALKIMTTGYIWPISAWQELISNDLVALKDEI TVSPR >gnl|To_CP_proteins|ftsH ftsH metallo-endopeptidase FtsH 1185:3113 forward MW:70057 MGKQILRSLLIGIILISTIDLSVKTFGIQTPESPEIVRNEAQLNQNVVSSRMTYGRFLEY LEMGWVKQVDLYDNSRNAIVQASSPELGNRPQTIRVEIPVGASQLIQKLKEYNIDFDAHP AEQKNLFVTIASNLLLPLIFIAGLVYFFQNSENFGGGNGQSPLSLGKSTARFERRPDTGV NFKDIAGIDEAKAEFEEIVSFLKEPEKYTVVGAKIPKGILLVGPPGTGKTLLAKAIANEA DVPFFSVAGSEFVEMFIGIGAARVRDLFQKASENAPCIVFIDEIDAVGRERGAGVGGGND EREQTLNQLLTEMDGFKENKGVIVVGATNRVDILDAALLRPGRFDRQVTVNLPDRLGRIG ILKVHAKNKPLGDDVSLVQLANRTPGFSGADLANLLNEAAILATRYKKETISKNEVNQAI DRIIGGIAGTPLEDGKNKKLIAYQEVGHAIVGTVLQSHDEVEKITITPRGAAKGLTWFTP EEDQTLISRSALLARIIGILGSRAAEQVVFGEPEITTGASSDLQQVTNLARQMVTRFGMS NIGPIALEDESNGQVFLGATMDQGSNYPETIADRIDDEVCKIINYAEEKALQIISDNRVI IDLVVEKLIDIETMDGTEFRELLSTYTILPNKKAAYVSKFPQ >gnl|To_CP_proteins|rps14 rps14 30S ribosomal protein S14 3312:3614 forward MW:12101 MAKKSMIEREKKRIKLHKKYDSKRQMLLNEYNNTSDFNLKLEIHSKIQRLPRNSSKIRIR NRCWKTGRPRGFYRDFGVSRHVLREMAHQCLLPGVTKSSW >gnl|To_CP_proteins|psaM psaM Photosystem I reaction center subunit XII 3793:3885 forward MW:3242 MLTDFQIYVALMIAGVASVLAIRLGATLYQ >gnl|To_CP_proteins|chlI chlI Magnesium chelatase, ATPase subunit ChlI 3919:4977 forward MW:39441 MVTQDLKKFSSPVFPFTAIVGQEEMKLALQLNVIDPKIGGVMIMGDRGTGKSTTIRAIAD LLPEIEVIKDDPFNSHKSDLDLMGNEVKTAIQNGETLETEFIKIPMVDLPLGATEDRVCG TIDIEKALTEGVKAFEPGLLAKANRGLLYVDEVNLLDDHLVDILLDSAASGWNTVEREGI SIRHPARFVLVGSGNPEEGELRPQLLDRFGMHAEIRTVKDPILRVKVVEERTSFDQTPMV WMENYEVQQQELRNRIVDAQKLLPTVQIDYDLRVKISEVCSQLDVDGLRGDIVTNRAAKA HAAYNKRDKVTLEDIESIITLCLRHRLRKDPLESIDSGDKVSKVFKEIFEIE >gnl|To_CP_proteins|dnaB dnaB replicative DNA helicase DnaB 5749:7116 reverse MW:52764 MKKFDNKIKFSISDTPLPHNFAAEKIVLSCLLINFEAIEITLKTVRIETFYFINHQEIYR VIVEMYRKKLPINVFSVNNFLREKGCLNKIGGTKVLLELLNQIPNLVYLEEYIQLLQDKF LRRSLIQLGYEAINSAYITNISLEKILNDFEKKSFALTNELIEEKKLTTAELFSSVFEEL KEKALKPELPGLASGFYDVDALTQGFQKSDLIILAGRPSMGKTALVLNITENILKKYKLP ILFFSLEMSKEQLIYRLLSNETGISQTRLKIGNLYKEDWYELKKTIQLYSRLPFFIDDQA NLTTQDIYSKIKKILFEQNKIGLIIIDYLQLLLNLNSKSENRVQELSQITRSLKNIAKEF QIPVIALSQLSRNVETRLNKRPVLSDLRESGSIEQDADLVLMLYRENYYSMNKEKEEKMT TAELIVAKHRNGPLGIIELLFQNDPAKFFNTFSSL >gnl|To_CP_proteins|rpl11 rpl11 50S ribosomal protein L11 7440:7865 forward MW:14818 MPKKITALIKLALPAGKATPAPPVGPALGQHGVNIAAFCKEYNAKTTEKAGLIIPVEISV YEDRSYTFLLKTPPASVLLANAAKVKKGSSTPNRVNVGSVTQAQLEEIANIKLPDLNTTK ISSAVRIVEGTARNMGITIVD >gnl|To_CP_proteins|rpl1 rpl1 50S ribosomal protein L1 7901:8593 forward MW:25131 MQKLSRRHSENLKKIKNVAHSSLEEAITSLQETATAKFIESVELHANLNIDPKYADQQLR TTVTLPHGIGKSMRIAVLTNEANFNEAKEGGADIVGSQDLIDDISQGTINFDLLVATPDM MPKLAKLGRVLGPKGLMPSPKSGTVSTTLTATLSEFKKGKFEYKADKAGVVHVNFGKVDF SQSQLVENLTSLYQSIEQNRPSGVKGKYFKSLFICTSMGPSIQLDLNAFD >gnl|To_CP_proteins|rpl12 rpl12 50S ribosomal protein L7/L12 8651:9037 forward MW:13340 MSDKINQIVEELKTLTLLEASELVTAIEETFGVDASASVGGGVVMAAAPAAAEEAEEKTE FNVMLDEVPADKKIAVLKVVRTLTGLGLKEAKELVESAPKMVQEALGKDAAEDAKKQIEA AGGKVSLT >gnl|To_CP_proteins|rps2 rps2 30S ribosomal protein S2 9226:9924 reverse MW:26268 MADINLAQLLKAGVHFGHKAYRWNPKMFPYIYTERNNIHILDLVQTAQLLKRANSYVTKA ASNNKTFLFIGTKRQASAVIAEQANSCNAYYVNHRWLGGMLTNWVTLKSRIERLKTLEQQ EVDGVFALLPKKEASLRSKELEKLRKHLNGVKNMTRLPDVAIIIDQKREMTAVQECRKLG IPIISILDTNCDPDLVDIPIPGNDDAVRSIKLILSSLTESIKNGNVDFPQID >gnl|To_CP_proteins|rpoC2 rpoC2 DNA-directed RNA polymerase subunit beta'' 10014:14375 reverse MW:167705 MKDYIYQNTLINKKQLKELLAWSFSKYDAMQASLLADELKYLGFKYATQAGISISIEDLK VPATKNEMLEKANQDILNAEKICLKGKITDVERFQKIIDTWSLASESLKENVVSYFKIYD PLNSVYIMAFSGARGNLSQVRQLVGMRGLMADPSGEIMRVPIKKNFREGLTITDYLMSGY GARKGIVDTALKTANSGYLTRRLIDIAQDIIIREKDCHTTTSFSVEISNKFDTEQIIGRV LAKSVYDPKTKKLIAIASTQLTAKLLEIFKEKQILRFHIRSPLTCNLYHSICQMCYGWDL SNQNLVDLGEAVGILAGQSIGEPGTQLTMRTFHTGGIFTSEARQQVISPVNGIIKFSKLL KTIILRTNRGDEVLLTKNSGSLILIPEQKSQKIIQIELLRNTMLFIKNNQYVKKSAIIGE LISTEKQTLTEQKPILSDTAGEIFIPKVKTKVNLVTQNRLLWILSGQVYQAPTNSFLNFY TDYKINKNSYIFRTKVINQHPGHITVFDNQNSLVEQKIEIAIKNSFLQNSYIEKIRKPKN NKNYLINWDHSKYLVTLKDSNSETFKRCNIFKPFAVTLSNKFRTITGGIMYYDQRIKKKK VITDKSFSYFIPDIISKTSYNEISKDHYKQYLRLLKRKINKSNFDIYQNFIKYQRKEDNT LKLKRKILHNSIIWLSEETYKLNCDKNILLVENGNFISKNFEVIPNIISKTAGIINVVQK NNIIEEIAIKEGTVYQGKQCEQLDKNVYYPGEILFNNVEITQPSLCEHIKTKASSQLLIR PFTIYEIPKEKKLKKNLEKFFNSSKSSEISFDKVIKINYLYKANQKIKTSDTVNLISQSI NLRSKDSLQNNIEIALSNSFRKRHLTFTILENLALKQYLPPHLKYTKLQSCLLFEPKQFI DSYTNLGYLEHLTRKSVEIVKFKSKRANKKQVFLISNDNCITVDKKKAKNKTINELLIDD INVNQIGKILIDNGKFLTIQKGRPYFFPNCKTENSREKVDLQYKLIKVDNYISNFKTDTF INYSDITQQSLVERVDPNKKLSGLKIKFSKMLIKKNGKFYSSPIPFFLNNFTLVKKNQET VKKKQIIQEIVKRNTDNIKIKQCLPLLIKSSELIFNTNKEDKSSDINLTGLKFLKYPFHK SIGIHSITEDYFEQEVNNVYCKNGEFIEKGEVIGLLNFEKEITGDIVQGLPRIEELLEAR KRKPTNKHLATNQKKSLLIQKTSLDSNFEFQKLGTTIKENDKINPHNLLKIYFNYYGMKK RFFNTQNDFEEVEGFVVSYRLTNNYEASYRTFKKIQLLILNAVQSVYQSQGVSIANKHLE VIIKQMTTKVLITHEGHTPLLPREVVDLYHIQYINQIIEAHNKRPAYYVPLLRGITKAAL NNPSFISAASFQETTRVLTKAAIEGRVDWLRGLKENIIIGHLIPSGTGYQSYTNCFNKIQ QNKTISEKIKMKM >gnl|To_CP_proteins|rpoC1 rpoC1 DNA-directed RNA polymerase subunit beta' 14392:16530 reverse MW:82868 MIRSEKEFDYIKIKLASPMRILEWSHRKLPNGQFVGEVQKSETINYRTFKPEMDGLFCER IFGPSKSLECACGKYKRVRYEGLICERCGVELTESRVRRHRMGHINLIYPVTHVWYTNSR PNYVALLLEVEQCEKRLDTGWNDFISCTSDTQLNSEEILNKEEFIKEKSKGKKSKDVNFF DERIKRIKLASLAYFIAEDEIAFYGLHWDLQQYRRCRELGFSGYPLKPKSKSNNRRRNTP KYLLRSTPNYLIGAVLIKRELEKLNLDQEIFKTRNFITICSKVLHKEQPFYNFSHWAKKW EYQRIYKLRDQSIKRIRILENLLTTGVNPAWMIITILPVIPPALRPMIQLEGGRFATSDL NELYRRIITRNNRLLRLLEIDAPQLIIRNEKRMLQEAVDTLIDNGKRGKIALSASNRPLK SLSDIIKGKHGRFRQNLLGKRVDYSGRSVIVVGPSLKLNQCGLPYEMAIELFQPFIIREL INQGLASNIKVAKNLLQQNEPSIDTVLEKVLMNHPIFLNRAPTLHRLGIQAFEPILVQGR AIKLHPLVCSAFNADFDGDQMAVHVPLTLEAQAECYMLMLAPYNFLSPANGDPIIMPSQD MVLGCYYLTVNNINGLLGTNHYFADLNDVILAYNQNQIEIHTSIWVRYKHKISKPSNFIK KVTLKDNSYIEYYENIQIRKDKNHKTIVQYLQTTTGRVLLNYTIKTTLNLKL >gnl|To_CP_proteins|rpoB rpoB DNA-directed RNA polymerase subunit beta 16540:20634 reverse MW:156823 MMNYTTALPDFIEMQRVSFCWFIAQGLNEELSTFSRIYDFSQNTEYILFGQEYSLVKPVY NIVRAKKYTANYSAQLIIPLEVRNKKTNSIKYHNKFPIINLPLMTSAATFVINGCERVIV SQIIRSPGVYFEKNKHQKKNRKLKKILSNDIGKLKNFIPPSEILPTEKTLYFLKKVTDIK DLNEKGEEIDELWKWKRKISTLYSFKNLKASEVNFISLFIEYFKIYNSLFKITDLSEKTK RIKIFLKWLSTNKKSFISNNNKDRIFLLINYFNLLIKVLHKYKSLKNKKYNDSNISEHLE QIKKEHNNIFQKAKTLFEQNININLFNFVICDLSHFNQINDFSFLNSTISPKIRSILTKS NVQSNLYFTDGFKELIKYNKNDKNKTKYLKTKSQILEYKDEHSIRDIYKKKYEEKELYTA TLIPEYGSWVRFGFQKNTNTNISKYPIKNQEDEIIIQLDKVTQKPIIHLLKEMGVNDIEI CQNLQNSEFFYFNKPVLTPSIYAPRKLLRFTLSKNYYKNISEFSRIFDPAYYRLGKIGRS KLNNRLNIQLSKRIVTITYQDIFAIIDKLITLSISKQISDDIDHLKNRRVRSVGELLQNL FRIGFQRLSRKLRSQTNKTYSSQLSSFNIVGATIREFFGASQLSQYMDQTNPLSALTHRR RISGLGPGGFDRDRISFAVRDIHPSHYGRICPIETPEGQNVGLIASLTTSARVNESGFLE TPFWRVLNGKVIKTGNPIYLTADIEDFYKIAPADISTNEDNYLTKELIPIRYKQDFLTVT PSEVDFIAVSPIQVVSVAASLIPFFEHDDANRALMGSNMQRQSVPLLLPQKPIVGTGLEN QIALDSGMVINAQRDGIVSSVTADKIIILENSGRQCKYDLQKYQRSNQETCINYRPIVWE GEQIKSGQMLTDGPGIISSELSLGQNVLVAYMPWQGYNFEDAILINERLVYEDIFTSIHI ERYEIEIDRTAEISEQTTNNIPNLSISDVKNLNEDGIVTIGTFVKPGDILVGKIVPKDDS EQLPESKLLRAIFGAKAKGVRDTSFRMPEGEYGRVIETLTFNRRTKLAYKFEKIQIFIAQ IRKIQVGDKIAGRHGNKGIISRILARQDMPFLPDGTPIDIILNPLGVPSRMNVGQLYECL LGLAGHKLNRRFKILPFDEMYGPEVSRILINKKLRQASLENDEAWLFNPYSPGKMVLLDG RTGKEFDNPITVGNAYMLKLIHLVDDKMHSRATGPYSLVTQQPLGGKAQHGGQRFGEMEV WALEGFGASFTLKEILTIKSDDMEGRNETLNSIIKGQTIPTSGIPESFKVLVQELRSIGL DLSTYRIDEFSGHQSYEIELNLIEKYNPSLKTFSHTSNINNISF >gnl|To_CP_proteins|rps20 rps20 30S ribosomal protein S20 21067:21375 reverse MW:11696 MANNKSAKKRILISKRNNLQNRFYKSSVKTLTKKFLNDLEVFKSYEPTTEKKEELNLHLE KTLASIYSLIDKGLKKNVYHKNTAARKKAKLAALLKNACGLV >gnl|To_CP_proteins|rpl33 rpl33 50S ribosomal protein L33 21612:21806 forward MW:7509 MAKNKGARILITLECTECRTNPNKRSSGVSRYLTQKNRRNNPQRIELKKYCPHCNKPTIH KEIK >gnl|To_CP_proteins|rps18 rps18 30S ribosomal protein S18 21811:22029 forward MW:8165 MLSQKHKLAPIGINQKIDYKDIDLLKLFITEQGKILPRRATGVTVQQQRQIAKAIKRARV LSLLPFVASNEL >gnl|To_CP_proteins|ycf3 ycf3 photosystem I assembly protein Ycf3 22082:22636 forward MW:21286 MGVRNFIDRVFTVISDLILKLLPASKQEKQAFAYYKAGMAAQAEGDYAEALENYYESLYL DEDQYDRSYTLYNIGLIYGKNENYPRALEYYHQAVSLNSNLPQALNNIAAIYHRQGLLAL EMASQDYDSAMEISEEYEYVELAKGLFDKAAEYWYQALKLAPDNYPRARNWLRITGRAKS LESF >gnl|To_CP_proteins|atpB atpB ATP synthase CF1 subunit beta 22805:24229 forward MW:51205 MVETTNKGYVSQIIGPVLDIEFPSGNLPPIYSAIKVETADGLGNIVEVQQLLGDNKVRAV SMRSTDGLKRGVEAVDLGTPISVPVGTPTLGRIFNVIGEPVDEQGDVSLDETLPIHREAP AFTELETKPSIFETGIKVVDLLAPYRRGGKIGLFGGAGVGKTVLIMELINNIAKAHGGVS VFGGVGERTREGNDLYEEMKESGVINSNNFAESKVALVYGQMNEPPGARMRVGLTALTMA EYFRDVNKQDVLLFIDNIFRFTQAGSEVSALLGRMPSAVGYQPTLATEMGALQERITSTT QGSITSIQAVYVPADDLTDPAPATTFAHLDATTVLSRNLAAKGIYPAVDPLDSTSTMLQP GIVTEEHYAIAENVKETLQRYKELQDIIAILGIDELSEEDRLTVARARKVERFLSQPFFV AEIFTGSPGKYVSLEDTIKGFTMVLKGELDDLPEQSFYLVGNIDEAISKAETLK >gnl|To_CP_proteins|atpE atpE ATP synthase CF1 subunit epsilon 24243:24644 forward MW:14200 MVMNIRVLTPDRVICSTTADEVILPGLTGQVGVLDGHATLITALDTGLLRIKLADKWTPI ILCGGLAEVDSDRVTVLVNDVEELVAVELSEATKELEKATSAIENAETSKARLDASVELK KATARLEGINYLS >gnl|To_CP_proteins|tatC tatC Sec-independent protein translocase protein TatC 24778:25518 forward MW:27822 MVTNSDFNFTTNETVILELPFSEHIEELRQRLFHIFWIILFLTCAAFIEVKLLVKILELP VDNVKFFQLSPGEYFVSTVKISFYTGLLFGSPFAIGQIILFLLPGLTKKETKVILPLLVS SVCLFGLGLLFSYYALIPAALNFFLNYSDEVIEPLWSFDQYFEFVLVLFYSTGLAFQIPI IQILLGLLNFVSAQQMLGAWRYVILVSTIIGAILTPSTDPLTQLLLSCAILLLYFSGVGI LFLIKN >gnl|To_CP_proteins|petA petA Cytochrome f 25552:26496 forward MW:34042 MATNKFFKSLLFTLTIAISVFGFSVENSFAYPVFAQQNYSNPRAANGKLACANCHLNQKA IEIEAPQGVLPNSVFEVEIKVPYDLSRKQIAANGKPAGLNVGGILILPKGFKLASKNQIS EEVKAKNKGVFISPYSTEFDNILVVGPIAGKKHQELIFPVVAPDPAKDPDVKYLTYPFYA GGNRGRGQVYPTGEKSNINAFGATQAGQITNITTGEKGESQITIVNSEGNSTSQTLSAGL QLLVKQGDIVKQDQPLNIDPNVGGFGQEESEIVLQDSARILGYLAFCFCLLLTQIFLVIK KKQYEKVQAAELNF >gnl|To_CP_proteins|psbI psbI Photosystem II reaction center protein I 27387:27503 reverse MW:4451 MLTLKILVYTTVIFFVSLFIFGFLSSDPSRNPNRRDLE >gnl|To_CP_proteins|ycf41 ycf41 putative uncharacterized protein Ycf41 27614:27928 reverse MW:11858 MNCIILTVKVIKNAGQSFFKDGTALTELVVQLPQIRKNNTSILIHLSVWGKLSHDVAKYY QPDDYIIIEGYISLKNINNDKDLNLLDKQIEISVLKLYPLLLKS >gnl|To_CP_proteins|ycf39 ycf39 putative uncharacterized protein Ycf39 27945:28907 reverse MW:36459 MSLLVIGGTGTLGRQIVLQALTKGYPVRCLVRNFRKANFLKEWGAELIYGNLSKPETIPP CLKGITAVIDASTSRPSDLDIVKTVDWDGKLALIEAAKVAKVKRFIFCSTQNLDQFSNIP LMKMKQGIEVKLKESQIPYTIFRLTGFYQGLIEQYAIPILENLPIWVTNENTCVSYMDTQ DIAKFCLKAFQLPETENKTFFLGGPKGWVSSEIINLCEQLAGQSAKVNKIPLFVLKLITK VLGFFEWGQNISDRLAFVEILNVENDFSKSTFDLYKTFKIQPVGEINQLDSYFLEYFIRL LKRLRDINFEDIQKQKNLII >gnl|To_CP_proteins|cbbX cbbX CbbX protein 28957:29820 reverse MW:32354 MNSINLQEEYAKTDIAKLLNLLDEELVGLAPVKARIREIAALLLIDKLRKNLGITANSPG LHMSFTGSPGTGKTTVGLKMADILFQLGYVKKGHLLTVTRDDLVGQYIGHTAPKTKEVLK KAMGGVLFIDEAYYLYKPDNERDYGSEAIEILLQVMENQRDDLVVILAGYKEPMDKFYES NPGLSSRIANHIDFPDYTVEELSTIAKMMLEEQQYQLTPQAEIALTDYITRRKQKPLFAN ARSIKNALDRARMRQANRIFDSRGQVLTKKELVNIEADDILQSTVFN >gnl|To_CP_proteins|psaL psaL Photosystem I reaction center subunit XI 29896:30342 reverse MW:15734 MANFIKPYNDDPFVGHLATPITSSSLTRALLKNLPAYRFGLTPLLRGLEIGLAHGYFLIG PFAKLGPLRNSDIGLLAGFLSTVGLILILTLGLTIYGAASFSQNKSTGNELQTKKSWDQF KGGFFVGACGSAGFAFICLSSIPAFTLN >gnl|To_CP_proteins|petL petL Cytochrome b6-f complex subunit VI 30518:30613 forward MW:3406 MSILINYFLLVGFCFALASGLFLGLKSIKLI >gnl|To_CP_proteins|ycf4 ycf4 photosystem I assembly protein Ycf4 30624:31169 forward MW:20510 MQNQIRQDKIVGSRRFSNYFWAILLLIGGLAFLLAGVSSYLKIKLLPFVNTTELVFIPQG IVMMFYGTLSFGLSLYISATLFWDIGSGYNEYNKIESLVKVVRKGFPGKNREILLTYPLN NIQSIGIKISEGLNPQRIIYLCLKDERKIPLTPVQQPDSISDLEDEAADLAKFLDLKLEN L >gnl|To_CP_proteins|psbE psbE Cytochrome b559 subunit alpha (PSII reaction center subunit V) 31341:31595 forward MW:9513 MSGGSTGERPFSDIITSVRYWIIHTITIPSLFVSGWLFISTGLAYDVFGTPRPNEYFTQD RQQVPLVNDRFSAKQELEDLTKGL >gnl|To_CP_proteins|psbF psbF Cytochrome b559 subunit beta (PSII reaction center subunit VI) 31607:31738 forward MW:4881 MSNNINQPVAYPIFTFRWLAVHGLAIPTVFFLGGITAMQFIQR >gnl|To_CP_proteins|psbL psbL Photosystem II reaction center protein L 31772:31888 forward MW:4365 MTGPNPNKQAVELNRTSLYWGLLLIFVLAVLFSSYFFN >gnl|To_CP_proteins|psbJ psbJ Photosystem II reaction center protein J 31913:32032 forward MW:4028 MANTGRIPLWLVGLVGGFAVITLLSLFIYGAYSGLGSSL >gnl|To_CP_proteins|ycf90 ycf90 putative uncharacterized protein Ycf90 32129:33535 reverse MW:55654 MFVNNLYFLFNSVLSLIINIDEETLFTFLGIKGPFSTSTEIIHMATPVTSDTFKYILQEY WTYYRHGLGFVDIENIALFILIIRFIFLSKKYNIRTGFLITLCGLGAGYLWYMHFRDLAF YYMRSLWMCPLTHNLASDFSEIHFQQVTEFQRIGNRADSDKFYGAAVRGFVDLTNDGKYR YDPLSLLWNYLPTDIKFLSDKIYYFVTLKAIPKFYAFFNSEMSALSGMLWYTFLVRINKR FCPYLIRWHWTFLIGLEFVERPFIYVQHRLIYYLNEILIPNSYFIEAELVTNLLVTLVAA QYIFISLGLLHALCGQYFYFPLLTENTELHIGLRPKDSIYSGGYTIWQNKRAYMISRQAS AKLKKYRFGTLRHAYSILPRMWYGWLGRGTLDDLTTEEYEKYLKNKANSDSLRRAKQRDS KIRWAYRRERLKKRFINLLAKLGIKIHNSTNDDDGDDLYEEFKNFSKK >gnl|To_CP_proteins|petD petD Cytochrome b6-f complex subunit IV 33745:34227 reverse MW:17801 MSIIKKPDLTDPKLRAKLAKGMGHNYYGEPAWPNDLLYLFPVCILGTFACCIGLAVMAPT QMGEPADPFNTPLEILPEWYFFPTFNLLRVLPNKLLGVLAMAAVPAGLITVPFIENVNKF QNPFRRPIASLVFIFGFFFAVWFGIGACLPIDKAVSLGYW >gnl|To_CP_proteins|petB petB Cytochrome b6 34274:34921 reverse MW:23908 MGKVYDWFEERLEVQAIADDISSKYVPPHVNIFYCFGGIVFTCFLVQVATGFAMTFYYRP SVVDAFASVEYIMTSVNFGWLIRSIHRWSASMMVMMLVLHVFRVYLTGGFKKPRELTWVT GVILAVVTVSFGVTGYSLPWDQVGFWACKIVTGVPAAVPVVGQPLVLVLRGGESVGQSTL TRFYSAHTFVLPVAAAVLMLTHFLMIRKQGISGPL >gnl|To_CP_proteins|psaD psaD Photosystem I reaction center subunit II 35110:35529 reverse MW:15333 MTVNLKTSFPTFGGSTGGWLRAAEVEEKYAITWTSSKEQIFEMPTGGSAIMRNGENLLYL ARKEQCLALSTQLKTFKVNDYKIYRIFPSGEVQYLHPKDGVAPEKVNPGRTAANSRSFSI GKNPNPASIKFSGIATYES >gnl|To_CP_proteins|secG secG preprotein translocase protein SecG 35821:36030 reverse MW:7728 MLKIIWVILSFVLISLIFLRTPQNQGLASFSSKSDLLGSPSSAEQFLNNLTVLLMISYFS FAIFLNTIN >gnl|To_CP_proteins|psbD psbD Photosystem II D2 protein (Photosystem Q(A) protein) 36271:37326 forward MW:39000 MTIAIGQNQERGLFDLIDDWLKKDRFVFIGWSGILLFPTAYLAAGGWLTGTTFVSSWYTH GLASSYLEGCNFLTAAVSTPANSMGHSLLLLWGPEAQGDFTRWCQIGGLWAFIALHGAFG LIGFCLRQFEIARLVGIRPYNAIAFSGPIAVFVSVFLLYPLGQASWFFAPSFGVAAIFRF LLFLQGFHNWTLNPFHMMGVAGILGGALLCAIHGATVENTLFEDGDAANTFRAFTPTQSE ETYSMVTANRFWSQIFGVAFSNKRWLHFFMLFVPVAGLWTSAIGIVGLALNLRAYDFVSQ ELRAAEDPEFETFYTKNILLNEGIRSWMAAQDQPHENFVFPEEVLPRGNAL >gnl|To_CP_proteins|psbC psbC Photosystem II CP43 chlorophyll apoprotein 37274:38689 forward MW:51909 MKTLYSLRRYYHVETPFNSSIAGRDIESTGFAWWSGNARLINVSGKLLGAHVAHAGLMVF WAGAMILFEVSHFVPEKPLYEQGFICMQHLATLGYGIGPGGEITSTVPYFAVGVIHLISS AVLGFGGIYHSLLGPDTLEESFPFFGYDWRDKNKMTTILGIHLCLLGLGSFLLVAKAMYL GGIYDTWAPGGGDVRYITTPTLNPIVIFGYVFRSPFGGDGWVVSVNNMEDLVGGHIWVGL LCIVGGVWHIFTKPFAWARRAFVWSGEAYLSYSLAAISLMGFTASLYSWYNNTAYPSELY GPTGPEASQSQAFTFLVRDQRLGANVSSAQGPTGLGKYLMRSPSGEIIFGGETMRFWDLR APWVEPLRGPNGLDINKIKNDIQPWQERRAAEYMTHAPLGSLNSVGGVATEINSVNYVSP RSWLCCSHFFLGFFFLIGHWWHSGRARAAAAGFEKGINRANEPVLSMRPID >gnl|To_CP_proteins|ycf12 ycf12 photosystem II reaction center protein Ycf12 39087:39191 forward MW:3591 MVNWQVIGQLVSTGTIMLLGPAIVVLLALKKGNL >gnl|To_CP_proteins|psbZ psbZ Photosystem II reaction center protein Z 39247:39432 forward MW:6300 MITALTALLVLVSLALVVTVPVALATPGEWESSKDQFNKAFTLWVGLVVAIATADGISSS I >gnl|To_CP_proteins|psaI psaI Photosystem I reaction center subunit VIII 40409:40519 reverse MW:3893 MAASFLPSILVPLVGLIFPAFSMALFFLYSQSDDIA >gnl|To_CP_proteins|psbK psbK Photosystem II reaction center protein K 40666:40800 forward MW:4996 MESLLLARLPEAYVVFSPIVDVFPVIPVFFLLLAFVWQAAIGFR >gnl|To_CP_proteins|petG petG Cytochrome b6-f complex subunit V 40906:41019 forward MW:4088 MVEPLLSGIVLGMITVSAFGLFVAAFLQYRRGNQFEI >gnl|To_CP_proteins|petM petM Cytochrome b6-f complex subunit VII 41578:41706 reverse MW:4489 MALVLQIFPFANAEIVTAAVTCIFMTLFGLSLGFALLKVQGE >gnl|To_CP_proteins|petN petN Cytochrome b6-f complex subunit VIII 41746:41835 reverse MW:3234 MDIISLGWAGVMTMFTFTLALVVWGRNGF >gnl|To_CP_proteins|ycf33 ycf33 putative uncharacterized protein Ycf33 42481:42675 forward MW:7713 MNNFWTNIVRYPRFFISSLIGLILIILTPFRNLLKVPKLRWILILFSLVFILSLYFIIRN MIAL >gnl|To_CP_proteins|psbX psbX Photosystem II reaction center protein X 42905:43021 forward MW:3885 MTTSLANFIASLTAGALVLAAIGIALIIISKNDRVERG >gnl|To_CP_proteins|ycf66 ycf66 putative 26S protease regulatory subunit Ycf66 43167:43478 forward MW:11617 MLNFVFSPNVLLGFILGSSVIILYFLRLVKPEVARDEDIFFATIGLLYSGILVIHGWRLD PILLFSQVLVITAVLAAGWENIRLRGVLAMLAMRDIEENKEIN >gnl|To_CP_proteins|psbV psbV Cytochrome c550 43522:44013 forward MW:17983 MFKRYSKFCACILFCIFNLFVVSASAIDLDEATRTVTVDSSGKTTVLTPEQVKRGKRLYN ATCGACHTGGITKTNPNVGLDPEALSLATPRRDNIEALVDYLKNPTTYDGLESIAEIHPS IKSADIYPRMRSLTDEDLYSIAGHIMLQPKIVAEKWGGGKIYF >gnl|To_CP_proteins|psbB psbB Photosystem II CP47 chlorophyll apoprotein 44806:46335 forward MW:56333 MALPWYRVHTVVLNDPGRLISVHLMHTALVAGWAGSMALYELAVFDPSDPVLNPMWRQGM FVMPFMTRLGITDSWGGWSITGESVSNPGLWSFEGVALSHIVLSGMCFLAAIWHWVYWDL ELFRDPRTGEPALDLPKIFGIHLLLSGLLCFGFGAFHVTGLFGPGIWVSDAYGITGKVQP VAPSWGADGFNPFNPGGIAAHHIAAGIFGILAGIFHLTVRPPQRLYRALRMGNIETVLSS SISAVFFAAFVTSGTMWYGAAATPIELFGPTRYQWDSGYFQQEIERQVEASVTEGLSESQ AWSRIPDKLAFYDYIGNNPAKGGLFRSGPMDKGDGIAEAWLGHPIFRDKEGRELTVRRMP AFFETFPVILVDKDGIIRADIPFRRAESKYSIEQVGVTVDFYGGKLNGQTFKDAPTVKKF ARKAQLGEVFEFDRTSLESDGVFRSSPRGWYTFGHANFALLFFLGHLWHGGRTIFRDVFT GIGAEVTEQVEFGAFQKLGDKTTKKQGAV >gnl|To_CP_proteins|psbT psbT Photosystem II reaction center protein T 46385:46483 forward MW:3808 MEALVYTFLLIGTLMVIFFAVFFRETPRILKK >gnl|To_CP_proteins|psbN psbN Photosystem II reaction center protein N 46510:46641 reverse MW:4747 METATIIVIFVSSLLLGITAYSVYTAFGPASKNLRDPFEEHED >gnl|To_CP_proteins|psbH psbH Photosystem II reaction center protein H 46725:46925 forward MW:7331 MALRTRLGEILRPLNAEYGKVVPGWGTTPIMGVTMGLFLVFLLIILQIYNSSLIIENVDV DWANAI >gnl|To_CP_proteins|psaE psaE Photosystem I reaction center subunit IV 47277:47474 forward MW:7656 MIKRNSKVRILRPESYWFYKIGTVATVDNSGIRYPIVVRFENVNYNGTSTNNFTLDELVE IKKEQ >gnl|To_CP_proteins|ycf42 ycf42 putative peroxiredoxin Ycf42 47614:48249 forward MW:24197 MITFPQIGKRAPNFITVGVFKKRLGRIRLSDYRGKKYVMLIFYPANFTPVAPTELIALSD QVSEFRKLSTQILAISIDSPFSHLQSLLVSRKKGGLANLNYPLISDLNHTITTDYRLLTE EGLAVPGVFIIDKEGIIQYYTVNNLLCSRSINELLRIIESIQYVKKNPGQACPVNWQSSS FKNFYQYEQVLYSHPLKSKLYFKELYSSKKN >gnl|To_CP_proteins|rpl35 rpl35 50S ribosomal protein L35 48256:48450 forward MW:7382 MPKLKTRKAALKRYKKTGAGNFLRRHAYKGHLLMKKSNVQKRRLSQRVCVAAGDSKPIKL MLPY >gnl|To_CP_proteins|rpl20 rpl20 50S ribosomal protein L20 48460:48819 forward MW:13978 MVRVKRGNVARKRRKKILQLAKGYKGAHSRLFRVANQQVMKALRYAYVGRKQKKRVFRKL WISRINASARQSGTTYSQLINCLKENKINLNRKMLAQMAVLDYSSFYEIIKQTKLTKVS >gnl|To_CP_proteins|ycf45 ycf45 putative uncharacterized protein Ycf45 49047:50405 forward MW:52264 MNIDDDLEKLLKNLPFFIYQHVHNHSNKEKLIEIVLDLGRRPEARFTTGPEYLSQKVISW QDIDYTTRRISKFSGENRAGIERTLHRISCIRNRQFLINGLTCRVGRAIFGTISIIRDLL ESRKSILILGKPGVGKTTIIREIARILSDEMEKRVIIIDTSNEIAGDSDVPHNGIGRARR MQVAKTELQHKIMLEAIENHMPQVIVIDEIGTELEALAARTIAEKGVQLVGTTHGNCLEN LIKNPSLSDLVGGIQYVTISDEEAKRRGTQKSILERKSYPAFQLAVEVNNVYSWTIHENV ENSVDLILRDNFTILQTRSIKKNEKLSITYKKLQKDFLTKNSWFLNREMIAIHRNWFEMY KPKTLGSFKNTTLVIYPYSLSKNLIREILVKLGNKVVITRKIKQANLIIGLKKHLRQNFR LKNLAHQRKIPIYTINQRSIYQIMRLLQFFIS >gnl|To_CP_proteins|rbcS rbcS RbcS, small subunit of RuBisCO 52328:52747 reverse MW:15925 VRLTQGCFSFLPDLTDQQIEKQVQYAMAKGWAMNVEWTDDPHPRNNYWELWGLPLFDIKD PATVMFELNEARKSCAAGYIRMNAFNASYGTESCVMSFITNRPANEPGFYLDRTDGEGRN IVYSIKSYSVQANPEGSRY >gnl|To_CP_proteins|rbcL rbcL RbcL, large subunit of RuBisCO 52786:54258 reverse MW:54295 MSQSVSERTRIKSDRYESGVIPYAKMGYWDAAYTVKDTDVLALFRITPQPGVDPVEAAAA VAGESSTATWTVVWTDLLTACERYRAKAYRVDPVPNTPDQYFAFIAYECDLFEEASLSNL TASIIGNVFGFKAVSALRLEDMRIPHSYLKTFQGPATGIIVERERLNKYGTPLLGATVKP KLGLSGKNYGRVVYEGLKGGLDFLKDDENINSQPFMRWRERFLNCLEGINRASAATGEVK GSYLNVTAATMEEVYKRAEYAKQIGSIVIMIDLVMGYTAIQSIAYWARENDMLLHLHRAG NSTYARQKNHGINFRVICKWMRMSGVDHIHAGTVVGKLEGDPLMIKGFYDILRLTELEVN LPFGIFFEMDWASLRRCMPVASGGIHCGQMHQLIHYLGDDVVLQFGGGTIGHSDGIQAGA TANRVALEAMVLARNEGADYFNTQVGPQILRDAAKTCGPLQTALDLWKDISFNYTSTDTA DFAETATANR >gnl|To_CP_proteins|sufB sufB FeS cluster assembly protein SufB 54546:56003 forward MW:54489 MNQSNKTLNNNITKLVNQPYKYGFSTTIEKDIIEKGLNEKIIRLLSKKKNEPQFLLEFRL KAYKKWKQMTCPEWAQLKFSEIDYQDIIYYSAPKTKKKLNSLDEVDPELLETFDKLGITL SEQKRLTNVAVDVVFDSVSIATTFKQELAEYGVIFSSISEAIHDYEELVEKYLGTVVPIG DNYFSALNSAVFTDGSFCYIPKDIICPLDLSTYFRINDENSGQFERTLIVAEENSQVSYL EGCTAPQYDNNQLHAAIVELIALNNASIKYSTVQNWYSGDEKGQGGVYNFVTKRGLCSGT ASKISWTQVETGSSITWKYPSCLLVGENTQGEFYSVALTNNYQQADTGTKMIHIGKNSRS RIVSKGISAGNSKNTYRGFVNINKKAIGARNYSQCDSLLIGNLSNSNTFPFISVQNSSTK IEHEASTSKIGEEQIFYFLQRGISIEKGIELMISGFCREVFTELPLEFASEADRLLSLKL EGSVG >gnl|To_CP_proteins|sufC sufC FeS cluster assembly protein SufC 56003:56758 forward MW:28209 MTLYSPILEIKDLKASINNNEILKSLNLTIKRGEIHAIMGPNGSGKSTFSKIIAGHPAYE VISGDILFNGSSILDLDPEERSHLGIFLAFQYPIEIPGVSNEDFLRLSYNSKQKFLNKPE VDPIQFFSIINEKLKLVDMSSVFLSRNVNEGFSGGEKKRNEILQMILLESELSILDETDS GLDIDALKIISKGINTFMSSDKSIILITHYQRLLDYIKPDYVHVMQNGKIIKTGTANLAK ELEIKGYEWLI >gnl|To_CP_proteins|atpI atpI ATP synthase CF0 subunit IV (A chain) 56886:57614 forward MW:26977 MYPDNFSSSTFTFLAEAEVGKHFYWNIAGFLVHGQVLVVIWFVTLILLLFAFLGSREASR IPHGWQNFMESALDFVTDIARNQLGESFYREWIPFIGTLFLFIFGCNWAGAVIPWKLIQL PEGEFAAPTNDINTTVALALLTSFSYFYAGLSKKGLGYFKRYIAPIPLLLPINILEDFTK PLSLSFRLFGNVLADELTITVLTSLVPLVIPLPIMLLGIFAGSVQALIFSTLAAAYIAEA LE >gnl|To_CP_proteins|atpH atpH ATP synthase CF0 subunit III (C chain) 57680:57928 forward MW:8181 MDSIISAASVIAAGLSIGLAAIGPGIGQGNAAGQAVEGIARQPEAENKIRGTLLLSLAFM EALTIYGLVVALALLFANPFNS >gnl|To_CP_proteins|atpG atpG ATP synthase CF0 subunit II (B' chain) 58100:58570 forward MW:17325 MINPSILISNSEVSGPGGLFDIGATLPLVAIQFILLMVLLNIILYSPLLTIIEERKEYIL SHLAEASEKLAQAKELTTQYEQDLETARKEAQLEIANSQNIHKEILDIELDISQKYIDNL LETISSDLLNKKKTALDTLDSSVQALCTEVESKLSI >gnl|To_CP_proteins|atpF atpF ATP synthase CF0 subunit I (B chain) 58634:59173 forward MW:20041 MENFNQIFTLLSENGSIGLNLDILETGVLNIAALVGILIYTGKDFLGSILQERKSTIVKS VQDAEDRLNEANRRLSEAQKQLSQAHVVISEIRNETQTAKTNLLKSDANIAKKELTTRFN RAVASFRSKERVIFLDVKQQIISLVLNRSVVQAKETFGSKKRARALINETIEKLEGDLL >gnl|To_CP_proteins|atpD atpD ATP synthase CF1 subunit delta 59170:59733 forward MW:21137 MSANPLALKIATPYARALYDFSVEQNIMHQVTADFQNLEVFLTKTPELTEYLSNPVIGVK QKEDVLTKTLKSQLNSETFKFLMVLVKRDRINLLSSVIVSYLELVYKTASVKMIEVSTAF PFTTVQKLNLIKKLKELTNAREIRLVVTVDSSLIGGFLIKTNSKVIDFTIKNQLENLAKH LDGVLEI >gnl|To_CP_proteins|atpA atpA ATP synthase CF1 subunit alpha 59780:61291 forward MW:54049 MINIRPDEISSIIREQIEQYDQDVKVDNIGTVLQVGDGIARVYGLDQVMSGELLEFEDKT IGIALNLENDNVGVVLMGNGRQILEGGTVKTTGQIAQIPVGEAFLGRVVNPLGVPIDGKG DISTTESRLVESMAPGIISRKSVCEPLQTGITSIDAMIPIGRGQRELIIGDRQTGKTSIA VDTIINQKTEDVVCVYVGIGQKASTIAQVVNVLDEKEAMAYTIIVSASANDPATLQYIAP YSGAALAEYFMYNGKATLIIYDDLTKQAMAYRQMSLLLRRPPGREAYPGDVFYLHSRLLE RAAKLSDALGGGSMTALPVIETQASDVSAYIPTNVISITDGQIFLSNDLFNSGIRPAINV GISVSRVGSAAQTKAMKQVAGKLKLELAQFAELEAFSQFASDLDEATQKQLARGTRLREI LKQPQNSPLSVPQQVALIYAGINGFLDDLAVSDVKNFSATLLSTLDSQKSYIEAVGSTNQ FTQEAETLLKDAIASTKTGFTLI >gnl|To_CP_proteins|rpl19 rpl19 50S ribosomal protein L19 62795:63157 forward MW:13923 MLKFDNQKVIDNLHQNFIKPNLPKIQIGDSIKLGVKIIEGNKERVQFYEGTVIAKKNSSI NTTITVRKVLQGIGIERIFLIHSPKIASIEIIRHAKVRRSKLYYLRNLRGKASRLKQRFE >gnl|To_CP_proteins|orf127 orf127 To_Orf127 63712:64134 forward MW:15230 MINYYDLSTEKNTNKISVSLVFKRVLKCTLIIMPTLLFSSVASAKDVVKIVKTSKKLSKS EKALLIVIKTSGLYIGGKACTEEAKKATELTQQGPIFPLTATTCVLCGAFIATHVLEDQM ADVKIDDAIPQWKTAISCSI >gnl|To_CP_proteins|psaB psaB Photosystem I P700 chlorophyll a apoprotein A2 65614:67815 reverse MW:82057 MATKFPKFSQALAQDPATRRIWYGIATAHDLEAHDGMTEENLYQKIFASHFGHLAIIFLW TAGNLFHVAWQGNFEKWVSNPLKTRPIAHAIWDPHFGEKALKAFSKGNTYPVNISFSGLY QWWYTIGFRTNQELYRASIGVLLLASVLLIAGWLHLQPKFRPSLSWFKNNESRLNHHLSG LLGFSSLAWTGHIVHVAIPASRGVHVGWDNFLTTPPHPAGLTPFFTGNWTVYAENPDTAE HAFNTADGAGTAILTFLGGFHPQTQSLWLTDIAHHHLAIAVVFIVAGHMYRTNFGIGHNM KEILDAHRPPGGRLGAGHVGLFETITNSLHMQLGLALACLGVATSLTAQHMYALTPYAYL SKDFTTEAALYTHHQYIAGFLMVGAFAHGAIFFVRDYDPELNKDNVLARMLEHKEAIISH LSWAALFLGFHTLGLYIHNDTVVAFGQPEKQILFEPLFAEYIQAASGKAVYEFNVLLASS TSPATAAGNQVWLPGWLEAINNPKNDLFLKIGPGDFLVHHAIALGLHTTALILVKGALDA RGSKLMPDKKDFGYSFPCDGPGRGGTCDISAWDAFYLAMFWMLNTISWVTFYWHWKHMTI WAGNPGLFNESSNYIMGWLRDYLWLNSSPLINGYNPFGMNNLSVWAWTFLFGHLIWATGF MFLISWRGYWQELIETLVWAHERTPLANLIRWRDKPVALSIVQARLVGLVHFTVGFILTF AAFVIASTSGKYA >gnl|To_CP_proteins|psaA psaA Photosystem I P700 chlorophyll a apoprotein A1 67919:70177 reverse MW:83581 MAISSTERRSKNVQVFVEKDAVETSFAKWAQPGHFSRTLAKGPKTTTWIWNLHADAHDFD SQTNSLEEVSRKIFSAHFGQLAIIFLWISGMHFHGAYFSNYSAWLSDPISIKQSSQVVWP IVGQEILNADVGGNFQGVQTTSGWFQMWRAEGITSEVELYWIAIGGLAMSAIMLFAGWFH YHKAAPKLEWFQNAESMMNHHLAGLLGLGCLSWSGHQIHIALPINKLLDAGVSPQEIPLP HEFLINRDLMAQLYPSFGKGLAPFFSGNWGEYSDFLTFKGGLNPVTGGLWLSDIAHHHLA LSVLFIVAGHMYRTNWGIGHNMKEILEAHKGPFTGEGHKGLYEILTTSWHAQLAINLAMM GSLSIIVAHHMYAMPPYPYLATDYATTLSLFTHHMWIGGFCVVGGAAHGAIFMVRDYTPA NNYNNLLDRVLRHRDAIISHLNWVCIFLGCHAFGFYIHNDTMRALGRPQDMFSDKAIQLQ PIFAQWIQNIHLLAPGTTAPNALATTSYAFGGDVIEVGGKIAIMPIKLGTADFMVHHIHA FTIHVTVLILLKGVLYARSSKLIPDKANLGFRFPCDGPGRGGTCQSSSWDHVFLGLFWMY NCISVVIFHFSWKMQSDVWGSVTPDGTIAHITGGNFAQSSITINGWLRDFLWSQASQVIQ SYGSASSAYGLIFLGAHFIWAFSLMFLFSGRGYWQELIESIVWAHNKLNFAPTIQPRALS ITQGRAVGLAHYLLGGIGTTWAFFLARAVSIT >gnl|To_CP_proteins|ycf89 ycf89 putative uncharacterized protein Ycf89 70918:71904 forward MW:38469 MIFNTYIKELQNIIYKGFKIFFDSTARLFGYPNNPGMPIIPFDEDAYDKVYGRDLLPKHV TYLPPGQPQYPETLTEALFGTFPYTEPIEKHFYQHPSEGYYNFYVENYRELYFLPDWLSG YIQMHFNITINHSNLELCRDVFFYVVLLYGAIVTIRSNLFWMLAINPYTYPWVFAVDFVD WIYDALAGIVPCFVGIDMVPTFFMMLIGKIADSANHLVFTMPFLPSEGNKVRMLLDGDIN PTEVIQFHYLPYLWYKYPIPENVRQFWYSERPDILEFMEKNYSQFGLDFRPSTVSIDEIT SVVDSAVKLTTPIVSLVSDSTDFLSTNF >gnl|To_CP_proteins|psbY psbY Photosystem II reaction center protein Y 77715:77825 reverse MW:4005 MDTRLLVIAAPVLVAASWALFNIGRLAIQQLQRLSR >gnl|To_CP_proteins|rpl32 rpl32 50S ribosomal protein L32 78276:78440 forward MW:6043 MAVPKKRTSKAKKNSRKANWKGKAAKSAQKSLSLAKSILQGKTTSFVYRLYVEE >gnl|To_CP_proteins|rbcR rbcR transcription regulatory protein RbcR 78689:79612 forward MW:34587 MVLPFTLQQLRIFKAIASEKSFTQAAEILFVSQPSLSKQMKTLENRLGILLLTRTGNKIS LTEAGDVFLQYSERILALCEESCRALNDLKDGERGNLKVGASQTIGAYLMPRVLTLFAQS YPQINLNIDIDSTRIIAKKVADRIIDIAIVGGDIPTSLKKNLEIEDFVEDELILIIPKSH PFARKKKKKISKEDLYHLNFITLNSNSTIHKFIDNTLIQNNIQTKQFNVIMELNSIEAIK TAVSLGLGSAFVSSSAIEKEIELKTVEIIAIEKIKITRTLSIITNTDSHRSKAFDFFYNE LWLLKNL >gnl|To_CP_proteins|rpl21 rpl21 50S ribosomal protein L21 79695:80012 forward MW:12527 MKYAIVEISGRQFWIEAGKYYDLNRIPTELGKQIKLNRVLFVNNEEELLIGKPYLENVVV EGKILKHFRSRKTVVYKMRPKKKTRKKQGHRQDLTRVLIQEINIT >gnl|To_CP_proteins|rpl27 rpl27 50S ribosomal protein L27 80120:80371 forward MW:9006 MAHKKGAGSTKNGRDSNANRLGVKRFGGEKVKAGSILVRQRGMKFKPGSNVGSGKDFTLF ALVDGTVKFDYKDAQHKRVNIII >gnl|To_CP_proteins|secA secA preprotein translocase protein SecA 80395:83022 forward MW:101201 MLKNLFIKDSVTNQYQALVNQINALENNLKTLTDTELRNKTFQLKKQYEEEQNLDRLIAE SFAITREASLRTLGLRHFDVQLIGGLVLNSGKISEMRTGEGKTLVATLPAYLNALTNRGV HIVTVNDYLASRDQISMGQIYRFLGLDTGLIQEDMSFLERQTNYNADITYVTNNEVAFDY LRDNMAPNLDQVVLPPFNYCIVDEVDSIFIDEAQVPLIISQAVETCIDKYIVAAEVAQYL EVNVHFKVDEKNRNIILTEQGTTQIEKILQVEDLYNPNDPWIPYILSAIKATALFFRNVH YIVQNNQIIIVDEFTGRIMPDRRWNEGLHQAVEAKEGVPIRQNTETAASITYQNFFLLYP KLSGMTGTAKTSEVEFEKIYNLPVEEIPTARPNLRKDLPDFVYKDSLTKWTAIARECKSI AKTSQPILIGTTTVENSEMLADLLKEYQLSYRVLNAKPENVKRESEIVAQAGEIGSITIA TNMAGRGTDIILGGNITFKVRKFLYNILVSCKNGQSTINYYEFCNSVSLKFLSVFRSLLH DQTFLNLSSTAILKFLNEIDQIRIPKIPYQCSIKFLLEELSRFEKKNQKVNNKIVKNLGG LYIIGTERNNSRRIDNQLRGRCGRQGDPGTSRFFLSLEDSLFRNFGSSNLQNFMQSQLLD DLPLESNLLTKSLDAAQKRVEERDYDGRKYLFDYDDILNKQRNIVYYERRKLLESQSLRE TILAYGEQVIKDIINLAKDPKFIQYIKTNSIIEELFKTRFINLTDSLNSLDVVELKTYLF QEFWLSYETKVLEFEICQVGLIRSFERTIILYYTDIAWKEHLQKISLLRDAVGWRTYGQR NPLFEFKEEAYNLFQNRNMTIRHLLIRDFLHSYIL >gnl|To_CP_proteins|rpl34 rpl34 50S ribosomal protein L34 83068:83214 forward MW:5510 MTKRTLSNKTRSSVLKVSGFRARMATAQGRKVLRNRRKKGRKKLAISH >gnl|To_CP_proteins|ccsA ccsA cytochrome c-type biogenesis protein CcsA 83470:84447 forward MW:37006 MDWNITQNFSSNIVFGILLLAMIIYWVNLSLFRGNKNLIQLGKLSAILANIILFFILCSR WIIAGYFPLSNLYESLLFLTWTLLTIYLYLEFKTKSKVLGAILLPIALLINGFANLTLSA DMQKSSPLVPALQSNWLMMHVSMMMLSYATLIIGSLLCILFLVLSKLQEVDLQLIDESTS LVFYNNIFDYYEAKVFLEGNKESTETTLLVEEDIQEIAFLKLLKSLDNWSYRIIGLGFPF LTIGIIAGGVWANEAWGSYWSWDPKETWALITWLVFATYLHSRITKGWEGKKTAILGGLG FFVIWICYLGVNFLGKGLHSYGWLS >gnl|To_CP_proteins|psaC psaC Photosystem I reaction center subunit VII (Photosystem I iron-sulfur center) 84543:84788 forward MW:8798 MSHTVKIYDTCIGCTQCVRACPTDVLEMVPWDGCKSGQIASSPRVEDCVGCKRCETACPT DFLSVRVYLGAETTRSLGLAY >gnl|To_CP_proteins|psbA psbA Photosystem II D1 protein (Photosystem Q(B) protein) 85497:86579 reverse MW:39727 MIATLERREGVSLWERFCAWITSTENRLYIGWFGCLMFPLLLTATSCFIIAFIAAPPVDI DGIREPVAGSLLYGNNIISGAVIPSSNAIGMHFYPIWEAASVDEWLYNGGPYQLIVLHFL LGVASYMGREWELSYRLGMRPWIFVAFSAPVAAASAVFLVYPIGQGSFSDGMPLGISGTF NFMLVFQAEHNILMHPFHMAGVAGVFGGSLFSAMHGSLVTSSLIRETTENESTNYGYKFG QEEETYNIVAAHGYFGRLIFQYASFNNSRALHFFLALWPVLGIWLTSMGISTMAFNLNGF NFNQSVVDSQGRVINTWADIINRADLGMEVMHERNAHNFPLDLASGDVLPVAFTAPAVNA >gnl|To_CP_proteins|ccs1 ccs1 cytochrome c-type biogenesis protein Ccs1 86673:87938 reverse MW:48745 MKQKIFKSIADLRFAIFILLLIAIISVIGTVIEQDQSIETYKINYPLTNRVFGFLSWDII IKFGLDHVYKTWWFISLIVLFGLSLLTCTFLQQFPSLKIARRCQFFRTTQQFCRLNISTN LQHLSFPQLLFKIKENNYSIFQQKNIVYCYKGLIGRIAPIIVHFSMIIILIGAIIGSIGG FEAQEIVPKTETFHIQNILNNGQVSLIPKISVRINDFWIAYTKQTTINQFYSDLSILNND GNEVERKTIFVNSPEKYNNIDYYQTDWNIIGLRIRNKDFSIFQYPFINLPENTEKIWLTW ISNNSQLNQGLTILIDNLQGYCSIYNTEGTFLGNLELNESLKAENSIVLMDILSSTGLQI KADPGIVLIYSGFLFLMISTLISYITYSQIWIVQDQKKIFIGGNTTRATFDFELEFLKLI K >gnl|To_CP_proteins|ycf46 ycf46 putative uncharacterized protein Ycf46 87946:89439 reverse MW:57079 MKFTDELTLLLKARYPIIYINTIEEDRVEYIIRKYIKTSLNRSIYSWDFIDGYTNNPNNE GFAKRNPVQALELVERLTAQTPALFLLKDFNRFLTDVSISRKLKNISRILKLQPKTIIII GSELNIPKELYDLITILQFQLPIESEIKYELKRLIESLNIEIDQQVLESLISACQGLSLE RIRRVLSKIIATYKTIDENSIKLLLNEKKQIISQTEILEYWSVDETISKIGGVDELKNWL KKRKTSFGIQALNYGLPTPRGLLLVGIQGTGKSLTAKAIANEWQLPLLKLDVGKLFGGIV GESESRLRQMIEVAETISPCILWIDEIDKAFSNNNNTGDSGTSNRVLATFISWLSEKTKP VFVVATANNVEVLPLEIIRKGRFDEIFFLDLPQKQEREQIFKIHIQEFRPNRWESFDYSK LAQLSEAFSGAEIRQSIIEAMYQAFYEKREFITEDICLALTQLIPLSQLDNTQTLKLKNW AVSGRIRLASSKSISTN >gnl|To_CP_proteins|ycf35 ycf35 putative uncharacterized protein Ycf35 89605:90009 forward MW:15705 MSHFTNLKTSFKNLLHLENALKNLGIKYKKSKKIINNDQNMSKRYNDNVIIPQSNNYDIT FDWNGEEYDLVLDASFWIQPYPVESFINKLSQQYANDVIITESQKIGFQPIKSKQHIDGS NTITLQRWNANTIY >gnl|To_CP_proteins|clpC clpC chaperonin ClpC 90779:93577 forward MW:105156 MFEKFTEGAIKVIMLGQEEARRMGHNFVGTEQLLLGSIGQRHGIAGRALAQLNITLKKTR KEVEKYIGRGKGYVASEIPFTPRAKRVLEMSIHEGQDLGVNYVGTEHILLSILTEGEGIA IRTLDVLRVNIPKLRHLILTYIEEQQEDILKPHLTDQEKWAIARDKNGLKTPSLDAYSEN ITKEVIENKLDPVIGRDEEISALVKVLGRRRKNNPVLVGEAGVGKTACAEGLAILMQSVD CPDFLREHVIRSLDLGSVLAGTKYRGEFEERMKHIIEEVNNHGKTILVIDEIHTIVGAGA AEGAVDAANLLKPSLARGTLRVIGATTLDEYRRYIEKDPALERRFHSVIVEEPSIETTID ILRGLKPEFQRHHALAYEDSCLELAATLSSRFIADRNLPDKAIDVIDEAGARVRLRNQTL PEGLQKLLLELHETLGDKHDAIAQNQFSLAKELLNHEIEVRTHLRIMRFAMTTKKEFSKY STPGGPRIDGVVDSDVTKVVSAWTGIPTTKINQSENERLKLMEDILHERLIGQHHAITAV SKAVRRARVGIQDPKRPIASFIFAGPTGVGKTELTKTLAEYMFDDEKNLIRFDMSEFMER HTVAKLIGSPPGYVGFQDGGQLTEAVRKKPYSIVLFDEVEKAHPDVFNIFLQILDDGHLT DSTGRVVSFKNTIIVMTTNLGSRVIERESPVLAKEKIGFGRNLDPAAIEGKRIELQINSD GTFSLKPPVKKEVTAEDEEKYLHISELVQEELKKFFRPEFLNRIDDIIVFNHLSKHDIWE ICGLMLNSLKKRLGEQGAVLNIDNAVRFFLAEEGYDPIYGARPLRRSITKFLEDKLAEEC LSYTLHEGTNINVTQKIKPDIYGSRPVYSPIFDDIYKREIEITFDYSNVDFSQVADVEEK DKTNDRIEKEPSLRQKIIEANKRLEGRKIEHA >gnl|To_CP_proteins|groEL groEL chaperonin GroEL 93990:95582 reverse MW:57330 MAKKILYQDNARRALERGMEIMVEAVSVTLGPKGRNVVLEKTYGSPQIVNDGVTIAKEIN LEDHIENTGVSLIRQAAAKTNDVAGDGTTTATVLAYAMVKEGLKNVAAGANPISIKLGME KATQYLVMQINEFAQPVEDIQAIKQVASISAGNDNIIGALIADALAKVGKEGVISLEEGK GIVTELEITEGMKLEKGFISPYFITDTEKMEVSYENPYILLTDKRITLVQQDLLPILEQI TKTKRPLLIIAEDVEKEALATLILNKLRGIVNVVAVRAPGFGELRKLMLQDIAVLTGGTV ITQDAGLSLDNIQLNLLGEARRIIVTKDTTTIVGDGLEIENIKARCEQLRKQVNLVETAY EKEKLQDRIAKLSGGIAVLRVGAVTETEMKDKKLRLEDAINATRAAVEEGIVPGGGATLT HLSENLVTWAKNNLKEDELIGALIISRAIVAPLKRIAENAGINGPVIIGKVQEQEFEIGY DAAKNVFGNMYKEGVVDPAKVTRSGLQNATSIASMILTTECIIVDDMKKN >gnl|To_CP_proteins|psb28 psb28 Photosystem II reaction center protein W 95989:96333 forward MW:13035 MKARIQFIKGTNENILPDIRLTRSRDGSTGTATFRFKNANILDKSLALNGEITGMYMIDI EGILETRDVTACFIKGKPKAVQAIYIMKSSDEWNRFMRFMKRYSQGNDLVFTKA >gnl|To_CP_proteins|thiS thiS thiamine biosynthesis protein ThiS 96377:96589 reverse MW:8111 MKNAKSFFLNGEKFFTSKTINLLDVITYFNYSSSLLVLEYNNFICSRENWTTIYINDQDR LEIVTIVGGG >gnl|To_CP_proteins|rps4 rps4 30S ribosomal protein S4 96819:97436 forward MW:23421 MSRYRGPKLRITRRLGKLPGLTQKTSKKTARPGQHGKTLGDGKKKTTEYGLRLEEKQKLK FNYGITESQLYHYIKEARRRKGVTGLILLQLLEMRLDTICFSLGFAPTIAAARQIVNHGH ITVNNKVVDIPSFQCQINDVIGVKAKSTSKNIIENNIKTINFIEPPTHLILDKSKLEGTV KNYCDRNELLLDLNELLVIEYYSRR >gnl|To_CP_proteins|rps16 rps16 30S ribosomal protein S16 97467:97718 forward MW:9871 MLRLRLKRNGRKRQPTYRLVIMENTTRRNGRPVDEVGYYNPITKESYFNEEKIAKWLNHG VKPTTTVFQLLLKANLIQKKIPS >gnl|To_CP_proteins|rps6 rps6 30S ribosomal protein S6 98487:98798 forward MW:12021 MSAPIKYEMMILLTEEFNDNELKNWAFNYAKALRKLNASEISVISRGKRDLAYEISNQKR GNFIQINFSSIPKHAEEFLSSLKFDINVLRFLILNKTNNVKSF >gnl|To_CP_proteins|thiG thiG thiazole biosynthesis protein ThiG 98962:99765 forward MW:28838 MKDIIDPLKIGNKSFSSRLMLGTGKYRTSKDAVKSITTSECEIITVAIRRLPTNINQDNI NFLKSLNWNKLWLLPNTAGSQTAEDAIRMAFLGHELACQLGQEDNFFVKLEVISDPKYLL PDPIGTLKAAEFLVKKGFTVLPYINADPMLALHLEDLGCATVMPLGSPIGSGQGLNNLSN LQIIIENANVPVVIDAGIGSPSEATLAMELGADAVLLNTAVAQSKNPEQMAKAMHLGVKA GRLGYLAGRMEKKYYATPSSPLNLISK >gnl|To_CP_proteins|dnaK dnaK chaperone DnaK 100196:102019 reverse MW:65912 MNKVVGIDLGTTNSVVAAIEGGQPTVITNAEGFRTTPSIVAYTKKEELLVGQLAKRQSVV NAANTFFSVKRFIGCKADEVSEESKELPYKVIKDDNGNIKIKCSSLKKDFSPEEISAQVI RKLITDAKDYLGQDVTKAVITVPAYFNDSQRQATVDAGKIAGIEVLRIINEPTAASLAYG LDKKQNETILVFDLGGGTFDVSVLEVGDGIFEVLSTAGDTNLGGDDFDKALVRWLVEDFK TKEGTDLTKDIQALQRLTEAAEKAKMELSNVETTTINLPFITADKTGPKHIQQDLTREKF ETLCSDLINRCRIPVEKALKDAKLDQSGINEVVLVGGSTRIPAIQELVQSLTGKKPNKSV NPDEVVAIGAAIQAGILAGEITDILLLDVTPLSLGVETVGGIMTKLIARNTTIPVKKSEL FSTAADNQTNVEIHVLQGEREIVSGNKSLGNFKLEGIPEAPKGRPQIEVTFDINVDGLLS VTAKENESGKEQTVTIQGSSNLSETDINNMLEDAEKYAASDNEQKVRRDIIFVATQTCQN VEAELEKLNSEESTLFTDEEKEEIKSVIQTIRDKVANTEESTESITQSCQELTKLVEGKL EGINPTM >gnl|To_CP_proteins|rpl3 rpl3 50S ribosomal protein L3 102319:102942 forward MW:22118 VSLGLLGNKIGMTQIFDESGNIIPVTILKVGPCIVTQVKTTLNDGYNAIQVGYGNVSTKS LTQPQLGHLQKSNIQPLKYLKEFRVNEPADFSVGQVLNVDSFIEGQLINVTGKSIGKGFS GLQKRYNFSRGPMTHGSKNHRAPGSIGMGTDPGRVLPGKKMAGQLGNKMTNIKKLKIVQV NNNENILVVKGAVPGKPGNLLSIVPSK >gnl|To_CP_proteins|rpl4 rpl4 50S ribosomal protein L4 102985:103629 forward MW:24180 MTVQKSIEFNEDVIKLNVLEDSGNYLIHRDILRQQISQKQGTVSTKTRSEVRGGGKKPWQ QKGTGRARAGSSRSPLWRGGGVIFGPKPRMTVLKLNKKERKLAVQTLLYNKRKNILVSAK LEELGKELEFTDNKTKKFYSFCKKELNIDLNKTKLLLVSESSGSASKTFWLATRNLKNVE VISSANLNTLSLLKAERILITPIALNNIKEIYCD >gnl|To_CP_proteins|rpl23 rpl23 50S ribosomal protein L23/L25 103655:103933 forward MW:10539 MNIIKYPIITDKATRLLANNQYSFVVNPKSDKPTIKAAIEYLFNVKVVKINTAHLPKKKK RVGQYMGWKSHYKKAIVTLSEGDTINLFAEEN >gnl|To_CP_proteins|rpl2 rpl2 50S ribosomal protein L2 103958:104785 forward MW:30677 MSIRLYKAYTPGTRNRALSSFSEITTGKPEKSLIRKNHRQKGRNNRGVITIRHRGGGHKK QYRLIDFKRNKYNVPAIVNSIEYDPNRNARIALVHFVDGEKRYILHPNNLNVGDTILSGK GISLDIGNTLPLEEIPLGTSVHNIELIPNRGGQIVRSAGTSAKILAKEGNYVTLRLPSKE IRLIRKECFATIGEISNNDAFLVQSGKAGRTRWLGKRPTVRGSVMNPCDHPHGGGEGRAP IGRSRPLTPWGKPALGKKTRKTKKLSNAYILRRRS >gnl|To_CP_proteins|rps19 rps19 30S ribosomal protein S19 104822:105100 forward MW:10445 MSRSLKKGPFVAYHLLKKINKMNAEGKKDVITTWSRSSTILPNMVGFTIAVYNGRQHVPV FISDQLVGHKLGEFVSTRTFKSHIKADKKTKR >gnl|To_CP_proteins|ycf88 ycf88 putative uncharacterized protein Ycf88 105111:105494 forward MW:14894 MAGLELTLNFPESFHIKTFNLKPKKKLSPLANSILSSIKFKHFYYVRDDIAYLLNSNPLE RDFLLQAFYSIILSLQNNSSVNFFDMWIYEIYITTVPVNNKFLNEKSQSLESDKYITIKL AYETSVF >gnl|To_CP_proteins|rpl22 rpl22 50S ribosomal protein L22 105575:105922 forward MW:13026 MSEGQSVKAVAKYVRISPHKVRKVLDQIRGRSYQEALMILEFLPYDAGSPIWQVVHSVAA NAKNNYNLDKKKLIISEIFADEGPKLKRIRPRAQGRAFKILKPTCHITVVVKSIN >gnl|To_CP_proteins|rps3 rps3 30S ribosomal protein S3 105949:106593 forward MW:24213 MGQKTHPLGFRLGITQEHRSSWYANFKHYSTLLKEDDQIRTYLNKFAKFASISNVHINRN GLGDQIELNIETGRPGILVGDNGSQIKTLATNIKKFLPDNHQITINVIEVENINSNASLI ADLVVQQLEDRVAFRRAIREGLKCAQDNQVNGIKIEVSGRLNGAEMARSEWIREGRVPLQ TLRADIDYATKEANTIYGVLGVKVWLFKNEILKK >gnl|To_CP_proteins|rpl16 rpl16 50S ribosomal protein L16 106649:107059 forward MW:15484 MLSPKRTKYRKYHRGRMRGKATRGNEVTFGQYGLQALEPSWITSRQIEATRRTITRYTKR GASLWIRIFPDKTVTARAAESRMGSGKGAVDYWVATVKPGTILFEISSVPEEVARAAFNL AAYKLPIKTKFIIRND >gnl|To_CP_proteins|rpl29 rpl29 50S ribosomal protein L29 107099:107275 forward MW:6808 MSMSTQELIKELKEAEKGLFDLRFKKATRQPFKPHQIKATKKKVAMLKTILRTKSLDF >gnl|To_CP_proteins|rps17 rps17 30S ribosomal protein S17 107438:107692 forward MW:9619 MPVKEKIGIVVSNKMQKTVVVKVESRYPHPIYSKTMIKTSKYLAHDEMSECNIGDQVLVR ECRPLSKKKRWSVAQIISKSSLIT >gnl|To_CP_proteins|rpl14 rpl14 50S ribosomal protein L14 107708:108073 forward MW:13445 MIYPQTMLTVADNTGAKKIMCIRVLGGNRKYGKIGDTIIGVVKEAIPNMPVKKSDVIRAV IVRTSKTIRRPDGMYIRFDDNAAVIVNMENNPRGTRVFGPVAREIRDKNFSKIVSLAPEV L >gnl|To_CP_proteins|rpl24 rpl24 50S ribosomal protein L24 108074:108307 forward MW:8701 MPKGKKLQIKIGDNVKIISGFDKNKTGEITKIYRNTGKILVKGINFKFKHVKPNTESDVG EIKQFEAPIHHSNVKLI >gnl|To_CP_proteins|rpl5 rpl5 50S ribosomal protein L5 108328:109044 forward MW:27532 MRSQHSLEEIAELYGLLENIKKEYENGIHSVLRKSNPELFSNPHTIPKLKKISINRGLGL AAQNSNILKKSITEFTRITGQKPLITRAKKAVAGFKIRENMELGLSSTLRGEKMYNFLTK LIFFTFAQIRDFRGLSVRSFDKAGNYTFSLKEQLIFPEIEYDDVDQIQGLSVTLVVDSST PKARSKTIDRVLNGMILLKFLRFPLNDCGYYDKYSSFGEINQVWDKKKHLRRKRWSQE >gnl|To_CP_proteins|rps8 rps8 30S ribosomal protein S8 109070:109468 forward MW:14781 VVTDSIADMLTRIRNANMVKHQIVEIPATKMSRAIAIILKNEGFIENFEIYNENISQYLL LSLKYKGQSRERVITKIKRISKPGLRVYANSKTLPRVLDGLGIAILSTSKGVMTNTQAKE LGVGGEVLCYIW >gnl|To_CP_proteins|rpl6 rpl6 50S ribosomal protein L6 109487:110023 forward MW:19685 MSRIGKLPVKLPASVDVTNDNSLLTIKGKFGTLEKTIPEIFVVEQSNGTLIVRLENQTRT NKALHGLYRTLINNMVIGVSEQFLKTLMLQGVGYRASVQGKTLVLNLGFSHPVNIDIPEA INVEVVQNTTINIKACDKEQLGLFAAQVRSWRPPEPYKGKGILYKDEQILRKAGKSGK >gnl|To_CP_proteins|rpl18 rpl18 50S ribosomal protein L18 110055:110462 forward MW:15383 MKFSKRVLLRLKNKNKKLKPDKYKKLKRESVRGLIKGTIERPRLSVFRSNENIYAQIIDD STAKTLVSCSTLDRDIKLNSQNGRTCDASRLMGQKLAELSKKKNITKIVFDRGPYLYHGR IKALADGARAGGLQF >gnl|To_CP_proteins|rps5 rps5 30S ribosomal protein S5 110489:111019 forward MW:18695 MSTKLKQVNKLKKAPHRNDNLNDVKFVERLIKISRVTKVTKGGKKLSFRAVVVIGDENGQ VGVGVGKAEDVVNAFKKAKTDGRKNLIQVPITKALSIPHGVVGDKGACKIIMRPSIEGSG VIAGGAVRTVLEVAGIKNVIAKQLGSDNLLNNARAAIVGLESLTTKAQVLKKRDLS >gnl|To_CP_proteins|secY secY preprotein translocase protein SecY 111075:112325 forward MW:45852 MKKTDDILLKRLLLSVGILLFIRMGTFLPIPGINHGHLAFYIQQHPLTKNLVSTFSGNDT FVIGLFTLNIFPYINASIMVQLITGLIPSISKLQKEGGGEGRRAITRLTRLITLGWALIQ SSSIAFYLKRALFDWSPLLAFEIVLWLTTGAMIVLWLSEVITEYGLGNGASLLIYTNIIS SLPNLGKKLISENTNNLTTLSTISIGLLFFIAIAGIITLQESARIVPLISSKQLGQSQSF VSSAKSNNYIPLRFNQAGVMPIILTTAILVLPNYIINLGVFPLLTLPIFLKSSKIIYWIS YFALILIFSSFYSTIVLNPKDISQELQKMAVSIPGVRPGLATTFYLKQVMKRVTFFGAII LAILATLPNIIEGILNVSSFNGLGTTSLLILVGVVLDISREMKSIILSNIYNEKFD >gnl|To_CP_proteins|rpl36 rpl36 50S ribosomal protein L36 112371:112484 forward MW:4390 MKVRPSVKKMCDKCRVIKRHGKIMVICKNPKHKQRQG >gnl|To_CP_proteins|rps13 rps13 30S ribosomal protein S13 112506:112877 forward MW:14137 VVRLLGIDLPKNKRIEYALTYIHGIGLTSAKKIVKLAEINPETRTNEITVEQSVALRNIL ENFELKLEGDLRRFNGLNIKRLNEINCHRGKRHRNSLPVRGQRTRTNARSRRGSKKTVTG KKK >gnl|To_CP_proteins|rps11 rps11 30S ribosomal protein S11 112923:113315 forward MW:13724 MAQKTRKPAIKKNRTNLGTGVVHIQSTFNNTIVTITNITGDTISWASSGSSGFKGARKGT PFAAQTAAEKAALDALSLGIKTVEILVKGSGSGRETAIRSIEGAGLEILSIQDITPVPHN GCRPRKRRRV >gnl|To_CP_proteins|rpoA rpoA DNA-directed RNA polymerase subunit alpha 113345:114343 forward MW:37259 MNSTKDFINSSNLSLKSSKMKQYTIQCLKSETIDSGAMYGQFLIDSLNSGQGITNGNLLR RVLLGDLEGIAITGVRIAGVKDEFSLIPGVREDILEILLNLKGVILKSNIVTRQFGRLRI QGPAVITASSIQLPPEIETINPNHYIATISTSDIVEIEFKIESGRQYRLANELFSDKFED FIETDAIFMPVQKVDFKVENVYDNSNNIKERLFLDVWTNGSIAPQDAIISASNSIIAFFQ SISEQKVKNALEKENIIEKNQVKPIDPYIHIAIEELQLSVRPYNCLKRARINTIGDLLEY SPEKLLELKNFGRKSADEVFATLKNKLGIVLK >gnl|To_CP_proteins|rpl13 rpl13 50S ribosomal protein L13 114367:114783 forward MW:15863 MNKTFIPSTNYNKRKWYVIDCKDKQLGRLASSIVPLLTGKTKSIYHPSLDVGDYVILINS EELKIDRDVERFHVYEPGRPGSSLKRLVNILPKRIIENCVFNMLPNGFPKKHLVKRLKVY QGAVHPHAAQDPKVLEII >gnl|To_CP_proteins|rps9 rps9 30S ribosomal protein S9 114811:115215 forward MW:15170 MNTIFQKDKIGLGKRKRAVARVFLVPGDGKITINKTPGAKYLQYNTNYLNSIWAPLEKLN LKTQFDIVAIVKGGGLTGQTDAIKLGVARLICKMENENRLVLKPFGFLSRDARIKERKKY GLRKARKASQYSKR >gnl|To_CP_proteins|rpl31 rpl31 50S ribosomal protein L31 115234:115452 forward MW:8239 MPKPEIHPTWFPEATVLCEGKTLCSIGSTKPQLQLDVWLGNHPFYTDSQTLVDSEGRVER FMKKYGLQTNDK >gnl|To_CP_proteins|rps12 rps12 30S ribosomal protein S12 115501:115875 forward MW:13886 MPTIQQLIRSKRIQINKKTKSPALVNCPQRRGVCTRVYTTTPKKPNSAIRKVARVRLTSG FEVTAYIPGEGHNLQEHSVVLIRGGRVKDLPGVRYHIVRGALDSGGVKDRTQRRSKYGVK KPKS >gnl|To_CP_proteins|rps7 rps7 30S ribosomal protein S7 115943:116413 forward MW:17659 MSRRNISKKRFPKADPTYNSYLVSLLVTRILKSGKKNLAQNIVNGAFEIIKSKTNEDPLV IFEKAIRNASPVVEVKARRIGGSTYQVPIEVSSFRATNLALRWIIQYSKQRVGRTMSSKL ASEIIDTANDIGNTIKKKEETHKMADANKAFAHFRY >gnl|To_CP_proteins|tufA tufA translation elongation factor EF-Tu 116491:117720 forward MW:44618 MAREKFERTKPHVNIGTIGHVDHGKTTLTAAITATLSLEGDSVAKDYADIDGAPEERARG ITINTAHVEYETKDRHYAHVDCPGHADYVKNMITGAAQMDGAILVVSAADGPMPQTREHI LLAKQVGVPHIVVFLNKQDQVDDDELLELVELEVRELLSAYDFPGDDIPICPGSALQAIE AISANPSIQRGDNPWVDKIYALMDSVDAYIPTPERDVEKTFLMAIEDVFSITGRGTVATG RIERGVVKVGDNVEIVGVAETQTTTITGIEMFQKTLDEGFAGDNVGILLRGVTREDIERG MVLSQPGTITPHTNFESEVYVLTKEEGGRHTPFFTGYRPQFYVRTTDVTGSIEQFTADDG SVVEMVMPGDRIKMTAELIYPVAIEEGMRFAIREGGRTIGAGVVSKIVK >gnl|To_CP_proteins|rps10 rps10 30S ribosomal protein S10 117733:118056 forward MW:12536 MEIKSNQKIRVRLESFNRELLNTSIQKISEIIRDTGVYKAGVASLPTDKRIYCVLRSPHV NKDSREHFEIRTYKRIIEIYYDASVNIFDLLVKSDLPPGVFYRICIS >gnl|To_MT_proteinmodels_ML|mt04 ribosomal protein L2 XYYFYYKHKFYSKNDNIKNNVKKSSFTKKKIKGKKNKAGRNNSGKITIAHKGSGHKRKYR TVDFLRNQNSVGIVTSIEYDPFRTAFIASIFDMIQCNYYYITAPKNLSTGDVIKSGFNAE IKTGHSLALTKIPVGCFIHNISLKPNKRAQLCRSAGTSAQLIEKTANYSKILLNSGKQIT LASTCCATIGIVSNEFSFFKKIGKAGRNRWLNKRPTVRGVAMNPVDHPHGGGEGKSSGGR SSVTPWGKPTKNGKTKKNX >gnl|To_MT_proteinmodels_ML|mt05 ribosomal protein L5 XFFYNKLTQIPQLKQITLSFGYKQSLFKSILSGLIALEFITLKKSNITKSKRLNVFLKIK KGAPIGCKVILKKENMYFFYLKLITLTSLKTVQFKKKIDSFKAITLQIQNPLFFNELENQ YLYFKNLPKLTVTINFNTKSRKELYFILKSNKFFV >gnl|To_MT_proteinmodels_ML|mt06 ribosomal protein L6 MNNIIKKHILKIPKNTSIYYCSDNQIVIVSNLQHYKVFKLKTKLIVKKAHKIIKVTRIPF FKISNNYQKKLKAIQATQVTLLKQLFLDNSSKFCKKLKLVGVGFRVSTFEILTNNFLHFK LGYSHSIYFKIPKNLKTFCMKSNTLFIVGDSYSFVTQIAALIKSLKIPELYKGKGILYTT EKIILKEGKKV >gnl|To_MT_proteinmodels_ML|mt07 ribosomal protein L14 MIQQQTKLKVADNSGAKVVKCIKVTNGFNKKFALLGDLVIISVIKLRNKARITSKVQKGE VYKAMIIRTKKNITKKDGMTTMFNTNAVSLLNKQNKLIASRILGPVPKVIKKRKIKIGTL STGFV >gnl|To_MT_proteinmodels_ML|mt08 ribosomal protein S2 MKIINIKRYKLFKYNLLKLQFYKQTTKNSILSQGVLEQIEMSLKQLLKIIYEYHVNNSKI LFIGFPIIFKKQQIKFINLTNHDFISQKSWLNGTFRNRFSVSTYLKNLQLKNFSKKLNTL LTLKTKPHLVVFFDSNLQTGVINEFYKAGIPIIVFNCKKINLDKALYKTLINLNFDNKNL KLTWFLLFYSILKKX >gnl|To_MT_proteinmodels_ML|mt09 ribosomal protein S3 MAQKTDARIFRQGILNKSWNFKNNQKNIEDSSFYLYKTLEIQKYLNRFFKLYKIKIHNCK IFYTNNSLQVFISFYVSPNTVNIFKKRITKTSEKLKNYSEQLQLNKKIDWNKIQKIKKVR KLKNFYSTPNLIEFQEILLNSLAVYTNNKLNIFIKLQNLNVSNKLPNNQIQHLKIIFKQL KKYIKNNFFKEALNILFITISKRKSAKMFADFIAEQFKLNQLKTDQITISRKDNYFLSFL KQMLELLIKYELSCLKGIKIVIKGRFNKAPRAKTVNFHFGNFALQSFNSKIDYYQSTAYT TNGTFGIKVWLAENF >gnl|To_MT_proteinmodels_ML|mt10 ribosomal protein S4 MIFVVLFYIEKITTLKKQTYLLKTFMYSTKNWVAGYYIKTFIFKKRYKKLRQNFKIWKKK WHKFQQILLKSKKKRFYDPVIHSLFNFKSFFWWKFKYNLHNKQRLSFFYGKLKKSYLKKL LKEIFKKSKISKKNFITLFIQRLENRLDTILCRTHFAYSLQNAKQLITHKKVYVNGKIIQ QSGYIVKKGDLITFKNDIHKSLYQNIFNLNIWPIPPKQCFVNYKIFSIVITQEYSNCFSS FSFWIDFNSFVNFYKK >gnl|To_MT_proteinmodels_ML|mt11 ribosomal protein S7 MANKIINQFLLHGKKTVCEKIWRQNVKLFQKSFTKNYKKFINRSIINVAPLVKVKQLQQK RKRSQIKEFPYITTYKIRIALALKFFLIKNQKVKTKTYKRIFTELLNASKNNGNFITKKK SLYEYAFFKKKYFYYRWF >gnl|To_MT_proteinmodels_ML|mt12 ribosomal protein S8 MKSILKNVAINLNNSQIARRNFIFEPKTKLIVSFLNILWNEGFIFGYKLYGANSNLLKIF LKYKKNKPVINSIKLFYNSGSIGNYKLSQLWKLDAKKNLIILNTSKGLMTIQDCKKNQIG GQPLFMVK >gnl|To_MT_proteinmodels_ML|mt13 ribosomal protein S10 MFVIIQIFSKNSNSISNFLKFLYKLKTNKTLNLNFTIIQSSPFKKSKKFSVLKSPHVNKK AQEQFEYNVFKKQLKIYVSQFNKFLIIWKTVKLTLFTDINLKFKFNQNPNNFITISKINS DKFIKNKQYFKFLDIKKKNKSLSSIMGYLWRILFEKLKFRWTLKKKINHSLLLWDIYGEF CLKNWSSDSSVGWSKGLKILVSAVQICF >gnl|To_MT_proteinmodels_ML|mt14 ribosomal protein S11 MFIVKIVNKSLILPKLTFYNYFNCLYYNIKLLKFIKKKQFSFFVSWLFTLIFLNSYKLKT KINYFIKLIFISAYQNQKLINYILNVNQLKTNTLINLNDIKGNPKIFYSAGMLNLQKKQK IRQPKAAITILRQLVSKSKNLKQKPAVLQFNNMFLNYRSYIFKKLKRKIFIKLIVNYSST AHNGCRTKKKKRS >gnl|To_MT_proteinmodels_ML|mt15 ribosomal protein S12 MPTINQLCRKKRVKKKLRNKVPALDRCPQKKGVCTKVFLRTPKKPNSALRKLVKLRLTNN KKTMAYIPGEGHNLQEYSTVLLRGGRVKDLPGIKYHLVRGKLDFSGLKGRITSRSKYGTK KL >gnl|To_MT_proteinmodels_ML|mt16 ribosomal protein S13 MIYILETDLNDKKSISFALIKIFGLNRAKAVKICKTLGLAQHLTFKDLKKDQLLKLVKFI ENSNININIDLKKLQISIFKKLIQIKAYRGIRKVRKLPVRGQRTHTNAKTAKRQ >gnl|To_MT_proteinmodels_ML|mt17 ribosomal protein S19 MKRAKWKGPFVNNNLLNKIKNYEFEILTRNSLIVPKFVGLSLRVYNGKTFIIIKIIKEMI GYKLGEFVPTRKQFFYKKKSK >gnl|To_MT_proteinmodels_ML|mt18 NADH dehydrogenase subunit 1a MILLIISSLLKILAIVVPLLISVAYFTIAERKIMGIIQRRKGPNVIGFVGLLQPLADGLK LFVKETIFPSNSNIILFIIAPMLTFILSLIGWAVLPLSNQIVLSDLNVGILYLFATSSLS VYGIIIAGWSSNSKYPFLGALRSAAQMISYEVSIGFIIVNVCICTGSFNLSTIVLAQQTI WFIIPLFPMFVIFCVSMLAETNRHPFDLPEAEAELVSGYNVEYSAMTFALFFLGEYANML LMSAFVSILFLGGWLPIINLFPFNLLPGSLWFSLKMSLGVIFFIVTRATLPRYRYDQLMY IGWKSFLPLSLGYLLFSIGLLISFNWLPT >gnl|To_MT_proteinmodels_ML|mt20 NADH dehydrogenase subunit 2 MLIFKEIQLLPEIFLAVSIIYLIIHGTFCSVNTKYLLIQNSILQLSVLILCLFCYLLINN CVEFSSFQIFNNTINIDYLGVCSKIWIALISIFVFLIVQHYIVFQKINYFEYSILILFSL LGIFFFCCSNDLITAYLSIELQSLSFYILASIKKDSSFSVDGGLKYFVLGAFASSLFLFG SSILYGVTGTTNLEDLKNLFFFXXXXXDLVYEINNYTFSKNVFLEGSLIQFGLSFLFVSL LFKLAVAPFHLWSLDVYEGSPSSSTIFFAIIPKLAIFVLFIRIFYLSFFEYTNK*RYYLV ILAIVSIIVGSFGGLEQKKLKTLLAYSSTSHMGYTLLAFSAGTLEGTQVLFSYLFVYMIA GLCTWSIFLMLQLKHKYAKKQNKDLTDLYSLNKSNNILALSFSIVLLSVAGLPPMIGFLV KIGIFLATIESSMYFVAIVSILCSVISTFYYIRILKIIYFEGSTSGKLYYPIKSYVSFIV SSLFYFLIFLFINPTLIYLFSYKITLLLN >gnl|To_MT_proteinmodels_ML|mt21 NADH dehydrogenase subunit 4 XNVKFFFLNVTNLLVFILVLPLVGTFIIFFIPAWNHSLIKMIALNISGFCYFLSLFLWVF FNKSISSFQFVNKLAWIPFLNFNFSLGVDGISLFFILLSTLLIFLCILISWTSVKANIKE FLIAFLVMEFFLIGVFSILDLLLFYIFFESILIPMYLIVGMWGSRERKIRAAYFFFLYTL LGSVLMLLSILYIYYQVGTTDYEVLITFIFCPLEQKFLWLSFFASFATKVPMVPVHLWLP EAHVEAPTSGSVILAGILLKLGTYGFLRFSFPMFPMASLYFTPVVYLLAIVGIVYTSFTA IRQTDFKRIIAYTSVAHMNLVIVGLFSFNIIGLEGAILQSLSHGFVASALFMAIGVVYER HHTRMVKYYGGLVHCMPIYTFIFLFFTMSNIGLPGTGSFVGEFLILAGSFKANTSATFIS ATGMIIGGCYSLWLFNRISYGNLKIQYLNSYIDINKREAFIFLPLILGTIIIGLYPEIFL NSMHMSVNMLIEITHIIGQL >gnl|To_MT_proteinmodels_ML|mt22 NADH dehydrogenase subunit 4L MIILNLNYILNIVVILFFIGVLGLVLNRKNILIILMCIELMLLAINLNFVVFSVYLNDVT GYVFVLFILTIAAAESAIGLAVLTVYYRLKNTIQIDKIKNIKS >gnl|To_MT_proteinmodels_ML|mt23 NADH dehydrogenase subunit 5 MYLLLIFLPFLGSISAGLFGRSLGPFGASYITVSCLLFTFILSSFAFYEVAFLNSCVYVK LLPWIDSELLNVDWGFLFDSLTVVMCCVVTFVSTIVHLYSTEYMAYDPHLPRFMSYLSLF TFFMLILVTADNFVQMFVGWEGVGLCSYLLINFWFTRIQANKAAIKAMVLNRIGDFGLVM GILLIFIEYKAIDYATVFALTPIFIEKSINFLSFDFNLISLICFFLFVGAVGKSAQLGLH TWLPDAMEGPTPVSALIHAATMVTAGVFLIARTSPLFEYSPNMLQIVTIVGACTAFFAAT VGLLQNDLKKVIAYSTCSQLGYMVFACGLSNYSVGVFHLVNHAFFKALLFLGAGSVIHAV SDEQDMRKMGGLKKLVPFTYSVMVIGSLALIGFPFLTGFYSKDVILEVAYGKYTTEGHFS YLLGSLGAFFTAFYSTRLVYLTFLSNPNGYKSVICSAYDSSYQITLSLFLLVIPSVLIGF YAKDMIIGFGTDFWGNSIFCKTENMNRIDSEFITHQFKILPVGLSLLGSFSSFLFYLTGS KLLIKLKLSSFGKKIYNFLNKKWFFDKIYNEYISQFFFTISYTVTYKMIDRGIIEVFGPM GLSSVLSKKGLNISKLQTGYLYHYTFVMLVGLTFVLGVRQFWIFLGIYTDFKIFMLFVVA SLFLASNRSNS >gnl|To_MT_proteinmodels_ML|mt24 NADH dehydrogenase subunit 7 XTKKKNLLKKVKNFTINFGPQHPAAHGVLRLVLELNGEIVNRADPHIGLLHRGTEKLIEY KNYVQALPYFDRLDYVSMMAQEHSYCLAIEKLFNCKIPKRAQYIRVMFAEITRILNHLLA VGCHAMDVGAMTPFL*AFEEREKLMEFYERVSGARMHAAYFRPGGLQVDIPKGLLNDIYM FTEQFTLRLTEMEDMLTENRIWKQRLVDIGVVTAKDASLWGFSGVMLRGSGVYWDLRKSQ PYEIYDKLQFDIPVGTNGDCYDRYLIRVFEMKESLKIIEQCLNLMPNGYVKTNDFKIVPP ARAEIKQSMEALIHHFKMYTQGVVIPTNETYIGTEAPKGEFGVYLISDNTNRPYRCKIKA PGFNHLQALDFMSKGHLIADVVTIIGTQDIVFGEVDR >gnl|To_MT_proteinmodels_ML|mt25 NADH dehydrogenase subunit 9 MNSLFIKNIQNLTKMCPLEKIQIYSQELVIIVKSKLLVNVLIFLKYHISYQFDVLTCITG VDYPNNKYRFKLVYELLSIKYNIRLRVKTFTHELFSVDSCDKLYFTAGWYECEIWDMYGV FFKNHSNLKRILTDYGFEGHPLRKDFPLSGFVEMKYNETEKRVINESIELCQEYRTFKFL SPW >gnl|To_MT_proteinmodels_ML|mt26 NADH dehydrogenase subunit 11 XKFNYDLEFKNNSKINFWKQNTPLIEYCETIGITIPHYCYHKNLSISGNCRMCLVELKNS PKPIVSCAMNAKSCLANGDIYTNSVLVKKARENVLEFLLLNHPLDCPICDQGGECDLQDQ SLFFGLTKKRFFNFKRIVLNKDLGPIVKTVMTRCIHCTRCVRFATEIAGTNDIGMFGRGL QSEIGTYVNQMFKSELSGNVIDLCPVGALTSKPYPFVNRNWELKNLKSFDFSDGLSTPIQ LFIKNNQIVKILPGFEQKTFKTNWITDKTRFAFDGMFSPTRIINSFLQTKKNTLNNLSWQ KILKEFFYILYFKTHLSQHYYKPKQITICLGVNTSIEVLGLLKTLNRKYSFFKLRKSNVQ NINNDLQQNYLLNSSVDEVKMVNSNICLLVGLNPRYEGSKLNLKLRTRYLKGDFKVLTLN SLIDLTFQNMNISSNIKTLQSIAEGNNVHCQNFVNSSSPLIISNNEMFQRADSLGLATIF RTLIENINIVSKSNANCINILNSTLNESGFLNLKNFKTISNKDIKKSCCVFFLNNSFSPN LKKLLNLKLLNFLQKYKTTSKMLVTQTNALDTQLVSKIKKNLDVRTHLHFPNSNFFETTG TYINTEGDFIKTNKAVTSFGQIKTDWQIIRKLMSYSSKLLFMNDFFKSNKIGFNSSNNSN YKNFISFHYYAISNLNNSTFKLITNSNQHTLKFFKFKLTQKKLHHSQVRSWMNDFYLDSK NFTNKSSSTMIKCSKLFRLNNTNFKF >gnl|To_MT_proteinmodels_ML|mt27 cytochrome b MSSFVVKTMRWSKNYLLSILDSHIIHYPSPLNLTYAWSFGSSAGICLVIQILSGVFLAMH YTPHIDLAFSSVEHIMRDVNNGWLIRYIHANGASMFFIVVYCHIFRGLYYGSYIQPRQLL WCSGVLIFILMMGTAFMGYVLPWGQMSFWGATVITSLVTAVPVVGQSIVDWLWGGFTVNN ATLNRFFSLHFLLPFLIAGLSLIHLALLHKDGSNNPLGIDSKSDKIPFYPYFVVKDLFAF FCFLTFLVFCFYFPNALGHPDNYIPADPMQTPAHIVPEWYFLPFYAILRSIPDKLGGVAA MGGALIILFLIPFTNTSEIRSTTFRPIFKIFYWLLVADFLLLGWIGQKPVKDVYVLVGQI ATVFYFLFFLILIPAIGIVEAKLTTYSNKK >gnl|To_MT_proteinmodels_ML|mt28_cytochrome c oxidase subunit I MANSAVQLNPIYRFATRWLFSTNHKDIGTLYLIFGAISGVAGTALSLYIRITLAQPNGSF LEYNHHLYNVIVTGHAILMIFFMVMPTLIGGFGNWFVPLMIGAPDMAFPRMNNISFWLLP PSLLLLFASMLTESGVGTGWTIYPPLSSATAHSGGSVDLAIFSLHVSGASSILGAINFIC TVFNMRVKSLSFHNLPLFVWSVLITAFLLLLSLPVLAGAITMLLTDRNFNTTFFDPAGGG DPVLFQHLFWFFGHPEVYILILPGFGIISHIIVSTAKKPIFGYIGMVYAMISIGVLGFIV WAHHMFTVGLDIDTRAYFTAATMIIAIPTGIKIFSWLATLWGGSIDLRTPGLFAIGFIFL FTVGGVTGVVLANSGIDIALHDTYYVVAHFHYVLSMGAVFSMLGGLYFWFEKITGVRYSE ILGKIHFWSFFIGVNLTFFPMHFLGVAGMPRRIPDYPDSYMTFNKIASWGSYISAISSLF FFYIVFEAFTANRKNFSKV >gnl|To_MT_proteinmodels_ML|mt29 cytochrome c oxidase subunit II MIFITQCDSPALWQTYLSDPASISMEGILIFNKHLLFLLTVIILFVAWLLFYTLYYFIEY NNKFSSKFVHSKELEIVWTSIPALILLILSTPSFTLLYAMDEISEPELTLKILGHQWFWS YEISEFNSCQKQEQSLKYVCYMMVLDGLPTTKQGYFRLLETNKRVILPTNTHLRLLVSAA DVLHSWTIPSFGLKVDACPGRLNQINLFIKRIGVFFGQCSEICGVNHGFMPIVIVSLPTV QFHHYIMTKLELN >gnl|To_MT_proteinmodels_ML|mt30 cytochrome c oxidase subunit III MSDKSRFKYLKTKHSFHLVDPSPWPLVASLGAFFMTSGTVLYMNKFLGGGQLASIGFFVI LYVMYTWWRDIVREATFEEQHSVAVQRGLRLGMILFIVSEIIVFFVAFFWAFFHSSLSPV FNLGGIWPPESIETIQTSGIPLTNTFFLLSSGATVTWAHHAIIVRAKKQAIIGLLFTIVL ATIFTLLQGMEYVEAPFNINDSVFGSCFYMATGFHGFHVFIGTCCLVVSLLRIVYNHFTS THHFGFESAAWYWHFVDVVWLFLFITVYWWGGIH >gnl|To_MT_proteinmodels_ML|mt31 unknown putative protein XKHKSLFFCYTFRNFKWVFYFFIIFEKSLKSKIPIKRSIKNWVKLMCLIKSKIFWIFWAY SSVVERTAHNGLVVGSNPTKPKINMDFNLKTYKHHKIKTSFQQFNLLFFLHSPFIGNKIS IKTNQKLLKFKLQHYKISNKLFNNTIKNSVFTHLTVLIHASIILINSSQSITFTQLTKIN PLTLLGLRLNNKIYSKKQLKN >gnl|To_MT_proteinmodels_ML|mt32 ATP synthase F0 subunit 6 MLHTPLEQFQIILLFPIKLFCFDFSITNLVLVNFLVLFIFMLLVISFSSNLNFFKETSFF FVPNTWQVLIETMYEVSAQLLFDNINKDGEKFFPFISVIFTFVLFSNLIGLVPYSFTTTS HLIVTFTLSLSIFIGINIICIQKHKFHMLSLFIPANTSFGLALLLVPIELLSYIFKPISL GVRLFANLMAGHTLLKVIVGFSWSMLLLEDFLAVLHVIPLLILVLLMGLELGVALIQAYV FTILTCIYLNDAVNLH >gnl|To_MT_proteinmodels_ML|mt33 ATP synthase F0 subunit 8 XLGIPQFDILTLGAQVFGLLLSLSFFYYFSITVVIPNFIEVKKFRTKKLVKNTQYISTMN LDLVNNKKLINNSYKLFMQ >gnl|To_MT_proteinmodels_ML|mt34 ATP synthase F0 subunit 9 MLLQAAKFVGAGLATIGLAGAGVGIGTVFGALVIGVSRNPSLKDELFKLAILGFALTEAI ALFSLMMAFLILFAL >gnl|To_MT_proteinmodels_ML|mt35 SecY-independent transporter protein MLSKYLFEIKYRLLFCLIAWIFIIINCYYFKEILLYVFLKPSLKLHSNFNLKFLTTNVTE VFFTYLKLSYFIANQFLTIYIFFQWLLQIANGLYKFEYIYFKTILIKWIIIWFFCIFFLN KFLFPLIWFFFFEFQNSSFNFEIKLDEYTNFYYSIYFTCNWVFKIIILLLILLDLIKTNL FIIKKFKKICYFFFAILATFITPPDVIQQLITGIFIIIIYELTVFNLIFKYELTAIK