Test Datasets [Download]
>rrm-1
IYVGNLMFQVTDKDIAREFSQQGRFVYIHLKSGRVTGRRDGFGYILFTTY
KDAHQALSTLGGFYVNSRNLSC
> rrm-2
LYIRNLDTVTEKQSMLTEFSPFGDDLHLRAIKNSDASSSANSKGFAFIDF
KDEDSAEAAFRKLGEIPINGSTLDV
> rrm-3 (Psi-blast)
AYVSRLSYDTTEEHVREAFGNFPPIVSALLINDCNNFGFIEFKDREETAH
IMEGLPKTDFNNVSVKK
> rrm-4
LFIIGLPDETSESQLRQIFAAFGGVCTFNLIKQPVTRLTSERNKCRGYAF
VIYQNKEDAANARNLNGIAINNRIVRV
> rrm-5
LYVGDVSYQIQTTSVLIYFSPEGDKTTISMILQKNNDGKGFVHFPKHPSA
TDIPDRMDGSELLGKKLAL
> rrm-6
YIGGLAEKINKSELSLLFGRFGSSMSAKVVKIRQTNRNQPMCFISFKTEE
DTNMAMKEMFNKEFLGCELDV
> rrm-7
LFVKNLEPMETTDEDVRAFFTKNGVLKECNMIVDKETGHLRGKGALNFED
PVSAEKAVRNLNGGKFNSNEATL
> rrm-8
VKIRGLDQGATGKAIRDAFNQYGNLIVIIVVMDTNSRQFNMGFIEFKDQL
DANVALREKDGIPVLNERSAV
> rrm-9(good seq for testing BLOSSUM matrices)
LYIGDLPDAIRDDDLKDLFRKFGEIETIKAISSGVPTRARGYAFVRFARY
EDIREALDEKEGYQPCDYAEKI
> rrm-10
LFLSNLPRDCNRHKLRDMFSKYGLIVKIRAHPDKDTGPTRGYAFVQFKSS
EDVDKAKALNKYFLGLRKIRV
GFP (SYG) residues 65-67 in Aequorea victoria
PHI patterns: x(5)SYGx(1,3)
> gi|18175269|gb|AAK02065.1| GFP
MSKGEELFTGIVPVLIELDGDVHGHKFSVRGEGEGDADYGKLEIKFICTT
GKLPVPWPTLVTTLSYGILCFARYPEHMKMNDFFKSAMPEGYIQERTIFF
QDDGKYKTRGEVKFEGDTLVNRIELKGMDFKEDGNILGHKLEYNFNSHNV
YIMPDKANNGLKVNFKIRHNIEGGGVQLADHYQTNVPLGDGPVLIPINHY
LSTQTAISKDRNETRDHMVFLEFFSACGHTHGMDELYK
Sequences for Clustalw, Mview and Jalview
Mirochondrial Porin sequences
>PORIN_HUMAN
MSWCNELRLPALKQHSIGRGLESHITMCIPPSYADLGKAARDIFNKGFGFGLVKLDVKTKSCSGVEFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEI
AIEDQICQGLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGYQMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVNDGTEFGGSIYQ
KVCEDLDTSVNLAWTSGTNCTRFGIAAKYQLDPTASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKVGSPWSWRLNPAERNLWEWISEDLALIYFHCDQ
QQAFFPPEDDQNKG
> POR1_MOUSE
MAVPPTYADLGKSARDVFTKGYGFGLIKLDLKTKSENGLEFTSSGSANTETTKVNGSLETKYRWTEYGLTFTEKWNTDNTLGTEITVEDQLARGLKLTFDSSFSPNTGKKN
AKIKTGYKREHINLGCDVDFDIAGPSIRGALVLGYEGWLAGYQMNFETSKSRVTQSNFAVGYKTDEFQLHTNVNDGTEFGGSIYQKVNKKLETAVNLAWTAGNSNTRFGIA
AKYQVDPDACFSAKVNNSSLIGLGYTQTLKPGIKLTLSALLDGKNVNAGGHKLGLGLEFQ
> POR3_BOVIN
MCNTPTYCDLGKAAKDVFNKGYGFGMVKIDLRTKSCSGVEFSTSGHAYTDTGKASGNLETKYKICNYGLTFTQKWNTDNTLGTEISWENKLAEGLKLTLDTIFVPNTGKKS
GKLKASYKRDCFSLGSNVDIDFSGPTIYGWAVLAFEGWLAGYQMSFDTAKSKLSQNNFALGYKAADFQLHTHVNDGTEFGGSIYQKVNEKIETSINLAWTAGSNNTRFGIA
AKYKLDCRTSLSAKVNNASLIGLGYTQTLRPGVKLTLSALIDGKNFNAGGHKVGLGFELE
> PORI_DROME
MAPPSYSDLGKQARDIFSKGYNFGLWKLDLKTKTSSGIEFNTAGHSNQESGKVFGSLETKYKVKDYGLTLTEKWNTDNTLFTEVAVQDQLLEGLKLSLEGNFAPQSGNKNG
KFKVAYGHENVKADSDVNIDLKGPLINASAVLGYQGWLAGYQTAFDTQQSKLTTNNFALGYTTKDFVLHTAVNDGQEFSGSIFQRTSDKLDVGVQLSWASGTSNTKFAIGA
KYQLDDDASVRAKVNNASQVGLGYQQKLRDGVTLTLSTLVDGKNFNAGGHKIGVGLELE
> POR4_SOLTU
GPGLYTEIGKKARDLLYKDYQSDHKFSITTYSPTGVVITSSGSKKGDLFLADVNTQLKNKNVTTDIKVDTNSNLFTTITVDEAAPGLKTILSFRVPDQRSGKLEVQYLHDY
AGICTSVGLTANPIVNFSGVVGTNIIALGTDVSFDTKTGDFTKCNAGLSFTNADLVASLNLNNKGDNLTASYYHTVSPLTSTAVGAEVNHSFSTNENIITVGTQHRLDPLT
SVKARINNFGKASALLQHEWRPKSLFTVSGEVDTKSVDKGAKFGLALALK
> POR1_WHEAT_1_274
MGGPGLYSGIGKKAKDLLYRDYQTDHKFTLTTYTANGPAITATSTKKADLTVGEIQSQIKNKNITVDVKANSASNVITTITADDLAAPGLKTILSFAVPDQKSGKVELQYL
HDYAGINASIGLTANPVVNLSGAFGTSALAVGADVSLDTATKNFAKYNAALSYTNQDLIASLNLNNKGDSLTASYYHIVEKSGTAVGAELTHSFSSNENSLTFGTQHTLDP
LTLVKARINNSGKASALIQHEFMPKSLCTISAEVDTKAIEKSSKVGIAIALK
> POR6_SOLTU
GPGLYSDIGKKARDLLYRDYVSDHKFTVTTYSTTGVAITASGLKKGELFLADVSTQLKNKNITTDVKVDTNSNVYTTITVDEPAPGLKTIFSFVVPDQKSGKVELQYLHEY
AGINTSIGLTASPLVNFSGVAGNNTVALGTDLSFDTATGNFTKCNAGLSFSSSDLIASLALNDKGDTVSASYYHTVKPVTNTAVGAELTHSFSSNENTLTIGTQHLLDPLT
TVKARVNSYGKASALIQHEWRPKSLFTISGEVDTRAIEKSAKIGLAVALK
> POR1_ARATH
GPGLYTEIGKKARDLLYKDHNSDQKFSITTFSPAGVAITSTGTKKGDLLLGDVAFQSRRKNITTDLKVCTDSTFLITATVDEAAPGLRSIFSFKVPDQNSGKVELQYLHEY
AGISTSMGLTQNPTVNFSGVIGSNVLAVGTDVSFDTKSGNFTKINAGLSFTKEDLIASLTVNDKGDLLNASYYHIVNPLFNTAVGAEVSHKLSSKDSTITVGTQHSLDPLT
SVKARVNSAGIASALIQHEWKPKSFFTISGEVDTKSIDKSAKVGLALALK
> PORI_NEUCR
MAVPAFSDIAKSANDLLNKDFYHLAAGTIEVKSNTPNNVAFKVTGKSTHDKVTSGALEGKFTDKPNGLTVTQTWNTANALETKVEMADNLAKGLKAEGIFSFLPATNARGA
KFNLHFKQSNFHGRAFFDLLKGPTANIDAIVGHEGFLAGASAGYDVQKAAITGYSAAVGYHAPTYSAAITATDNLSVFSASYYHKVNSQVEAGSKATWNSKTGNTVGLEVA
TKYRIDPVSFVKGKINDRGVAAIAYNVLLREGVTLGVGASFDTQKLDQATHKVGTSFTFE
> POR1_YEAST
PPVYSDISRNINDLLNKDFYHATPAAFDVQTTTANGIKFSLKAKQPVKDGPLSTNVEAKLNDKQTGLGLTQGWSNTNNLQTKLEFANLTPGLKNELITSLTPGVAKSAVLN
TTFTEPFFTARGAFDLCLKSPTFVGDLTMAHEGIVGGAEFGYDISAGSISRYAMALSYFAKDYSLGATLNNEQITTVDFFQNVNAFLQVGAKATMNCKLPNSNVNIEFATR
YLPDASSQVKAKVSDSGIVTLAYKQLLRPGVTLGVGSSFDALKLSEPVHKLGWSLSF
ATAB Test Sequences
>143B_BOVIN cyt
TMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERNLLSVAYKNVVGARRSSW
RVISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLQLLDKYLIPNATQPESKVFYL
KMKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYY
EILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTLIMQLLRDNLTLWTSENQGDEGDA
GEGEN
> DBI5_MOUSE cyt
MSQVEFEMACASLKQLKGPVSDQEKLLVYSFYKQATQGDCNIPVPPATDVRAKAKYEAWM
VNKGMSKMDAMRIYIAKVEELKKKEPC
> RT09_HUMAN mitochondrial
MAAPCVSYGGAVSYRLLLWGRGSLARKQGLWKTAAPELQTNVRSQILRLRHTAFVIPKKN
VPTSKRETYTEDFIKKQIEEFNIGKRHLANMMGEDPETFTQEDIDRAIAYLFPSGLFEKR
ARPVMKHPEQIFPRQRAIQWGEDGRPFHYLFYTGKQSYYSLMHDVYGMLLTLEKHQSHWQ
AKSLLPEKTVTRDVIGSRWLIKEELEEMLVGKLSDLDYMQFIRLLEKLLTSQCGAAEEEF
VQRFRRSVTLESKKQLIEPVQYDEQGMAFSKSEGKRKTAKAEAIVYKHGSGRIKVNGIDY
QLYFPITQDREQLMFPFHFVDRLGKHDVTCTVSGGGRSAQAGAIRLAMAKALCSFVTEDE
VEWMRQAGLLTTDPRVRERKKPGQEGARRKFTWKKR
> gi|16554607|ref|NP_060611.2| (NM_018141) mitochondrial
MAARTAFGAVCRRLWQGLGNFSVNTSKGNTAKNGGLLLSTNMKWVQFSNLHVDVPKDLTK
PVVTISDEPDILYKRLSVLVKGHDKAVLDSYEYFAVLAAKELGISIKVHEPPRKIERFTL
LQSVHIYKKHRVQYEMRTLYRCLELEHLTGSTADVYLEYIQRNLPEGVAMEVTKTQLEQL
PEHIKEPIWETLSEEKEESKS
N-Glycosylation Example
Top of Form 1
> gi|9256644|ref|NP_061349.1| solute carrier family 1 [Mus musculus]
MEKSGETNGYLDGTQAEPAAGPRTPETAMGKSQRCASFFRRHALVLLTVSGVLVGAGMGAALRGLQLTRT
QITYLAFPGEMLLRMLRMIILPLVVCSLVSGAASLDASSLGRLGGIAVAYFGLTTLSASALAVALAFIIK
PGAGAQTLQSSSLGLENSGPPPVSKETVDSFLDLLRNLFPSNLVVAAFTTSATDYTVVTHNTSSGNVTKE
KIPVVTDVEGMNILGLVLFALVLGVALKKLGPEGEDLIRFFNSFNEATMVLVSWIMWYVPIGIMFLIGSK
IVEMKDIVMLVTSLGKYIFASMLGHVIHGGIVLPLVYFAFTRKNPFTFLLGLLTPFATAFATCSSSATLP
SMMKCIEENNGVDKRISRFILPIGATVNMDGAAIFQCVAAVFIAQLNNVDLNAGQIFTILVTATASSVGA
AGVPAGGVLTIAIILEAIGLPTHDLSLILAVDWIVDRTTTVVNVEGDALGAGILNHLNQKVVKKGEQELQ
EVKVEAIPNSKSEEETSPLVTHQNPAGPVAIAPELESKESVL
Bottom of Form 1
Phosphorylation Example
> seq2
ASQKRPSQRHGSKYLATASTMDHARHGFLPRHRDTGILDSIGRFFGGDRGAPK
NMYKDSHHPARTAHYGSLPQKSHGRTQDENPVVHFFKNIVTPRTPPPSQGKGR
KSAHKGFKGVDAQGTLSKIFKLGGRDSRSGSPMARRELVISLIVES
>A60D_DROME (SEG-test sequence)
MNALLRHKGRNLRTSHLAQNVYKRFLKSNCCACSSVNVTDEPAKEDELPRRSASTSVLEL
SRSLGTYRRFQPHANYGYDYSGYGFRHLHTSRTLLETSSSKIDATVKKLKNQQKEKVEEI
MKEVANGQAAAVRASSAATATASSEKGQNASATAGSTSATASTTSLAKTADKSVAKPKKP
LRTRIWDELVHYYHGFRLLFIDVAICSKLLWRVLNGKTLTRRENKQLQRTTSDLFRLIPF
SVFIIVPFMELLLPLFIKFFPGMLPSTFQTSTDRQEKLRQSLSVRLEVAKFLQQTLDQMP
VQHKEHSSEEAKQFEAFFTKIRNPTEPVSNDEIIKFAKRFDDEITLDSLSREQLAALCRV
LELNTIGTTTLLRFQLRLKLRSLATDDRVIAREGVDSLDLLELQQACKARGMRAYGLTEE
RLRFQLKEWIDLSLNEQVPPTLLLLSRTMLISDDSITTDKLKETIRVLPDAVGAHTRHAI
GESEGKVDNKTKIEIIKEEERKIREEREEEREETIAKRSAIKEEIPAPYVFAEKLSGSQD
LLDHKEQSSVSETDKGISSTDVQLLSEALKTLSSDKQLVVEKETIKELKEELADYKEDVE
ELREVRQVVKEPVRESRAAKLLYNRVNKMISQLDNVLNDLEARQHQIKQAESSDYAASSP
TVEPQQMVHIDELVATIRRMKEASDEERFKVVGDLLVKLDADKDGVISVNEITKAVQSID
REATNIDKKQLEEFTELLSKLASRRRHEEIVHIDDLMNNIKVLKETSDEARLKHIEAVLE
KFDADKDGVVTVNDIRKVLESIGRDNIKLSDKAIEELISLLDKEQVLQAEQKIEKAIAKS
MKEAEKLKSEVDKADKDLSKLVNDIHDSAKEIQDIANEMRDKEETVPDKAKELKAEPAFK
DTAKTLKDNAKDLDDLAKDPKSDPKSPTKASTGSGPAGLSGGGPSSGSSGIATGSTTESA
LREAAERQMEKILPSTDIGLPPTIQTPSQPPTSKKATATASTLSTTITAKKLL
CLUSTAL W (1.83) multiple sequence alignment
PORIN_HUMAN MSWCNELRLPALKQHSIGRGLESHITMCIPPSYADLGKAARDIFNKG-FGFGLVKLDVKT
POR1_MOUSE_14_295 --------------------------MAVPPTYADLGKSARDVFTKG-YGFGLIKLDLKT
POR3_BOVIN_1_282 --------------------------MCNTPTYCDLGKAAKDVFNKG-YGFGMVKIDLRT
PORI_DROME_1_281 ---------------------------MAPPSYSDLGKQARDIFSKG-YNFGLWKLDLKT
PORI_NEUCR_1_282 ---------------------------MAVPAFSDIAKSANDLLNKDFYHLAAGTIEVKS
POR1_YEAST_3_281 -----------------------------PPVYSDISRNINDLLNKDFYHATPAAFDVQT
POR4_SOLTU_3_274 ----------------------------GPGLYTEIGKKARDLLYKD--YQSDHKFSITT
POR6_SOLTU_3_274 ----------------------------GPGLYSDIGKKARDLLYRD--YVSDHKFTVTT
POR1_ARATH_3_274 ----------------------------GPGLYTEIGKKARDLLYKD--HNSDQKFSITT
POR1_WHEAT_1_274 --------------------------MGGPGLYSGIGKKAKDLLYRD--YQTDHKFTLTT
: :.: .*:: :. : : :
PORIN_HUMAN KSCSGVEFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQICQ
POR1_MOUSE_14_295 KSENGLEFTSSGSANTETTKVNGSLETKYRWTEYGLTFTEKWNTDNTLGTEITVEDQLAR
POR3_BOVIN_1_282 KSCSGVEFSTSGHAYTDTGKASGNLETKYKICNYGLTFTQKWNTDNTLGTEISWENKLAE
PORI_DROME_1_281 KTSSGIEFNTAGHSNQESGKVFGSLETKYKVKDYGLTLTEKWNTDNTLFTEVAVQDQLLE
PORI_NEUCR_1_282 NTPNNVAFKVTGKS-THDKVTSGALEGKFTDKPNGLTVTQTWNTANALETKVEMADNLAK
POR1_YEAST_3_281 TTANGIKFSLKAKQPVKDGPLSTNVEAKLNDKQTGLGLTQGWSNTNNLQTKLEFAN-LTP
POR4_SOLTU_3_274 YSPTGVVITSSGSK--KGDLFLADVNTQLKNKN--VTTDIKVDTNSNLFTTITVDE-AAP
POR6_SOLTU_3_274 YSTTGVAITASGLK--KGELFLADVSTQLKNKN--ITTDVKVDTNSNVYTTITVDE-PAP
POR1_ARATH_3_274 FSPAGVAITSTGTK--KGDLLLGDVAFQSRRKN--ITTDLKVCTDSTFLITATVDE-AAP
POR1_WHEAT_1_274 YTANGPAITATSTK--KADLTVGEIQSQIKNKN--ITVDVKANSASNVITTITADDLAAP
: . :. . . : : : . . . :
PORIN_HUMAN GLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGY
POR1_MOUSE_14_295 GLKLTFDSSFSPNTGKKNAKIKTGYKREHINLGCDVDFDIAGPSIRGALVLGYEGWLAGY
POR3_BOVIN_1_282 GLKLTLDTIFVPNTGKKSGKLKASYKRDCFSLGSNVDIDFSGPTIYGWAVLAFEGWLAGY
PORI_DROME_1_281 GLKLSLEGNFAPQSGNKNGKFKVAYGHENVKADSDVNIDLKGPLINASAVLGYQGWLAGY
PORI_NEUCR_1_282 GLKAEGIFSFLPATNARGAKFNLHFKQSNFHGRAFFDL-LKGPTANIDAIVGHEGFLAGA
POR1_YEAST_3_281 GLKNELITSLTPGV-AKSAVLNTTFTEPFFTARGAFDLCLKSPTFVGDLTMAHEGIVGGA
POR4_SOLTU_3_274 GLKTILS---FRVPDQRSGKLEVQYLHDYAGICTSVGLTAN-PIVNFSGVVGTNIIALGT
POR6_SOLTU_3_274 GLKTIFS---FVVPDQKSGKVELQYLHEYAGINTSIGLTAS-PLVNFSGVAGNNTVALGT
POR1_ARATH_3_274 GLRSIFS---FKVPDQNSGKVELQYLHEYAGISTSMGLTQN-PTVNFSGVIGSNVLAVGT
POR1_WHEAT_1_274 GLKTILS---FAVPDQKSGKVELQYLHDYAGINASIGLTAN-PVVNLSGAFGTSALAVGA
**: ... .: : . ..: * . . *
PORIN_HUMAN QMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVN-DGTEFGGSIYQKVCEDLDTSVNLAWTS
POR1_MOUSE_14_295 QMNFETSKSRVTQSNFAVGYKTDEFQLHTNVN-DGTEFGGSIYQKVNKKLETAVNLAWTA
POR3_BOVIN_1_282 QMSFDTAKSKLSQNNFALGYKAADFQLHTHVN-DGTEFGGSIYQKVNEKIETSINLAWTA
PORI_DROME_1_281 QTAFDTQQSKLTTNNFALGYTTKDFVLHTAVN-DGQEFSGSIFQRTSDKLDVGVQLSWAS
PORI_NEUCR_1_282 SAGYDVQKAAITGYSAAVGYHAPTYSAAITATDNLSVFSASYYHKVNSQVEAGSKATWNS
POR1_YEAST_3_281 EFGYDISAGSISRYAMALSYFAKDYSLGATLN-NEQITTVDFFQNVNAFLQVGAKATMNC
POR4_SOLTU_3_274 DVSFDTKTGDFTKCNAGLSFTNADLVASLNLNNKGDNLTASYYHTVSPLTSTAVGAEVNH
POR6_SOLTU_3_274 DLSFDTATGNFTKCNAGLSFSSSDLIASLALNDKGDTVSASYYHTVKPVTNTAVGAELTH
POR1_ARATH_3_274 DVSFDTKSGNFTKINAGLSFTKEDLIASLTVNDKGDLLNASYYHIVNPLFNTAVGAEVSH
POR1_WHEAT_1_274 DVSLDTATKNFAKYNAALSYTNQDLIASLNLNNKGDSLTASYYHIVE-KSGTAVGAELTH
. : .: .:.: . . . :: . ..
PORIN_HUMAN GTNC--TRFGIAAKYQLDPTASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINA
POR1_MOUSE_14_295 GNSN--TRFGIAAKYQVDPDACFSAKVNNSSLIGLGYTQTLKPGIKLTLSALLDGKNVNA
POR3_BOVIN_1_282 GSNN--TRFGIAAKYKLDCRTSLSAKVNNASLIGLGYTQTLRPGVKLTLSALIDGKNFNA
PORI_DROME_1_281 GTSN--TKFAIGAKYQLDDDASVRAKVNNASQVGLGYQQKLRDGVTLTLSTLVDGKNFNA
PORI_NEUCR_1_282 KTGN-TVGLEVATKYRIDPVSFVKGKINDRGVAAIAYNVLLREGVTLGVGASFDTQKLDQ
POR1_YEAST_3_281 KLPNSNVNIEFATRYLPDASSQVKAKVSDSGIVTLAYKQLLRPGVTLGVGSSFDALKLSE
POR4_SOLTU_3_274 SFSTNENIITVGTQHRLDPLTSVKARINNFGKASALLQHEWRPKSLFTVSGEVDTKSVDK
POR6_SOLTU_3_274 SFSSNENTLTIGTQHLLDPLTTVKARVNSYGKASALIQHEWRPKSLFTISGEVDTRAIEK
POR1_ARATH_3_274 KLSSKDSTITVGTQHSLDPLTSVKARVNSAGIASALIQHEWKPKSFFTISGEVDTKSIDK
POR1_WHEAT_1_274 SFSSNENSLTFGTQHTLDPLTLVKARINNSGKASALIQHEFMPKSLCTISAEVDTKAIEK
: ..::: * : . .::.. . :. .* ..
PORIN_HUMAN GGHKVGSPWSWRLNPAERNLWEWISEDLALIYFHCDQQQAFFPPEDDQNKG
POR1_MOUSE_14_295 GGHKLG---------------------LGLEFQ------------------
POR3_BOVIN_1_282 GGHKVG---------------------LGFELE------------------
PORI_DROME_1_281 GGHKIG---------------------VGLELE------------------
PORI_NEUCR_1_282 ATHKVG---------------------TSFTFE------------------
POR1_YEAST_3_281 PVHKLG---------------------WSLSF-------------------
POR4_SOLTU_3_274 -GAKFG---------------------LALALK------------------
POR6_SOLTU_3_274 -SAKIG---------------------LAVALK------------------
POR1_ARATH_3_274 -SAKVG---------------------LALALK------------------
POR1_WHEAT_1_274 -SSKVG---------------------IAIALK------------------
*.* ..
>gi|75677554|ref|NM_181395.1| Mus musculus peroxidasin homolog (Drosophila) (Pxdn), mRNA
GGACCGGAGGGCTCAGTTGGGAGCCGGCGGTGGACGCGCCCGCCGAGGCCTCCTCCGCTGCTGTTCACGC
GTGCCAGCTCCTGTCCGCGCCATCTGCCATGGCCGTGCGCCCCACGCGCCGCTGCCTGCTGGCGCTCCTG
CTGTGCTTTGCCTGGTGGGCCATGGCGGTGGTCGCCTCGAAGCAAGGGGCAGGCTGTCCAAGCCGCTGCC
TGTGTTTCCGTACCACCGTGCGCTGCATGCATCTGTTGCTGGAGGCCGTGCCCGCCGTGGCGCCGCAGAC
CTCCATCCTAGATCTTCGGTTCAACAGAATCAGAGAGATCCAACCCGGGGCATTCAGGAGGCTGAGGAGC
CTGAACACACTGCTTCTTAACAACAACCAGATCAAGAAGATCCCCAATGGTGCATTTGAGGACCTGGAGA
ACTTAAAATACCTCTATTTGTACAAGAATGAGATCCAATCAATTGACAGGCAAGCATTTAAGGGACTTGC
CTCTCTAGAGCAACTGTACCTGCACTTTAATCAGATAGAAACGCTGGACCCTGAATCCTTCCAGCACCTG
CCAAAGCTGGAGAGACTGTTTTTGCACAACAACCGTATCACGCACTTAGTTCCTGGGACGTTCAGTCAGT
TAGAGTCCATGAAACGGCTGCGATTGGACTCGAATGCACTCCACTGTGACTGTGAAATCCTATGGCTAGC
GGATCTACTGAAGACCTACGCCCAATCTGGAAACGCACAAGCAGCAGCTACATGCGAGTATCCCAGACGC
ATCCAAGGACGCTCTGTGGCTACCATCACCCCAGAAGAGCTGAACTGTGAAAGGCCCCGGATTACCTCAG
AGCCACAGGATGCAGATGTCACCTCAGGGAACACAGTGTACTTCACCTGCAGAGCTGAGGGCAACCCCAA
ACCTGAGATCATCTGGCTTCGAAACAATAACGAGTTGAGCATGAAGACGGACTCTCGCTTAAACTTGCTG
GACGATGGCACGCTGATGATTCAGAACACACAGGAGGCGGATGAGGGTGTCTACCAGTGCATGGCGAAAA
ATGTGGCTGGAGAGGCGAAAACGCAGGAGGTGACCCTCAGGTACTTGGGGTCTCCAGCCCGACCCACTTT
TGTAATCCAGCCGCAGAACACAGAGGTACTGGTGGGTGAGAGTGTCACTCTGGAGTGCAGTGCCACAGGC
CACCCTCTGCCTCAGATCACCTGGACAAGAGGTGACCGCACACCCTTGCCAATTGACCCTCGAGTGAATA
TCACTCCCTCTGGAGGACTGTATATACAGAACGTCGCCCAGAGTGACAGCGGCGAGTACACGTGCTTTGC
ATCCAATAGTGTGGACAGTATCCATGCCACAGCCTTCATCATTGTACAAGCCCTTCCTCAGTTCACTGTG
ACCCCACAGAGCCGAGTGGTCATTGAAGGGCAGACTGTGGATTTCCAGTGTGCGGCTAAGGGACACCCTC
AGCCTGTCATAGCCTGGACCAAGGGAGGGAGCCAGCTCTCAGTGGACAGGCGGCACCTGGTGCTGTCCTC
AGGAACACTCAGGATTTCTGGGGTGGCCCTGCATGACCAGGGCCAGTATGAGTGCCAGGCCGTCAATATC
ATTGGCTCCCAGAAGGTCGTGGCCCACCTGACAGTACAGCCTAGAGTCACCCCGGTATTTGCCAGCATTC
CCAGTGACATGACTGTAGAGGTGGGCACCAACGTGCAGCTGCCTTGTAGCTCCCAGGGAGAACAAGAGCC
AGCCATCACCTGGAACAAGGATGGTGTTCAGGTAACAGAAAGTGGAAAATTTCACATCAGCCCCGAAGGA
TTCTTGACCATCAATGATGTTGGCACTGCCGATGCAGGTCGCTATGAGTGTGTAGCTCGGAACACAATTG
GATATGCCTCTGTGAGCATGGTACTCAGTGTGAATGTTCCTGATGTAAGTCGGAATGGGGATCCCTATGT
TGCTACCTCTATTGTTGAAGCCATTGCAACTGTTGACAGAGCCATCAACTCTACAAGGACACACTTGTTT
GACAGCCGTCCTCGTTCTCCAAATGACCTGCTCGCTCTGTTCCGGTACCCACGGGATCCATACACAGTGG
GACAAGCCAGGGCAGGAGAGATATTCGAGCGGACCCTGCAGCTGATCCAGGAGCATGTTCAGCATGGCTT
GATGGTGGACTTGAATGGAACAAGTTACCACTACAATGATCTGGTGTCCCCGCAGTACCTGAGCCTCATC
GCCAACCTGTCAGGCTGCACTGCACACCGCCGCGTGAACAACTGCTCAGACATGTGCTTCCACCAGAAGT
ATAGGACGCACGATGGCACGTGCAACAATCTACAGCACCCGATGTGGGGTGCCTCACTGACCGCCTTTGA
GCGCCTGTTGAAGGCTGTGTATGAGAATGGGTTCAACACACCCCGGGGCATTAATTCCCAGCGTCAGTAC
AATGGGCATGTACTACCCATGCCCCGCCTGGTGTCCACCACACTGATTGGGACAGAGGTGATCACCCCCG
ATGAGCAGTTTACACACATGCTGATGCAATGGGGCCAGTTCCTTGACCATGACCTAGACTCTACAGTGGT
AGCCCTGAGCCAGGCCCGCTTCTCTGACGGCCAGCATTGCAGCTCTGTGTGCAGCAATGACCCTCCCTGT
TTCTCGGTCATGATCCCCCCCAATGATCCCCGGGTGCGGAGTGGCGCCCGATGCATGTTCTTCGTGCGAT
CGAGCCCCGTGTGTGGCAGCGGCATGACGTCCCTGCTCATGAACTCTGTGTACCCTCGAGAGCAGATCAA
CCAGCTCACCTCCTACATCGATGCCTCCAATGTGTACGGCAGCACAGACCACGAAGCCCGCAGCATCCGG
GACCTGGCCAGCCACCGTGGCCTGCTGCGTCAGGGCATTGTGCAGAGGTCTGGCAAGCCCCTGCTTCCCT
TTGCCACCGGGCCACCCACTGAGTGCATGCGCGATGAGAACGAGAGCCCGATACCATGCTTTCTGGCCGG
CGACCACCGTGCCAACGAGCAGCTTGGCCTGACCAGCATGCATACGCTGTGGTTCCGGGAGCACAACCGC
ATTGCAGCAGAGCTGTTGAAGCTAAACCCGCACTGGGATGGGGACACTGTCTACCATGAGACCCGCAAGA
TAGTCGGGGCAGAGATACAGCACATCACCTACCGGCACTGGCTGCCCAAGATCCTGGGGGAGGTGGGCAT
GAAGATGCTCGGTGAGTACCGGGGCTACGACCCCAGTGTCAATGCTGGCATCTTTAATGCCTTTGCCACT
GCAGCCTTCAGGTTCGGTCACACTCTGATCAACCCTCTGCTCTACCGGCTGGATGAGAACTTTGAGCCCA
TCCCTCAGGGCCATGTGCCCCTCCACAAAGCCTTCTTCTCGCCCTTCCGGATTGTCAACGAGGGGGGCAT
CGACCCACTTCTCCGAGGGCTGTTTGGAGTGGCAGGGAAGATGCGCATTCCCTCTCAATTGCTGAACACA
GAGCTCACGGAGAGGCTGTTCTCCATGGCCCACACAGTGGCCCTGGACCTGGCTGCCATCAATATCCAGC
GAGGCCGGGACCATGGCATCCCACCCTACCATGACTACAGAGTCTACTGCAACTTGTCGGCTGCTTACAC
CTTTGAGGACCTGAAAAATGAGATCAAGAGCCCTGTGATCCGGGAGAAACTGCAGAGGCTGTATGGCTCG
ACTCTCAACATTGATCTGTTCCCAGCCCTCATGGTAGAAGACCTAGTACCTGGCAGCCGCTTGGGGCCCA
CACTCATGTGCCTGCTCAGCACACAGTTCCGACGCCTGCGGGATGGAGACAGGTTGTGGTATGAGAACCC
AGGCGTGTTCTCCCCCGCCCAGCTGACTCAGCTCAAGCAGACGTCCCTGGCGAGGATCCTTTGTGACAAC
TCAGACAACATCACCCGTGTGCAGCAGGATGTGTTCAGGGTGGCAGAGTTCCCCCACGGTTATAGCAGCT
GTGAGGACATCCCCAGGGTGGACCTGCGAGTGTGGCAGGACTGTTGTGAAGATTGTAGGACCAGGGGACA
ATTCAATGCTTTCTCCTACCATTTCCGGGGAAGACGGTCTCTAGAATTTAGCTATGAGGACGATAAGCCC
ACAAAGAGAGCCAGGTGGCGGAAAGCACTAAGTGTAAAGCATGGCAAACATCTTAGCAATGCCACATCAG
CCACCCACGAGCACTTGGAAGGGCCAGCAACTAATGATCTCAAGGAATTTGTTCTGGAAATGCAAAAGAT
CATCACAGACCTCAGAAAACAGATAAACAGCTTGGAGTCTCGGCTCAGCACCACAGAATGTGTGGATGAC
AGCGGTGAATCTCACGGCGGCAACACAAAGTGGAAAAAAGACCCATGCACAGTTTGTGAGTGCAAAAATG
GCCAGATCACCTGCTTTGTGGAAGCTTGCCAGCCTGCAGCCTGCCCCCAGCCTGTGAAAGTGGAAGGCGC
TTGCTGTCCCGTCTGCTTAAAGAACACTGCAGAGGAAAAGCCTTAGTGTTCCCGAGCTTAGCTCCTCGAA
GCTGGTCCACAGAATACTTGTGAGCCTAGATGACAACATGGGGAGCTTCAGACCACAGGACAAGTTGGGA
TCTCAGACATTTCAGGACCTCTGCTGTGCCATCGCAGAAGCAGCCGGGGCTGCTTCACACCCTGTGTTTG
TAGAAGGAAATTGAGCAGGCGGGAGTGGGTGCAGGCTCTGGCCCTCACTTCATGTTAGACTTCTCAGGTT
TATATTTAAGTGTTTTTAAATGGAAAATTGGTGCTACTATTAAATCACACAGTTG
>gi|27924101|gb|BC044828.1| Mus musculus peroxidasin homolog (Drosophila), mRNA (cDNA clone IMAGE:5008265), partial cds
CAGGGCAGGAGAGATATTCGAGCGGACCCTGCAGCTGATCCAGGAGCATGTTCAGCATGGCTTGATGGTG
GACTTGAATGGGACAAGTCAGTACCATGTTCTTCCTCCTCTGACTTGAGTGCGAGGTAGCAGACACTCTC
TGCTGCATGTGCATCATGTTACCACTACAATGATCTGGTGTCCCCGCAGTACCTGAGCCTCATCGCCAAC
CTGTCAGGCTGCACTGCACACCGCCGCGTGAACAACTGCTCAGACATGTGCTTCCACCAGAAGTATAGGA
CGCACGATGGCACGTGCAACAATCTACAGCACCCGATGTGGGGTGCCTCACTGACCGCCTTTGAGCGCCT
GTTGAAGGCTGTGTATGAGAATGGGTTCAACACACCCCGGGGCATTAATTCCCAGCGTCAGTACAATGGG
CATGTACTACCCATGCCCCGCCTGGTATCCACCACACTGATTGGGACAGAGGTGATCACCCCCGATGAGC
AGTTTACACACATGCTGATGCAATGGGGCCAGTTCCTTGACCATGACCTAGACTCTACAGTGGTAGCCCT
GAGCCAGGCCCGCTTCTCTGACGGCCAGCATTGCAGCTCAGTGTGCAGCAATGACCCTCCCTGTTTCTCG
GTCATGATCCCCCCCAATGATCCCCGGGTGCGGAGTGGCGCCCGATGCATGTTCTTCGTGCGATCGAGCC
CCGTGTGTGGCAGCGGCATGACGTCCCTGCTCATGAACTCTGTGTACCCTCGAGAGCAGATCAACCAGCT
CACCTCCTACATCGATGCCTCCAATGTGTACGGCAGCACAGACCACGAAGCCCGCAGCATCCGGGACCTG
GCCAGCCACCGTGGCCTGCTGCGTCAGGGCATAGTGCAGAGGTCTGGCAAGCCCCTGCTTCCCTTTGCCA
CCGGGCCACCCACTGAGTGCATGCGCGATGAGAACGAGAGCCCGATACCATGCTTTCTGGCCGGCGACCA
CCGTGCCAACGAGCAGCTTGGCCTGACCAGCATGCATACGCTGTGGTTCCGGGAGCACAACCGCATTGCA
GCAGAGCTGTTGAAGCTAAACCCGCACTGGGATGGGGACACTGTCTACCATGAGACCCGCAAGATAGTCG
GGGCAGAGATACAGCACATCACCTACCGGCACTGGCTGCCCAAGATCCTGGGGGAGGTGGGCATGAAGAT
GCTCGGTGAGTACCGGGGCTACGACCCCAGTGTCAATGCTGGCATCTTTAATGCCTTTGCCACTGCAGCC
TTCAGGTTCGGTCACACTCTGATCAACCCTCTGCTCTACCGGCTGGATGAGAACTTTGAGCCCATCCCTC
AGGGCCATGTGCCCCTCCACAAAGCCTTCTTCTCGCCCTTCCGGATTGTCAACGAGGGGGGCATCGACCC
ACTTCTCCGAGGGCTGTTTGGAGTGGCAGGGAAGATGCGCATTCCCTCTCAATTGCTGAACACAGAGCTC
ACGGAGAGGCTGTTCTCCATGGCCCACACAGTGGCCCTGGACCTGGCTGCCATCAATATCCAGCGAGGCC
GGGACCATGGCATCCCACCCTACCATGACTACAGAGTCTACTGCAACTTGTCGGCTGCTTACACCTTTGA
GGACCTGAAAAATGAGATCAAGAGCCCTGTGATCCGGGAGAAACTGCAGAGGCTGTATGGCTCGACTCTC
AACATTGATCTGTTCCCAGCCCTCATGGTGGAAGACCTGGTACCTGGCAGCCGCTTGGGGCCCACACTCA
TGTGCCTGCTCAGCACACAGTTCCGACGCCTGCGGGATGGAGACAGGTTGTGGTATGAGAACCCAGGCGT
GTTCTCCCCCGCCCAGCTGACTCAGCTCAAGCAGACGTCCCTGGCGAGGATCCTTTGTGACAACTCAGAC
AACATCACCCGTGTGCAGCAGGATGTGTTCAGGGTGGCAGAGTTCCCCCACGGTTATAGCAGCTGTGAGG
ACATCCCCAGGGTGGATCTGCGAGTGTGGCAGGACTGTTGTGAAGATTGTAGGACCAGGGGACAATTCAA
TGCTTTCTCCTACCATTTCCGGGGAAGACGGTCTCTAGAATTTAGCTATGAGGACGATAAGCCCACAAAG
AGAGCCAGGTGGCGGAAAGCGCTAAGTGTAAAGCATGGCAAACATCTTAGCAATGCCACATCAGCCACCC
ACGAGCACTTGGAAGGGCCAGCAACTAATGATCTCAAGGAATTTGTTCTGGAAATGCAAAAGATCATCAC
AGACCTCAGAAAACAGATAAACAGCTTGGAGTCTCGGCTCAGCACCACAGAATGTGTGGATGACAGCGGT
GAATCTCACGGCGGCAACACAAAGTGGAAAAAAGACCCATGCACAGTTTGTGAGTGCAAAAATGGCCAGA
TCACCTGCTTTGTGGAAGCTTGCCAGCCTGCAGCCTGCCCCCAGCCTGTGAAAGTGGAAGGCGCTTGCTG
TCCCGTCTGCTTAAAGAACACTGCAGAGGAAAAGCCTTAGTGTTCCCGAGCTTAGCTCCTCGAAGCTGGT
CCACAGAATACTTGTGAGCCTAGATGACAACATGGGGAGCTTCAGACCACAGGACAAGTTGGGATCTCAG
ACATTTCAGGACCTCTGCTGTGCCATCGCAGAAGCAGCCGGGGCTGCTTCACACCCTGTGTTTGTAGAAG
GAAATTGAGCAGGCGGGAGTGGGTGCAGGCTCTGGCCCTCACTTCATGTTAGACTTCTCAGGTTTATATT
TAAGTGTTTTTAAATGGAAAATTGGTGCTACTATTAAATCACACAGTTGAGACATGGGTTTCGAAATTGC
TTCGGCCTGATGATGCCACTTCTGTTTAAGCGAGGCAGAAGGTCTTTGACAGGCTTCATTTAAACAGAAG
CATTTGGCAAATGGAACAGGATCTTATAAAATAATATCCTGGGCCAAAATCCCTACAGGGATACTAGGGC
TTTCTCCCACTGCTCCCTCATGTCTGCCATCTTCCCCTTCAGGGCCCTTCAGCGTCTAATAACAATACCA
ACACAAACCTGCACCTTGCGACCATGTCCACCCTTCTACACAGGACTGTCACATTATCAAGCCACAGTTA
GAAATAATCTTGTTTCCACTTAGAACATGTGTAAATACTGCCTTGCTGTAAATTGAATGTGAAATCCTTT
GGTCCTATGCCATGTTGAGTGACCCGAACAGCCCCTTGCATCGGCAGGATGGTCTGCAGCCACCACCTGC
CTTGGTTCCTTGGTACACCGGCCATGGTGTAGGGAGGGTGATTCACTCTCCAGGTCCTTTGAATAGGTGA
CAGACAGGCAGAGCCTGAATGTCAGGAAATTCCATTGTATGAATGCTTTAATGGAAAACGTTCCCTATAC
CTCATTATTTTTTATTGCATGATGTTTTATAAAAATTTGCCCATTTTAGATGTTAGAAAAATTATTTCCA
CTATGCAAATTTCTTTTTAATTCAGTGAAAAGCAACTGTTATACCTCATAGTCTCTTGTTTTTAATTGAC
CAAAATATTCCATTCTATTCTCACAAGGTTCTGAGGTCTCTGCCTGAAAAAGCAAGTCTCACCCTATAGA
CACTGATGTGCCCAGCACTACGTGCCAGCCATTGTGGGAACACAAGAGGGTCACCTGCCCAAGGGCTAGG
GAGGAAGACCTCAAGAACACAGAGGAGGTGGAAAAGGACAAAATAGGTGCCATTTTAGGGAGACCATGGT
CAGATAGGGGGATGCTCAGTTCCTTGCAGCCTCGGGCGGGCTTATGTCGGCACTAAGGCCATTGTGGTGT
GTACTTATATGATCCCTATGCTGATAGGATTACCTTCCTAGACATAGCTAGACGCAAAGCCACATGTGTA
AGGCTGCTGAGCAAAGACAGCATCCCAGCATGGGTGTGTTCACGGTGGATTCACCACGTTGCATAGTAAA
GTGGTCCCCTTGGCTTACCCTTCACTTTGCTCATGAGATTCAGAAGCTTGTGGTCCAGCAGGGGTGAGCA
TTTGTGAAATAGTAAGCTGAACTTAGTGGTGAGATTTCAGAACAGACTTCTGTGAAGTAAGAGATGTAAC
CATGCATCTAAAATCGGATGGCCGTGTAACTGCTCGGGCATAGAAATGGTGGGAGAACCTGTCCTGGGTA
CCTGGCATTTCACATGAGCCCAGGGATATGTCTTGTGCCAAGGCACACAAGTGTCCGTGGACTTGGACAG
GTGCCAAGGGTTTTTGTCTCTGTTCCTATGTGGGAGGCTGGCTGTGATTTACATTAATTTCTGTATTTCA
AACGAAGATGTCTGCAGATCTCCATTTTGATGTTACAGCCTCATTGCCCAGGCAGTGGGCAGTGGCCAGA
CACCCTTTCTGACTAGCCACTGCATTGGGCTTCTGTGATTCAAAGTAGTGTATATATTTATTTACTTCTC
TGACTGTGGCCAACAGCCAAATGCCATTTTATGTTCCTTGTATTCAGTCCATTACCAAAGAGGTGTTTGC
ACTTTGTAATGATACCTTTCAGTTCAAATAAAAGGACCACATCGTTAAGTGGAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA
>gi|147905582|ref|NP_001081848.1| polysomal ribonuclease 1 [Xenopus laevis] Peroxidasin
MAPADLWLGVLLILASVQGTASSYYDGVEELDNNLILSCTKEAKHLVDTAYKNTRLMLKDRLRRRTVSAS
DLMAYFKQPVCSRNAIRAADYMGTTLQLLSHKLKPFHHRPFNITELLTETQIDAIYKLTGCAYQHLPSAC
QESPYRTITGQCNNRKNPILGASNTGFTRLLPVVYEDGLSVPRGWTENLPINGFPLPLARAVSNEIVRFP
NENLTLDEGRALIFMQWGQWTDHDLDLSPETPARSTFLEGIDCDTNCAKEPPCFPLKIPPNDPRISNQSD
CIPLFRSSPVCTPGSPVREQINILTSFIDGSQVYGSDWPLAVKLRNNTNQLGLMAINQRFTDNGLPFLPF
ETAEEDFCVLTNRSSGIPCFLGGDPRVSEQPGLTAFHTLFVRAHNNIAARLRELNPRWSGETLYQEARKI
IGGILQKITYKDWLPLLLGSEMAAVLPAYRSYNESVDPRVSNVFTVVFRMGHTLIQPFIYRLADGYRPLG
PEPQIPLHKTFFNSWRVVREGGIDPLLRGLMANRAKLNRQNQLVVDELRERLFVLFKRIGLDLTAINMQR
GREHGLPGYNAWRRFCGLSAPSNVNELAAVLNNRNLAEKFIKLYGSPENIDIWVGGVAESLVRNGRIGKL
LTCLIGNQFRRARDGDRFYYEQPSVFTNEQRASIERVTLARVICDNTKITEVPRNVFLGNRYPRDFVACS
RIPTLDLNPWKVA