Datasets

Test Datasets [Download]

>rrm-1
IYVGNLMFQVTDKDIAREFSQQGRFVYIHLKSGRVTGRRDGFGYILFTTY
KDAHQALSTLGGFYVNSRNLSC
> rrm-2
LYIRNLDTVTEKQSMLTEFSPFGDDLHLRAIKNSDASSSANSKGFAFIDF
KDEDSAEAAFRKLGEIPINGSTLDV
> rrm-3 (Psi-blast)
AYVSRLSYDTTEEHVREAFGNFPPIVSALLINDCNNFGFIEFKDREETAH
IMEGLPKTDFNNVSVKK
> rrm-4
LFIIGLPDETSESQLRQIFAAFGGVCTFNLIKQPVTRLTSERNKCRGYAF
VIYQNKEDAANARNLNGIAINNRIVRV
> rrm-5
LYVGDVSYQIQTTSVLIYFSPEGDKTTISMILQKNNDGKGFVHFPKHPSA
TDIPDRMDGSELLGKKLAL
> rrm-6
YIGGLAEKINKSELSLLFGRFGSSMSAKVVKIRQTNRNQPMCFISFKTEE
DTNMAMKEMFNKEFLGCELDV
> rrm-7
LFVKNLEPMETTDEDVRAFFTKNGVLKECNMIVDKETGHLRGKGALNFED
PVSAEKAVRNLNGGKFNSNEATL
> rrm-8
VKIRGLDQGATGKAIRDAFNQYGNLIVIIVVMDTNSRQFNMGFIEFKDQL
DANVALREKDGIPVLNERSAV
> rrm-9(good seq for testing BLOSSUM matrices)
LYIGDLPDAIRDDDLKDLFRKFGEIETIKAISSGVPTRARGYAFVRFARY
EDIREALDEKEGYQPCDYAEKI
> rrm-10
LFLSNLPRDCNRHKLRDMFSKYGLIVKIRAHPDKDTGPTRGYAFVQFKSS
EDVDKAKALNKYFLGLRKIRV

GFP (SYG) residues 65-67 in Aequorea victoria
PHI patterns: x(5)SYGx(1,3)
> gi|18175269|gb|AAK02065.1| GFP
MSKGEELFTGIVPVLIELDGDVHGHKFSVRGEGEGDADYGKLEIKFICTT
GKLPVPWPTLVTTLSYGILCFARYPEHMKMNDFFKSAMPEGYIQERTIFF
QDDGKYKTRGEVKFEGDTLVNRIELKGMDFKEDGNILGHKLEYNFNSHNV
YIMPDKANNGLKVNFKIRHNIEGGGVQLADHYQTNVPLGDGPVLIPINHY
LSTQTAISKDRNETRDHMVFLEFFSACGHTHGMDELYK

Sequences for Clustalw, Mview and Jalview
Mirochondrial Porin sequences

>PORIN_HUMAN
MSWCNELRLPALKQHSIGRGLESHITMCIPPSYADLGKAARDIFNKGFGFGLVKLDVKTKSCSGVEFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEI
AIEDQICQGLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGYQMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVNDGTEFGGSIYQ
KVCEDLDTSVNLAWTSGTNCTRFGIAAKYQLDPTASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINAGGHKVGSPWSWRLNPAERNLWEWISEDLALIYFHCDQ
QQAFFPPEDDQNKG
> POR1_MOUSE
MAVPPTYADLGKSARDVFTKGYGFGLIKLDLKTKSENGLEFTSSGSANTETTKVNGSLETKYRWTEYGLTFTEKWNTDNTLGTEITVEDQLARGLKLTFDSSFSPNTGKKN
AKIKTGYKREHINLGCDVDFDIAGPSIRGALVLGYEGWLAGYQMNFETSKSRVTQSNFAVGYKTDEFQLHTNVNDGTEFGGSIYQKVNKKLETAVNLAWTAGNSNTRFGIA
AKYQVDPDACFSAKVNNSSLIGLGYTQTLKPGIKLTLSALLDGKNVNAGGHKLGLGLEFQ
> POR3_BOVIN
MCNTPTYCDLGKAAKDVFNKGYGFGMVKIDLRTKSCSGVEFSTSGHAYTDTGKASGNLETKYKICNYGLTFTQKWNTDNTLGTEISWENKLAEGLKLTLDTIFVPNTGKKS
GKLKASYKRDCFSLGSNVDIDFSGPTIYGWAVLAFEGWLAGYQMSFDTAKSKLSQNNFALGYKAADFQLHTHVNDGTEFGGSIYQKVNEKIETSINLAWTAGSNNTRFGIA
AKYKLDCRTSLSAKVNNASLIGLGYTQTLRPGVKLTLSALIDGKNFNAGGHKVGLGFELE
> PORI_DROME
MAPPSYSDLGKQARDIFSKGYNFGLWKLDLKTKTSSGIEFNTAGHSNQESGKVFGSLETKYKVKDYGLTLTEKWNTDNTLFTEVAVQDQLLEGLKLSLEGNFAPQSGNKNG
KFKVAYGHENVKADSDVNIDLKGPLINASAVLGYQGWLAGYQTAFDTQQSKLTTNNFALGYTTKDFVLHTAVNDGQEFSGSIFQRTSDKLDVGVQLSWASGTSNTKFAIGA
KYQLDDDASVRAKVNNASQVGLGYQQKLRDGVTLTLSTLVDGKNFNAGGHKIGVGLELE
> POR4_SOLTU
GPGLYTEIGKKARDLLYKDYQSDHKFSITTYSPTGVVITSSGSKKGDLFLADVNTQLKNKNVTTDIKVDTNSNLFTTITVDEAAPGLKTILSFRVPDQRSGKLEVQYLHDY
AGICTSVGLTANPIVNFSGVVGTNIIALGTDVSFDTKTGDFTKCNAGLSFTNADLVASLNLNNKGDNLTASYYHTVSPLTSTAVGAEVNHSFSTNENIITVGTQHRLDPLT
SVKARINNFGKASALLQHEWRPKSLFTVSGEVDTKSVDKGAKFGLALALK
> POR1_WHEAT_1_274
MGGPGLYSGIGKKAKDLLYRDYQTDHKFTLTTYTANGPAITATSTKKADLTVGEIQSQIKNKNITVDVKANSASNVITTITADDLAAPGLKTILSFAVPDQKSGKVELQYL
HDYAGINASIGLTANPVVNLSGAFGTSALAVGADVSLDTATKNFAKYNAALSYTNQDLIASLNLNNKGDSLTASYYHIVEKSGTAVGAELTHSFSSNENSLTFGTQHTLDP
LTLVKARINNSGKASALIQHEFMPKSLCTISAEVDTKAIEKSSKVGIAIALK
> POR6_SOLTU
GPGLYSDIGKKARDLLYRDYVSDHKFTVTTYSTTGVAITASGLKKGELFLADVSTQLKNKNITTDVKVDTNSNVYTTITVDEPAPGLKTIFSFVVPDQKSGKVELQYLHEY
AGINTSIGLTASPLVNFSGVAGNNTVALGTDLSFDTATGNFTKCNAGLSFSSSDLIASLALNDKGDTVSASYYHTVKPVTNTAVGAELTHSFSSNENTLTIGTQHLLDPLT
TVKARVNSYGKASALIQHEWRPKSLFTISGEVDTRAIEKSAKIGLAVALK
> POR1_ARATH
GPGLYTEIGKKARDLLYKDHNSDQKFSITTFSPAGVAITSTGTKKGDLLLGDVAFQSRRKNITTDLKVCTDSTFLITATVDEAAPGLRSIFSFKVPDQNSGKVELQYLHEY
AGISTSMGLTQNPTVNFSGVIGSNVLAVGTDVSFDTKSGNFTKINAGLSFTKEDLIASLTVNDKGDLLNASYYHIVNPLFNTAVGAEVSHKLSSKDSTITVGTQHSLDPLT
SVKARVNSAGIASALIQHEWKPKSFFTISGEVDTKSIDKSAKVGLALALK
> PORI_NEUCR
MAVPAFSDIAKSANDLLNKDFYHLAAGTIEVKSNTPNNVAFKVTGKSTHDKVTSGALEGKFTDKPNGLTVTQTWNTANALETKVEMADNLAKGLKAEGIFSFLPATNARGA
KFNLHFKQSNFHGRAFFDLLKGPTANIDAIVGHEGFLAGASAGYDVQKAAITGYSAAVGYHAPTYSAAITATDNLSVFSASYYHKVNSQVEAGSKATWNSKTGNTVGLEVA
TKYRIDPVSFVKGKINDRGVAAIAYNVLLREGVTLGVGASFDTQKLDQATHKVGTSFTFE
> POR1_YEAST
PPVYSDISRNINDLLNKDFYHATPAAFDVQTTTANGIKFSLKAKQPVKDGPLSTNVEAKLNDKQTGLGLTQGWSNTNNLQTKLEFANLTPGLKNELITSLTPGVAKSAVLN
TTFTEPFFTARGAFDLCLKSPTFVGDLTMAHEGIVGGAEFGYDISAGSISRYAMALSYFAKDYSLGATLNNEQITTVDFFQNVNAFLQVGAKATMNCKLPNSNVNIEFATR
YLPDASSQVKAKVSDSGIVTLAYKQLLRPGVTLGVGSSFDALKLSEPVHKLGWSLSF

ATAB Test Sequences

>143B_BOVIN cyt
TMDKSELVQKAKLAEQAERYDDMAAAMKAVTEQGHELSNEERNLLSVAYKNVVGARRSSW
RVISSIEQKTERNEKKQQMGKEYREKIEAELQDICNDVLQLLDKYLIPNATQPESKVFYL
KMKGDYFRYLSEVASGDNKQTTVSNSQQAYQEAFEISKKEMQPTHPIRLGLALNFSVFYY
EILNSPEKACSLAKTAFDEAIAELDTLNEESYKDSTLIMQLLRDNLTLWTSENQGDEGDA
GEGEN
> DBI5_MOUSE cyt
MSQVEFEMACASLKQLKGPVSDQEKLLVYSFYKQATQGDCNIPVPPATDVRAKAKYEAWM
VNKGMSKMDAMRIYIAKVEELKKKEPC
> RT09_HUMAN mitochondrial
MAAPCVSYGGAVSYRLLLWGRGSLARKQGLWKTAAPELQTNVRSQILRLRHTAFVIPKKN
VPTSKRETYTEDFIKKQIEEFNIGKRHLANMMGEDPETFTQEDIDRAIAYLFPSGLFEKR
ARPVMKHPEQIFPRQRAIQWGEDGRPFHYLFYTGKQSYYSLMHDVYGMLLTLEKHQSHWQ
AKSLLPEKTVTRDVIGSRWLIKEELEEMLVGKLSDLDYMQFIRLLEKLLTSQCGAAEEEF
VQRFRRSVTLESKKQLIEPVQYDEQGMAFSKSEGKRKTAKAEAIVYKHGSGRIKVNGIDY
QLYFPITQDREQLMFPFHFVDRLGKHDVTCTVSGGGRSAQAGAIRLAMAKALCSFVTEDE
VEWMRQAGLLTTDPRVRERKKPGQEGARRKFTWKKR
> gi|16554607|ref|NP_060611.2| (NM_018141) mitochondrial
MAARTAFGAVCRRLWQGLGNFSVNTSKGNTAKNGGLLLSTNMKWVQFSNLHVDVPKDLTK
PVVTISDEPDILYKRLSVLVKGHDKAVLDSYEYFAVLAAKELGISIKVHEPPRKIERFTL
LQSVHIYKKHRVQYEMRTLYRCLELEHLTGSTADVYLEYIQRNLPEGVAMEVTKTQLEQL
PEHIKEPIWETLSEEKEESKS

N-Glycosylation Example

Top of Form 1
> gi|9256644|ref|NP_061349.1| solute carrier family 1 [Mus musculus]
MEKSGETNGYLDGTQAEPAAGPRTPETAMGKSQRCASFFRRHALVLLTVSGVLVGAGMGAALRGLQLTRT
QITYLAFPGEMLLRMLRMIILPLVVCSLVSGAASLDASSLGRLGGIAVAYFGLTTLSASALAVALAFIIK
PGAGAQTLQSSSLGLENSGPPPVSKETVDSFLDLLRNLFPSNLVVAAFTTSATDYTVVTHNTSSGNVTKE
KIPVVTDVEGMNILGLVLFALVLGVALKKLGPEGEDLIRFFNSFNEATMVLVSWIMWYVPIGIMFLIGSK
IVEMKDIVMLVTSLGKYIFASMLGHVIHGGIVLPLVYFAFTRKNPFTFLLGLLTPFATAFATCSSSATLP
SMMKCIEENNGVDKRISRFILPIGATVNMDGAAIFQCVAAVFIAQLNNVDLNAGQIFTILVTATASSVGA
AGVPAGGVLTIAIILEAIGLPTHDLSLILAVDWIVDRTTTVVNVEGDALGAGILNHLNQKVVKKGEQELQ
EVKVEAIPNSKSEEETSPLVTHQNPAGPVAIAPELESKESVL
Bottom of Form 1

Phosphorylation Example
> seq2
ASQKRPSQRHGSKYLATASTMDHARHGFLPRHRDTGILDSIGRFFGGDRGAPK
NMYKDSHHPARTAHYGSLPQKSHGRTQDENPVVHFFKNIVTPRTPPPSQGKGR
KSAHKGFKGVDAQGTLSKIFKLGGRDSRSGSPMARRELVISLIVES

>A60D_DROME (SEG-test sequence)
MNALLRHKGRNLRTSHLAQNVYKRFLKSNCCACSSVNVTDEPAKEDELPRRSASTSVLEL
SRSLGTYRRFQPHANYGYDYSGYGFRHLHTSRTLLETSSSKIDATVKKLKNQQKEKVEEI
MKEVANGQAAAVRASSAATATASSEKGQNASATAGSTSATASTTSLAKTADKSVAKPKKP
LRTRIWDELVHYYHGFRLLFIDVAICSKLLWRVLNGKTLTRRENKQLQRTTSDLFRLIPF
SVFIIVPFMELLLPLFIKFFPGMLPSTFQTSTDRQEKLRQSLSVRLEVAKFLQQTLDQMP
VQHKEHSSEEAKQFEAFFTKIRNPTEPVSNDEIIKFAKRFDDEITLDSLSREQLAALCRV
LELNTIGTTTLLRFQLRLKLRSLATDDRVIAREGVDSLDLLELQQACKARGMRAYGLTEE
RLRFQLKEWIDLSLNEQVPPTLLLLSRTMLISDDSITTDKLKETIRVLPDAVGAHTRHAI
GESEGKVDNKTKIEIIKEEERKIREEREEEREETIAKRSAIKEEIPAPYVFAEKLSGSQD
LLDHKEQSSVSETDKGISSTDVQLLSEALKTLSSDKQLVVEKETIKELKEELADYKEDVE
ELREVRQVVKEPVRESRAAKLLYNRVNKMISQLDNVLNDLEARQHQIKQAESSDYAASSP
TVEPQQMVHIDELVATIRRMKEASDEERFKVVGDLLVKLDADKDGVISVNEITKAVQSID
REATNIDKKQLEEFTELLSKLASRRRHEEIVHIDDLMNNIKVLKETSDEARLKHIEAVLE
KFDADKDGVVTVNDIRKVLESIGRDNIKLSDKAIEELISLLDKEQVLQAEQKIEKAIAKS
MKEAEKLKSEVDKADKDLSKLVNDIHDSAKEIQDIANEMRDKEETVPDKAKELKAEPAFK
DTAKTLKDNAKDLDDLAKDPKSDPKSPTKASTGSGPAGLSGGGPSSGSSGIATGSTTESA
LREAAERQMEKILPSTDIGLPPTIQTPSQPPTSKKATATASTLSTTITAKKLL

CLUSTAL W (1.83) multiple sequence alignment

PORIN_HUMAN            MSWCNELRLPALKQHSIGRGLESHITMCIPPSYADLGKAARDIFNKG-FGFGLVKLDVKT
POR1_MOUSE_14_295      --------------------------MAVPPTYADLGKSARDVFTKG-YGFGLIKLDLKT
POR3_BOVIN_1_282       --------------------------MCNTPTYCDLGKAAKDVFNKG-YGFGMVKIDLRT
PORI_DROME_1_281       ---------------------------MAPPSYSDLGKQARDIFSKG-YNFGLWKLDLKT
PORI_NEUCR_1_282       ---------------------------MAVPAFSDIAKSANDLLNKDFYHLAAGTIEVKS
POR1_YEAST_3_281       -----------------------------PPVYSDISRNINDLLNKDFYHATPAAFDVQT
POR4_SOLTU_3_274       ----------------------------GPGLYTEIGKKARDLLYKD--YQSDHKFSITT
POR6_SOLTU_3_274       ----------------------------GPGLYSDIGKKARDLLYRD--YVSDHKFTVTT
POR1_ARATH_3_274       ----------------------------GPGLYTEIGKKARDLLYKD--HNSDQKFSITT
POR1_WHEAT_1_274       --------------------------MGGPGLYSGIGKKAKDLLYRD--YQTDHKFTLTT
                                                       :  :.:  .*:: :.        : : :

PORIN_HUMAN            KSCSGVEFSTSGSSNTDTGKVTGTLETKYKWCEYGLTFTEKWNTDNTLGTEIAIEDQICQ
POR1_MOUSE_14_295      KSENGLEFTSSGSANTETTKVNGSLETKYRWTEYGLTFTEKWNTDNTLGTEITVEDQLAR
POR3_BOVIN_1_282       KSCSGVEFSTSGHAYTDTGKASGNLETKYKICNYGLTFTQKWNTDNTLGTEISWENKLAE
PORI_DROME_1_281       KTSSGIEFNTAGHSNQESGKVFGSLETKYKVKDYGLTLTEKWNTDNTLFTEVAVQDQLLE
PORI_NEUCR_1_282       NTPNNVAFKVTGKS-THDKVTSGALEGKFTDKPNGLTVTQTWNTANALETKVEMADNLAK
POR1_YEAST_3_281       TTANGIKFSLKAKQPVKDGPLSTNVEAKLNDKQTGLGLTQGWSNTNNLQTKLEFAN-LTP
POR4_SOLTU_3_274       YSPTGVVITSSGSK--KGDLFLADVNTQLKNKN--VTTDIKVDTNSNLFTTITVDE-AAP
POR6_SOLTU_3_274       YSTTGVAITASGLK--KGELFLADVSTQLKNKN--ITTDVKVDTNSNVYTTITVDE-PAP
POR1_ARATH_3_274       FSPAGVAITSTGTK--KGDLLLGDVAFQSRRKN--ITTDLKVCTDSTFLITATVDE-AAP
POR1_WHEAT_1_274       YTANGPAITATSTK--KADLTVGEIQSQIKNKN--ITVDVKANSASNVITTITADDLAAP
                        :  .  :.  .    .       :  :       :       . . .       :   

PORIN_HUMAN            GLKLTFDTTFSPNTGKKSGKIKSSYKRECINLGCDVDFDFAGPAIHGSAVFGYEGWLAGY
POR1_MOUSE_14_295      GLKLTFDSSFSPNTGKKNAKIKTGYKREHINLGCDVDFDIAGPSIRGALVLGYEGWLAGY
POR3_BOVIN_1_282       GLKLTLDTIFVPNTGKKSGKLKASYKRDCFSLGSNVDIDFSGPTIYGWAVLAFEGWLAGY
PORI_DROME_1_281       GLKLSLEGNFAPQSGNKNGKFKVAYGHENVKADSDVNIDLKGPLINASAVLGYQGWLAGY
PORI_NEUCR_1_282       GLKAEGIFSFLPATNARGAKFNLHFKQSNFHGRAFFDL-LKGPTANIDAIVGHEGFLAGA
POR1_YEAST_3_281       GLKNELITSLTPGV-AKSAVLNTTFTEPFFTARGAFDLCLKSPTFVGDLTMAHEGIVGGA
POR4_SOLTU_3_274       GLKTILS---FRVPDQRSGKLEVQYLHDYAGICTSVGLTAN-PIVNFSGVVGTNIIALGT
POR6_SOLTU_3_274       GLKTIFS---FVVPDQKSGKVELQYLHEYAGINTSIGLTAS-PLVNFSGVAGNNTVALGT
POR1_ARATH_3_274       GLRSIFS---FKVPDQNSGKVELQYLHEYAGISTSMGLTQN-PTVNFSGVIGSNVLAVGT
POR1_WHEAT_1_274       GLKTILS---FAVPDQKSGKVELQYLHDYAGINASIGLTAN-PVVNLSGAFGTSALAVGA
                       **:             ... .:  : .        ..:    *        . .    *

PORIN_HUMAN            QMTFDSAKSKLTRNNFAVGYRTGDFQLHTNVN-DGTEFGGSIYQKVCEDLDTSVNLAWTS
POR1_MOUSE_14_295      QMNFETSKSRVTQSNFAVGYKTDEFQLHTNVN-DGTEFGGSIYQKVNKKLETAVNLAWTA
POR3_BOVIN_1_282       QMSFDTAKSKLSQNNFALGYKAADFQLHTHVN-DGTEFGGSIYQKVNEKIETSINLAWTA
PORI_DROME_1_281       QTAFDTQQSKLTTNNFALGYTTKDFVLHTAVN-DGQEFSGSIFQRTSDKLDVGVQLSWAS
PORI_NEUCR_1_282       SAGYDVQKAAITGYSAAVGYHAPTYSAAITATDNLSVFSASYYHKVNSQVEAGSKATWNS
POR1_YEAST_3_281       EFGYDISAGSISRYAMALSYFAKDYSLGATLN-NEQITTVDFFQNVNAFLQVGAKATMNC
POR4_SOLTU_3_274       DVSFDTKTGDFTKCNAGLSFTNADLVASLNLNNKGDNLTASYYHTVSPLTSTAVGAEVNH
POR6_SOLTU_3_274       DLSFDTATGNFTKCNAGLSFSSSDLIASLALNDKGDTVSASYYHTVKPVTNTAVGAELTH
POR1_ARATH_3_274       DVSFDTKSGNFTKINAGLSFTKEDLIASLTVNDKGDLLNASYYHIVNPLFNTAVGAEVSH
POR1_WHEAT_1_274       DVSLDTATKNFAKYNAALSYTNQDLIASLNLNNKGDSLTASYYHIVE-KSGTAVGAELTH
                       .   :     .:    .:.:           . .      . :: .     ..      

PORIN_HUMAN            GTNC--TRFGIAAKYQLDPTASISAKVNNSSLIGVGYTQTLRPGVKLTLSALVDGKSINA
POR1_MOUSE_14_295      GNSN--TRFGIAAKYQVDPDACFSAKVNNSSLIGLGYTQTLKPGIKLTLSALLDGKNVNA
POR3_BOVIN_1_282       GSNN--TRFGIAAKYKLDCRTSLSAKVNNASLIGLGYTQTLRPGVKLTLSALIDGKNFNA
PORI_DROME_1_281       GTSN--TKFAIGAKYQLDDDASVRAKVNNASQVGLGYQQKLRDGVTLTLSTLVDGKNFNA
PORI_NEUCR_1_282       KTGN-TVGLEVATKYRIDPVSFVKGKINDRGVAAIAYNVLLREGVTLGVGASFDTQKLDQ
POR1_YEAST_3_281       KLPNSNVNIEFATRYLPDASSQVKAKVSDSGIVTLAYKQLLRPGVTLGVGSSFDALKLSE
POR4_SOLTU_3_274       SFSTNENIITVGTQHRLDPLTSVKARINNFGKASALLQHEWRPKSLFTVSGEVDTKSVDK
POR6_SOLTU_3_274       SFSSNENTLTIGTQHLLDPLTTVKARVNSYGKASALIQHEWRPKSLFTISGEVDTRAIEK
POR1_ARATH_3_274       KLSSKDSTITVGTQHSLDPLTSVKARVNSAGIASALIQHEWKPKSFFTISGEVDTKSIDK
POR1_WHEAT_1_274       SFSSNENSLTFGTQHTLDPLTLVKARINNSGKASALIQHEFMPKSLCTISAEVDTKAIEK
                               : ..:::  *  : . .::.. .                 :.  .*   ..

PORIN_HUMAN            GGHKVGSPWSWRLNPAERNLWEWISEDLALIYFHCDQQQAFFPPEDDQNKG
POR1_MOUSE_14_295      GGHKLG---------------------LGLEFQ------------------
POR3_BOVIN_1_282       GGHKVG---------------------LGFELE------------------
PORI_DROME_1_281       GGHKIG---------------------VGLELE------------------
PORI_NEUCR_1_282       ATHKVG---------------------TSFTFE------------------
POR1_YEAST_3_281       PVHKLG---------------------WSLSF-------------------
POR4_SOLTU_3_274       -GAKFG---------------------LALALK------------------
POR6_SOLTU_3_274       -SAKIG---------------------LAVALK------------------
POR1_ARATH_3_274       -SAKVG---------------------LALALK------------------
POR1_WHEAT_1_274       -SSKVG---------------------IAIALK------------------
                          *.*                      ..                    

>gi|75677554|ref|NM_181395.1| Mus musculus peroxidasin homolog (Drosophila) (Pxdn), mRNA
GGACCGGAGGGCTCAGTTGGGAGCCGGCGGTGGACGCGCCCGCCGAGGCCTCCTCCGCTGCTGTTCACGC
GTGCCAGCTCCTGTCCGCGCCATCTGCCATGGCCGTGCGCCCCACGCGCCGCTGCCTGCTGGCGCTCCTG
CTGTGCTTTGCCTGGTGGGCCATGGCGGTGGTCGCCTCGAAGCAAGGGGCAGGCTGTCCAAGCCGCTGCC
TGTGTTTCCGTACCACCGTGCGCTGCATGCATCTGTTGCTGGAGGCCGTGCCCGCCGTGGCGCCGCAGAC
CTCCATCCTAGATCTTCGGTTCAACAGAATCAGAGAGATCCAACCCGGGGCATTCAGGAGGCTGAGGAGC
CTGAACACACTGCTTCTTAACAACAACCAGATCAAGAAGATCCCCAATGGTGCATTTGAGGACCTGGAGA
ACTTAAAATACCTCTATTTGTACAAGAATGAGATCCAATCAATTGACAGGCAAGCATTTAAGGGACTTGC
CTCTCTAGAGCAACTGTACCTGCACTTTAATCAGATAGAAACGCTGGACCCTGAATCCTTCCAGCACCTG
CCAAAGCTGGAGAGACTGTTTTTGCACAACAACCGTATCACGCACTTAGTTCCTGGGACGTTCAGTCAGT
TAGAGTCCATGAAACGGCTGCGATTGGACTCGAATGCACTCCACTGTGACTGTGAAATCCTATGGCTAGC
GGATCTACTGAAGACCTACGCCCAATCTGGAAACGCACAAGCAGCAGCTACATGCGAGTATCCCAGACGC
ATCCAAGGACGCTCTGTGGCTACCATCACCCCAGAAGAGCTGAACTGTGAAAGGCCCCGGATTACCTCAG
AGCCACAGGATGCAGATGTCACCTCAGGGAACACAGTGTACTTCACCTGCAGAGCTGAGGGCAACCCCAA
ACCTGAGATCATCTGGCTTCGAAACAATAACGAGTTGAGCATGAAGACGGACTCTCGCTTAAACTTGCTG
GACGATGGCACGCTGATGATTCAGAACACACAGGAGGCGGATGAGGGTGTCTACCAGTGCATGGCGAAAA
ATGTGGCTGGAGAGGCGAAAACGCAGGAGGTGACCCTCAGGTACTTGGGGTCTCCAGCCCGACCCACTTT
TGTAATCCAGCCGCAGAACACAGAGGTACTGGTGGGTGAGAGTGTCACTCTGGAGTGCAGTGCCACAGGC
CACCCTCTGCCTCAGATCACCTGGACAAGAGGTGACCGCACACCCTTGCCAATTGACCCTCGAGTGAATA
TCACTCCCTCTGGAGGACTGTATATACAGAACGTCGCCCAGAGTGACAGCGGCGAGTACACGTGCTTTGC
ATCCAATAGTGTGGACAGTATCCATGCCACAGCCTTCATCATTGTACAAGCCCTTCCTCAGTTCACTGTG
ACCCCACAGAGCCGAGTGGTCATTGAAGGGCAGACTGTGGATTTCCAGTGTGCGGCTAAGGGACACCCTC
AGCCTGTCATAGCCTGGACCAAGGGAGGGAGCCAGCTCTCAGTGGACAGGCGGCACCTGGTGCTGTCCTC
AGGAACACTCAGGATTTCTGGGGTGGCCCTGCATGACCAGGGCCAGTATGAGTGCCAGGCCGTCAATATC
ATTGGCTCCCAGAAGGTCGTGGCCCACCTGACAGTACAGCCTAGAGTCACCCCGGTATTTGCCAGCATTC
CCAGTGACATGACTGTAGAGGTGGGCACCAACGTGCAGCTGCCTTGTAGCTCCCAGGGAGAACAAGAGCC
AGCCATCACCTGGAACAAGGATGGTGTTCAGGTAACAGAAAGTGGAAAATTTCACATCAGCCCCGAAGGA
TTCTTGACCATCAATGATGTTGGCACTGCCGATGCAGGTCGCTATGAGTGTGTAGCTCGGAACACAATTG
GATATGCCTCTGTGAGCATGGTACTCAGTGTGAATGTTCCTGATGTAAGTCGGAATGGGGATCCCTATGT
TGCTACCTCTATTGTTGAAGCCATTGCAACTGTTGACAGAGCCATCAACTCTACAAGGACACACTTGTTT
GACAGCCGTCCTCGTTCTCCAAATGACCTGCTCGCTCTGTTCCGGTACCCACGGGATCCATACACAGTGG
GACAAGCCAGGGCAGGAGAGATATTCGAGCGGACCCTGCAGCTGATCCAGGAGCATGTTCAGCATGGCTT
GATGGTGGACTTGAATGGAACAAGTTACCACTACAATGATCTGGTGTCCCCGCAGTACCTGAGCCTCATC
GCCAACCTGTCAGGCTGCACTGCACACCGCCGCGTGAACAACTGCTCAGACATGTGCTTCCACCAGAAGT
ATAGGACGCACGATGGCACGTGCAACAATCTACAGCACCCGATGTGGGGTGCCTCACTGACCGCCTTTGA
GCGCCTGTTGAAGGCTGTGTATGAGAATGGGTTCAACACACCCCGGGGCATTAATTCCCAGCGTCAGTAC
AATGGGCATGTACTACCCATGCCCCGCCTGGTGTCCACCACACTGATTGGGACAGAGGTGATCACCCCCG
ATGAGCAGTTTACACACATGCTGATGCAATGGGGCCAGTTCCTTGACCATGACCTAGACTCTACAGTGGT
AGCCCTGAGCCAGGCCCGCTTCTCTGACGGCCAGCATTGCAGCTCTGTGTGCAGCAATGACCCTCCCTGT
TTCTCGGTCATGATCCCCCCCAATGATCCCCGGGTGCGGAGTGGCGCCCGATGCATGTTCTTCGTGCGAT
CGAGCCCCGTGTGTGGCAGCGGCATGACGTCCCTGCTCATGAACTCTGTGTACCCTCGAGAGCAGATCAA
CCAGCTCACCTCCTACATCGATGCCTCCAATGTGTACGGCAGCACAGACCACGAAGCCCGCAGCATCCGG
GACCTGGCCAGCCACCGTGGCCTGCTGCGTCAGGGCATTGTGCAGAGGTCTGGCAAGCCCCTGCTTCCCT
TTGCCACCGGGCCACCCACTGAGTGCATGCGCGATGAGAACGAGAGCCCGATACCATGCTTTCTGGCCGG
CGACCACCGTGCCAACGAGCAGCTTGGCCTGACCAGCATGCATACGCTGTGGTTCCGGGAGCACAACCGC
ATTGCAGCAGAGCTGTTGAAGCTAAACCCGCACTGGGATGGGGACACTGTCTACCATGAGACCCGCAAGA
TAGTCGGGGCAGAGATACAGCACATCACCTACCGGCACTGGCTGCCCAAGATCCTGGGGGAGGTGGGCAT
GAAGATGCTCGGTGAGTACCGGGGCTACGACCCCAGTGTCAATGCTGGCATCTTTAATGCCTTTGCCACT
GCAGCCTTCAGGTTCGGTCACACTCTGATCAACCCTCTGCTCTACCGGCTGGATGAGAACTTTGAGCCCA
TCCCTCAGGGCCATGTGCCCCTCCACAAAGCCTTCTTCTCGCCCTTCCGGATTGTCAACGAGGGGGGCAT
CGACCCACTTCTCCGAGGGCTGTTTGGAGTGGCAGGGAAGATGCGCATTCCCTCTCAATTGCTGAACACA
GAGCTCACGGAGAGGCTGTTCTCCATGGCCCACACAGTGGCCCTGGACCTGGCTGCCATCAATATCCAGC
GAGGCCGGGACCATGGCATCCCACCCTACCATGACTACAGAGTCTACTGCAACTTGTCGGCTGCTTACAC
CTTTGAGGACCTGAAAAATGAGATCAAGAGCCCTGTGATCCGGGAGAAACTGCAGAGGCTGTATGGCTCG
ACTCTCAACATTGATCTGTTCCCAGCCCTCATGGTAGAAGACCTAGTACCTGGCAGCCGCTTGGGGCCCA
CACTCATGTGCCTGCTCAGCACACAGTTCCGACGCCTGCGGGATGGAGACAGGTTGTGGTATGAGAACCC
AGGCGTGTTCTCCCCCGCCCAGCTGACTCAGCTCAAGCAGACGTCCCTGGCGAGGATCCTTTGTGACAAC
TCAGACAACATCACCCGTGTGCAGCAGGATGTGTTCAGGGTGGCAGAGTTCCCCCACGGTTATAGCAGCT
GTGAGGACATCCCCAGGGTGGACCTGCGAGTGTGGCAGGACTGTTGTGAAGATTGTAGGACCAGGGGACA
ATTCAATGCTTTCTCCTACCATTTCCGGGGAAGACGGTCTCTAGAATTTAGCTATGAGGACGATAAGCCC
ACAAAGAGAGCCAGGTGGCGGAAAGCACTAAGTGTAAAGCATGGCAAACATCTTAGCAATGCCACATCAG
CCACCCACGAGCACTTGGAAGGGCCAGCAACTAATGATCTCAAGGAATTTGTTCTGGAAATGCAAAAGAT
CATCACAGACCTCAGAAAACAGATAAACAGCTTGGAGTCTCGGCTCAGCACCACAGAATGTGTGGATGAC
AGCGGTGAATCTCACGGCGGCAACACAAAGTGGAAAAAAGACCCATGCACAGTTTGTGAGTGCAAAAATG
GCCAGATCACCTGCTTTGTGGAAGCTTGCCAGCCTGCAGCCTGCCCCCAGCCTGTGAAAGTGGAAGGCGC
TTGCTGTCCCGTCTGCTTAAAGAACACTGCAGAGGAAAAGCCTTAGTGTTCCCGAGCTTAGCTCCTCGAA
GCTGGTCCACAGAATACTTGTGAGCCTAGATGACAACATGGGGAGCTTCAGACCACAGGACAAGTTGGGA
TCTCAGACATTTCAGGACCTCTGCTGTGCCATCGCAGAAGCAGCCGGGGCTGCTTCACACCCTGTGTTTG
TAGAAGGAAATTGAGCAGGCGGGAGTGGGTGCAGGCTCTGGCCCTCACTTCATGTTAGACTTCTCAGGTT
TATATTTAAGTGTTTTTAAATGGAAAATTGGTGCTACTATTAAATCACACAGTTG

>gi|27924101|gb|BC044828.1| Mus musculus peroxidasin homolog (Drosophila), mRNA (cDNA clone IMAGE:5008265), partial cds
CAGGGCAGGAGAGATATTCGAGCGGACCCTGCAGCTGATCCAGGAGCATGTTCAGCATGGCTTGATGGTG
GACTTGAATGGGACAAGTCAGTACCATGTTCTTCCTCCTCTGACTTGAGTGCGAGGTAGCAGACACTCTC
TGCTGCATGTGCATCATGTTACCACTACAATGATCTGGTGTCCCCGCAGTACCTGAGCCTCATCGCCAAC
CTGTCAGGCTGCACTGCACACCGCCGCGTGAACAACTGCTCAGACATGTGCTTCCACCAGAAGTATAGGA
CGCACGATGGCACGTGCAACAATCTACAGCACCCGATGTGGGGTGCCTCACTGACCGCCTTTGAGCGCCT
GTTGAAGGCTGTGTATGAGAATGGGTTCAACACACCCCGGGGCATTAATTCCCAGCGTCAGTACAATGGG
CATGTACTACCCATGCCCCGCCTGGTATCCACCACACTGATTGGGACAGAGGTGATCACCCCCGATGAGC
AGTTTACACACATGCTGATGCAATGGGGCCAGTTCCTTGACCATGACCTAGACTCTACAGTGGTAGCCCT
GAGCCAGGCCCGCTTCTCTGACGGCCAGCATTGCAGCTCAGTGTGCAGCAATGACCCTCCCTGTTTCTCG
GTCATGATCCCCCCCAATGATCCCCGGGTGCGGAGTGGCGCCCGATGCATGTTCTTCGTGCGATCGAGCC
CCGTGTGTGGCAGCGGCATGACGTCCCTGCTCATGAACTCTGTGTACCCTCGAGAGCAGATCAACCAGCT
CACCTCCTACATCGATGCCTCCAATGTGTACGGCAGCACAGACCACGAAGCCCGCAGCATCCGGGACCTG
GCCAGCCACCGTGGCCTGCTGCGTCAGGGCATAGTGCAGAGGTCTGGCAAGCCCCTGCTTCCCTTTGCCA
CCGGGCCACCCACTGAGTGCATGCGCGATGAGAACGAGAGCCCGATACCATGCTTTCTGGCCGGCGACCA
CCGTGCCAACGAGCAGCTTGGCCTGACCAGCATGCATACGCTGTGGTTCCGGGAGCACAACCGCATTGCA
GCAGAGCTGTTGAAGCTAAACCCGCACTGGGATGGGGACACTGTCTACCATGAGACCCGCAAGATAGTCG
GGGCAGAGATACAGCACATCACCTACCGGCACTGGCTGCCCAAGATCCTGGGGGAGGTGGGCATGAAGAT
GCTCGGTGAGTACCGGGGCTACGACCCCAGTGTCAATGCTGGCATCTTTAATGCCTTTGCCACTGCAGCC
TTCAGGTTCGGTCACACTCTGATCAACCCTCTGCTCTACCGGCTGGATGAGAACTTTGAGCCCATCCCTC
AGGGCCATGTGCCCCTCCACAAAGCCTTCTTCTCGCCCTTCCGGATTGTCAACGAGGGGGGCATCGACCC
ACTTCTCCGAGGGCTGTTTGGAGTGGCAGGGAAGATGCGCATTCCCTCTCAATTGCTGAACACAGAGCTC
ACGGAGAGGCTGTTCTCCATGGCCCACACAGTGGCCCTGGACCTGGCTGCCATCAATATCCAGCGAGGCC
GGGACCATGGCATCCCACCCTACCATGACTACAGAGTCTACTGCAACTTGTCGGCTGCTTACACCTTTGA
GGACCTGAAAAATGAGATCAAGAGCCCTGTGATCCGGGAGAAACTGCAGAGGCTGTATGGCTCGACTCTC
AACATTGATCTGTTCCCAGCCCTCATGGTGGAAGACCTGGTACCTGGCAGCCGCTTGGGGCCCACACTCA
TGTGCCTGCTCAGCACACAGTTCCGACGCCTGCGGGATGGAGACAGGTTGTGGTATGAGAACCCAGGCGT
GTTCTCCCCCGCCCAGCTGACTCAGCTCAAGCAGACGTCCCTGGCGAGGATCCTTTGTGACAACTCAGAC
AACATCACCCGTGTGCAGCAGGATGTGTTCAGGGTGGCAGAGTTCCCCCACGGTTATAGCAGCTGTGAGG
ACATCCCCAGGGTGGATCTGCGAGTGTGGCAGGACTGTTGTGAAGATTGTAGGACCAGGGGACAATTCAA
TGCTTTCTCCTACCATTTCCGGGGAAGACGGTCTCTAGAATTTAGCTATGAGGACGATAAGCCCACAAAG
AGAGCCAGGTGGCGGAAAGCGCTAAGTGTAAAGCATGGCAAACATCTTAGCAATGCCACATCAGCCACCC
ACGAGCACTTGGAAGGGCCAGCAACTAATGATCTCAAGGAATTTGTTCTGGAAATGCAAAAGATCATCAC
AGACCTCAGAAAACAGATAAACAGCTTGGAGTCTCGGCTCAGCACCACAGAATGTGTGGATGACAGCGGT
GAATCTCACGGCGGCAACACAAAGTGGAAAAAAGACCCATGCACAGTTTGTGAGTGCAAAAATGGCCAGA
TCACCTGCTTTGTGGAAGCTTGCCAGCCTGCAGCCTGCCCCCAGCCTGTGAAAGTGGAAGGCGCTTGCTG
TCCCGTCTGCTTAAAGAACACTGCAGAGGAAAAGCCTTAGTGTTCCCGAGCTTAGCTCCTCGAAGCTGGT
CCACAGAATACTTGTGAGCCTAGATGACAACATGGGGAGCTTCAGACCACAGGACAAGTTGGGATCTCAG
ACATTTCAGGACCTCTGCTGTGCCATCGCAGAAGCAGCCGGGGCTGCTTCACACCCTGTGTTTGTAGAAG
GAAATTGAGCAGGCGGGAGTGGGTGCAGGCTCTGGCCCTCACTTCATGTTAGACTTCTCAGGTTTATATT
TAAGTGTTTTTAAATGGAAAATTGGTGCTACTATTAAATCACACAGTTGAGACATGGGTTTCGAAATTGC
TTCGGCCTGATGATGCCACTTCTGTTTAAGCGAGGCAGAAGGTCTTTGACAGGCTTCATTTAAACAGAAG
CATTTGGCAAATGGAACAGGATCTTATAAAATAATATCCTGGGCCAAAATCCCTACAGGGATACTAGGGC
TTTCTCCCACTGCTCCCTCATGTCTGCCATCTTCCCCTTCAGGGCCCTTCAGCGTCTAATAACAATACCA
ACACAAACCTGCACCTTGCGACCATGTCCACCCTTCTACACAGGACTGTCACATTATCAAGCCACAGTTA
GAAATAATCTTGTTTCCACTTAGAACATGTGTAAATACTGCCTTGCTGTAAATTGAATGTGAAATCCTTT
GGTCCTATGCCATGTTGAGTGACCCGAACAGCCCCTTGCATCGGCAGGATGGTCTGCAGCCACCACCTGC
CTTGGTTCCTTGGTACACCGGCCATGGTGTAGGGAGGGTGATTCACTCTCCAGGTCCTTTGAATAGGTGA
CAGACAGGCAGAGCCTGAATGTCAGGAAATTCCATTGTATGAATGCTTTAATGGAAAACGTTCCCTATAC
CTCATTATTTTTTATTGCATGATGTTTTATAAAAATTTGCCCATTTTAGATGTTAGAAAAATTATTTCCA
CTATGCAAATTTCTTTTTAATTCAGTGAAAAGCAACTGTTATACCTCATAGTCTCTTGTTTTTAATTGAC
CAAAATATTCCATTCTATTCTCACAAGGTTCTGAGGTCTCTGCCTGAAAAAGCAAGTCTCACCCTATAGA
CACTGATGTGCCCAGCACTACGTGCCAGCCATTGTGGGAACACAAGAGGGTCACCTGCCCAAGGGCTAGG
GAGGAAGACCTCAAGAACACAGAGGAGGTGGAAAAGGACAAAATAGGTGCCATTTTAGGGAGACCATGGT
CAGATAGGGGGATGCTCAGTTCCTTGCAGCCTCGGGCGGGCTTATGTCGGCACTAAGGCCATTGTGGTGT
GTACTTATATGATCCCTATGCTGATAGGATTACCTTCCTAGACATAGCTAGACGCAAAGCCACATGTGTA
AGGCTGCTGAGCAAAGACAGCATCCCAGCATGGGTGTGTTCACGGTGGATTCACCACGTTGCATAGTAAA
GTGGTCCCCTTGGCTTACCCTTCACTTTGCTCATGAGATTCAGAAGCTTGTGGTCCAGCAGGGGTGAGCA
TTTGTGAAATAGTAAGCTGAACTTAGTGGTGAGATTTCAGAACAGACTTCTGTGAAGTAAGAGATGTAAC
CATGCATCTAAAATCGGATGGCCGTGTAACTGCTCGGGCATAGAAATGGTGGGAGAACCTGTCCTGGGTA
CCTGGCATTTCACATGAGCCCAGGGATATGTCTTGTGCCAAGGCACACAAGTGTCCGTGGACTTGGACAG
GTGCCAAGGGTTTTTGTCTCTGTTCCTATGTGGGAGGCTGGCTGTGATTTACATTAATTTCTGTATTTCA
AACGAAGATGTCTGCAGATCTCCATTTTGATGTTACAGCCTCATTGCCCAGGCAGTGGGCAGTGGCCAGA
CACCCTTTCTGACTAGCCACTGCATTGGGCTTCTGTGATTCAAAGTAGTGTATATATTTATTTACTTCTC
TGACTGTGGCCAACAGCCAAATGCCATTTTATGTTCCTTGTATTCAGTCCATTACCAAAGAGGTGTTTGC
ACTTTGTAATGATACCTTTCAGTTCAAATAAAAGGACCACATCGTTAAGTGGAAAAAAAAAAAAAAAAAA
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

>gi|147905582|ref|NP_001081848.1| polysomal ribonuclease 1 [Xenopus laevis] Peroxidasin
MAPADLWLGVLLILASVQGTASSYYDGVEELDNNLILSCTKEAKHLVDTAYKNTRLMLKDRLRRRTVSAS
DLMAYFKQPVCSRNAIRAADYMGTTLQLLSHKLKPFHHRPFNITELLTETQIDAIYKLTGCAYQHLPSAC
QESPYRTITGQCNNRKNPILGASNTGFTRLLPVVYEDGLSVPRGWTENLPINGFPLPLARAVSNEIVRFP
NENLTLDEGRALIFMQWGQWTDHDLDLSPETPARSTFLEGIDCDTNCAKEPPCFPLKIPPNDPRISNQSD
CIPLFRSSPVCTPGSPVREQINILTSFIDGSQVYGSDWPLAVKLRNNTNQLGLMAINQRFTDNGLPFLPF
ETAEEDFCVLTNRSSGIPCFLGGDPRVSEQPGLTAFHTLFVRAHNNIAARLRELNPRWSGETLYQEARKI
IGGILQKITYKDWLPLLLGSEMAAVLPAYRSYNESVDPRVSNVFTVVFRMGHTLIQPFIYRLADGYRPLG
PEPQIPLHKTFFNSWRVVREGGIDPLLRGLMANRAKLNRQNQLVVDELRERLFVLFKRIGLDLTAINMQR
GREHGLPGYNAWRRFCGLSAPSNVNELAAVLNNRNLAEKFIKLYGSPENIDIWVGGVAESLVRNGRIGKL
LTCLIGNQFRRARDGDRFYYEQPSVFTNEQRASIERVTLARVICDNTKITEVPRNVFLGNRYPRDFVACS
RIPTLDLNPWKVA