GeneSeqer. Version of February 10, 2003.
Date run: Mon Feb 24 10:29:27 2003
(Bayesian) Splice site model (species): Zea mays
________________________________________________________________________________
Sequence 1: 21326110, from 7800 to 11800, both strands analyzed.
********************************************************************************
Query protein sequence 13 (File: 18496651)
1 DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
61 MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
121 ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKSDG
181 PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RMIWSALGHL NDKEDAPSQL KIVGVQATGG
241 MIAGAVTSCV STPLDTIKTR LQVNQNKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
301 GTSMIVCYEY LKRVCAKVEE A-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 7969 7800 ( 170 n); Protein 1 54 ( 54 aa); score: 0.123
MATCH 21326110- 18496651 0.123 170 0.176 P
PGS_21326110-_18496651 (7969 7800)
Alignment:
TCCACCAATC CGCGACTCCC GAGGTGAGAA CCGAAGGGGG CGCAGGAGGC GTGGCGTGGA 7910
S T N P R L P R * E P K G A Q E A W R G
. | + | + | . | + . .
D T S T R A A K - I P S L P Q Q T E I N 19
CTCGATTCGG CAAGAATGGC GGGCGGGCGG GCTGCTGTGG TTGGCTTCCC CCCGTTTTCC 7850
L D S A R M A G G R A A V V G F P P F S
| + | . + | | | | |
W D N L D M T - - K L Y V V G A G M F S 37
CTCCCCACTT TCGTGCTGTT TT-TGTCCAC AACAACTCAG CGAGAATGGC G 7800
L P T F V L F V H N N S A R M A
| . | + | . . | |
C V T V A L Y P V S V I K T R M Q 54
********************************************************************************
Query protein sequence 14 (File: 12278522)
1 DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
61 MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
121 ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKADG
181 PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RVIWSALGRL DDKEDTPSQL KIVGVQATGG
241 MVAGAVTSCV STPLDTIKTR LQVNINKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
301 GTSMIVCYEY LKRVCAKVEE A-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 7969 7800 ( 170 n); Protein 1 54 ( 54 aa); score: 0.123
MATCH 21326110- 12278522 0.123 170 0.176 P
PGS_21326110-_12278522 (7969 7800)
Alignment:
TCCACCAATC CGCGACTCCC GAGGTGAGAA CCGAAGGGGG CGCAGGAGGC GTGGCGTGGA 7910
S T N P R L P R * E P K G A Q E A W R G
. | + | + | . | + . .
D T S T R A A K - I P S L P Q Q T E I N 19
CTCGATTCGG CAAGAATGGC GGGCGGGCGG GCTGCTGTGG TTGGCTTCCC CCCGTTTTCC 7850
L D S A R M A G G R A A V V G F P P F S
| + | . + | | | | |
W D N L D M T - - K L Y V V G A G M F S 37
CTCCCCACTT TCGTGCTGTT TT-TGTCCAC AACAACTCAG CGAGAATGGC G 7800
L P T F V L F V H N N S A R M A
| . | + | . . | |
C V T V A L Y P V S V I K T R M Q 54
********************************************************************************
Query protein sequence 18 (File: 13365793)
1 AAAAAAETSE ASTAGLALAE ANINWQRRIL RSDGIPGAFR GFGTSAVGAL PGRVFALTSL
61 EVSKEMAFKY SEHFDMSEAS RIAVANGIAG LVSSIFSSAY FVPLDVICQR LMAQGLPGMA
121 TYRGPFDVIS KVVRTEGLRG LYRGFGITML TQSPASALWW SSYGGAQHAI WRSLGYGIDS
181 QKKPSQSELV VVQATAGTIA GACSSIITTP IDTIKTRLQV MDNYGRGRPS VMKTTRVLLE
241 EDGWRGFYRG FGPRFLNMSL WGTSMIVTYE LIKRLSVKPE -
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 8399 8307 ( 93 n); Protein 1 32 ( 32 aa); score: 0.174
Intron 1 8306 7807 ( 500 n); Pd: 0.647 Pa: 0.000
Exon 2 7806 7801 ( 6 n); Protein 33 34 ( 2 aa); score: 0.583
MATCH 21326110- 13365793 0.174 99 0.117 P
PGS_21326110-_13365793 (8399 8307,7806 7801)
Alignment:
CGCCACGGTG ACGCCGCTGA ACATGCCTGC GCCCACCACG TAGAGCTTGG TCTTGTCGAG 8340
R H G D A A E H A C A H H V E L G L V E
. | | | + | . | . | . |
A A A A A A E T S E A S T A G L A L A E 20
GCT---GCAA CATTTCAGCG CCACGATTTG AGAACAGTAA AAGAATCGGA AACAACGGAA 8283
A A T F Q R H D L R T
| . + | | . | | +
A N I N W Q R R I L R S .... .......... .......... 32
TAAGCTACGA ATCCCCAAAT TGCGCTTCGT CCAGGAGCAT GAACAGCCAT GTCAAATCCT 8223
.......... .......... .......... .......... .......... .......... 32
CAAACAACAG TTCCTAATCC TAAGGCGGTA AAGCCAATCC GGAATAGGGC GGGGGACACA 8163
.......... .......... .......... .......... .......... .......... 32
GTTCAACGAT CGGAGCTAAA TTTCTAGAAG ACTAGGACCG CGAGAAAGGC GGAAATCGGC 8103
.......... .......... .......... .......... .......... .......... 32
ACGAAATGGG AACCTAAGGA TTACAAGACC GGTGGGTTGC TTACTTGTCC CAGTTGATCT 8043
.......... .......... .......... .......... .......... .......... 32
CCGTCTGGTG GAGCGACGGG ATCTTGGCGG CCCTAGAGGT TGTATCCATG GCCGCGCCGC 7983
.......... .......... .......... .......... .......... .......... 32
TCAAATCCTC CCCTCCACCA ATCCGCGACT CCCGAGGTGA GAACCGAAGG GGGCGCAGGA 7923
.......... .......... .......... .......... .......... .......... 32
GGCGTGGCGT GGACTCGATT CGGCAAGAAT GGCGGGCGGG CGGGCTGCTG TGGTTGGCTT 7863
.......... .......... .......... .......... .......... .......... 32
CCCCCCGTTT TCCCTCCCCA CTTTCGTGCT GTTTTTGTCC ACAACAACTC AGCGAGAATG 7803
N
+
.......... .......... .......... .......... .......... ...... D 33
GC 7801
G
|
G 34
********************************************************************************
Query protein sequence 10 (File: 21594326)
1 SLGALMEEKR RATTSSSSSQ VHMSNDIDWQ MLDKSRFFFL GAALFSGVST ALYPIVVLKT
61 RQQVSPTRVS CANISLAIAR LEGLKGFYKG FGTSLLGTIP ARALYMTALE ITKSSVGQAT
121 VRLGLSDTTS LAVANGAAGL TSAVAAQTVW TPIDIVSQRL MVQGDVSLSK HLPGVMNSCR
181 YRNGFDAFRK ILYTDGPRGF YRGFGISILT YAPSNAVWWA SYSLAQKSIW SRYKHSYNHK
241 EDAGGSVVVQ ALSSATASGC SALVTMPVDT IKTRLQVLDA EENGRRRAMT VMQSVKSLMK
301 EGGVGACYRG LGPRWVAMSM SATTMITTYE FLKRLATKKQ K-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7964 8058 ( 95 n); Protein 1 31 ( 31 aa); score: 0.110
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8749 ( 411 n); Protein 32 165 ( 134 aa); score: 0.385
Intron 2 8750 8848 ( 99 n); Pd: 0.000 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 166 231 ( 66 aa); score: 0.553
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9844 ( 147 n); Protein 232 278 ( 47 aa); score: 0.349
Intron 4 9845 10200 ( 356 n); Pd: 0.446 Pa: 0.975
Exon 5 10201 10207 ( 7 n); Protein 279 281 ( 3 aa); score: -0.467
Intron 5 10208 10515 ( 308 n); Pd: 0.994 Pa: 0.987
Exon 6 10516 10669 ( 154 n); Protein 282 332 ( 51 aa); score: 0.295
Intron 6 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 7 11330 11361 ( 32 n); Protein 333 341 ( 9 aa); score: 0.377
MATCH 21326110+ 21594326 0.374 1043 1.017 P
PGS_21326110+_21594326 (7964 8058,8339 8749,8849 9045,9698 9844,10201 10207,10516 10669,11330 11361)
Alignment:
GGTGGAGGGG AGGATTTGAG CGGCGCGGCC ATGGATACAA CCTCTAGGGC CGCCAAGATC 8023
G G G E D L S G A A M D T T S R A A K I
. | + . | | | + + .
S L G A L M E E K R R A T T S S S S S Q 20
CCGTCGCTCC ACCAGACGGA GATCAACTGG GACAAGTAAG CAACCCACCG GTCTTGTAAT 8083
P S L H Q T E I N W D N
+ . . + | + | .
V H M - S N D I D W Q M..... .......... .......... 31
CCTTAGGTTC CCATTTCGTG CCGATTTCCG CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT 8143
.......... .......... .......... .......... .......... .......... 31
TTAGCTCCGA TCGTTGAACT GTGTCCCCCG CCCTATTCCG GATTGGCTTT ACCGCCTTAG 8203
.......... .......... .......... .......... .......... .......... 31
GATTAGGAAC TGTTGTTTGA GGATTTGACA TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA 8263
.......... .......... .......... .......... .......... .......... 31
TTTGGGGATT CGTAGCTTAT TCCGTTGTTT CCGATTCTTT TACTGTTCTC AAATCGTGGC 8323
.......... .......... .......... .......... .......... .......... 31
GCTGAAATGT TGCAGCCTCG ACAAGACCAA GCTCTACGTG GTGGGCGCAG GCATGTTCAG 8383
L D K T K L Y V V G A G M F S
| | | + + . + + | | . + | |
.......... ..... L D K S R F F F L G A A L F S 46
CGGCGTCACC GTGGCGCTGT ATCCTGTCTC GGTGGTCAAG ACCCGGATGC AGGTTGCCTC 8443
G V T V A L Y P V S V V K T R M Q V A S
| | + . | | | | + | + | | | . | | +
G V S T A L Y P I V V L K T R Q Q V S P 66
TGGGGACGCC ATGAGGAGGA ACGCGCTGGC TACCTTCAAG AACATCCTCA AGATGGACGG 8503
G D A M R R N A L A T F K N I L K M D G
. | + + . | + + + |
T R V S C A N - I - S L A - I A R L E G 83
CGTGCCAGGG CTGTACCGGG GGTTTGCTAC CGTTATCATT GGGGCTGTAC CAACTAGGAT 8563
V P G L Y R G F A T V I I G A V P T R I
+ | . | + | | . | + + | . + | . |
L K G F Y K G F G T S L L G T I P A R A 103
CATCTTCCTC ACAGCGCTTG AGACAACCAA AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT 8623
I F L T A L E T T K A A S L K L V E P F
+ + + | | | | | | + + + . .
L Y M T A L E I T K S S V G Q A T V R L 123
CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT TGCCAATGGC CTTGCTGGTC TGTCAGCGTC 8683
K L S E P V R A A F A N G L A G L S A S
| | + . | | | | | | | + + +
G L S D T T S L A V A N G A A G L T S A 143
TACATGTTCG CAGGCTATTT TTGTTCCAAT TGATGTGGTA T-GCCTCTCA TGTGCCTTCT 8742
T C S Q A I F V P I D V V P L M C L L
. . + | . + + . | | | + | + +
V A A Q T V W T P I D I V S Q R L M V Q 163
ATGTGATGTT GTATAGAGAA AAAATATCTT ACAATATGTT GATGTTAAAT GCTAATTACA 8802
C D
|
G D ... .......... .......... .......... .......... .......... 165
ATACTAGACT ACTGTTTTCA TTCTGTTGTG CATTGGAATG TTTCAGATTA GCCAGAAATT 8862
I S Q K L
+ | .
.......... .......... .......... .......... ...... V S L S K 170
GATGGTTCAA GGATATTCTG GTAATGCCAG ATACAAAGGT GGATTAGATG TTGCTCGAAA 8922
M V Q G Y S G N A R Y K G G L D V A R K
+ | . + . | | + . | . | . | |
H L P G V M N S C R Y R N G F D A F R K 190
GGTCATAAAG GCTGATGGCA TTAGGGGGCT GTACAGAGGA TTTGGACTGT CTGTTATGAC 8982
V I K A D G I R G L Y R G F G L S V M T
+ + . | | | | . | | | | | + | + + |
I L Y T D G P R G F Y R G F G I S I L T 210
CTATGCTCCA TCCAGTGCTG TGTGGTGGGC AAGTTATGGT TCCAGCCAGC GCATAATTTG 9042
Y A P S S A V W W A S Y G S S Q R I I W
| | | | + | | | | | | | . + | + | |
Y A P S N A V W W A S Y S L A Q K S I W 230
GAGGTTAGCT TATCTGATTG GTTCATCGTT ATGTTCCTCT CAGCCCTGTG TACTATGTAA 9102
S
|
S....... .......... .......... .......... .......... .......... 231
TATTTACGAG AAAAAGACCA GTAATACATT TCTACTTAAT AGTTATTTGA ATTGGTACTT 9162
.......... .......... .......... .......... .......... .......... 231
TCCATCTGTC CAAAACCTTT TCAAACTTCC CCTCTTGATG CTCAAACTGC AGCTATAATT 9222
.......... .......... .......... .......... .......... .......... 231
GCAATTTTGT TTTCTGATGC TTGTTCTTCC ATGTCAATAT GTACATATCT TTTTTAGAAA 9282
.......... .......... .......... .......... .......... .......... 231
ACAAGAATGC ATCTCAATGC ATGTGCTGTA TTGTTTTGAT TAGATTTATC ATAGCGATCA 9342
.......... .......... .......... .......... .......... .......... 231
ATCACATTTT CTTTACAGAT AAAAATAGTC GGAAGGATAA GTTGGATAAC TGACCAAAGT 9402
.......... .......... .......... .......... .......... .......... 231
GGAAATATGA TCTTACATAT TTTTATCTCT GGCAGCTTAG AGAACTTAAT TACCAACCTG 9462
.......... .......... .......... .......... .......... .......... 231
AAACAATGTG ATGAAGTAAC TACACAAAAC CACATATAGT TTCATGCACT CTGCAAAACT 9522
.......... .......... .......... .......... .......... .......... 231
AAATTGAAAC TCTTAGTGTG CTCTTAATGC TGTTAAGAGG GTGTATGCAA GTTTACTGGA 9582
.......... .......... .......... .......... .......... .......... 231
ATCAGTACCT TTTGTTAGTT TATTTCTTTG TGGTTGATGG TTGAAAGATT ATATTTCTTG 9642
.......... .......... .......... .......... .......... .......... 231
TCTTGATAAC TTAGCCAAAA TAGTTAACTA TTGTGCTTTT TACATATTGG AACAGTGCTC 9702
A
.......... .......... .......... .......... .......... ..... R 232
TTGGCCATTT GCATGACAAA GAAGAGGCTC CTAGCCAATT GAAACTAGTT GGTGTTCAAG 9762
L G H L H D K E E A P S Q L K L V G V Q
| + + + | . . + | | |
Y K H S Y N H K E D A G G - S V V - V Q 250
CATCAGGGGG GGTTTTTGCC GGTGCCGTGA CCTCTTTTGT TACGACTCCC ATAGATACAA 9822
A S G G V F A G A V T S F V T T P I D T
| . . . | . . + + . | | | + | |
A L S S A T A S G C S A L V T M P V D T 270
TAAAGACCAG GCTGCAGGTA CTGTGTGACA TTCTGTTTGC TGATTACTCT TGTAATTTGA 9882
I K T R L Q V L
| | | | | | | |
I K T R L Q V L........ .......... .......... .......... 278
TTTGTGTGGG TATATTTTGT GAGGCTTACC CTTGTGACTT AATGATTCTT GTCTTTACAT 9942
.......... .......... .......... .......... .......... .......... 278
TTATGCTGCT CATTTGCAAT AATTTGATTC CTTATCAATG CAATGCCACT AAGTTTAGGG 10002
.......... .......... .......... .......... .......... .......... 278
GAATGGATAT TTTGTTTTGG AAGTATATTT GATGTCAGAC TTGAAGACCT AAATGTTCTT 10062
.......... .......... .......... .......... .......... .......... 278
TTATACTGAT ATTTCCTCCA ATGGCGGGCT ATTGAGGTGC TGGACTGGAA TGCTGTCTAT 10122
.......... .......... .......... .......... .......... .......... 278
ATTAAACAAT ATATACTTCT ATGTTTACAG CTGTTTGTTT TCTGCTGACA TACCATGACC 10182
.......... .......... .......... .......... .......... .......... 278
AATTTGTCAT GGTTTCAGT- --TATGAGGT CAGAAAAAAA GAAACTTCCA TTGGGAAAAC 10239
Y E
|
.......... ........ D A E .. .......... .......... .......... 281
TTGATATCTA TTACTTCATT ATTTATAGTG AGTAACAAAA GTTAGCACTT TCAAACTGAC 10299
.......... .......... .......... .......... .......... .......... 281
TAAAGTATGC CAGGGACGTA TCATGCATTT TACAACATGC TCCACATATC TCCAAATATC 10359
.......... .......... .......... .......... .......... .......... 281
ACATATTACG CTTGTAGTGG TAAACTGATA ATACATCTAC CAACACTGAA AGTTCTCACA 10419
.......... .......... .......... .......... .......... .......... 281
AGTCAGAACC CTATATTTGA CAGTTGTGGT CTCCCTCCTT CCCTCTGCAT TTGTTGCTAC 10479
.......... .......... .......... .......... .......... .......... 281
AGATGATTAC ACTGAGTTTT GTTTCTTGTC ATTTAGGTTA TGGATAATGA AAATAAGCCA 10539
V M D N E N K P
. . .
.......... .......... .......... ...... E N G R R R A M 289
AAAGCCAGGG AAGTTGTCAA AAGATTGATT GCTGAAGATG GATGGAAAGG TTTGTACAGA 10599
K A R E V V K R L I A E D G W K G L Y R
. + | | | + | | . | |
T V M Q S V K S L M K E G G V G A C Y R 309
GGGTTGGGTC CAAGATTTTT CAGCTCATCA GCTTGGGGAA CCTCAATGAT AGTATGCTAC 10659
G L G P R F F S S S A W G T S M I V C Y
| | | | | + + | . | + | | . |
G L G P R W V A M S M S A T T M I T T Y 329
GAGTACCTGA GTATGTTTCG TCTTCCCTTG TCAAATGTAC ACATGCATAT GTAGTGTTAT 10719
E Y L
| + |
E F L .......... .......... .......... .......... .......... 332
ATATCACTGC ATCCCATGCA GGTTAATTTT AAGTACCCAG ATACTTCTTC TCATTTAGAA 10779
.......... .......... .......... .......... .......... .......... 332
TTTAGTTAAA ATGACATCAT TCAGGTCAGT TGGCATCTCC AGTACACTGC TTTTGTAAGT 10839
.......... .......... .......... .......... .......... .......... 332
TGTATCATAA ATCCCATTTG CAATGAAATT TTTGACTCAA GTTGCAGCCT GTAACTTTTC 10899
.......... .......... .......... .......... .......... .......... 332
TATATTTTTC GAATAAAGCT ATCACCGTAC ATGAAACCTG CTTCTGTTAA TGCCAAGGAG 10959
.......... .......... .......... .......... .......... .......... 332
CGCACATTAT TTCCTGTAGA CCGGCTTGGA TGTTGAACAA TTGGCACATG CAAGTAGCAA 11019
.......... .......... .......... .......... .......... .......... 332
AGAGCAGCCT TGTGCTTGCA ACAATCTGGT CCACCTGTGG ATATGTTCGC TGTGAAAGAA 11079
.......... .......... .......... .......... .......... .......... 332
ACCAATTAGT CCTTGTATGA AACATGGTAT TAGCGCTTCA TGAATAAAAC CACTGATTCT 11139
.......... .......... .......... .......... .......... .......... 332
GATTTCTTAT TTTCAATGAA TGGATGGGCA TTACCAAAGT TATCATGATT AAAGATCTAT 11199
.......... .......... .......... .......... .......... .......... 332
TTCATATAAG TTTATTTTTA TACATTAGAG TTTATTTAGA GAACAAGGTA TATTTAGTTT 11259
.......... .......... .......... .......... .......... .......... 332
TGGTAATTTT GTGAACTGCA CTCAGACGAC TTTGGTATTC TTACTGTAAT TTTGTTTTGT 11319
.......... .......... .......... .......... .......... .......... 332
TTTCCTACAG AGCGCTTGTG TGCTAAAGTT GAAGAGGTCT GA 11361
K R L C A K V E E V *
| | | . . | + +
.......... K R L A T K - K Q K * 342
********************************************************************************
Query protein sequence 9 (File: 23308305)
1 NLGAAEEESA QEIHLPADIN WEMLDKSKFF VLGAALFSGV SGALYPAVLM KTRQQVCHSQ
61 GSCIKTAFTL VRHEGLRGLY RGFGTSLMGT IPARALYMTA LEVTKSNVGS AAVSLGLTEA
121 KAAAVANAVG GLSAAMAAQL VWTPVDVVSQ RLMVQGSAGL VNASRCNYVN GFDAFRKIVR
181 ADGPKGLYRG FGISILTYAP SNAVWWASYS VAQRMVWGGI GCYVCKKDEE SGNNSTTMKP
241 DSKTIMAVQG VSAAIAGSVS ALITMPLDTI KTRLQVLDGE DSSNNGKRGP SIGQTVRNLV
301 REGGWTACYR GLGPRCASMS MSATTMITTY EFLKRLSAKN HDGFYSKS-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7994 8058 ( 65 n); Protein 1 23 ( 23 aa); score: 0.076
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 24 147 ( 124 aa); score: 0.419
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 148 218 ( 71 aa); score: 0.536
Intron 3 9046 9437 ( 392 n); Pd: 0.462 Pa: 0.346
Exon 4 9438 9477 ( 40 n); Protein 219 229 ( 11 aa); score: 0.084
Intron 4 9478 9697 ( 220 n); Pd: 0.924 Pa: 0.863
Exon 5 9698 9844 ( 147 n); Protein 230 277 ( 48 aa); score: 0.386
Intron 5 9845 10200 ( 356 n); Pd: 0.446 Pa: 0.975
Exon 6 10201 10207 ( 7 n); Protein 278 280 ( 3 aa); score: -0.471
Intron 6 10208 10515 ( 308 n); Pd: 0.994 Pa: 0.987
Exon 7 10516 10669 ( 154 n); Protein 281 333 ( 53 aa); score: 0.330
Intron 7 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 8 11330 11378 ( 49 n); Protein 334 348 ( 15 aa); score: 0.207
MATCH 21326110+ 23308305 0.400 1041 0.994 P
PGS_21326110+_23308305 (7994 8058,8339 8720,8849 9045,9438 9477,9698 9844,10201 10207,10516 10669,11330 11378)
Alignment:
ATGGATACAA CCTCTAGGGC CGCCAAGATC CCGTCGCTCC AC---CAGAC GGAGATCAAC 8050
M D T T S R A A K I P S L H Q T E I N
. + . . . + | . + | |
N L G A A E E E S A Q E I H L P A D I N 20
TGGGACAAGT AAGCAACCCA CCGGTCTTGT AATCCTTAGG TTCCCATTTC GTGCCGATTT 8110
W D N
| +
W E M.. .......... .......... .......... .......... .......... 23
CCGCCTTTCT CGCGGTCCTA GTCTTCTAGA AATTTAGCTC CGATCGTTGA ACTGTGTCCC 8170
.......... .......... .......... .......... .......... .......... 23
CCGCCCTATT CCGGATTGGC TTTACCGCCT TAGGATTAGG AACTGTTGTT TGAGGATTTG 8230
.......... .......... .......... .......... .......... .......... 23
ACATGGCTGT TCATGCTCCT GGACGAAGCG CAATTTGGGG ATTCGTAGCT TATTCCGTTG 8290
.......... .......... .......... .......... .......... .......... 23
TTTCCGATTC TTTTACTGTT CTCAAATCGT GGCGCTGAAA TGTTGCAGCC TCGACAAGAC 8350
L D K T
| | | +
.......... .......... .......... .......... ........ L D K S 27
CAAGCTCTAC GTGGTGGGCG CAGGCATGTT CAGCGGCGTC ACCGTGGCGC TGTATCCTGT 8410
K L Y V V G A G M F S G V T V A L Y P V
| . + | + | | . + | | | | + | | | | .
K F F V L G A A L F S G V S G A L Y P A 47
CTCGGTGGTC AAGACCCGGA TGCAGGTTGC CTCTGGGGAC GCCATGAGGA GGAACGCGCT 8470
S V V K T R M Q V A S G D A M R R N A L
+ + | | | . | | . . . . . .
V L M K T R Q Q V C H S Q G S C I K T - 66
GGCTACCTTC AAGAACATCC TCAAGATGGA CGGCGTGCCA GGGCTGTACC GGGGGTTTGC 8530
A T F K N I L K M D G V P G L Y R G F A
| | . + + + + | + | | | | | | .
A - F - T L V R H E G L R G L Y R G F G 84
TACCGTTATC ATTGGGGCTG TACCAACTAG GATCATCTTC CTCACAGCGC TTGAGACAAC 8590
T V I I G A V P T R I I F L T A L E T T
| + + | . + | . | + + + | | | | . |
T S L M G T I P A R A L Y M T A L E V T 104
CAAAGCAGCC TCGCTTAAGC TTGTTGAGCC CTTCAAGCTG TCAGAGCCGG TGCGGGCTGC 8650
K A A S L K L V E P F K L S E P V R A A
| + . . . | + | | |
K S N V G S A A V S L G L T E A K A A A 124
CTTTGCCAAT GGCCTTGCTG GTCTGTCAGC GTCTACATGT TCGCAGGCTA TTTTTGTTCC 8710
F A N G L A G L S A S T C S Q A I F V P
| | . + . | | | | + . + | + + . |
V A N A V G G L S A A M A A Q L V W T P 144
AATTGATGTG GTATGCCTCT CATGTGCCTT CTATGTGATG TTGTATAGAG AAAAAATATC 8770
I D V
+ | |
V D V .......... .......... .......... .......... .......... 147
TTACAATATG TTGATGTTAA ATGCTAATTA CAATACTAGA CTACTGTTTT CATTCTGTTG 8830
.......... .......... .......... .......... .......... .......... 147
TGCATTGGAA TGTTTCAGAT TAGCCAGAAA TTGATGGTTC AAGGATATTC TGGT------ 8884
I S Q K L M V Q G Y S G
+ | | + | | | | | + |
.......... ........ V S Q R L M V Q G S A G L V 161
AATGCC---A GA------TA CAAAGGTGGA TTAGATGTTG CTCGAAAGGT CATAAAGGCT 8935
N A R Y K G G L D V A R K V I K A
| | | | . | . | . | | + + + |
N A S R C N Y V N G F D A F R K I V R A 181
GATGGCATTA GGGGGCTGTA CAGAGGATTT GGACTGTCTG TTATGACCTA TGCTCCATCC 8995
D G I R G L Y R G F G L S V M T Y A P S
| | + | | | | | | | + | + + | | | | |
D G P K G L Y R G F G I S I L T Y A P S 201
AGTGCTGTGT GGTGGGCAAG TTATGGTTCC AGCCAGCGCA TAATTTGGAG GTTAGCTTAT 9055
S A V W W A S Y G S S Q R I I W S
+ | | | | | | | . + | | + + | .
N A V W W A S Y S V A Q R M V W G .......... 218
CTGATTGGTT CATCGTTATG TTCCTCTCAG CCCTGTGTAC TATGTAATAT TTACGAGAAA 9115
.......... .......... .......... .......... .......... .......... 218
AAGACCAGTA ATACATTTCT ACTTAATAGT TATTTGAATT GGTACTTTCC ATCTGTCCAA 9175
.......... .......... .......... .......... .......... .......... 218
AACCTTTTCA AACTTCCCCT CTTGATGCTC AAACTGCAGC TATAATTGCA ATTTTGTTTT 9235
.......... .......... .......... .......... .......... .......... 218
CTGATGCTTG TTCTTCCATG TCAATATGTA CATATCTTTT TTAGAAAACA AGAATGCATC 9295
.......... .......... .......... .......... .......... .......... 218
TCAATGCATG TGCTGTATTG TTTTGATTAG ATTTATCATA GCGATCAATC ACATTTTCTT 9355
.......... .......... .......... .......... .......... .......... 218
TACAGATAAA AATAGTCGGA AGGATAAGTT GGATAACTGA CCAAAGTGGA AATATGATCT 9415
.......... .......... .......... .......... .......... .......... 218
TACATATTTT TATCTCTGGC AGCTTAGAGA ACTTAATTAC CAACCTGAAA CAATGTGATG 9475
L E N L I T N L K Q C D
. . | + |
.......... .......... .. G I G C Y V - C K K - D 228
AAGTAACTAC ACAAAACCAC ATATAGTTTC ATGCACTCTG CAAAACTAAA TTGAAACTCT 9535
E
|
E ........ .......... .......... .......... .......... .......... 229
TAGTGTGCTC TTAATGCTGT TAAGAGGGTG TATGCAAGTT TACTGGAATC AGTACCTTTT 9595
.......... .......... .......... .......... .......... .......... 229
GTTAGTTTAT TTCTTTGTGG TTGATGGTTG AAAGATTATA TTTCTTGTCT TGATAACTTA 9655
.......... .......... .......... .......... .......... .......... 229
GCCAAAATAG TTAACTATTG TGCTTTTTAC ATATTGGAAC AGTGCTCT-T -GGCCATTTG 9713
C S G H L
| | +
.......... .......... .......... .......... .. E S - G N - 233
CATGACAAAG AAGAGGCTCC TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG 9773
H D K E E A P S Q L K L V G V Q A S G G
+ . | . . + + . | | . . .
N S T T M K P D S K T I M A V Q G V S A 253
GTTTTTGCCG GTGCCGTGAC CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG 9833
V F A G A V T S F V T T P I D T I K T R
. . | | + | + + . + | | + | | | | | |
A I A G S V S A L I T M P L D T I K T R 273
CTGCAGGTAC TGTGTGACAT TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT 9893
L Q V L
| | | |
L Q V L......... .......... .......... .......... .......... 277
ATATTTTGTG AGGCTTACCC TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC 9953
.......... .......... .......... .......... .......... .......... 277
ATTTGCAATA ATTTGATTCC TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT 10013
.......... .......... .......... .......... .......... .......... 277
TTGTTTTGGA AGTATATTTG ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA 10073
.......... .......... .......... .......... .......... .......... 277
TTTCCTCCAA TGGCGGGCTA TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA 10133
.......... .......... .......... .......... .......... .......... 277
TATACTTCTA TGTTTACAGC TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG 10193
.......... .......... .......... .......... .......... .......... 277
GTTTCAGT-- -TATGAGGTC AGAAAAAAAG AAACTTCCAT TGGGAAAACT TGATATCTAT 10250
Y E
|
....... D G E ... .......... .......... .......... .......... 280
TACTTCATTA TTTATAGTGA GTAACAAAAG TTAGCACTTT CAAACTGACT AAAGTATGCC 10310
.......... .......... .......... .......... .......... .......... 280
AGGGACGTAT CATGCATTTT ACAACATGCT CCACATATCT CCAAATATCA CATATTACGC 10370
.......... .......... .......... .......... .......... .......... 280
TTGTAGTGGT AAACTGATAA TACATCTACC AACACTGAAA GTTCTCACAA GTCAGAACCC 10430
.......... .......... .......... .......... .......... .......... 280
TATATTTGAC AGTTGTGGTC TCCCTCCTTC CCTCTGCATT TGTTGCTACA GATGATTACA 10490
.......... .......... .......... .......... .......... .......... 280
CTGAGTTTTG TTTCTTGTCA TTTAGGTTAT GGATAATGAA AATAAG---- --CCAAAAGC 10544
V M D N E N K P K A
. | . . | | .
.......... .......... ..... D S S N N G K R G P S I 292
CAGGGAAGTT GTCAAAAGAT TGATTGCTGA AGATGGATGG AAAGGTTTGT ACAGAGGGTT 10604
R E V V K R L I A E D G W K G L Y R G L
+ . | + . | + | | | . | | | |
G Q T V R N L V R E G G W T A C Y R G L 312
GGGTCCAAGA TTTTTCAGCT CATCAGCTTG GGGAACCTCA ATGATAGTAT GCTACGAGTA 10664
G P R F F S S S A W G T S M I V C Y E Y
| | | | | . | + | | . | | +
G P R C A S M S M S A T T M I T T Y E F 332
CCTGAGTATG TTTCGTCTTC CCTTGTCAAA TGTACACATG CATATGTAGT GTTATATATC 10724
L
|
L ..... .......... .......... .......... .......... .......... 333
ACTGCATCCC ATGCAGGTTA ATTTTAAGTA CCCAGATACT TCTTCTCATT TAGAATTTAG 10784
.......... .......... .......... .......... .......... .......... 333
TTAAAATGAC ATCATTCAGG TCAGTTGGCA TCTCCAGTAC ACTGCTTTTG TAAGTTGTAT 10844
.......... .......... .......... .......... .......... .......... 333
CATAAATCCC ATTTGCAATG AAATTTTTGA CTCAAGTTGC AGCCTGTAAC TTTTCTATAT 10904
.......... .......... .......... .......... .......... .......... 333
TTTTCGAATA AAGCTATCAC CGTACATGAA ACCTGCTTCT GTTAATGCCA AGGAGCGCAC 10964
.......... .......... .......... .......... .......... .......... 333
ATTATTTCCT GTAGACCGGC TTGGATGTTG AACAATTGGC ACATGCAAGT AGCAAAGAGC 11024
.......... .......... .......... .......... .......... .......... 333
AGCCTTGTGC TTGCAACAAT CTGGTCCACC TGTGGATATG TTCGCTGTGA AAGAAACCAA 11084
.......... .......... .......... .......... .......... .......... 333
TTAGTCCTTG TATGAAACAT GGTATTAGCG CTTCATGAAT AAAACCACTG ATTCTGATTT 11144
.......... .......... .......... .......... .......... .......... 333
CTTATTTTCA ATGAATGGAT GGGCATTACC AAAGTTATCA TGATTAAAGA TCTATTTCAT 11204
.......... .......... .......... .......... .......... .......... 333
ATAAGTTTAT TTTTATACAT TAGAGTTTAT TTAGAGAACA AGGTATATTT AGTTTTGGTA 11264
.......... .......... .......... .......... .......... .......... 333
ATTTTGTGAA CTGCACTCAG ACGACTTTGG TATTCTTACT GTAATTTTGT TTTGTTTTCC 11324
.......... .......... .......... .......... .......... .......... 333
TACAGAGCGC TTGTGTGCTA AAGTTGAAG- AGGTCTGATT TCTGAGCTGC CTTAA 11378
K R L C A K V E G L I S E L P *
| | | | | . | . | +
.....K R L S A K N H D G F Y S K - S * 349
********************************************************************************
Query protein sequence 1 (File: 21326111)
1 DTTSRAAKIP SLHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVVKT RMQVASGDAM
61 RRNALATFKN ILKMDGVPGL YRGFATVIIG AVPTRIIFLT ALETTKAASL KLVEPFKLSE
121 PVRAAFANGL AGLSASTCSQ AIFVPIDVIS QKLMVQGYSG NARYKGGLDV ARKVIKADGI
181 RGLYRGFGLS VMTYAPSSAV WWASYGSSQR IIWSALGHLH DKEEAPSQLK LVGVQASGGV
241 FAGAVTSFVT TPIDTIKTRL QVMDNENKPK AREVVKRLIA EDGWKGLYRG LGPRFFSSSA
301 WGTSMIVCYE YLKRLCAKVE EV-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7997 8058 ( 62 n); Protein 1 21 ( 21 aa); score: 1.000
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 22 148 ( 127 aa); score: 1.000
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 149 214 ( 66 aa); score: 1.000
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 215 261 ( 47 aa); score: 1.000
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 262 312 ( 51 aa); score: 1.000
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 313 322 ( 10 aa); score: 1.000
MATCH 21326110+ 21326111 1.000 969 1.000 P
PGS_21326110+_21326111 (7997 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTCCACC AGACGGAGAT CAACTGGGAC 8056
D T T S R A A K I P S L H Q T E I N W D
| | | | | | | | | | | | | | | | | | | |
D T T S R A A K I P S L H Q T E I N W D 20
AAGTAAGCAA CCCACCGGTC TTGTAATCCT TAGGTTCCCA TTTCGTGCCG ATTTCCGCCT 8116
N
|
N........ .......... .......... .......... .......... .......... 21
TTCTCGCGGT CCTAGTCTTC TAGAAATTTA GCTCCGATCG TTGAACTGTG TCCCCCGCCC 8176
.......... .......... .......... .......... .......... .......... 21
TATTCCGGAT TGGCTTTACC GCCTTAGGAT TAGGAACTGT TGTTTGAGGA TTTGACATGG 8236
.......... .......... .......... .......... .......... .......... 21
CTGTTCATGC TCCTGGACGA AGCGCAATTT GGGGATTCGT AGCTTATTCC GTTGTTTCCG 8296
.......... .......... .......... .......... .......... .......... 21
ATTCTTTTAC TGTTCTCAAA TCGTGGCGCT GAAATGTTGC AGCCTCGACA AGACCAAGCT 8356
L D K T K L
| | | | | |
.......... .......... .......... .......... .. L D K T K L 27
CTACGTGGTG GGCGCAGGCA TGTTCAGCGG CGTCACCGTG GCGCTGTATC CTGTCTCGGT 8416
Y V V G A G M F S G V T V A L Y P V S V
| | | | | | | | | | | | | | | | | | | |
Y V V G A G M F S G V T V A L Y P V S V 47
GGTCAAGACC CGGATGCAGG TTGCCTCTGG GGACGCCATG AGGAGGAACG CGCTGGCTAC 8476
V K T R M Q V A S G D A M R R N A L A T
| | | | | | | | | | | | | | | | | | | |
V K T R M Q V A S G D A M R R N A L A T 67
CTTCAAGAAC ATCCTCAAGA TGGACGGCGT GCCAGGGCTG TACCGGGGGT TTGCTACCGT 8536
F K N I L K M D G V P G L Y R G F A T V
| | | | | | | | | | | | | | | | | | | |
F K N I L K M D G V P G L Y R G F A T V 87
TATCATTGGG GCTGTACCAA CTAGGATCAT CTTCCTCACA GCGCTTGAGA CAACCAAAGC 8596
I I G A V P T R I I F L T A L E T T K A
| | | | | | | | | | | | | | | | | | | |
I I G A V P T R I I F L T A L E T T K A 107
AGCCTCGCTT AAGCTTGTTG AGCCCTTCAA GCTGTCAGAG CCGGTGCGGG CTGCCTTTGC 8656
A S L K L V E P F K L S E P V R A A F A
| | | | | | | | | | | | | | | | | | | |
A S L K L V E P F K L S E P V R A A F A 127
CAATGGCCTT GCTGGTCTGT CAGCGTCTAC ATGTTCGCAG GCTATTTTTG TTCCAATTGA 8716
N G L A G L S A S T C S Q A I F V P I D
| | | | | | | | | | | | | | | | | | | |
N G L A G L S A S T C S Q A I F V P I D 147
TGTGGTATGC CTCTCATGTG CCTTCTATGT GATGTTGTAT AGAGAAAAAA TATCTTACAA 8776
V
|
V ...... .......... .......... .......... .......... .......... 148
TATGTTGATG TTAAATGCTA ATTACAATAC TAGACTACTG TTTTCATTCT GTTGTGCATT 8836
.......... .......... .......... .......... .......... .......... 148
GGAATGTTTC AGATTAGCCA GAAATTGATG GTTCAAGGAT ATTCTGGTAA TGCCAGATAC 8896
I S Q K L M V Q G Y S G N A R Y
| | | | | | | | | | | | | | | |
.......... .. I S Q K L M V Q G Y S G N A R Y 164
AAAGGTGGAT TAGATGTTGC TCGAAAGGTC ATAAAGGCTG ATGGCATTAG GGGGCTGTAC 8956
K G G L D V A R K V I K A D G I R G L Y
| | | | | | | | | | | | | | | | | | | |
K G G L D V A R K V I K A D G I R G L Y 184
AGAGGATTTG GACTGTCTGT TATGACCTAT GCTCCATCCA GTGCTGTGTG GTGGGCAAGT 9016
R G F G L S V M T Y A P S S A V W W A S
| | | | | | | | | | | | | | | | | | | |
R G F G L S V M T Y A P S S A V W W A S 204
TATGGTTCCA GCCAGCGCAT AATTTGGAGG TTAGCTTATC TGATTGGTTC ATCGTTATGT 9076
Y G S S Q R I I W S
| | | | | | | | | |
Y G S S Q R I I W S. .......... .......... .......... 214
TCCTCTCAGC CCTGTGTACT ATGTAATATT TACGAGAAAA AGACCAGTAA TACATTTCTA 9136
.......... .......... .......... .......... .......... .......... 214
CTTAATAGTT ATTTGAATTG GTACTTTCCA TCTGTCCAAA ACCTTTTCAA ACTTCCCCTC 9196
.......... .......... .......... .......... .......... .......... 214
TTGATGCTCA AACTGCAGCT ATAATTGCAA TTTTGTTTTC TGATGCTTGT TCTTCCATGT 9256
.......... .......... .......... .......... .......... .......... 214
CAATATGTAC ATATCTTTTT TAGAAAACAA GAATGCATCT CAATGCATGT GCTGTATTGT 9316
.......... .......... .......... .......... .......... .......... 214
TTTGATTAGA TTTATCATAG CGATCAATCA CATTTTCTTT ACAGATAAAA ATAGTCGGAA 9376
.......... .......... .......... .......... .......... .......... 214
GGATAAGTTG GATAACTGAC CAAAGTGGAA ATATGATCTT ACATATTTTT ATCTCTGGCA 9436
.......... .......... .......... .......... .......... .......... 214
GCTTAGAGAA CTTAATTACC AACCTGAAAC AATGTGATGA AGTAACTACA CAAAACCACA 9496
.......... .......... .......... .......... .......... .......... 214
TATAGTTTCA TGCACTCTGC AAAACTAAAT TGAAACTCTT AGTGTGCTCT TAATGCTGTT 9556
.......... .......... .......... .......... .......... .......... 214
AAGAGGGTGT ATGCAAGTTT ACTGGAATCA GTACCTTTTG TTAGTTTATT TCTTTGTGGT 9616
.......... .......... .......... .......... .......... .......... 214
TGATGGTTGA AAGATTATAT TTCTTGTCTT GATAACTTAG CCAAAATAGT TAACTATTGT 9676
.......... .......... .......... .......... .......... .......... 214
GCTTTTTACA TATTGGAACA GTGCTCTTGG CCATTTGCAT GACAAAGAAG AGGCTCCTAG 9736
A L G H L H D K E E A P S
| | | | | | | | | | | | |
.......... .......... . A L G H L H D K E E A P S 227
CCAATTGAAA CTAGTTGGTG TTCAAGCATC AGGGGGGGTT TTTGCCGGTG CCGTGACCTC 9796
Q L K L V G V Q A S G G V F A G A V T S
| | | | | | | | | | | | | | | | | | | |
Q L K L V G V Q A S G G V F A G A V T S 247
TTTTGTTACG ACTCCCATAG ATACAATAAA GACCAGGCTG CAGGTACTGT GTGACATTCT 9856
F V T T P I D T I K T R L Q
| | | | | | | | | | | | | |
F V T T P I D T I K T R L Q ....... .......... 261
GTTTGCTGAT TACTCTTGTA ATTTGATTTG TGTGGGTATA TTTTGTGAGG CTTACCCTTG 9916
.......... .......... .......... .......... .......... .......... 261
TGACTTAATG ATTCTTGTCT TTACATTTAT GCTGCTCATT TGCAATAATT TGATTCCTTA 9976
.......... .......... .......... .......... .......... .......... 261
TCAATGCAAT GCCACTAAGT TTAGGGGAAT GGATATTTTG TTTTGGAAGT ATATTTGATG 10036
.......... .......... .......... .......... .......... .......... 261
TCAGACTTGA AGACCTAAAT GTTCTTTTAT ACTGATATTT CCTCCAATGG CGGGCTATTG 10096
.......... .......... .......... .......... .......... .......... 261
AGGTGCTGGA CTGGAATGCT GTCTATATTA AACAATATAT ACTTCTATGT TTACAGCTGT 10156
.......... .......... .......... .......... .......... .......... 261
TTGTTTTCTG CTGACATACC ATGACCAATT TGTCATGGTT TCAGTTATGA GGTCAGAAAA 10216
.......... .......... .......... .......... .......... .......... 261
AAAGAAACTT CCATTGGGAA AACTTGATAT CTATTACTTC ATTATTTATA GTGAGTAACA 10276
.......... .......... .......... .......... .......... .......... 261
AAAGTTAGCA CTTTCAAACT GACTAAAGTA TGCCAGGGAC GTATCATGCA TTTTACAACA 10336
.......... .......... .......... .......... .......... .......... 261
TGCTCCACAT ATCTCCAAAT ATCACATATT ACGCTTGTAG TGGTAAACTG ATAATACATC 10396
.......... .......... .......... .......... .......... .......... 261
TACCAACACT GAAAGTTCTC ACAAGTCAGA ACCCTATATT TGACAGTTGT GGTCTCCCTC 10456
.......... .......... .......... .......... .......... .......... 261
CTTCCCTCTG CATTTGTTGC TACAGATGAT TACACTGAGT TTTGTTTCTT GTCATTTAGG 10516
.......... .......... .......... .......... .......... ......... 261
TTATGGATAA TGAAAATAAG CCAAAAGCCA GGGAAGTTGT CAAAAGATTG ATTGCTGAAG 10576
V M D N E N K P K A R E V V K R L I A E
| | | | | | | | | | | | | | | | | | | |
V M D N E N K P K A R E V V K R L I A E 281
ATGGATGGAA AGGTTTGTAC AGAGGGTTGG GTCCAAGATT TTTCAGCTCA TCAGCTTGGG 10636
D G W K G L Y R G L G P R F F S S S A W
| | | | | | | | | | | | | | | | | | | |
D G W K G L Y R G L G P R F F S S S A W 301
GAACCTCAAT GATAGTATGC TACGAGTACC TGAGTATGTT TCGTCTTCCC TTGTCAAATG 10696
G T S M I V C Y E Y L
| | | | | | | | | | |
G T S M I V C Y E Y L ....... .......... .......... 312
TACACATGCA TATGTAGTGT TATATATCAC TGCATCCCAT GCAGGTTAAT TTTAAGTACC 10756
.......... .......... .......... .......... .......... .......... 312
CAGATACTTC TTCTCATTTA GAATTTAGTT AAAATGACAT CATTCAGGTC AGTTGGCATC 10816
.......... .......... .......... .......... .......... .......... 312
TCCAGTACAC TGCTTTTGTA AGTTGTATCA TAAATCCCAT TTGCAATGAA ATTTTTGACT 10876
.......... .......... .......... .......... .......... .......... 312
CAAGTTGCAG CCTGTAACTT TTCTATATTT TTCGAATAAA GCTATCACCG TACATGAAAC 10936
.......... .......... .......... .......... .......... .......... 312
CTGCTTCTGT TAATGCCAAG GAGCGCACAT TATTTCCTGT AGACCGGCTT GGATGTTGAA 10996
.......... .......... .......... .......... .......... .......... 312
CAATTGGCAC ATGCAAGTAG CAAAGAGCAG CCTTGTGCTT GCAACAATCT GGTCCACCTG 11056
.......... .......... .......... .......... .......... .......... 312
TGGATATGTT CGCTGTGAAA GAAACCAATT AGTCCTTGTA TGAAACATGG TATTAGCGCT 11116
.......... .......... .......... .......... .......... .......... 312
TCATGAATAA AACCACTGAT TCTGATTTCT TATTTTCAAT GAATGGATGG GCATTACCAA 11176
.......... .......... .......... .......... .......... .......... 312
AGTTATCATG ATTAAAGATC TATTTCATAT AAGTTTATTT TTATACATTA GAGTTTATTT 11236
.......... .......... .......... .......... .......... .......... 312
AGAGAACAAG GTATATTTAG TTTTGGTAAT TTTGTGAACT GCACTCAGAC GACTTTGGTA 11296
.......... .......... .......... .......... .......... .......... 312
TTCTTACTGT AATTTTGTTT TGTTTTCCTA CAGAGCGCTT GTGTGCTAAA GTTGAAGAGG 11356
K R L C A K V E E
| | | | | | | | |
.......... .......... .......... ...K R L C A K V E E 321
TCTGA 11361
V *
|
V * 323
********************************************************************************
Query protein sequence 2 (File: 12061241)
1 DTTTRAKIPS LHHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVIKT RMQVATGEAV
61 RRNAAATFRN ILKVDGVPGL YRGFGTVITG AIPARIIFLT ALETTKAASL KLVEPFKLSE
121 PVQAAFANGL GGLSASLCSQ AVFVPIDVVS QKLMVQGYSG HVRYKGGLDV AQQIIKADGI
181 RGLYRGFGLS VMTYSPSSAV WWASYGSSQR IIWSAFDRWN DKESSPSQLT IVGVQATGGI
241 IAGAVTSCVT TPIDTIKTRL QVNQNKPKAM EVVRRLIAED GWKGFYRGLG PRFFSSSAWG
301 TSMIVCYEYL KRLCAKVEEV -
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7997 8058 ( 62 n); Protein 1 21 ( 21 aa); score: 0.750
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 22 148 ( 127 aa); score: 0.910
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 149 214 ( 66 aa); score: 0.932
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 215 261 ( 47 aa); score: 0.706
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 262 310 ( 49 aa); score: 0.863
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 311 320 ( 10 aa); score: 1.000
MATCH 21326110+ 12061241 0.864 969 1.006 P
PGS_21326110+_12061241 (7997 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG 8053
D T T S R A A K I P S L H Q T E I N W
| | | + | | | | | | | | | | | | | |
D T T T R A - K I P S L H H Q T E I N W 19
GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG 8113
D N
| |
D N..... .......... .......... .......... .......... .......... 21
CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG 8173
.......... .......... .......... .......... .......... .......... 21
CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA 8233
.......... .......... .......... .......... .......... .......... 21
TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT 8293
.......... .......... .......... .......... .......... .......... 21
CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA 8353
L D K T K
| | | | |
.......... .......... .......... .......... ..... L D K T K 26
GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC 8413
L Y V V G A G M F S G V T V A L Y P V S
| | | | | | | | | | | | | | | | | | | |
L Y V V G A G M F S G V T V A L Y P V S 46
GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC 8473
V V K T R M Q V A S G D A M R R N A L A
| + | | | | | | | + | + | + | | | | |
V I K T R M Q V A T G E A V R R N A A A 66
TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC 8533
T F K N I L K M D G V P G L Y R G F A T
| | + | | | | + | | | | | | | | | | . |
T F R N I L K V D G V P G L Y R G F G T 86
CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA 8593
V I I G A V P T R I I F L T A L E T T K
| | | | + | . | | | | | | | | | | | |
V I T G A I P A R I I F L T A L E T T K 106
AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT 8653
A A S L K L V E P F K L S E P V R A A F
| | | | | | | | | | | | | | | | + | | |
A A S L K L V E P F K L S E P V Q A A F 126
TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT 8713
A N G L A G L S A S T C S Q A I F V P I
| | | | . | | | | | | | | | + | | | |
A N G L G G L S A S L C S Q A V F V P I 146
TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA 8773
D V
| |
D V ... .......... .......... .......... .......... .......... 148
CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC 8833
.......... .......... .......... .......... .......... .......... 148
ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA 8893
I S Q K L M V Q G Y S G N A R
+ | | | | | | | | | | | + . |
.......... ..... V S Q K L M V Q G Y S G H V R 163
TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG 8953
Y K G G L D V A R K V I K A D G I R G L
| | | | | | | | + + + | | | | | | | | |
Y K G G L D V A Q Q I I K A D G I R G L 183
TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA 9013
Y R G F G L S V M T Y A P S S A V W W A
| | | | | | | | | | | + | | | | | | | |
Y R G F G L S V M T Y S P S S A V W W A 203
AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA 9073
S Y G S S Q R I I W S
| | | | | | | | | | |
S Y G S S Q R I I W S........ .......... .......... 214
TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT 9133
.......... .......... .......... .......... .......... .......... 214
CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC 9193
.......... .......... .......... .......... .......... .......... 214
CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA 9253
.......... .......... .......... .......... .......... .......... 214
TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT 9313
.......... .......... .......... .......... .......... .......... 214
TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG 9373
.......... .......... .......... .......... .......... .......... 214
GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG 9433
.......... .......... .......... .......... .......... .......... 214
GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC 9493
.......... .......... .......... .......... .......... .......... 214
ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT 9553
.......... .......... .......... .......... .......... .......... 214
GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT 9613
.......... .......... .......... .......... .......... .......... 214
GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT 9673
.......... .......... .......... .......... .......... .......... 214
TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC 9733
A L G H L H D K E E A P
| . . + | | | . + |
.......... .......... .... A F D R W N D K E S S P 226
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| | | + | | | | | + | | + . | | | | |
S Q L T I V G V Q A T G G I I A G A V T 246
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| | | | | | | | | | | | | |
S C V T T P I D T I K T R L Q .... .......... 261
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 261
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 261
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 261
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 261
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 261
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 261
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 261
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 261
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 261
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 261
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 261
AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG 10573
V M D N E N K P K A R E V V K R L I A
| | + | | | | | | | | + | | | |
.. V - - N Q N K P K A M E V V R R L I A 278
AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT 10633
E D G W K G L Y R G L G P R F F S S S A
| | | | | | . | | | | | | | | | | | | |
E D G W K G F Y R G L G P R F F S S S A 298
GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA 10693
W G T S M I V C Y E Y L
| | | | | | | | | | | |
W G T S M I V C Y E Y L .... .......... .......... 310
ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT 10753
.......... .......... .......... .......... .......... .......... 310
ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC 10813
.......... .......... .......... .......... .......... .......... 310
ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG 10873
.......... .......... .......... .......... .......... .......... 310
ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA 10933
.......... .......... .......... .......... .......... .......... 310
AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT 10993
.......... .......... .......... .......... .......... .......... 310
GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC 11053
.......... .......... .......... .......... .......... .......... 310
CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC 11113
.......... .......... .......... .......... .......... .......... 310
GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC 11173
.......... .......... .......... .......... .......... .......... 310
CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA 11233
.......... .......... .......... .......... .......... .......... 310
TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG 11293
.......... .......... .......... .......... .......... .......... 310
GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG 11353
K R L C A K V E
| | | | | | | |
.......... .......... .......... ......K R L C A K V E 318
AGGTCTGA 11361
E V *
| |
E V * 321
********************************************************************************
Query protein sequence 3 (File: 18496651)
1 DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
61 MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
121 ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKSDG
181 PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RMIWSALGHL NDKEDAPSQL KIVGVQATGG
241 MIAGAVTSCV STPLDTIKTR LQVNQNKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
301 GTSMIVCYEY LKRVCAKVEE A-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7997 8058 ( 62 n); Protein 1 22 ( 22 aa); score: 0.754
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 23 149 ( 127 aa); score: 0.846
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 150 215 ( 66 aa); score: 0.854
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 216 262 ( 47 aa); score: 0.835
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 263 311 ( 49 aa); score: 0.866
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 312 321 ( 10 aa); score: 0.857
MATCH 21326110+ 18496651 0.843 969 1.003 P
PGS_21326110+_18496651 (7997 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG 8053
D T T S R A A K I P S L H Q T E I N W
| | + + | | | | | | | | . | | | | | |
D T S T R A A K I P S L P Q Q T E I N W 20
GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG 8113
D N
| |
D N..... .......... .......... .......... .......... .......... 22
CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG 8173
.......... .......... .......... .......... .......... .......... 22
CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA 8233
.......... .......... .......... .......... .......... .......... 22
TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT 8293
.......... .......... .......... .......... .......... .......... 22
CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA 8353
L D K T K
| | | |
.......... .......... .......... .......... ..... L D M T K 27
GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC 8413
L Y V V G A G M F S G V T V A L Y P V S
| | | | | | | | | | | | | | | | | | |
L Y V V G A G M F S C V T V A L Y P V S 47
GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC 8473
V V K T R M Q V A S G D A M R R N A L A
| + | | | | | | | | | + | | | | | | | |
V I K T R M Q V A S G E A M R R N A L A 67
TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC 8533
T F K N I L K M D G V P G L Y R G F A T
| | | | | | | + | | | | | | | | | | . |
T F K N I L K V D G V P G L Y R G F G T 87
CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA 8593
V I I G A V P T R I I F L T A L E T T K
| | | | + | . | | | | | | | | | | |
V I T G A I P A R I I F L T A L E K T K 107
AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT 8653
A A S L K L V E P F K L S E P V R A A F
| . | | | | | | | . + | | | + . | | .
A T S L K L V E P L Q L S E S M E A A L 127
TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT 8713
A N G L A G L S A S T C S Q A I F V P I
| | | | . | | + | | | | | | + | | | |
A N G L G G L T A S L C S Q A V F V P I 147
TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA 8773
D V
| |
D V ... .......... .......... .......... .......... .......... 149
CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC 8833
.......... .......... .......... .......... .......... .......... 149
ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA 8893
I S Q K L M V Q G Y S G N A R
+ | | | | | | | | | | | + . |
.......... ..... V S Q K L M V Q G Y S G H V R 164
TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG 8953
Y K G G L D V A R K V I K A D G I R G L
| | | | + | | . + | + + | + | | | | |
Y K G G I D V V Q K I M K S D G P R G L 184
TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA 9013
Y R G F G L S V M T Y A P S S A V W W A
| | | | | | | | | | | | | | | | | | | |
Y R G F G L S V M T Y A P S S A V W W A 204
AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA 9073
S Y G S S Q R I I W S
| | | | | | + | | |
S Y G F S Q R M I W S........ .......... .......... 215
TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT 9133
.......... .......... .......... .......... .......... .......... 215
CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC 9193
.......... .......... .......... .......... .......... .......... 215
CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA 9253
.......... .......... .......... .......... .......... .......... 215
TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT 9313
.......... .......... .......... .......... .......... .......... 215
TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG 9373
.......... .......... .......... .......... .......... .......... 215
GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG 9433
.......... .......... .......... .......... .......... .......... 215
GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC 9493
.......... .......... .......... .......... .......... .......... 215
ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT 9553
.......... .......... .......... .......... .......... .......... 215
GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT 9613
.......... .......... .......... .......... .......... .......... 215
GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT 9673
.......... .......... .......... .......... .......... .......... 215
TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC 9733
A L G H L H D K E E A P
| | | | | + | | | + | |
.......... .......... .... A L G H L N D K E D A P 227
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| | | | + | | | | | + | | + . | | | | |
S Q L K I V G V Q A T G G M I A G A V T 247
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| | + | | + | | | | | | | |
S C V S T P L D T I K T R L Q .... .......... 262
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 262
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 262
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 262
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 262
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 262
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 262
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 262
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 262
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 262
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 262
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 262
AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG 10573
V M D N E N K P K A R E V V K R L I A
| | + | | | | | | | | + | | | |
.. V - - N Q N K P K A S E V V R R L I A 279
AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT 10633
E D G W K G L Y R G L G P R F F S S S A
| | | | | | . | | | | | | | | | | | | |
E D G W K G F Y R G L G P R F F S S S A 299
GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA 10693
W G T S M I V C Y E Y L
| | | | | | | | | | | |
W G T S M I V C Y E Y L .... .......... .......... 311
ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT 10753
.......... .......... .......... .......... .......... .......... 311
ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC 10813
.......... .......... .......... .......... .......... .......... 311
ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG 10873
.......... .......... .......... .......... .......... .......... 311
ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA 10933
.......... .......... .......... .......... .......... .......... 311
AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT 10993
.......... .......... .......... .......... .......... .......... 311
GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC 11053
.......... .......... .......... .......... .......... .......... 311
CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC 11113
.......... .......... .......... .......... .......... .......... 311
GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC 11173
.......... .......... .......... .......... .......... .......... 311
CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA 11233
.......... .......... .......... .......... .......... .......... 311
TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG 11293
.......... .......... .......... .......... .......... .......... 311
GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG 11353
K R L C A K V E
| | + | | | | |
.......... .......... .......... ......K R V C A K V E 319
AGGTCTGA 11361
E V *
| .
E A * 322
********************************************************************************
Query protein sequence 4 (File: 12278522)
1 DTSTRAAKIP SLPQQTEINW DNLDMTKLYV VGAGMFSCVT VALYPVSVIK TRMQVASGEA
61 MRRNALATFK NILKVDGVPG LYRGFGTVIT GAIPARIIFL TALEKTKATS LKLVEPLQLS
121 ESMEAALANG LGGLTASLCS QAVFVPIDVV SQKLMVQGYS GHVRYKGGID VVQKIMKADG
181 PRGLYRGFGL SVMTYAPSSA VWWASYGFSQ RVIWSALGRL DDKEDTPSQL KIVGVQATGG
241 MVAGAVTSCV STPLDTIKTR LQVNINKPKA SEVVRRLIAE DGWKGFYRGL GPRFFSSSAW
301 GTSMIVCYEY LKRVCAKVEE A-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 7997 8058 ( 62 n); Protein 1 22 ( 22 aa); score: 0.754
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 23 149 ( 127 aa); score: 0.846
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 150 215 ( 66 aa); score: 0.871
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 216 262 ( 47 aa); score: 0.778
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 263 311 ( 49 aa); score: 0.851
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 312 321 ( 10 aa); score: 0.857
MATCH 21326110+ 12278522 0.836 969 1.003 P
PGS_21326110+_12278522 (7997 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
GATACAACCT CTAGGGCCGC CAAGATCCCG TCGCTC---C ACCAGACGGA GATCAACTGG 8053
D T T S R A A K I P S L H Q T E I N W
| | + + | | | | | | | | . | | | | | |
D T S T R A A K I P S L P Q Q T E I N W 20
GACAAGTAAG CAACCCACCG GTCTTGTAAT CCTTAGGTTC CCATTTCGTG CCGATTTCCG 8113
D N
| |
D N..... .......... .......... .......... .......... .......... 22
CCTTTCTCGC GGTCCTAGTC TTCTAGAAAT TTAGCTCCGA TCGTTGAACT GTGTCCCCCG 8173
.......... .......... .......... .......... .......... .......... 22
CCCTATTCCG GATTGGCTTT ACCGCCTTAG GATTAGGAAC TGTTGTTTGA GGATTTGACA 8233
.......... .......... .......... .......... .......... .......... 22
TGGCTGTTCA TGCTCCTGGA CGAAGCGCAA TTTGGGGATT CGTAGCTTAT TCCGTTGTTT 8293
.......... .......... .......... .......... .......... .......... 22
CCGATTCTTT TACTGTTCTC AAATCGTGGC GCTGAAATGT TGCAGCCTCG ACAAGACCAA 8353
L D K T K
| | | |
.......... .......... .......... .......... ..... L D M T K 27
GCTCTACGTG GTGGGCGCAG GCATGTTCAG CGGCGTCACC GTGGCGCTGT ATCCTGTCTC 8413
L Y V V G A G M F S G V T V A L Y P V S
| | | | | | | | | | | | | | | | | | |
L Y V V G A G M F S C V T V A L Y P V S 47
GGTGGTCAAG ACCCGGATGC AGGTTGCCTC TGGGGACGCC ATGAGGAGGA ACGCGCTGGC 8473
V V K T R M Q V A S G D A M R R N A L A
| + | | | | | | | | | + | | | | | | | |
V I K T R M Q V A S G E A M R R N A L A 67
TACCTTCAAG AACATCCTCA AGATGGACGG CGTGCCAGGG CTGTACCGGG GGTTTGCTAC 8533
T F K N I L K M D G V P G L Y R G F A T
| | | | | | | + | | | | | | | | | | . |
T F K N I L K V D G V P G L Y R G F G T 87
CGTTATCATT GGGGCTGTAC CAACTAGGAT CATCTTCCTC ACAGCGCTTG AGACAACCAA 8593
V I I G A V P T R I I F L T A L E T T K
| | | | + | . | | | | | | | | | | |
V I T G A I P A R I I F L T A L E K T K 107
AGCAGCCTCG CTTAAGCTTG TTGAGCCCTT CAAGCTGTCA GAGCCGGTGC GGGCTGCCTT 8653
A A S L K L V E P F K L S E P V R A A F
| . | | | | | | | . + | | | + . | | .
A T S L K L V E P L Q L S E S M E A A L 127
TGCCAATGGC CTTGCTGGTC TGTCAGCGTC TACATGTTCG CAGGCTATTT TTGTTCCAAT 8713
A N G L A G L S A S T C S Q A I F V P I
| | | | . | | + | | | | | | + | | | |
A N G L G G L T A S L C S Q A V F V P I 147
TGATGTGGTA TGCCTCTCAT GTGCCTTCTA TGTGATGTTG TATAGAGAAA AAATATCTTA 8773
D V
| |
D V ... .......... .......... .......... .......... .......... 149
CAATATGTTG ATGTTAAATG CTAATTACAA TACTAGACTA CTGTTTTCAT TCTGTTGTGC 8833
.......... .......... .......... .......... .......... .......... 149
ATTGGAATGT TTCAGATTAG CCAGAAATTG ATGGTTCAAG GATATTCTGG TAATGCCAGA 8893
I S Q K L M V Q G Y S G N A R
+ | | | | | | | | | | | + . |
.......... ..... V S Q K L M V Q G Y S G H V R 164
TACAAAGGTG GATTAGATGT TGCTCGAAAG GTCATAAAGG CTGATGGCAT TAGGGGGCTG 8953
Y K G G L D V A R K V I K A D G I R G L
| | | | + | | . + | + + | | | | | | |
Y K G G I D V V Q K I M K A D G P R G L 184
TACAGAGGAT TTGGACTGTC TGTTATGACC TATGCTCCAT CCAGTGCTGT GTGGTGGGCA 9013
Y R G F G L S V M T Y A P S S A V W W A
| | | | | | | | | | | | | | | | | | | |
Y R G F G L S V M T Y A P S S A V W W A 204
AGTTATGGTT CCAGCCAGCG CATAATTTGG AGGTTAGCTT ATCTGATTGG TTCATCGTTA 9073
S Y G S S Q R I I W S
| | | | | | + | | |
S Y G F S Q R V I W S........ .......... .......... 215
TGTTCCTCTC AGCCCTGTGT ACTATGTAAT ATTTACGAGA AAAAGACCAG TAATACATTT 9133
.......... .......... .......... .......... .......... .......... 215
CTACTTAATA GTTATTTGAA TTGGTACTTT CCATCTGTCC AAAACCTTTT CAAACTTCCC 9193
.......... .......... .......... .......... .......... .......... 215
CTCTTGATGC TCAAACTGCA GCTATAATTG CAATTTTGTT TTCTGATGCT TGTTCTTCCA 9253
.......... .......... .......... .......... .......... .......... 215
TGTCAATATG TACATATCTT TTTTAGAAAA CAAGAATGCA TCTCAATGCA TGTGCTGTAT 9313
.......... .......... .......... .......... .......... .......... 215
TGTTTTGATT AGATTTATCA TAGCGATCAA TCACATTTTC TTTACAGATA AAAATAGTCG 9373
.......... .......... .......... .......... .......... .......... 215
GAAGGATAAG TTGGATAACT GACCAAAGTG GAAATATGAT CTTACATATT TTTATCTCTG 9433
.......... .......... .......... .......... .......... .......... 215
GCAGCTTAGA GAACTTAATT ACCAACCTGA AACAATGTGA TGAAGTAACT ACACAAAACC 9493
.......... .......... .......... .......... .......... .......... 215
ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT 9553
.......... .......... .......... .......... .......... .......... 215
GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT 9613
.......... .......... .......... .......... .......... .......... 215
GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT 9673
.......... .......... .......... .......... .......... .......... 215
TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC 9733
A L G H L H D K E E A P
| | | . | | | | + . |
.......... .......... .... A L G R L D D K E D T P 227
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| | | | + | | | | | + | | + | | | | |
S Q L K I V G V Q A T G G M V A G A V T 247
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| | + | | + | | | | | | | |
S C V S T P L D T I K T R L Q .... .......... 262
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 262
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 262
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 262
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 262
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 262
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 262
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 262
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 262
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 262
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 262
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 262
AGGTTATGGA TAATGAAAAT AAGCCAAAAG CCAGGGAAGT TGTCAAAAGA TTGATTGCTG 10573
V M D N E N K P K A R E V V K R L I A
| | | | | | | | | | + | | | |
.. V - - N I N K P K A S E V V R R L I A 279
AAGATGGATG GAAAGGTTTG TACAGAGGGT TGGGTCCAAG ATTTTTCAGC TCATCAGCTT 10633
E D G W K G L Y R G L G P R F F S S S A
| | | | | | . | | | | | | | | | | | | |
E D G W K G F Y R G L G P R F F S S S A 299
GGGGAACCTC AATGATAGTA TGCTACGAGT ACCTGAGTAT GTTTCGTCTT CCCTTGTCAA 10693
W G T S M I V C Y E Y L
| | | | | | | | | | | |
W G T S M I V C Y E Y L .... .......... .......... 311
ATGTACACAT GCATATGTAG TGTTATATAT CACTGCATCC CATGCAGGTT AATTTTAAGT 10753
.......... .......... .......... .......... .......... .......... 311
ACCCAGATAC TTCTTCTCAT TTAGAATTTA GTTAAAATGA CATCATTCAG GTCAGTTGGC 10813
.......... .......... .......... .......... .......... .......... 311
ATCTCCAGTA CACTGCTTTT GTAAGTTGTA TCATAAATCC CATTTGCAAT GAAATTTTTG 10873
.......... .......... .......... .......... .......... .......... 311
ACTCAAGTTG CAGCCTGTAA CTTTTCTATA TTTTTCGAAT AAAGCTATCA CCGTACATGA 10933
.......... .......... .......... .......... .......... .......... 311
AACCTGCTTC TGTTAATGCC AAGGAGCGCA CATTATTTCC TGTAGACCGG CTTGGATGTT 10993
.......... .......... .......... .......... .......... .......... 311
GAACAATTGG CACATGCAAG TAGCAAAGAG CAGCCTTGTG CTTGCAACAA TCTGGTCCAC 11053
.......... .......... .......... .......... .......... .......... 311
CTGTGGATAT GTTCGCTGTG AAAGAAACCA ATTAGTCCTT GTATGAAACA TGGTATTAGC 11113
.......... .......... .......... .......... .......... .......... 311
GCTTCATGAA TAAAACCACT GATTCTGATT TCTTATTTTC AATGAATGGA TGGGCATTAC 11173
.......... .......... .......... .......... .......... .......... 311
CAAAGTTATC ATGATTAAAG ATCTATTTCA TATAAGTTTA TTTTTATACA TTAGAGTTTA 11233
.......... .......... .......... .......... .......... .......... 311
TTTAGAGAAC AAGGTATATT TAGTTTTGGT AATTTTGTGA ACTGCACTCA GACGACTTTG 11293
.......... .......... .......... .......... .......... .......... 311
GTATTCTTAC TGTAATTTTG TTTTGTTTTC CTACAGAGCG CTTGTGTGCT AAAGTTGAAG 11353
K R L C A K V E
| | + | | | | |
.......... .......... .......... ......K R V C A K V E 319
AGGTCTGA 11361
E V *
| .
E A * 322
********************************************************************************
Query protein sequence 5 (File: 21553961)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATTAPSKS KIVMVQAAGG
241 IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
301 SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 8000 8058 ( 59 n); Protein 1 20 ( 20 aa); score: 0.467
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 21 147 ( 127 aa); score: 0.712
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 148 213 ( 66 aa); score: 0.837
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 214 262 ( 49 aa); score: 0.496
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 263 314 ( 52 aa); score: 0.754
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 315 322 ( 8 aa); score: 0.590
MATCH 21326110+ 21553961 0.697 966 0.997 P
PGS_21326110+_21553961 (8000 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG 8059
T T S R A A K I P S L H Q T E I N W D N
| . + + | | . | | | | | | | .
D T P P T S R I A S F G Q T E I N W D K. 20
TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC 8119
.......... .......... .......... .......... .......... .......... 20
TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT 8179
.......... .......... .......... .......... .......... .......... 20
TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG 8239
.......... .......... .......... .......... .......... .......... 20
TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT 8299
.......... .......... .......... .......... .......... .......... 20
CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA 8359
L D K T K L Y
| | | + . |
.......... .......... .......... ......... L D K R R F Y 27
CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT 8419
V V G A G M F S G V T V A L Y P V S V V
+ | | | + | + | | | | | | | | | | | |
I N G A G L F T G V T V A L Y P V S V V 47
CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT 8479
K T R M Q V A S G D A M R R N A L A T F
| | | + | | | | + . | + | . + .
K T R L Q V A S K E I A E R S A F S V V 67
CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT 8539
K N I L K M D G V P G L Y R G F A T V I
| . | | | | | | | | | | | | | . | | |
K G I L K N D G V P G L Y R G F G T V I 87
CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC 8599
I G A V P T R I I F L T A L E T T K A A
| | | | . | | | | | | | | | | | | +
T G A V P A R I I F L T A L E T T K I S 107
CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA 8659
S L K L V E P F K L S E P V R A A F A N
+ . | | | | . + | | | | . + | | . | |
A F K L V A P L E L S E P T Q A A I A N 127
TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT 8719
G L A G L S A S T C S Q A I F V P I D V
| + | | + + | | | | | + | | | | | |
G I A G M T A S L F S Q A V F V P I D V 147
GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT 8779
......... .......... .......... .......... .......... .......... 147
GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA 8839
.......... .......... .......... .......... .......... .......... 147
ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA 8899
I S Q K L M V Q G Y S G N A R Y K
+ | | | | | | | | | | | + | |
......... V S Q K L M V Q G Y S G H A T Y T 164
GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA 8959
G G L D V A R K V I K A D G I R G L Y R
| | + | | | | + | | + | + | | | | |
G G I D V A T K I I K S Y G V R G L Y R 184
GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT 9019
G F G L S V M T Y A P S S A V W W A S Y
| | | | | | | | | + | | | | . | | | | |
G F G L S V M T Y S P S S A A W W A S Y 204
GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTATGTTCC 9079
G S S Q R I I W S
| | | | | + | |
G S S Q R V I W R.... .......... .......... .......... 213
TCTCAGCCCT GTGTACTATG TAATATTTAC GAGAAAAAGA CCAGTAATAC ATTTCTACTT 9139
.......... .......... .......... .......... .......... .......... 213
AATAGTTATT TGAATTGGTA CTTTCCATCT GTCCAAAACC TTTTCAAACT TCCCCTCTTG 9199
.......... .......... .......... .......... .......... .......... 213
ATGCTCAAAC TGCAGCTATA ATTGCAATTT TGTTTTCTGA TGCTTGTTCT TCCATGTCAA 9259
.......... .......... .......... .......... .......... .......... 213
TATGTACATA TCTTTTTTAG AAAACAAGAA TGCATCTCAA TGCATGTGCT GTATTGTTTT 9319
.......... .......... .......... .......... .......... .......... 213
GATTAGATTT ATCATAGCGA TCAATCACAT TTTCTTTACA GATAAAAATA GTCGGAAGGA 9379
.......... .......... .......... .......... .......... .......... 213
TAAGTTGGAT AACTGACCAA AGTGGAAATA TGATCTTACA TATTTTTATC TCTGGCAGCT 9439
.......... .......... .......... .......... .......... .......... 213
TAGAGAACTT AATTACCAAC CTGAAACAAT GTGATGAAGT AACTACACAA AACCACATAT 9499
.......... .......... .......... .......... .......... .......... 213
AGTTTCATGC ACTCTGCAAA ACTAAATTGA AACTCTTAGT GTGCTCTTAA TGCTGTTAAG 9559
.......... .......... .......... .......... .......... .......... 213
AGGGTGTATG CAAGTTTACT GGAATCAGTA CCTTTTGTTA GTTTATTTCT TTGTGGTTGA 9619
.......... .......... .......... .......... .......... .......... 213
TGGTTGAAAG ATTATATTTC TTGTCTTGAT AACTTAGCCA AAATAGTTAA CTATTGTGCT 9679
.......... .......... .......... .......... .......... .......... 213
TTTTACATAT TGGAACAGTG CTCTTGGCCA TTTGCATGAC AAAGAA---- --GAGGCTCC 9733
A L G H L H D K E E A P
| | + | . + | |
.......... ........ F L G Y G G D S D A T T A P 227
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| + | + | | | | + | | + . | | | . .
S K S K I V M V Q A A G G I I A G A T A 247
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| + | | | + | | | | | | | |
S S I T T P L D T I K T R L Q .... .......... 262
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 262
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 262
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 262
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 262
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 262
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 262
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 262
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 262
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 262
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 262
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 262
AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG 10570
V M D N E N K P K A R E V V K R L I
| | + | | + | . | + + | | | + | +
.. V M G H Q E N R P S A K Q V V K K L L 281
CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG 10630
A E D G W K G L Y R G L G P R F F S S S
| | | | | | | . | | | | | | | | | | |
A E D G W K G F Y R G L G P R F F S M S 301
CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT 10690
A W G T S M I V C Y E Y L
| | | | | | | + | | | |
A W G T S M I L T Y E Y L . .......... .......... 314
CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA 10750
.......... .......... .......... .......... .......... .......... 314
AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT 10810
.......... .......... .......... .......... .......... .......... 314
GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT 10870
.......... .......... .......... .......... .......... .......... 314
TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA 10930
.......... .......... .......... .......... .......... .......... 314
TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT 10990
.......... .......... .......... .......... .......... .......... 314
GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC 11050
.......... .......... .......... .......... .......... .......... 314
CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT 11110
.......... .......... .......... .......... .......... .......... 314
AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT 11170
.......... .......... .......... .......... .......... .......... 314
TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT 11230
.......... .......... .......... .......... .......... .......... 314
TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT 11290
.......... .......... .......... .......... .......... .......... 314
TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG 11350
K R L C A K V
| | | | | +
.......... .......... .......... .........K R L C A - I 320
AAGAGGTCTG A 11361
E E V *
| +
E D - * 323
********************************************************************************
Query protein sequence 6 (File: 15292889)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATAAPSKS KIVMVQAAGG
241 IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
301 SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 8000 8058 ( 59 n); Protein 1 20 ( 20 aa); score: 0.467
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 21 147 ( 127 aa); score: 0.712
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9045 ( 197 n); Protein 148 213 ( 66 aa); score: 0.837
Intron 3 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 4 9698 9839 ( 142 n); Protein 214 262 ( 49 aa); score: 0.498
Intron 4 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 5 10516 10669 ( 154 n); Protein 263 314 ( 52 aa); score: 0.754
Intron 5 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 6 11330 11361 ( 32 n); Protein 315 322 ( 8 aa); score: 0.590
MATCH 21326110+ 15292889 0.697 966 0.997 P
PGS_21326110+_15292889 (8000 8058,8339 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG 8059
T T S R A A K I P S L H Q T E I N W D N
| . + + | | . | | | | | | | .
D T P P T S R I A S F G Q T E I N W D K. 20
TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC 8119
.......... .......... .......... .......... .......... .......... 20
TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT 8179
.......... .......... .......... .......... .......... .......... 20
TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG 8239
.......... .......... .......... .......... .......... .......... 20
TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT 8299
.......... .......... .......... .......... .......... .......... 20
CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA 8359
L D K T K L Y
| | | + . |
.......... .......... .......... ......... L D K R R F Y 27
CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT 8419
V V G A G M F S G V T V A L Y P V S V V
+ | | | + | + | | | | | | | | | | | |
I N G A G L F T G V T V A L Y P V S V V 47
CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT 8479
K T R M Q V A S G D A M R R N A L A T F
| | | + | | | | + . | + | . + .
K T R L Q V A S K E I A E R S A F S V V 67
CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT 8539
K N I L K M D G V P G L Y R G F A T V I
| . | | | | | | | | | | | | | . | | |
K G I L K N D G V P G L Y R G F G T V I 87
CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC 8599
I G A V P T R I I F L T A L E T T K A A
| | | | . | | | | | | | | | | | | +
T G A V P A R I I F L T A L E T T K I S 107
CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA 8659
S L K L V E P F K L S E P V R A A F A N
+ . | | | | . + | | | | . + | | . | |
A F K L V A P L E L S E P T Q A A I A N 127
TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT 8719
G L A G L S A S T C S Q A I F V P I D V
| + | | + + | | | | | + | | | | | |
G I A G M T A S L F S Q A V F V P I D V 147
GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT 8779
......... .......... .......... .......... .......... .......... 147
GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA 8839
.......... .......... .......... .......... .......... .......... 147
ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA 8899
I S Q K L M V Q G Y S G N A R Y K
+ | | | | | | | | | | | + | |
......... V S Q K L M V Q G Y S G H A T Y T 164
GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA 8959
G G L D V A R K V I K A D G I R G L Y R
| | + | | | | + | | + | + | | | | |
G G I D V A T K I I K S Y G V R G L Y R 184
GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT 9019
G F G L S V M T Y A P S S A V W W A S Y
| | | | | | | | | + | | | | . | | | | |
G F G L S V M T Y S P S S A A W W A S Y 204
GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTATGTTCC 9079
G S S Q R I I W S
| | | | | + | |
G S S Q R V I W R.... .......... .......... .......... 213
TCTCAGCCCT GTGTACTATG TAATATTTAC GAGAAAAAGA CCAGTAATAC ATTTCTACTT 9139
.......... .......... .......... .......... .......... .......... 213
AATAGTTATT TGAATTGGTA CTTTCCATCT GTCCAAAACC TTTTCAAACT TCCCCTCTTG 9199
.......... .......... .......... .......... .......... .......... 213
ATGCTCAAAC TGCAGCTATA ATTGCAATTT TGTTTTCTGA TGCTTGTTCT TCCATGTCAA 9259
.......... .......... .......... .......... .......... .......... 213
TATGTACATA TCTTTTTTAG AAAACAAGAA TGCATCTCAA TGCATGTGCT GTATTGTTTT 9319
.......... .......... .......... .......... .......... .......... 213
GATTAGATTT ATCATAGCGA TCAATCACAT TTTCTTTACA GATAAAAATA GTCGGAAGGA 9379
.......... .......... .......... .......... .......... .......... 213
TAAGTTGGAT AACTGACCAA AGTGGAAATA TGATCTTACA TATTTTTATC TCTGGCAGCT 9439
.......... .......... .......... .......... .......... .......... 213
TAGAGAACTT AATTACCAAC CTGAAACAAT GTGATGAAGT AACTACACAA AACCACATAT 9499
.......... .......... .......... .......... .......... .......... 213
AGTTTCATGC ACTCTGCAAA ACTAAATTGA AACTCTTAGT GTGCTCTTAA TGCTGTTAAG 9559
.......... .......... .......... .......... .......... .......... 213
AGGGTGTATG CAAGTTTACT GGAATCAGTA CCTTTTGTTA GTTTATTTCT TTGTGGTTGA 9619
.......... .......... .......... .......... .......... .......... 213
TGGTTGAAAG ATTATATTTC TTGTCTTGAT AACTTAGCCA AAATAGTTAA CTATTGTGCT 9679
.......... .......... .......... .......... .......... .......... 213
TTTTACATAT TGGAACAGTG CTCTTGGCCA TTTGCATGAC AAAGAA---- --GAGGCTCC 9733
A L G H L H D K E E A P
| | + | . + | |
.......... ........ F L G Y G G D S D A T A A P 227
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| + | + | | | | + | | + . | | | . .
S K S K I V M V Q A A G G I I A G A T A 247
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| + | | | + | | | | | | | |
S S I T T P L D T I K T R L Q .... .......... 262
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 262
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 262
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 262
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 262
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 262
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 262
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 262
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 262
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 262
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 262
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 262
AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG 10570
V M D N E N K P K A R E V V K R L I
| | + | | + | . | + + | | | + | +
.. V M G H Q E N R P S A K Q V V K K L L 281
CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG 10630
A E D G W K G L Y R G L G P R F F S S S
| | | | | | | . | | | | | | | | | | |
A E D G W K G F Y R G L G P R F F S M S 301
CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT 10690
A W G T S M I V C Y E Y L
| | | | | | | + | | | |
A W G T S M I L T Y E Y L . .......... .......... 314
CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA 10750
.......... .......... .......... .......... .......... .......... 314
AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT 10810
.......... .......... .......... .......... .......... .......... 314
GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT 10870
.......... .......... .......... .......... .......... .......... 314
TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA 10930
.......... .......... .......... .......... .......... .......... 314
TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT 10990
.......... .......... .......... .......... .......... .......... 314
GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC 11050
.......... .......... .......... .......... .......... .......... 314
CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT 11110
.......... .......... .......... .......... .......... .......... 314
AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT 11170
.......... .......... .......... .......... .......... .......... 314
TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT 11230
.......... .......... .......... .......... .......... .......... 314
TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT 11290
.......... .......... .......... .......... .......... .......... 314
TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG 11350
K R L C A K V
| | | | | +
.......... .......... .......... .........K R L C A - I 320
AAGAGGTCTG A 11361
E E V *
| +
E D - * 323
********************************************************************************
Query protein sequence 7 (File: 11358653)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRLAMNVLS FLEFGFATKA TIPLIQYLLL
241 LGRFLGYGGD SDATAAPSKS KIVMVQAAGG IIAGATASSI TTPLDTIKTR LQVMGHQENR
301 PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 8000 8058 ( 59 n); Protein 1 20 ( 20 aa); score: 0.467
Intron 1 8059 8338 ( 280 n); Pd: 1.000 Pa: 0.998
Exon 2 8339 8720 ( 382 n); Protein 21 147 ( 127 aa); score: 0.712
Intron 2 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 3 8849 9101 ( 253 n); Protein 148 233 ( 86 aa); score: 0.657
Intron 3 9102 9437 ( 336 n); Pd: 0.000 Pa: 0.346
Exon 4 9438 9477 ( 40 n); Protein 234 246 ( 13 aa); score: 0.042
Intron 4 9478 9697 ( 220 n); Pd: 0.924 Pa: 0.863
Exon 5 9698 9839 ( 142 n); Protein 247 292 ( 46 aa); score: 0.555
Intron 5 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 6 10516 10669 ( 154 n); Protein 293 344 ( 52 aa); score: 0.754
Intron 6 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 7 11330 11361 ( 32 n); Protein 345 352 ( 8 aa); score: 0.590
MATCH 21326110+ 11358653 0.667 1062 1.003 P
PGS_21326110+_11358653 (8000 8058,8339 8720,8849 9101,9438 9477,9698 9839,10516 10669,11330 11361)
Alignment:
ACAACCTCTA GGGCCGCCAA GATCCCGTCG CTCCACCAGA CGGAGATCAA CTGGGACAAG 8059
T T S R A A K I P S L H Q T E I N W D N
| . + + | | . | | | | | | | .
D T P P T S R I A S F G Q T E I N W D K. 20
TAAGCAACCC ACCGGTCTTG TAATCCTTAG GTTCCCATTT CGTGCCGATT TCCGCCTTTC 8119
.......... .......... .......... .......... .......... .......... 20
TCGCGGTCCT AGTCTTCTAG AAATTTAGCT CCGATCGTTG AACTGTGTCC CCCGCCCTAT 8179
.......... .......... .......... .......... .......... .......... 20
TCCGGATTGG CTTTACCGCC TTAGGATTAG GAACTGTTGT TTGAGGATTT GACATGGCTG 8239
.......... .......... .......... .......... .......... .......... 20
TTCATGCTCC TGGACGAAGC GCAATTTGGG GATTCGTAGC TTATTCCGTT GTTTCCGATT 8299
.......... .......... .......... .......... .......... .......... 20
CTTTTACTGT TCTCAAATCG TGGCGCTGAA ATGTTGCAGC CTCGACAAGA CCAAGCTCTA 8359
L D K T K L Y
| | | + . |
.......... .......... .......... ......... L D K R R F Y 27
CGTGGTGGGC GCAGGCATGT TCAGCGGCGT CACCGTGGCG CTGTATCCTG TCTCGGTGGT 8419
V V G A G M F S G V T V A L Y P V S V V
+ | | | + | + | | | | | | | | | | | |
I N G A G L F T G V T V A L Y P V S V V 47
CAAGACCCGG ATGCAGGTTG CCTCTGGGGA CGCCATGAGG AGGAACGCGC TGGCTACCTT 8479
K T R M Q V A S G D A M R R N A L A T F
| | | + | | | | + . | + | . + .
K T R L Q V A S K E I A E R S A F S V V 67
CAAGAACATC CTCAAGATGG ACGGCGTGCC AGGGCTGTAC CGGGGGTTTG CTACCGTTAT 8539
K N I L K M D G V P G L Y R G F A T V I
| . | | | | | | | | | | | | | . | | |
K G I L K N D G V P G L Y R G F G T V I 87
CATTGGGGCT GTACCAACTA GGATCATCTT CCTCACAGCG CTTGAGACAA CCAAAGCAGC 8599
I G A V P T R I I F L T A L E T T K A A
| | | | . | | | | | | | | | | | | +
T G A V P A R I I F L T A L E T T K I S 107
CTCGCTTAAG CTTGTTGAGC CCTTCAAGCT GTCAGAGCCG GTGCGGGCTG CCTTTGCCAA 8659
S L K L V E P F K L S E P V R A A F A N
+ . | | | | . + | | | | . + | | . | |
A F K L V A P L E L S E P T Q A A I A N 127
TGGCCTTGCT GGTCTGTCAG CGTCTACATG TTCGCAGGCT ATTTTTGTTC CAATTGATGT 8719
G L A G L S A S T C S Q A I F V P I D V
| + | | + + | | | | | + | | | | | |
G I A G M T A S L F S Q A V F V P I D V 147
GGTATGCCTC TCATGTGCCT TCTATGTGAT GTTGTATAGA GAAAAAATAT CTTACAATAT 8779
......... .......... .......... .......... .......... .......... 147
GTTGATGTTA AATGCTAATT ACAATACTAG ACTACTGTTT TCATTCTGTT GTGCATTGGA 8839
.......... .......... .......... .......... .......... .......... 147
ATGTTTCAGA TTAGCCAGAA ATTGATGGTT CAAGGATATT CTGGTAATGC CAGATACAAA 8899
I S Q K L M V Q G Y S G N A R Y K
+ | | | | | | | | | | | + | |
......... V S Q K L M V Q G Y S G H A T Y T 164
GGTGGATTAG ATGTTGCTCG AAAGGTCATA AAGGCTGATG GCATTAGGGG GCTGTACAGA 8959
G G L D V A R K V I K A D G I R G L Y R
| | + | | | | + | | + | + | | | | |
G G I D V A T K I I K S Y G V R G L Y R 184
GGATTTGGAC TGTCTGTTAT GACCTATGCT CCATCCAGTG CTGTGTGGTG GGCAAGTTAT 9019
G F G L S V M T Y A P S S A V W W A S Y
| | | | | | | | | + | | | | . | | | | |
G F G L S V M T Y S P S S A A W W A S Y 204
GGTTCCAGCC AGCGCATAAT TTGGAGGTTA GCTTATCTGA TTGGTTCATC GTTAT-GTTC 9078
G S S Q R I I W R L A Y L I G S S L F
| | | | | + | | | | | + | | |
G S S Q R V I W R L A M N V L S F L E F 224
---CTCTCAG CCCTGTGTAC TATGTAATAT TTACGAGAAA AAGACCAGTA ATACATTTCT 9135
L S A L C T M Y
. + . . | +
G F A T K A T I P.... .......... .......... .......... 233
ACTTAATAGT TATTTGAATT GGTACTTTCC ATCTGTCCAA AACCTTTTCA AACTTCCCCT 9195
.......... .......... .......... .......... .......... .......... 233
CTTGATGCTC AAACTGCAGC TATAATTGCA ATTTTGTTTT CTGATGCTTG TTCTTCCATG 9255
.......... .......... .......... .......... .......... .......... 233
TCAATATGTA CATATCTTTT TTAGAAAACA AGAATGCATC TCAATGCATG TGCTGTATTG 9315
.......... .......... .......... .......... .......... .......... 233
TTTTGATTAG ATTTATCATA GCGATCAATC ACATTTTCTT TACAGATAAA AATAGTCGGA 9375
.......... .......... .......... .......... .......... .......... 233
AGGATAAGTT GGATAACTGA CCAAAGTGGA AATATGATCT TACATATTTT TATCTCTGGC 9435
.......... .......... .......... .......... .......... .......... 233
AGCTTAGAGA ACTTAATTAC CAACCTG-A- AACAATGTGA TGAAGTAACT ACACAAAACC 9493
L E N L I T N L N N V M N
| . + | . . + .
.. L I Q Y L L L L - G R F L G...... .......... 246
ACATATAGTT TCATGCACTC TGCAAAACTA AATTGAAACT CTTAGTGTGC TCTTAATGCT 9553
.......... .......... .......... .......... .......... .......... 246
GTTAAGAGGG TGTATGCAAG TTTACTGGAA TCAGTACCTT TTGTTAGTTT ATTTCTTTGT 9613
.......... .......... .......... .......... .......... .......... 246
GGTTGATGGT TGAAAGATTA TATTTCTTGT CTTGATAACT TAGCCAAAAT AGTTAACTAT 9673
.......... .......... .......... .......... .......... .......... 246
TGTGCTTTTT ACATATTGGA ACAGTGCTCT TGGCCATTTG CATGACAAAG AAGAGGCTCC 9733
A L G H L H D K E E A P
| | | |
.......... .......... .... - Y G G D S D A T A A P 257
TAGCCAATTG AAACTAGTTG GTGTTCAAGC ATCAGGGGGG GTTTTTGCCG GTGCCGTGAC 9793
S Q L K L V G V Q A S G G V F A G A V T
| + | + | | | | + | | + . | | | . .
S K S K I V M V Q A A G G I I A G A T A 277
CTCTTTTGTT ACGACTCCCA TAGATACAAT AAAGACCAGG CTGCAGGTAC TGTGTGACAT 9853
S F V T T P I D T I K T R L Q
| + | | | + | | | | | | | |
S S I T T P L D T I K T R L Q .... .......... 292
TCTGTTTGCT GATTACTCTT GTAATTTGAT TTGTGTGGGT ATATTTTGTG AGGCTTACCC 9913
.......... .......... .......... .......... .......... .......... 292
TTGTGACTTA ATGATTCTTG TCTTTACATT TATGCTGCTC ATTTGCAATA ATTTGATTCC 9973
.......... .......... .......... .......... .......... .......... 292
TTATCAATGC AATGCCACTA AGTTTAGGGG AATGGATATT TTGTTTTGGA AGTATATTTG 10033
.......... .......... .......... .......... .......... .......... 292
ATGTCAGACT TGAAGACCTA AATGTTCTTT TATACTGATA TTTCCTCCAA TGGCGGGCTA 10093
.......... .......... .......... .......... .......... .......... 292
TTGAGGTGCT GGACTGGAAT GCTGTCTATA TTAAACAATA TATACTTCTA TGTTTACAGC 10153
.......... .......... .......... .......... .......... .......... 292
TGTTTGTTTT CTGCTGACAT ACCATGACCA ATTTGTCATG GTTTCAGTTA TGAGGTCAGA 10213
.......... .......... .......... .......... .......... .......... 292
AAAAAAGAAA CTTCCATTGG GAAAACTTGA TATCTATTAC TTCATTATTT ATAGTGAGTA 10273
.......... .......... .......... .......... .......... .......... 292
ACAAAAGTTA GCACTTTCAA ACTGACTAAA GTATGCCAGG GACGTATCAT GCATTTTACA 10333
.......... .......... .......... .......... .......... .......... 292
ACATGCTCCA CATATCTCCA AATATCACAT ATTACGCTTG TAGTGGTAAA CTGATAATAC 10393
.......... .......... .......... .......... .......... .......... 292
ATCTACCAAC ACTGAAAGTT CTCACAAGTC AGAACCCTAT ATTTGACAGT TGTGGTCTCC 10453
.......... .......... .......... .......... .......... .......... 292
CTCCTTCCCT CTGCATTTGT TGCTACAGAT GATTACACTG AGTTTTGTTT CTTGTCATTT 10513
.......... .......... .......... .......... .......... .......... 292
AGGTTATGGA TAAT---GAA AATAAGCCAA AAGCCAGGGA AGTTGTCAAA AGATTGATTG 10570
V M D N E N K P K A R E V V K R L I
| | + | | + | . | + + | | | + | +
.. V M G H Q E N R P S A K Q V V K K L L 311
CTGAAGATGG ATGGAAAGGT TTGTACAGAG GGTTGGGTCC AAGATTTTTC AGCTCATCAG 10630
A E D G W K G L Y R G L G P R F F S S S
| | | | | | | . | | | | | | | | | | |
A E D G W K G F Y R G L G P R F F S M S 331
CTTGGGGAAC CTCAATGATA GTATGCTACG AGTACCTGAG TATGTTTCGT CTTCCCTTGT 10690
A W G T S M I V C Y E Y L
| | | | | | | + | | | |
A W G T S M I L T Y E Y L . .......... .......... 344
CAAATGTACA CATGCATATG TAGTGTTATA TATCACTGCA TCCCATGCAG GTTAATTTTA 10750
.......... .......... .......... .......... .......... .......... 344
AGTACCCAGA TACTTCTTCT CATTTAGAAT TTAGTTAAAA TGACATCATT CAGGTCAGTT 10810
.......... .......... .......... .......... .......... .......... 344
GGCATCTCCA GTACACTGCT TTTGTAAGTT GTATCATAAA TCCCATTTGC AATGAAATTT 10870
.......... .......... .......... .......... .......... .......... 344
TTGACTCAAG TTGCAGCCTG TAACTTTTCT ATATTTTTCG AATAAAGCTA TCACCGTACA 10930
.......... .......... .......... .......... .......... .......... 344
TGAAACCTGC TTCTGTTAAT GCCAAGGAGC GCACATTATT TCCTGTAGAC CGGCTTGGAT 10990
.......... .......... .......... .......... .......... .......... 344
GTTGAACAAT TGGCACATGC AAGTAGCAAA GAGCAGCCTT GTGCTTGCAA CAATCTGGTC 11050
.......... .......... .......... .......... .......... .......... 344
CACCTGTGGA TATGTTCGCT GTGAAAGAAA CCAATTAGTC CTTGTATGAA ACATGGTATT 11110
.......... .......... .......... .......... .......... .......... 344
AGCGCTTCAT GAATAAAACC ACTGATTCTG ATTTCTTATT TTCAATGAAT GGATGGGCAT 11170
.......... .......... .......... .......... .......... .......... 344
TACCAAAGTT ATCATGATTA AAGATCTATT TCATATAAGT TTATTTTTAT ACATTAGAGT 11230
.......... .......... .......... .......... .......... .......... 344
TTATTTAGAG AACAAGGTAT ATTTAGTTTT GGTAATTTTG TGAACTGCAC TCAGACGACT 11290
.......... .......... .......... .......... .......... .......... 344
TTGGTATTCT TACTGTAATT TTGTTTTGTT TTCCTACAGA GCGCTTGTGT GCTAAAGTTG 11350
K R L C A K V
| | | | | +
.......... .......... .......... .........K R L C A - I 350
AAGAGGTCTG A 11361
E E V *
| +
E D - * 353
********************************************************************************
Query protein sequence 8 (File: 13365793)
1 AAAAAAETSE ASTAGLALAE ANINWQRRIL RSDGIPGAFR GFGTSAVGAL PGRVFALTSL
61 EVSKEMAFKY SEHFDMSEAS RIAVANGIAG LVSSIFSSAY FVPLDVICQR LMAQGLPGMA
121 TYRGPFDVIS KVVRTEGLRG LYRGFGITML TQSPASALWW SSYGGAQHAI WRSLGYGIDS
181 QKKPSQSELV VVQATAGTIA GACSSIITTP IDTIKTRLQV MDNYGRGRPS VMKTTRVLLE
241 EDGWRGFYRG FGPRFLNMSL WGTSMIVTYE LIKRLSVKPE -
Predicted gene structure (within gDNA segment 7800 to 11800):
Exon 1 8403 8720 ( 318 n); Protein 1 106 ( 106 aa); score: 0.385
Intron 1 8721 8848 ( 128 n); Pd: 0.797 Pa: 0.243
Exon 2 8849 9045 ( 197 n); Protein 107 172 ( 66 aa); score: 0.596
Intron 2 9046 9697 ( 652 n); Pd: 0.462 Pa: 0.863
Exon 3 9698 9839 ( 142 n); Protein 173 219 ( 47 aa); score: 0.591
Intron 3 9840 10515 ( 676 n); Pd: 0.947 Pa: 0.987
Exon 4 10516 10669 ( 154 n); Protein 220 272 ( 53 aa); score: 0.500
Intron 4 10670 11329 ( 660 n); Pd: 0.990 Pa: 0.997
Exon 5 11330 11361 ( 32 n); Protein 273 280 ( 8 aa); score: 0.379
MATCH 21326110+ 13365793 0.496 843 1.000 P
PGS_21326110+_13365793 (8403 8720,8849 9045,9698 9839,10516 10669,11330 11361)
Alignment:
TATCCTGTCT CGGTGGTCAA GACCCGGATG CAGGTTGCCT CTGGGGACGC CATGAGGAGG 8462
Y P V S V V K T R M Q V A S G D A M R R
. + . . + | . + | | + .
A A A A A A E T S E A S T A G L A L A E 20
AACGCGCTGG CTACCTTCAA GAACATCCTC AAGATGGACG GCGTGCCAGG GCTGTACCGG 8522
N A L A T F K N I L K M D G V P G L Y R
+ + . | | + | | + | | + |
A N I N W Q R R I L R S D G I P G A F R 40
GGGTTTGCTA CCGTTATCAT TGGGGCTGTA CCAACTAGGA TCATCTTCCT CACAGCGCTT 8582
G F A T V I I G A V P T R I I F L T A L
| | . | + | | + | | + . | | + |
G F G T S A V G A L P G R V F A L T S L 60
GAGACAACCA AAGCAGCCTC GCTTAAGCTT GTTGAGCCCT TCAAGCTGTC AGAGCCGGTG 8642
E T T K A A S L K L V E P F K L S E P V
| . + | + . | | | + | |
E V S K E M A F K Y S E H F D M S E A S 80
CGGGCTGCCT TTGCCAATGG CCTTGCTGGT CTGTCAGCGT CTACATGTTC GCAGGCTATT 8702
R A A F A N G L A G L S A S T C S Q A I
| | | | | + | | | + | | . |
R I A V A N G I A G L V S S I F S S A Y 100
TTTGTTCCAA TTGATGTGGT ATGCCTCTCA TGTGCCTTCT ATGTGATGTT GTATAGAGAA 8762
F V P I D V
| | | + | |
F V P L D V .. .......... .......... .......... .......... 106
AAAATATCTT ACAATATGTT GATGTTAAAT GCTAATTACA ATACTAGACT ACTGTTTTCA 8822
.......... .......... .......... .......... .......... .......... 106
TTCTGTTGTG CATTGGAATG TTTCAGATTA GCCAGAAATT GATGGTTCAA GGATATTCTG 8882
I S Q K L M V Q G Y S
| | + | | . | |
.......... .......... ...... I C Q R L M A Q G L P 117
GTAATGCCAG ATACAAAGGT GGATTAGATG TTGCTCGAAA GGTCATAAAG GCTGATGGCA 8942
G N A R Y K G G L D V A R K V I K A D G
| | | + | . | | | | + + . + |
G M A T Y R G P F D V I S K V V R T E G 137
TTAGGGGGCT GTACAGAGGA TTTGGACTGT CTGTTATGAC CTATGCTCCA TCCAGTGCTG 9002
I R G L Y R G F G L S V M T Y A P S S A
+ | | | | | | | | + + + + | + | + | |
L R G L Y R G F G I T M L T Q S P A S A 157
TGTGGTGGGC AAGTTATGGT TCCAGCCAGC GCATAATTTG GAGGTTAGCT TATCTGATTG 9062
V W W A S Y G S S Q R I I W S
+ | | + | | | . + | . | |
L W W S S Y G G A Q H A I W R....... .......... 172
GTTCATCGTT ATGTTCCTCT CAGCCCTGTG TACTATGTAA TATTTACGAG AAAAAGACCA 9122
.......... .......... .......... .......... .......... .......... 172
GTAATACATT TCTACTTAAT AGTTATTTGA ATTGGTACTT TCCATCTGTC CAAAACCTTT 9182
.......... .......... .......... .......... .......... .......... 172
TCAAACTTCC CCTCTTGATG CTCAAACTGC AGCTATAATT GCAATTTTGT TTTCTGATGC 9242
.......... .......... .......... .......... .......... .......... 172
TTGTTCTTCC ATGTCAATAT GTACATATCT TTTTTAGAAA ACAAGAATGC ATCTCAATGC 9302
.......... .......... .......... .......... .......... .......... 172
ATGTGCTGTA TTGTTTTGAT TAGATTTATC ATAGCGATCA ATCACATTTT CTTTACAGAT 9362
.......... .......... .......... .......... .......... .......... 172
AAAAATAGTC GGAAGGATAA GTTGGATAAC TGACCAAAGT GGAAATATGA TCTTACATAT 9422
.......... .......... .......... .......... .......... .......... 172
TTTTATCTCT GGCAGCTTAG AGAACTTAAT TACCAACCTG AAACAATGTG ATGAAGTAAC 9482
.......... .......... .......... .......... .......... .......... 172
TACACAAAAC CACATATAGT TTCATGCACT CTGCAAAACT AAATTGAAAC TCTTAGTGTG 9542
.......... .......... .......... .......... .......... .......... 172
CTCTTAATGC TGTTAAGAGG GTGTATGCAA GTTTACTGGA ATCAGTACCT TTTGTTAGTT 9602
.......... .......... .......... .......... .......... .......... 172
TATTTCTTTG TGGTTGATGG TTGAAAGATT ATATTTCTTG TCTTGATAAC TTAGCCAAAA 9662
.......... .......... .......... .......... .......... .......... 172
TAGTTAACTA TTGTGCTTTT TACATATTGG AACAGTGCTC TTGGCCATTT GCATGACAAA 9722
A L G H L H D K
+ | | + | .
.......... .......... .......... ..... S L G Y G I D S 180
GAAGAGGCTC CTAGCCAATT GAAACTAGTT GGTGTTCAAG CATCAGGGGG GGTTTTTGCC 9782
E E A P S Q L K L V G V Q A S G G V F A
+ + | | | + | | | | | + . | . . |
Q K K P S Q S E L V V V Q A T A G T I A 200
GGTGCCGTGA CCTCTTTTGT TACGACTCCC ATAGATACAA TAAAGACCAG GCTGCAGGTA 9842
G A V T S F V T T P I D T I K T R L Q
| | + | . + | | | | | | | | | | | |
G A C S S I I T T P I D T I K T R L Q ... 219
CTGTGTGACA TTCTGTTTGC TGATTACTCT TGTAATTTGA TTTGTGTGGG TATATTTTGT 9902
.......... .......... .......... .......... .......... .......... 219
GAGGCTTACC CTTGTGACTT AATGATTCTT GTCTTTACAT TTATGCTGCT CATTTGCAAT 9962
.......... .......... .......... .......... .......... .......... 219
AATTTGATTC CTTATCAATG CAATGCCACT AAGTTTAGGG GAATGGATAT TTTGTTTTGG 10022
.......... .......... .......... .......... .......... .......... 219
AAGTATATTT GATGTCAGAC TTGAAGACCT AAATGTTCTT TTATACTGAT ATTTCCTCCA 10082
.......... .......... .......... .......... .......... .......... 219
ATGGCGGGCT ATTGAGGTGC TGGACTGGAA TGCTGTCTAT ATTAAACAAT ATATACTTCT 10142
.......... .......... .......... .......... .......... .......... 219
ATGTTTACAG CTGTTTGTTT TCTGCTGACA TACCATGACC AATTTGTCAT GGTTTCAGTT 10202
.......... .......... .......... .......... .......... .......... 219
ATGAGGTCAG AAAAAAAGAA ACTTCCATTG GGAAAACTTG ATATCTATTA CTTCATTATT 10262
.......... .......... .......... .......... .......... .......... 219
TATAGTGAGT AACAAAAGTT AGCACTTTCA AACTGACTAA AGTATGCCAG GGACGTATCA 10322
.......... .......... .......... .......... .......... .......... 219
TGCATTTTAC AACATGCTCC ACATATCTCC AAATATCACA TATTACGCTT GTAGTGGTAA 10382
.......... .......... .......... .......... .......... .......... 219
ACTGATAATA CATCTACCAA CACTGAAAGT TCTCACAAGT CAGAACCCTA TATTTGACAG 10442
.......... .......... .......... .......... .......... .......... 219
TTGTGGTCTC CCTCCTTCCC TCTGCATTTG TTGCTACAGA TGATTACACT GAGTTTTGTT 10502
.......... .......... .......... .......... .......... .......... 219
TCTTGTCATT TAGGTTATGG ATAAT----- -GAAAATAAG CCAAAAGCCA GGGAAGTTGT 10556
V M D N E N K P K A R E V V
| | | | . . + | . . + . .
.......... ... V M D N Y G R G R P S V M K T T 235
CAAAAGATTG ATTGCTGAAG ATGGATGGAA AGGTTTGTAC AGAGGGTTGG GTCCAAGATT 10616
K R L I A E D G W K G L Y R G L G P R F
+ | + | | | | + | . | | | . | | | |
R V L L E E D G W R G F Y R G F G P R F 255
TTTCAGCTCA TCAGCTTGGG GAACCTCAAT GATAGTATGC TACGAGTACC TGAGTATGTT 10676
F S S S A W G T S M I V C Y E Y L
. + | | | | | | | | | | +
L N M S L W G T S M I V T Y E L I ....... 272
TCGTCTTCCC TTGTCAAATG TACACATGCA TATGTAGTGT TATATATCAC TGCATCCCAT 10736
.......... .......... .......... .......... .......... .......... 272
GCAGGTTAAT TTTAAGTACC CAGATACTTC TTCTCATTTA GAATTTAGTT AAAATGACAT 10796
.......... .......... .......... .......... .......... .......... 272
CATTCAGGTC AGTTGGCATC TCCAGTACAC TGCTTTTGTA AGTTGTATCA TAAATCCCAT 10856
.......... .......... .......... .......... .......... .......... 272
TTGCAATGAA ATTTTTGACT CAAGTTGCAG CCTGTAACTT TTCTATATTT TTCGAATAAA 10916
.......... .......... .......... .......... .......... .......... 272
GCTATCACCG TACATGAAAC CTGCTTCTGT TAATGCCAAG GAGCGCACAT TATTTCCTGT 10976
.......... .......... .......... .......... .......... .......... 272
AGACCGGCTT GGATGTTGAA CAATTGGCAC ATGCAAGTAG CAAAGAGCAG CCTTGTGCTT 11036
.......... .......... .......... .......... .......... .......... 272
GCAACAATCT GGTCCACCTG TGGATATGTT CGCTGTGAAA GAAACCAATT AGTCCTTGTA 11096
.......... .......... .......... .......... .......... .......... 272
TGAAACATGG TATTAGCGCT TCATGAATAA AACCACTGAT TCTGATTTCT TATTTTCAAT 11156
.......... .......... .......... .......... .......... .......... 272
GAATGGATGG GCATTACCAA AGTTATCATG ATTAAAGATC TATTTCATAT AAGTTTATTT 11216
.......... .......... .......... .......... .......... .......... 272
TTATACATTA GAGTTTATTT AGAGAACAAG GTATATTTAG TTTTGGTAAT TTTGTGAACT 11276
.......... .......... .......... .......... .......... .......... 272
GCACTCAGAC GACTTTGGTA TTCTTACTGT AATTTTGTTT TGTTTTCCTA CAGAGCGCTT 11336
K R L
| | |
.......... .......... .......... .......... .......... ...K R L 275
GTGTGCTAAA GTTGAAGAGG TCTGA 11361
C A K V E E V *
. | |
S V K - P E - * 281
********************************************************************************
Query protein sequence 19 (File: 23308305)
1 NLGAAEEESA QEIHLPADIN WEMLDKSKFF VLGAALFSGV SGALYPAVLM KTRQQVCHSQ
61 GSCIKTAFTL VRHEGLRGLY RGFGTSLMGT IPARALYMTA LEVTKSNVGS AAVSLGLTEA
121 KAAAVANAVG GLSAAMAAQL VWTPVDVVSQ RLMVQGSAGL VNASRCNYVN GFDAFRKIVR
181 ADGPKGLYRG FGISILTYAP SNAVWWASYS VAQRMVWGGI GCYVCKKDEE SGNNSTTMKP
241 DSKTIMAVQG VSAAIAGSVS ALITMPLDTI KTRLQVLDGE DSSNNGKRGP SIGQTVRNLV
301 REGGWTACYR GLGPRCASMS MSATTMITTY EFLKRLSAKN HDGFYSKS-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11690 ( 110 n); Protein 292 327 ( 36 aa); score: 0.126
Intron 1 11689 10943 ( 747 n); Pd: 0.998 Pa: 0.506
Exon 2 10942 10876 ( 67 n); Protein 328 348 ( 21 aa); score: 0.000
MATCH 21326110- 23308305 0.080 177 0.169 P
PGS_21326110-_23308305 (11799 11690,10942 10876)
Alignment:
ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAACAGCAT -CTCCTGGAA ACATTGTACC 11742
I I E T M K R R V K T A S W K H C T
| + | + + . | + . . | |
I G Q T - V R N L V R E G - G W T A C Y 309
ATGTCATTAC AA-C-TGGTC TCAATCATCC AGCCTCGCTA AAACAAGCCT TCACGTATAA 11684
M S L Q W S Q S S S L A K T S L H
. | | | + + | + +
R G L G P R C A S M S M S A T T M I ...... 327
AAGAATGATA AATGTGTACA TCTGATCTCG TTTATATTCA TCTAAAATGA TCAGCCTAAT 11624
.......... .......... .......... .......... .......... .......... 327
CCATATTCAG CATAAGAGGC AAAAAAAAAT ATAGGCCCCT GCATTTTTTT GAGAATACTC 11564
.......... .......... .......... .......... .......... .......... 327
TGCTCAGAGA ACCAAAGTTT GTAAGGACTA TTGTCTAGCA TCAAATGACG TCTTTAACAC 11504
.......... .......... .......... .......... .......... .......... 327
CATCAAACAT CCATGTTTAA TACTTTCACC TATGTCAGCA AGAACTGAAG CTTCCGTTGG 11444
.......... .......... .......... .......... .......... .......... 327
TGTACAGCAT TATTACACTT CTGCTAATAC AAAACTTCCA GAATTGTAGG AATTGCGTTC 11384
.......... .......... .......... .......... .......... .......... 327
TGAGTTTAAG GCAGCTCAGA AATCAGACCT CTTCAACTTT AGCACACAAG CGCTCTGTAG 11324
.......... .......... .......... .......... .......... .......... 327
GAAAACAAAA CAAAATTACA GTAAGAATAC CAAAGTCGTC TGAGTGCAGT TCACAAAATT 11264
.......... .......... .......... .......... .......... .......... 327
ACCAAAACTA AATATACCTT GTTCTCTAAA TAAACTCTAA TGTATAAAAA TAAACTTATA 11204
.......... .......... .......... .......... .......... .......... 327
TGAAATAGAT CTTTAATCAT GATAACTTTG GTAATGCCCA TCCATTCATT GAAAATAAGA 11144
.......... .......... .......... .......... .......... .......... 327
AATCAGAATC AGTGGTTTTA TTCATGAAGC GCTAATACCA TGTTTCATAC AAGGACTAAT 11084
.......... .......... .......... .......... .......... .......... 327
TGGTTTCTTT CACAGCGAAC ATATCCACAG GTGGACCAGA TTGTTGCAAG CACAAGGCTG 11024
.......... .......... .......... .......... .......... .......... 327
CTCTTTGCTA CTTGCATGTG CCAATTGTTC AACATCCAAG CCGGTCTACA GGAAATAATG 10964
.......... .......... .......... .......... .......... .......... 327
TGCGCTCCTT GGCATTAACA GAAGCAGGTT TCATGTACGG TGATA-G-CT TTATTCGAAA 10906
K Q V S C T V I L Y S K
. | + |
.......... .......... . T T Y E F L K R - L S A K 339
AATATAGAAA AGTTACAGGC TGCAACTTGA 10876
N I E K L Q A A T *
| + . + +
N H D G F Y S K S * 349
********************************************************************************
Query protein sequence 11 (File: 21326111)
1 DTTSRAAKIP SLHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVVKT RMQVASGDAM
61 RRNALATFKN ILKMDGVPGL YRGFATVIIG AVPTRIIFLT ALETTKAASL KLVEPFKLSE
121 PVRAAFANGL AGLSASTCSQ AIFVPIDVIS QKLMVQGYSG NARYKGGLDV ARKVIKADGI
181 RGLYRGFGLS VMTYAPSSAV WWASYGSSQR IIWSALGHLH DKEEAPSQLK LVGVQASGGV
241 FAGAVTSFVT TPIDTIKTRL QVMDNENKPK AREVVKRLIA EDGWKGLYRG LGPRFFSSSA
301 WGTSMIVCYE YLKRLCAKVE EV-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11740 ( 60 n); Protein 271 289 ( 19 aa); score: 0.205
Intron 1 11739 11069 ( 671 n); Pd: 0.880 Pa: 0.693
Exon 2 11068 10966 ( 103 n); Protein 290 322 ( 33 aa); score: 0.042
MATCH 21326110- 21326111 0.104 163 0.168 P
PGS_21326110-_21326111 (11799 11740,11068 10966)
Alignment:
ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAA-CAGCA TCTCCTGGAA AC-ATTGTAC 11743
I I E T M K R R V K S I S W K L Y
| . + | | + . . | | | |
A R E V - V K R L I A - E D G W K G L Y 288
CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA 11683
H
.
R ....... .......... .......... .......... .......... .......... 289
AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC 11623
.......... .......... .......... .......... .......... .......... 289
CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT 11563
.......... .......... .......... .......... .......... .......... 289
GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC 11503
.......... .......... .......... .......... .......... .......... 289
ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT 11443
.......... .......... .......... .......... .......... .......... 289
GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT 11383
.......... .......... .......... .......... .......... .......... 289
GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG 11323
.......... .......... .......... .......... .......... .......... 289
AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA 11263
.......... .......... .......... .......... .......... .......... 289
CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT 11203
.......... .......... .......... .......... .......... .......... 289
GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA 11143
.......... .......... .......... .......... .......... .......... 289
ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT 11083
.......... .......... .......... .......... .......... .......... 289
GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT 11025
R T Y P R W T R L L Q A Q G
| | + . | |
.......... .... G L G P - R F F S S - S A W G 302
GCTCTTTGCT AC-TTGCATG TGCCAATTGT TCAACATCCA AGCCGGTCTA CAGGAA-A-T 10968
C S L L C M C Q L F N I Q A G L Q E
| + + | | . + | + + |
T S M I V C Y E Y L K R L C A K V E E V 322
AA 10966
*
* 323
********************************************************************************
Query protein sequence 12 (File: 12061241)
1 DTTTRAKIPS LHHQTEINWD NLDKTKLYVV GAGMFSGVTV ALYPVSVIKT RMQVATGEAV
61 RRNAAATFRN ILKVDGVPGL YRGFGTVITG AIPARIIFLT ALETTKAASL KLVEPFKLSE
121 PVQAAFANGL GGLSASLCSQ AVFVPIDVVS QKLMVQGYSG HVRYKGGLDV AQQIIKADGI
181 RGLYRGFGLS VMTYSPSSAV WWASYGSSQR IIWSAFDRWN DKESSPSQLT IVGVQATGGI
241 IAGAVTSCVT TPIDTIKTRL QVNQNKPKAM EVVRRLIAED GWKGFYRGLG PRFFSSSAWG
301 TSMIVCYEYL KRLCAKVEEV -
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11740 ( 60 n); Protein 269 287 ( 19 aa); score: 0.176
Intron 1 11739 11069 ( 671 n); Pd: 0.880 Pa: 0.693
Exon 2 11068 10966 ( 103 n); Protein 288 320 ( 33 aa); score: 0.042
MATCH 21326110- 12061241 0.094 163 0.169 P
PGS_21326110-_12061241 (11799 11740,11068 10966)
Alignment:
ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAA-CAGCA TCTCCTGGAA AC-ATTGTAC 11743
I I E T M K R R V K S I S W K L Y
+ | . + + | + . . | | . |
A M E V - V R R L I A - E D G W K G F Y 286
CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA 11683
H
.
R ....... .......... .......... .......... .......... .......... 287
AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC 11623
.......... .......... .......... .......... .......... .......... 287
CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT 11563
.......... .......... .......... .......... .......... .......... 287
GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC 11503
.......... .......... .......... .......... .......... .......... 287
ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT 11443
.......... .......... .......... .......... .......... .......... 287
GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT 11383
.......... .......... .......... .......... .......... .......... 287
GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG 11323
.......... .......... .......... .......... .......... .......... 287
AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA 11263
.......... .......... .......... .......... .......... .......... 287
CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT 11203
.......... .......... .......... .......... .......... .......... 287
GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA 11143
.......... .......... .......... .......... .......... .......... 287
ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT 11083
.......... .......... .......... .......... .......... .......... 287
GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT 11025
R T Y P R W T R L L Q A Q G
| | + . | |
.......... .... G L G P - R F F S - S S A W G 300
GCTCTTTGCT AC-TTGCATG TGCCAATTGT TCAACATCCA AGCCGGTCTA CAGGAA-A-T 10968
C S L L C M C Q L F N I Q A G L Q E
| + + | | . + | + + |
T S M I V C Y E Y L K R L C A K V E E V 320
AA 10966
*
* 321
********************************************************************************
Query protein sequence 20 (File: 21594326)
1 SLGALMEEKR RATTSSSSSQ VHMSNDIDWQ MLDKSRFFFL GAALFSGVST ALYPIVVLKT
61 RQQVSPTRVS CANISLAIAR LEGLKGFYKG FGTSLLGTIP ARALYMTALE ITKSSVGQAT
121 VRLGLSDTTS LAVANGAAGL TSAVAAQTVW TPIDIVSQRL MVQGDVSLSK HLPGVMNSCR
181 YRNGFDAFRK ILYTDGPRGF YRGFGISILT YAPSNAVWWA SYSLAQKSIW SRYKHSYNHK
241 EDAGGSVVVQ ALSSATASGC SALVTMPVDT IKTRLQVLDA EENGRRRAMT VMQSVKSLMK
301 EGGVGACYRG LGPRWVAMSM SATTMITTYE FLKRLATKKQ K-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11740 ( 60 n); Protein 291 309 ( 19 aa); score: 0.209
Intron 1 11739 11069 ( 671 n); Pd: 0.880 Pa: 0.693
Exon 2 11068 10966 ( 103 n); Protein 310 341 ( 32 aa); score: 0.044
MATCH 21326110- 21594326 0.108 163 0.159 P
PGS_21326110-_21594326 (11799 11740,11068 10966)
Alignment:
ATAATTGAAA CCG-CATGAA AAGAAGAGTC AAAACAGCAT CTCCTGGA-A -ACATTGTAC 11743
I I E T M K R R V K T A S P G T L Y
+ + + + + | + | . . | . |
V M Q S - V K S L M K E G G V G - A C Y 308
CATGTCATTA CAACTGGTCT CAATCATCCA GCCTCGCTAA AACAAGCCTT CACGTATAAA 11683
H
.
R ....... .......... .......... .......... .......... .......... 309
AGAATGATAA ATGTGTACAT CTGATCTCGT TTATATTCAT CTAAAATGAT CAGCCTAATC 11623
.......... .......... .......... .......... .......... .......... 309
CATATTCAGC ATAAGAGGCA AAAAAAAATA TAGGCCCCTG CATTTTTTTG AGAATACTCT 11563
.......... .......... .......... .......... .......... .......... 309
GCTCAGAGAA CCAAAGTTTG TAAGGACTAT TGTCTAGCAT CAAATGACGT CTTTAACACC 11503
.......... .......... .......... .......... .......... .......... 309
ATCAAACATC CATGTTTAAT ACTTTCACCT ATGTCAGCAA GAACTGAAGC TTCCGTTGGT 11443
.......... .......... .......... .......... .......... .......... 309
GTACAGCATT ATTACACTTC TGCTAATACA AAACTTCCAG AATTGTAGGA ATTGCGTTCT 11383
.......... .......... .......... .......... .......... .......... 309
GAGTTTAAGG CAGCTCAGAA ATCAGACCTC TTCAACTTTA GCACACAAGC GCTCTGTAGG 11323
.......... .......... .......... .......... .......... .......... 309
AAAACAAAAC AAAATTACAG TAAGAATACC AAAGTCGTCT GAGTGCAGTT CACAAAATTA 11263
.......... .......... .......... .......... .......... .......... 309
CCAAAACTAA ATATACCTTG TTCTCTAAAT AAACTCTAAT GTATAAAAAT AAACTTATAT 11203
.......... .......... .......... .......... .......... .......... 309
GAAATAGATC TTTAATCATG ATAACTTTGG TAATGCCCAT CCATTCATTG AAAATAAGAA 11143
.......... .......... .......... .......... .......... .......... 309
ATCAGAATCA GTGGTTTTAT TCATGAAGCG CTAATACCAT GTTTCATACA AGGACTAATT 11083
.......... .......... .......... .......... .......... .......... 309
GGTTTCTTTC ACAGCGAACA TATCCA-C-A GGTGGACCAG ATTGTTGCAA GCACAAGGCT 11025
R T Y P R W T R L L Q A Q G
| | | . + . + .
.......... .... G L G P - R W V A M S M S - A 322
GCTCTTTGCT ACTTGCATGT GCCAATTGTT CAACATCCAA GCCGGTCTAC AGGAAATAA 10966
C S L L L A C A N C S T S K P V Y R K *
+ + + . . + |
T T M I T T Y E F L K R L A T K K Q K * 342
********************************************************************************
Query protein sequence 15 (File: 21553961)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATTAPSKS KIVMVQAAGG
241 IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
301 SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11659 ( 141 n); Protein 276 322 ( 47 aa); score: 0.023
MATCH 21326110- 21553961 0.023 141 0.146 P
PGS_21326110-_21553961 (11799 11659)
Alignment:
ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A 11744
I I E R M K E E S K Q H L L E T L
+ + + + + | + . + . . |
V V K - K L L A E D G W K G F Y R G L G 294
CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT 11686
P C H Y N W S Q S S S S L K Q A F T Y
| + + | . + | + . + |
P R F F S M S A W G T - S M I L T Y E Y 313
---AAAAGAA TGATAAATGT GTACATCTGA 11659
K R M I N V Y I *
| | + +
L K R L C A I E D * 323
********************************************************************************
Query protein sequence 16 (File: 15292889)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRFLGYGGD SDATAAPSKS KIVMVQAAGG
241 IIAGATASSI TTPLDTIKTR LQVMGHQENR PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM
301 SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11659 ( 141 n); Protein 276 322 ( 47 aa); score: 0.023
MATCH 21326110- 15292889 0.023 141 0.146 P
PGS_21326110-_15292889 (11799 11659)
Alignment:
ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A 11744
I I E R M K E E S K Q H L L E T L
+ + + + + | + . + . . |
V V K - K L L A E D G W K G F Y R G L G 294
CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT 11686
P C H Y N W S Q S S S S L K Q A F T Y
| + + | . + | + . + |
P R F F S M S A W G T - S M I L T Y E Y 313
---AAAAGAA TGATAAATGT GTACATCTGA 11659
K R M I N V Y I *
| | + +
L K R L C A I E D * 323
********************************************************************************
Query protein sequence 17 (File: 11358653)
1 DTPPTSRIAS FGQTEINWDK LDKRRFYING AGLFTGVTVA LYPVSVVKTR LQVASKEIAE
61 RSAFSVVKGI LKNDGVPGLY RGFGTVITGA VPARIIFLTA LETTKISAFK LVAPLELSEP
121 TQAAIANGIA GMTASLFSQA VFVPIDVVSQ KLMVQGYSGH ATYTGGIDVA TKIIKSYGVR
181 GLYRGFGLSV MTYSPSSAAW WASYGSSQRV IWRLAMNVLS FLEFGFATKA TIPLIQYLLL
241 LGRFLGYGGD SDATAAPSKS KIVMVQAAGG IIAGATASSI TTPLDTIKTR LQVMGHQENR
301 PSAKQVVKKL LAEDGWKGFY RGLGPRFFSM SAWGTSMILT YEYLKRLCAI ED-
Predicted gene structure (within gDNA segment 11800 to 7800):
Exon 1 11799 11659 ( 141 n); Protein 306 352 ( 47 aa); score: 0.023
MATCH 21326110- 11358653 0.023 141 0.133 P
PGS_21326110-_11358653 (11799 11659)
Alignment:
ATAATTGAAA -CCGCATG-A -AAAGAAGAG TCAAAACAGC ATCTCCTGGA AACATTGT-A 11744
I I E R M K E E S K Q H L L E T L
+ + + + + | + . + . . |
V V K - K L L A E D G W K G F Y R G L G 324
CCATGTCATT ACAACTGGTC TCAATCATCC AGC-C-TCGC TAAAACAAGC CTTCACGTAT 11686
P C H Y N W S Q S S S S L K Q A F T Y
| + + | . + | + . + |
P R F F S M S A W G T - S M I L T Y E Y 343
---AAAAGAA TGATAAATGT GTACATCTGA 11659
K R M I N V Y I *
| | + +
L K R L C A I E D * 353