MidgeBase gene description page [Pn.05895]

Outline

Link to gbrowse

Gene ID Pn.05895
Type Protein coding gene
Scaffold PnScaf5243
Start 1457
End 5830
Direction +

Sequence

Transcript: 2598 (bp)

 ATGAGAATTTGGCTTTACAAGCTCCTGCTCGTAACAACGGCCATTTGCGTGAGTCAATCGCGCGGCGACATCTCGATAAACGACGACGAGGCGAATTATGAGGACAGCAACGAGATAACCAGTGATCCGGCAAATTGTAATGTCAAGAGCAGCCTCGACGAGCTGAAGAAGATGATGACGAGCTTCCAGAAGGAGCTGTCCTATCAGAATGCCGAGATAAAGTATGTGCAGTCGCTCATCGAGAACTGCGCGTCCTGCAAAATAGTGCCCGAGACAGACACTTGCGCAACGGCCAATCCATGCTTCCATGGCGTCGAGTGCTTCGACACTGACAGCGGCATTGTTTGCGGCAAATGTCCGCGCGGCTACAAGGGCGACGGCAAGGATTGCTTCAGGATCGAGATGTGCAGCGACCGTCCGTGCTTCTCAGGCGTGTCGTGCACAGACAGCGACGATGGAGTTGTTTGCGGCGCTTGCCCTTCGGGCTACGAAGGAAACGGAAGAGTTTGCACGCTGAGACGCAACTACTGCGACGACAATCCCTGCATCGACGGCTACCAGTGTGTGCAAATGAATGAAGCGCCTTTCTACAGATGCTCCTCGTGCCCACCGGGCTTTACGAGCTTCGACGGTACAACTTGCGTCGACATCGACGAGTGCCAGACTTCGAATCCTTGCGACCCGCTTGTGAAGTGCACAAACCTCTCTCCCGGCGTCCGCTGCGAGGCTTGCCCTCCCGGTTACCAGGGCTATTACTCCGAAGGATTCTACCTCGAGTCGGTCAATGAGCTTTCATTCCAGCTTCAACGGTGCGAGGACATCGACGAGTGTCTCGAGGGAACGGCGCAGTGCGGGCAATTCGCGCTCTGCCACAATCTTCTCGGCTCCTATGAGTGCATCTGCCCCGTCGGCTACATGAAGTTGAACGGATCTGAGGACTGCGTTCTGCAGCCGGGCGCTTGCCTTGATGGAACTTTCTGCGATAAAAACGCCGTTTGCAAGCACATCGGCTACGGAAAGTACTCGTGCAGATGCAAGGTCGGCTATGCGGGAGACGGCCAGTATTGTGGCAGCGACAGAGACCTCGATGGCTTTCCCGACTACGACTTGGGATGCTCGAGTGCGACATGCAGAAAGGACAACTGCCCGAATGTTCCGAACTCGGGTCAAGAGGACGCCGACGGTGACGGAATCGGTGATGTTTGCGACAACGACATCGACAACGATGGAGTGCTTAATCACAACGACAACTGCGTTTACTTCTACAATCCCGATCAGTATGACGGTGATGGCGACAAAGTTGGCAGCGCCTGCGACAACTGCCCTGATATTTTCAATGGCGATCAGATGGACATTGACGACGATGGCCTCGGTGACAAGTGCGACCCTGACATGGACAATGATGGTATTTACAACCAAGAAGACAACTGCCCGAAAATTGCGAACAGAGATCAGAAAGACTCGGACATGGACAGAGTTGGAGACGTTTGCGACAACTGCCCGCTAACACCGAACATGGACCAAAAGGACTCTGACGCCGACCTAATCGGAGATGCTTGCGACAATAATCTGGACCGCGACCAAGACGGAATTCAAGACAACAAGGACAACTGCCCGAATGTTCCAAACGCCGATCAAGTAGACACCGATTCCGATGGAAAAGGCGACGCTTGTGACTACGACATTGACAACGACGGCATCGTCAACGAGCGCGACAACTGCCGCTTCATCAAGAACGCAGACCAGCTCGACTCGGACAACAACGGAATCGGCGACGCGTGCGAGCTCGACTCGGACGGCGACGGCCACGACAACCTCGTCGACAACTGCCCGTACAACAACAAAATCTACACGACGGACTTCCGCGAGTTCCAGCAGGTGCGCCTCGACCCGGAGGGCGACGCCCAAATCGACCCCAAATGGGTCATCTACAACAAGGGCGCCGAGATGGTGCAGACGATGAACTCGGACCCGGGCCTGGCCATCGGCTACGACTCGTTCAGCGGCGTCGACTTCGAGGGCACGCTCTTCGTGGACACGGACGTCGACGACGACTACATCGGCTTCATCTTCTCGTACCAGAGTAGCCATCGCTTCTACGCCATCATGTGGAAGCGGAACCCGCAGGAGTACTGGCACAAGAAGCCCTTCGTCGCCAACGCCGAGCCCGGCATCCAGATCAAGCTCATCGACAGCAAGACCGGGCCCGGGACGCTGCTGAGGAACAGCCTGTGGCACACCGGCAACACCACCGACCAGGTGAAGCTGCTGTGGAAGGATCCGAAGAATGTCGGCTGGAAGGAGCGCACCGCCTACCGCTGGCATCTCATCCACCGACCCAAAATCGGCCTCATCAACATGAAGATCTTCAACGGGAAGCGGCTGGTTGTCGACTCGGGTAACATCTACGACTCGACCTTGAAGGGCGGCCGCTTGGGGATGTTTGTCTTCTCGCAGGAGATGATCATCTGGTCCAACTTGGTCTACAAGTGCAATGAAAACGTTCCGGCGAAGCTGTACGCGAAGCTTCCGGCCAACCTTCAGCGCGAGTGCCAGGTCGAGGTGATGATGAAGCGAAACGAGAAGATGGAACAGAAC 

Protein: 866 (aa)

 MRIWLYKLLLVTTAICVSQSRGDISINDDEANYEDSNEITSDPANCNVKSSLDELKKMMTSFQKELSYQNAEIKYVQSLIENCASCKIVPETDTCATANPCFHGVECFDTDSGIVCGKCPRGYKGDGKDCFRIEMCSDRPCFSGVSCTDSDDGVVCGACPSGYEGNGRVCTLRRNYCDDNPCIDGYQCVQMNEAPFYRCSSCPPGFTSFDGTTCVDIDECQTSNPCDPLVKCTNLSPGVRCEACPPGYQGYYSEGFYLESVNELSFQLQRCEDIDECLEGTAQCGQFALCHNLLGSYECICPVGYMKLNGSEDCVLQPGACLDGTFCDKNAVCKHIGYGKYSCRCKVGYAGDGQYCGSDRDLDGFPDYDLGCSSATCRKDNCPNVPNSGQEDADGDGIGDVCDNDIDNDGVLNHNDNCVYFYNPDQYDGDGDKVGSACDNCPDIFNGDQMDIDDDGLGDKCDPDMDNDGIYNQEDNCPKIANRDQKDSDMDRVGDVCDNCPLTPNMDQKDSDADLIGDACDNNLDRDQDGIQDNKDNCPNVPNADQVDTDSDGKGDACDYDIDNDGIVNERDNCRFIKNADQLDSDNNGIGDACELDSDGDGHDNLVDNCPYNNKIYTTDFREFQQVRLDPEGDAQIDPKWVIYNKGAEMVQTMNSDPGLAIGYDSFSGVDFEGTLFVDTDVDDDYIGFIFSYQSSHRFYAIMWKRNPQEYWHKKPFVANAEPGIQIKLIDSKTGPGTLLRNSLWHTGNTTDQVKLLWKDPKNVGWKERTAYRWHLIHRPKIGLINMKIFNGKRLVVDSGNIYDSTLKGGRLGMFVFSQEMIIWSNLVYKCNENVPAKLYAKLPANLQRECQVEVMMKRNEKMEQN 
Type Start End Length
CDS 1457 1544 88
CDS 1628 1681 54
CDS 1825 2110 286
CDS 2244 2846 603
CDS 3166 3485 320
CDS 3558 3595 38
CDS 3661 3754 94
CDS 3994 4048 55
CDS 4107 4248 142
CDS 4321 4374 54
CDS 4433 5195 763
CDS 5727 5827 101
intron 1545 1627 83
intron 1682 1824 143
intron 2111 2243 133
intron 2847 3165 319
intron 3486 3557 72
intron 3596 3660 65
intron 3755 3993 239
intron 4049 4106 58
intron 4249 4320 72
intron 4375 4432 58
intron 5196 5726 531

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_308033 AGAP002157-PA [Anopheles gambiae str. PEST] gb|EAA03707.5| AGAP002157-PA [Anopheles gambiae str. PEST] 0.0
InterPro IPR001881 EGF-like calcium-binding
InterPro IPR000742 Epidermal growth factor-like domain
InterPro IPR013320 Concanavalin A-like lectin/glucanase, subgroup
InterPro IPR013032 EGF-like, conserved site
InterPro IPR000152 EGF-type aspartate/asparagine hydroxylation site
InterPro IPR018097 EGF-like calcium-binding, conserved site
InterPro IPR006210 Epidermal growth factor-like
InterPro IPR008859 Thrombospondin, C-terminal
InterPro IPR017897 Thrombospondin, type 3 repeat
InterPro IPR008985 Concanavalin A-like lectin/glucanase
InterPro IPR003367 Thrombospondin, type 3-like repeat
Gene Ontology(BP) GO:0007155 cell adhesion
Gene Ontology(CC) GO:0005576 extracellular region
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0005509 calcium ion binding
Pfam PF12662.2 Complement Clr-like EGF-like 1.6e-11
Pfam PF02412.13 Thrombospondin type 3 repeat 1.7e-56
Pfam PF12947.2 EGF domain 4.8e-10
Pfam PF12661.2 Human growth factor-like EGF 0.0023
Pfam PF05735.7 Thrombospondin C-terminal region 2.7e-104
Pfam PF07645.10 Calcium-binding EGF domain 1.3e-13
Pfam PF12946.2 MSP1 EGF domain 1 0.0021
Pfam PF00008.22 EGF-like domain 0.00072
Pfam PF11598.3 Cartilage oligomeric matrix protein 0.00032

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.07257
Pn.07271

Orthologous genes

Species Gene ID
P. vanderplanki Pv.11089
P. humanus PHUM370970-PA
C. quinquefasciatus CPIJ011343
A. aegypti AAEL008062
H. sapiens ENSP00000439156
H. sapiens ENSP00000357362
H. sapiens ENSP00000355751
D. plexippus DPOGS210481PA
H. sapiens ENSP00000444792
H. sapiens ENSP00000422298
A. aegypti AAEL007455
H. melpomene HMEL009724-PA
H. sapiens ENSP00000222271
C. quinquefasciatus CPIJ016496
S. invicta SI2.2.0_80288
H. sapiens ENSP00000392207
H. sapiens ENSP00000437353
N. vitripennis NV16059-PA
M. musculus ENSMUSG00000031849
A. gambiae AGAP002157
M. musculus ENSMUSG00000021702
B. mori BGIBMGA001836-TA
M. musculus ENSMUSG00000023885
C. quinquefasciatus CPIJ016495
C. quinquefasciatus CPIJ016498
D. melanogaster FBgn0031850
T. castaneum TC008608
H. sapiens ENSP00000339730
M. musculus ENSMUSG00000040152
A. mellifera GB13833-PA
C. quinquefasciatus CPIJ016497
H. sapiens ENSP00000404040
P. vanderplanki Pv.04547
H. sapiens ENSP00000260356
H. sapiens ENSP00000403792
C. quinquefasciatus CPIJ009462