MidgeBase gene description page [Pn.05895]
Outline
Gene ID | Pn.05895 |
Type | Protein coding gene |
Scaffold | PnScaf5243 |
Start | 1457 |
End | 5830 |
Direction | + |
Sequence
Transcript: 2598 (bp)
ATGAGAATTTGGCTTTACAAGCTCCTGCTCGTAACAACGGCCATTTGCGTGAGTCAATCGCGCGGCGACATCTCGATAAACGACGACGAGGCGAATTATGAGGACAGCAACGAGATAACCAGTGATCCGGCAAATTGTAATGTCAAGAGCAGCCTCGACGAGCTGAAGAAGATGATGACGAGCTTCCAGAAGGAGCTGTCCTATCAGAATGCCGAGATAAAGTATGTGCAGTCGCTCATCGAGAACTGCGCGTCCTGCAAAATAGTGCCCGAGACAGACACTTGCGCAACGGCCAATCCATGCTTCCATGGCGTCGAGTGCTTCGACACTGACAGCGGCATTGTTTGCGGCAAATGTCCGCGCGGCTACAAGGGCGACGGCAAGGATTGCTTCAGGATCGAGATGTGCAGCGACCGTCCGTGCTTCTCAGGCGTGTCGTGCACAGACAGCGACGATGGAGTTGTTTGCGGCGCTTGCCCTTCGGGCTACGAAGGAAACGGAAGAGTTTGCACGCTGAGACGCAACTACTGCGACGACAATCCCTGCATCGACGGCTACCAGTGTGTGCAAATGAATGAAGCGCCTTTCTACAGATGCTCCTCGTGCCCACCGGGCTTTACGAGCTTCGACGGTACAACTTGCGTCGACATCGACGAGTGCCAGACTTCGAATCCTTGCGACCCGCTTGTGAAGTGCACAAACCTCTCTCCCGGCGTCCGCTGCGAGGCTTGCCCTCCCGGTTACCAGGGCTATTACTCCGAAGGATTCTACCTCGAGTCGGTCAATGAGCTTTCATTCCAGCTTCAACGGTGCGAGGACATCGACGAGTGTCTCGAGGGAACGGCGCAGTGCGGGCAATTCGCGCTCTGCCACAATCTTCTCGGCTCCTATGAGTGCATCTGCCCCGTCGGCTACATGAAGTTGAACGGATCTGAGGACTGCGTTCTGCAGCCGGGCGCTTGCCTTGATGGAACTTTCTGCGATAAAAACGCCGTTTGCAAGCACATCGGCTACGGAAAGTACTCGTGCAGATGCAAGGTCGGCTATGCGGGAGACGGCCAGTATTGTGGCAGCGACAGAGACCTCGATGGCTTTCCCGACTACGACTTGGGATGCTCGAGTGCGACATGCAGAAAGGACAACTGCCCGAATGTTCCGAACTCGGGTCAAGAGGACGCCGACGGTGACGGAATCGGTGATGTTTGCGACAACGACATCGACAACGATGGAGTGCTTAATCACAACGACAACTGCGTTTACTTCTACAATCCCGATCAGTATGACGGTGATGGCGACAAAGTTGGCAGCGCCTGCGACAACTGCCCTGATATTTTCAATGGCGATCAGATGGACATTGACGACGATGGCCTCGGTGACAAGTGCGACCCTGACATGGACAATGATGGTATTTACAACCAAGAAGACAACTGCCCGAAAATTGCGAACAGAGATCAGAAAGACTCGGACATGGACAGAGTTGGAGACGTTTGCGACAACTGCCCGCTAACACCGAACATGGACCAAAAGGACTCTGACGCCGACCTAATCGGAGATGCTTGCGACAATAATCTGGACCGCGACCAAGACGGAATTCAAGACAACAAGGACAACTGCCCGAATGTTCCAAACGCCGATCAAGTAGACACCGATTCCGATGGAAAAGGCGACGCTTGTGACTACGACATTGACAACGACGGCATCGTCAACGAGCGCGACAACTGCCGCTTCATCAAGAACGCAGACCAGCTCGACTCGGACAACAACGGAATCGGCGACGCGTGCGAGCTCGACTCGGACGGCGACGGCCACGACAACCTCGTCGACAACTGCCCGTACAACAACAAAATCTACACGACGGACTTCCGCGAGTTCCAGCAGGTGCGCCTCGACCCGGAGGGCGACGCCCAAATCGACCCCAAATGGGTCATCTACAACAAGGGCGCCGAGATGGTGCAGACGATGAACTCGGACCCGGGCCTGGCCATCGGCTACGACTCGTTCAGCGGCGTCGACTTCGAGGGCACGCTCTTCGTGGACACGGACGTCGACGACGACTACATCGGCTTCATCTTCTCGTACCAGAGTAGCCATCGCTTCTACGCCATCATGTGGAAGCGGAACCCGCAGGAGTACTGGCACAAGAAGCCCTTCGTCGCCAACGCCGAGCCCGGCATCCAGATCAAGCTCATCGACAGCAAGACCGGGCCCGGGACGCTGCTGAGGAACAGCCTGTGGCACACCGGCAACACCACCGACCAGGTGAAGCTGCTGTGGAAGGATCCGAAGAATGTCGGCTGGAAGGAGCGCACCGCCTACCGCTGGCATCTCATCCACCGACCCAAAATCGGCCTCATCAACATGAAGATCTTCAACGGGAAGCGGCTGGTTGTCGACTCGGGTAACATCTACGACTCGACCTTGAAGGGCGGCCGCTTGGGGATGTTTGTCTTCTCGCAGGAGATGATCATCTGGTCCAACTTGGTCTACAAGTGCAATGAAAACGTTCCGGCGAAGCTGTACGCGAAGCTTCCGGCCAACCTTCAGCGCGAGTGCCAGGTCGAGGTGATGATGAAGCGAAACGAGAAGATGGAACAGAAC
Protein: 866 (aa)
MRIWLYKLLLVTTAICVSQSRGDISINDDEANYEDSNEITSDPANCNVKSSLDELKKMMTSFQKELSYQNAEIKYVQSLIENCASCKIVPETDTCATANPCFHGVECFDTDSGIVCGKCPRGYKGDGKDCFRIEMCSDRPCFSGVSCTDSDDGVVCGACPSGYEGNGRVCTLRRNYCDDNPCIDGYQCVQMNEAPFYRCSSCPPGFTSFDGTTCVDIDECQTSNPCDPLVKCTNLSPGVRCEACPPGYQGYYSEGFYLESVNELSFQLQRCEDIDECLEGTAQCGQFALCHNLLGSYECICPVGYMKLNGSEDCVLQPGACLDGTFCDKNAVCKHIGYGKYSCRCKVGYAGDGQYCGSDRDLDGFPDYDLGCSSATCRKDNCPNVPNSGQEDADGDGIGDVCDNDIDNDGVLNHNDNCVYFYNPDQYDGDGDKVGSACDNCPDIFNGDQMDIDDDGLGDKCDPDMDNDGIYNQEDNCPKIANRDQKDSDMDRVGDVCDNCPLTPNMDQKDSDADLIGDACDNNLDRDQDGIQDNKDNCPNVPNADQVDTDSDGKGDACDYDIDNDGIVNERDNCRFIKNADQLDSDNNGIGDACELDSDGDGHDNLVDNCPYNNKIYTTDFREFQQVRLDPEGDAQIDPKWVIYNKGAEMVQTMNSDPGLAIGYDSFSGVDFEGTLFVDTDVDDDYIGFIFSYQSSHRFYAIMWKRNPQEYWHKKPFVANAEPGIQIKLIDSKTGPGTLLRNSLWHTGNTTDQVKLLWKDPKNVGWKERTAYRWHLIHRPKIGLINMKIFNGKRLVVDSGNIYDSTLKGGRLGMFVFSQEMIIWSNLVYKCNENVPAKLYAKLPANLQRECQVEVMMKRNEKMEQN
Type | Start | End | Length |
CDS |
1457 |
1544 |
88 |
CDS |
1628 |
1681 |
54 |
CDS |
1825 |
2110 |
286 |
CDS |
2244 |
2846 |
603 |
CDS |
3166 |
3485 |
320 |
CDS |
3558 |
3595 |
38 |
CDS |
3661 |
3754 |
94 |
CDS |
3994 |
4048 |
55 |
CDS |
4107 |
4248 |
142 |
CDS |
4321 |
4374 |
54 |
CDS |
4433 |
5195 |
763 |
CDS |
5727 |
5827 |
101 |
intron |
1545 |
1627 |
83 |
intron |
1682 |
1824 |
143 |
intron |
2111 |
2243 |
133 |
intron |
2847 |
3165 |
319 |
intron |
3486 |
3557 |
72 |
intron |
3596 |
3660 |
65 |
intron |
3755 |
3993 |
239 |
intron |
4049 |
4106 |
58 |
intron |
4249 |
4320 |
72 |
intron |
4375 |
4432 |
58 |
intron |
5196 |
5726 |
531 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_308033 |
AGAP002157-PA [Anopheles gambiae str. PEST] gb|EAA03707.5| AGAP002157-PA [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR001881 |
EGF-like calcium-binding |
|
InterPro |
IPR000742 |
Epidermal growth factor-like domain |
|
InterPro |
IPR013320 |
Concanavalin A-like lectin/glucanase, subgroup |
|
InterPro |
IPR013032 |
EGF-like, conserved site |
|
InterPro |
IPR000152 |
EGF-type aspartate/asparagine hydroxylation site |
|
InterPro |
IPR018097 |
EGF-like calcium-binding, conserved site |
|
InterPro |
IPR006210 |
Epidermal growth factor-like |
|
InterPro |
IPR008859 |
Thrombospondin, C-terminal |
|
InterPro |
IPR017897 |
Thrombospondin, type 3 repeat |
|
InterPro |
IPR008985 |
Concanavalin A-like lectin/glucanase |
|
InterPro |
IPR003367 |
Thrombospondin, type 3-like repeat |
|
Gene Ontology(BP) |
GO:0007155 |
cell adhesion |
|
Gene Ontology(CC) |
GO:0005576 |
extracellular region |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0005509 |
calcium ion binding |
|
Pfam |
PF12662.2 |
Complement Clr-like EGF-like |
1.6e-11 |
Pfam |
PF02412.13 |
Thrombospondin type 3 repeat |
1.7e-56 |
Pfam |
PF12947.2 |
EGF domain |
4.8e-10 |
Pfam |
PF12661.2 |
Human growth factor-like EGF |
0.0023 |
Pfam |
PF05735.7 |
Thrombospondin C-terminal region |
2.7e-104 |
Pfam |
PF07645.10 |
Calcium-binding EGF domain |
1.3e-13 |
Pfam |
PF12946.2 |
MSP1 EGF domain 1 |
0.0021 |
Pfam |
PF00008.22 |
EGF-like domain |
0.00072 |
Pfam |
PF11598.3 |
Cartilage oligomeric matrix protein |
0.00032 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Gene ID |
Pn.07257 |
Pn.07271 |
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.11089 |
P. humanus |
PHUM370970-PA |
C. quinquefasciatus |
CPIJ011343 |
A. aegypti |
AAEL008062 |
H. sapiens |
ENSP00000439156 |
H. sapiens |
ENSP00000357362 |
H. sapiens |
ENSP00000355751 |
D. plexippus |
DPOGS210481PA |
H. sapiens |
ENSP00000444792 |
H. sapiens |
ENSP00000422298 |
A. aegypti |
AAEL007455 |
H. melpomene |
HMEL009724-PA |
H. sapiens |
ENSP00000222271 |
C. quinquefasciatus |
CPIJ016496 |
S. invicta |
SI2.2.0_80288 |
H. sapiens |
ENSP00000392207 |
H. sapiens |
ENSP00000437353 |
N. vitripennis |
NV16059-PA |
M. musculus |
ENSMUSG00000031849 |
A. gambiae |
AGAP002157 |
M. musculus |
ENSMUSG00000021702 |
B. mori |
BGIBMGA001836-TA |
M. musculus |
ENSMUSG00000023885 |
C. quinquefasciatus |
CPIJ016495 |
C. quinquefasciatus |
CPIJ016498 |
D. melanogaster |
FBgn0031850 |
T. castaneum |
TC008608 |
H. sapiens |
ENSP00000339730 |
M. musculus |
ENSMUSG00000040152 |
A. mellifera |
GB13833-PA |
C. quinquefasciatus |
CPIJ016497 |
H. sapiens |
ENSP00000404040 |
P. vanderplanki |
Pv.04547 |
H. sapiens |
ENSP00000260356 |
H. sapiens |
ENSP00000403792 |
C. quinquefasciatus |
CPIJ009462 |