MidgeBase gene description page [Pn.11881]

Outline

Link to gbrowse

Gene ID Pn.11881
Type Protein coding gene
Scaffold PnScaf14803
Start 31464
End 39586
Direction -

Sequence

Transcript: 4920 (bp)

 ATGAGGTTTCTACTGTTGCTGCTGCTCGTGATTACGACTCGAGCGCAACGTCCACTAAATACCACAAACACATCAGGCGTTCTCACGGAGCCATTTTTCATCAAGCCAACGAGCACATCCACTTCCAAGTACCATCCGGAGCAGCGCGGAGGTGGCAATAATCCCTACTTCACGTCGAACGACTCGCCAGGAAAGGGCCGAGGCAGCGGCGACCGTCTCCTCGCTGTGCAGCCACACTCAAATGCCGAAATCTTCATTCCGTCGCTGAACGGCATCGTAGGCGAGCATGGCAGTGCCCAGTCGCAAGTGCCGGCAGCCAATCGACTGCACAAGGTCGTCAAGTCGCGCCAGAACAAATTCACGCAGCTCTACCAGCTCACAAAGAATCCCAACTTCACGCTCAACAATCCGAACCGCGAGAAGAGCCAGAACTACTCGTATGGCGGCCACCGCGGCGACTATCAGCACGGCAACATCATCACGTCCAAGAAGGAGGCGCGACCCGAGAAATTCTTCAGCGGACCCAACGACAACTTCGACACGCAGAAGACGTCGTGGTCGGACGGCCGGGCTTTCCAGGAAGTGCACAAACACTTGAGCAACAAGAACCACATGCACACGCCGTCGCCGCCCTCGTACACCGTCGACGACATCGATTTTGACGAGAAGCTCGGCGTGAAGTGCACCTTCGAGAAGCCGTGCGCGTGGACCTATGACACTGACGTGGTCGGCACAAATTTCGAGGTGACCACCGGAGCGAATCTCACCAACGCCAACATCACAGGCGTCATGCCGGGACCGTCGGCCGATAATTTGAAGGACGCGAACGGACACTTCCTGCACTTGCCGCTGACGCCGAACACGACTACACGCATACTGCGATCGCCCGTGTTTAGCTCGACGCGCGAGCGCTGCTACCTCGAGGTCTTCATGCATCAGAGCTCGATGGCCTCTGGCTCGATAAAGGTCGTCATCGAGCCGGCGGCGTCGCGCGAAAACTCCTGGGTGCCCGCTGAAATTGTCGGTGACGACTTGAGGAAGTGGAAGTATCATCATTTTGAGGTTGACAGGATAACAAACGACTTCCGCATTCTCTTCGAGATTGTTCCGAACGGCTTGGGCGGTGCATTGCGCGGCCACGTGAGCATCGACAACCTCAAGATGAGCAATTGTTTTGCTGATCCGCCGCGCAAGGACTCGTGCAAGCTCTCGCAAATCAAGTGCAGTCAGAGCAAGATTCCGGTGTGCATCGAGAACAACCGCATTTGCGACTTGGTGCAGGATTGCGACGACGCTGAGGACGAAAATCTCAATTGCGACAAGATACCGTTCGGAGGACTGTGCGACTTCGAGAACGGCAACTGCGGCTGGCAGAACTCGGGCAAGGCGATCATGGACTGGAAACGACACCTCGGGCCGACACCGACCGAGAAGACCGGGCCCGAGTTCGACCACACGTTCCAGCACACTAACAAGTCCGGCCACTATATGTTTGTAAATATGAACCAGCACGCCGACGACCCCGAGCGCAAAAACCTTGCCGGCTTCGCGAGCAACGCCATCATTAATTCGGTCGTGTTTAACCCGCCGCCGCCCTGCCACTCGAACGTCTCGTCGCCATACCACAACACATGCATGGCACGCCTCTTCGTCCACCAGTTCGGCCACAATCCCGGCAGCTTCAACGTGTCCGTTGTGGAGATGAAGGCGAAGGAGAACATCACGACCACGCTGTGGTGGAGCTCGCGTTCGATCGGCGACAAATGGGAGCGCGTTGACGTGGTGCTGCCGAACATTACGTCCAAGTACTACTTCCAGATCGAGGCGCGCAAAGGCATGCGGATCTATAGCGACGTCGCCATCGATGACTTCTCGATGAGTCCCGAGTGCTTCGGCTTCAACATTCCGGCCGAGCACCTTAACAACTACAACTACTGGGACCCGAGAATCGGCATTCACAAGAAACCCCATAGCGATTTTGTAGATAAGAAATTTTATGAACTCTCAACATGTGGTTCGCGTGGAATATTCGGACCCACACCTGAAGCTTGCTCAACTCACTATAATAATACTGAAATAATGAATCATGTGCGTGTGTCTGACAAAGTACCTTTCAAAGGCGTGCAAGTGTGGAAGGTGCCGAGTGAAGGCTACTACACAATAATAGCGAAAGGCGCAAGCGGCGGACTCGGCAGCGGCGGAGTGTCGTCATCGCGCGGAGCGATTGTGAGCTCAGTCATGGAGCTCCACAAGGACGAGGAGATTTATATCCTGGTCGGTCAGAAGGGCGAGAATGCGTGCATCAAGTCGATGGGCATTTCCGACGAGGGCTGTTCCCCAAAATTTCCCAAATACAACGACCACCCGAAGGACTCCGTCTCAATTATCAATCAACTGCGCAATGGCCTCCTCGAAAACGGAGCCGGCGGCGGCGGAGGAGCATCGTTTGTGTTTCTGCTGAATCAAGTGAACAAAGCCGTGCCGCTGATTGTGGCAGGCGGAGGTGGCGGTTTGGGAATCGGCCGCTATCTGGACGAGGACATTCAACAGGCTAAGGGAATCGTAATCGAGCGCGGCGACGTCAGTGGCCAGGTGGAGTACGACAATGGCGACTCTCTTGTCGCAGGACCGGGAGGCGGTTGGCGTCCGCGTCAGGACTCGGCGCTCGACTCTCACCACGGCGCATCGCTGCTGGAAGGTGCGCGTGGCGGAATCGCCTGCTACCAGACAGTCGGCCAGGATCACGGCTACGGAAATGGAGCGTTTGGCGGTGGCGGTGGCTCTTGTCAGTCGGGCGGCGGAGGCGGCGGTTATGCTGGCGGCAACTCGATGCTCAACTACACGACTAACTCGTCGGCAAACCCGAATGCATTCTCCACCAACGGCGAGGGCGGCTCTTCCTATTTGGGCCTTACGCGCAGCATCCACGAGCTCTCTTTCGTCTATCCGGGCTCGAACAGCGGCCACGGCTCAGTCATTCTCATTCCGGCCATCGAGGGATGCGGCTGCGATTACCGGTGTGTGGCGCTCGACGAATATCGCTCGCTCGTCGCCTGCATCTGTCCCGAGGGGTGGCGTCTCAAGCCCGACAACCTCACCTCATGCGAAATAATCGAAGAGGTCAACAATGACAACATTATAATCTACATACTCGTAGTTGTATCAGTCTTTTTAATCATCTCTCTTGTACTATTGATGTTGATGTTATATAATCGTTTCCAAAGAAGACGACAAGCGGAGATGCGCCACAAGATGCTTCTTGAACAAGAAGTTCAATTAAGTCGATTGCGACACACAGACGACTCAGCTTTAACTAATTTTAATCCAAATTATGGTTGTGATGGTATCTTGAATGGTGGAAATGTTGATGTTAAAAGTTTACCCCAAGTTGCGCGCGAAAGCCTGCGACTCGTCAAAGCCCTCGGACAAGGGGCATTTGGTGAGGTGTACCAAGGCCTTTATCGCCACCGCGATGGCGATACCGTCGAGATGCCTGTCGCTGTCAAAACCCTGCCCGAATTGTCGACCGGCCAAGCGGAGTCGGACTTCCTCATGGAGGCGGCAATCATGGCGAAGTTCAATCACCCGAACATTGTCCATCTCATTGGCGTGTGCTTTGACCGCCATCCGCGATTCATCGTGCTCGAGCTGCTCGCGGGCGGCGACCTGAAGAACTTTTTGCGCGAGGGCCGCAACAAACCCGAACGACCGTCGCCGCTGACGATGAAGGACTTGATTTTCTGCGCACTGGACGTCGCCAAGGGATGCCGCTACATGGAGAGCAAGCGCTTCATTCATCGTGACATCGCCGCCAGAAATTGCCTCCTCAGCAGCAAGGGTCCCGGTCGCGTCGTCAAAATTGCAGACTTCGGCATGGCGCGCGACATTTACCGGTCCGATTACTATCGCAAGGGCGGCAAGGCAATGCTGCCGATAAAATGGATGCCACCTGAGGCATTCTTGGACGGAATCTTCACGTCGAAGACAGACATCTGGAGCTATGGCGTGCTGTTGTGGGAGGTGTTTTCGCTCGGTCTAATGCCTTACACAGGCCTACCGAATCGTGACGTCATGCAGCTCGTCACGGGAGGAGGACGACTCGATGCACCGCCCGGATGTCCGCCAGCGATCTACCGCATCATGGCCGAGTGCTGGAACCCGACTCCCGAAGCCCGTCCAACCTTCTCGGTACTGCTCGAGCGACTGACCGCGTGCACGCAAGACCCCGAAATCATGAATGCACCGCTGCCGAGTTTCTTCCGCCCACAGTCGATGGAGCGCGACGCCACGATCATGCGGCCATCGGGCAACGATGACTTCTGCCTGCAGGTACCAAACTCGTCCGACTACTTGATCCCGCTGCCCGACTCGCGCGCCATCGCTGAGCGCTTACTCAATGAGGCGACGTGCGTCACACTGCCCGACTCGCTGCCCACCTATACACTCACCAACTCGCTCAAGATGCCCGACACCAACTGCTGGGAGACGTCCTTCTCGAATCCCGCCAAGATGAGCAACTCCGACAGCCTCGACGACCGCTTGATAAGTCTGGACACGCCGACGACGATCCAGCCTCCGCAGGCCTTCTGCAACAGTCCCGTGCAGACGCTTTCTTCGAACGCCGCTAACAACAACAACAACAATGTTAGCGGAAAGAACAGCGCGATCACACTCGACCCGGCCGCCCTTCAGCAGGGCTACGCGAATGTGAAGATGATGAACGGCGGCACGGACAAGCTAGAGGGCGACATGTCAAAGTTCAACGGCGGCTCGACGATGGTGATGATGAACAATAACAACGGAACGATTCCGAATGGCAACGGCATGCTTCCGAAAGACAAGAACTCGAACCAGCCGTTCTCGATACCCATGCAGGGCTACAACGAGAGCTTCAAGGAAAACCACGCCGAGATCAGCTGC 

Protein: 1640 (aa)

 MRFLLLLLLVITTRAQRPLNTTNTSGVLTEPFFIKPTSTSTSKYHPEQRGGGNNPYFTSNDSPGKGRGSGDRLLAVQPHSNAEIFIPSLNGIVGEHGSAQSQVPAANRLHKVVKSRQNKFTQLYQLTKNPNFTLNNPNREKSQNYSYGGHRGDYQHGNIITSKKEARPEKFFSGPNDNFDTQKTSWSDGRAFQEVHKHLSNKNHMHTPSPPSYTVDDIDFDEKLGVKCTFEKPCAWTYDTDVVGTNFEVTTGANLTNANITGVMPGPSADNLKDANGHFLHLPLTPNTTTRILRSPVFSSTRERCYLEVFMHQSSMASGSIKVVIEPAASRENSWVPAEIVGDDLRKWKYHHFEVDRITNDFRILFEIVPNGLGGALRGHVSIDNLKMSNCFADPPRKDSCKLSQIKCSQSKIPVCIENNRICDLVQDCDDAEDENLNCDKIPFGGLCDFENGNCGWQNSGKAIMDWKRHLGPTPTEKTGPEFDHTFQHTNKSGHYMFVNMNQHADDPERKNLAGFASNAIINSVVFNPPPPCHSNVSSPYHNTCMARLFVHQFGHNPGSFNVSVVEMKAKENITTTLWWSSRSIGDKWERVDVVLPNITSKYYFQIEARKGMRIYSDVAIDDFSMSPECFGFNIPAEHLNNYNYWDPRIGIHKKPHSDFVDKKFYELSTCGSRGIFGPTPEACSTHYNNTEIMNHVRVSDKVPFKGVQVWKVPSEGYYTIIAKGASGGLGSGGVSSSRGAIVSSVMELHKDEEIYILVGQKGENACIKSMGISDEGCSPKFPKYNDHPKDSVSIINQLRNGLLENGAGGGGGASFVFLLNQVNKAVPLIVAGGGGGLGIGRYLDEDIQQAKGIVIERGDVSGQVEYDNGDSLVAGPGGGWRPRQDSALDSHHGASLLEGARGGIACYQTVGQDHGYGNGAFGGGGGSCQSGGGGGGYAGGNSMLNYTTNSSANPNAFSTNGEGGSSYLGLTRSIHELSFVYPGSNSGHGSVILIPAIEGCGCDYRCVALDEYRSLVACICPEGWRLKPDNLTSCEIIEEVNNDNIIIYILVVVSVFLIISLVLLMLMLYNRFQRRRQAEMRHKMLLEQEVQLSRLRHTDDSALTNFNPNYGCDGILNGGNVDVKSLPQVARESLRLVKALGQGAFGEVYQGLYRHRDGDTVEMPVAVKTLPELSTGQAESDFLMEAAIMAKFNHPNIVHLIGVCFDRHPRFIVLELLAGGDLKNFLREGRNKPERPSPLTMKDLIFCALDVAKGCRYMESKRFIHRDIAARNCLLSSKGPGRVVKIADFGMARDIYRSDYYRKGGKAMLPIKWMPPEAFLDGIFTSKTDIWSYGVLLWEVFSLGLMPYTGLPNRDVMQLVTGGGRLDAPPGCPPAIYRIMAECWNPTPEARPTFSVLLERLTACTQDPEIMNAPLPSFFRPQSMERDATIMRPSGNDDFCLQVPNSSDYLIPLPDSRAIAERLLNEATCVTLPDSLPTYTLTNSLKMPDTNCWETSFSNPAKMSNSDSLDDRLISLDTPTTIQPPQAFCNSPVQTLSSNAANNNNNNVSGKNSAITLDPAALQQGYANVKMMNGGTDKLEGDMSKFNGGSTMVMMNNNNGTIPNGNGMLPKDKNSNQPFSIPMQGYNESFKENHAEISC 
Type Start End Length
CDS 31467 31853 387
CDS 31974 32404 431
CDS 32502 32658 157
CDS 32774 32853 80
CDS 33004 33452 449
CDS 33708 33915 208
CDS 34014 34112 99
CDS 34184 34519 336
CDS 34953 35090 138
CDS 35173 35350 178
CDS 35703 36000 298
CDS 36090 36255 166
CDS 36904 37578 675
CDS 37956 38203 248
CDS 38517 39586 1070
intron 31854 31973 120
intron 32405 32501 97
intron 32659 32773 115
intron 32854 33003 150
intron 33453 33707 255
intron 33916 34013 98
intron 34113 34183 71
intron 34520 34952 433
intron 35091 35172 82
intron 35351 35702 352
intron 36001 36089 89
intron 36256 36903 648
intron 37579 37955 377
intron 38204 38516 313

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001648125 leukocyte receptor tyrosine protein kinase [Aedes aegypti] gb|EAT44705.1| leukocyte receptor tyrosine protein kinase [Aedes aegypti] 0.0
InterPro IPR002172 Low-density lipoprotein (LDL) receptor class A repeat
InterPro IPR017441 Protein kinase, ATP binding site
InterPro IPR002290 Serine/threonine- / dual-specificity protein kinase, catalytic domain
InterPro IPR000998 MAM domain
InterPro IPR023415 Low-density lipoprotein (LDL) receptor class A, conserved site
InterPro IPR001245 Serine-threonine/tyrosine-protein kinase catalytic domain
InterPro IPR011009 Protein kinase-like domain
InterPro IPR002011 Tyrosine-protein kinase, receptor class II, conserved site
InterPro IPR000719 Protein kinase, catalytic domain
InterPro IPR008266 Tyrosine-protein kinase, active site
InterPro IPR020635 Tyrosine-protein kinase, catalytic domain
InterPro IPR008985 Concanavalin A-like lectin/glucanase
Gene Ontology(BP) GO:0007169 transmembrane receptor protein tyrosine kinase signaling pathway
Gene Ontology(BP) GO:0006468 protein phosphorylation
Gene Ontology(CC) GO:0016020 membrane
Gene Ontology(MF) GO:0004713 protein tyrosine kinase activity
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0005524 ATP binding
Gene Ontology(MF) GO:0004672 protein kinase activity
Gene Ontology(MF) GO:0016772 transferase activity, transferring phosphorus-containing groups
Gene Ontology(MF) GO:0004714 transmembrane receptor protein tyrosine kinase activity
Pfam PF00069.20 Protein kinase domain 3.5e-40
Pfam PF12810.2 Glycine rich protein 3.4e-16
Pfam PF00629.18 MAM domain 4.9e-60
Pfam PF07714.12 Protein tyrosine kinase 1.8e-91
Pfam PF00057.13 Low-density lipoprotein receptor domain class A 7.4e-05
Pfam PF01484.12 Nematode cuticle collagen N-terminal domain 0.017

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
H. sapiens ENSP00000263800
H. melpomene HMEL003801-PA
P. humanus PHUM583770-PA
C. quinquefasciatus CPIJ019761
D. melanogaster FBgn0040505
H. sapiens ENSP00000347293
A. mellifera GB14602-PA
C. quinquefasciatus CPIJ010280
S. invicta SI2.2.0_07727
H. sapiens ENSP00000392196
H. sapiens ENSP00000373700
M. musculus ENSMUSG00000027297
A. aegypti AAEL003969
B. mori BGIBMGA000152-TA
D. plexippus DPOGS205518PA
A. gambiae AGAP012070
N. vitripennis NV14367-PA
T. castaneum TC002114
M. musculus ENSMUSG00000055471
P. vanderplanki Pv.04037
H. sapiens ENSP00000458111