MidgeBase gene description page [Pn.01023]

Outline

Link to gbrowse

Gene ID Pn.01023
Type Protein coding gene
Scaffold PnScaf1029
Start 2793
End 11060
Direction +

Sequence

Transcript: 3594 (bp)

 ATGGAGTTCTGCCTCCTCTACAACTTCTTCGCTCGCTCTCCTGATGCGGGTTGCGATTGTGGTTGCGTTGATAGCCAAACTATTATCAGCACAACGAGTACACTGAATTCATCACAAACGCAGCAACAGCAAGTCAACATGGGTGCACAATCCAACGGCTCCAAAACGGCGCATTCAAAGGTGGCGAACGGCACGAACGCTGGAACGAGCGGCAACACCAATCAGACGACGACGAGCAAGCGATCGAGCAGCGGCGCTGACGGCGACTATCAGTTGGTGCAGCACGAGGTCCTCTACTCCCAATCGAGTCAATATGAGGTCTTGGAGTTTCTTGGACGAGGAACCTTCGGTCAAGTCTGCAAGTGCTGGAAGAAGGGCACCAGCGACATTGTCGCAATCAAAATCTTGAAGAACCATCCGTCGTACGCGCGACAAGGCCAAATCGAGGTCTCCATTCTGTCACGCCTCAGCCAAGAGAATGCCGACGAGTTCAACTTTGTGCGAGCGTTCGAGTGCTTCCAGCACAAGAACCACACATGTCTGGTGTTTGAAATGCTCGAGCAGAATTTGTATGATTTCCTGAAACAAAATAAATTTTCGCCATTACCGCTGAAGTACATCAGACCGATATTGCAACAGGTTTTAACAGCGCTGCTAAAGCTGAAGCAATTGGGTCTGATTCACGCCGATTTGAAGCCGGAAAATATTATGCTTGTGGATCCCGTTCGACAACCATACAGGGTAAAAGTGATTGATTTTGGAAGTGCATCGCACGTCAGCAAGACCGTGTGCAACACATACCTGCAATCGCGTTATTATCGTGCGCCGGAAATCATTCTCGGCCTGCCTTTCTGTGAAGCAATCGATATGTGGTCGCTGGGATGCGTCGTTGCTGAGCTCTTCCTCGGATGGCCTCTCTATCCGGGCTCATCTGAATATGATCAAATCAGATACATATCACAGACGCAAGGTCTACCGACCGAGCACATGCTGAACAGCGCGAGCAAGACGGAGAAGTTCTTCTACCGCGACGAGGACTCAACATATCCCTTCTGGCGCCTCATCTCGCCCGAGGAGAACGAGATCATGACGAACGTGAAGAGTAAGGAGGCGCGCAAGTACATCTTCAACTGCCTCGACGACATCGGCCAGGTGAACGTGCAGATGGACATCGAGAGCAGCCAACTGCTCGCCGAGAAAATCGACCGGCGCGAGTTCATCGACCTCCTTAAGCGGATGCTGACGATCGACCAGGAGCGCCGCATACAGCCGGCGCAGGCCCTGCGCCATCCCTTCGTCACGCTCTCGCACCTCGTCGACTACGCGCACTGCAACAACGTCAAGGCGAGCGTGCAGATGATGGAGGTGTGCCGGCGCGACCCGGTGATGCACTCGGTGCCGCAGGCAACCGCCACGCTCGTCACCAACTTCGGGCCCAACACTGCCGACAACATGACGTTCACGATCAACAATCAGCTGACGAACCAGGTGCAGCGGCTGGTGCGCGAGCGCAACCCCGCCTCCTACGACAACGTCTACCAGTTCTACGGGACGCCGCGCAACGTCGTGCGCCAGTACGCGAACACGCGCGCCGCCGAGGTCCTGCCGCCGCAGCTCAGCTTCATCTGCCCCTACAACCCCATGCCGAGTCCGACGACGAAGCACGTGGTCGTCGGCAGTGCCGGCATGCAGCCGTCGCTGCAGGTGCCGCCGCAGCAGTACGTCAACGTCCCGGTGCCGGTGTCGATGACGATGGAACCGAACGGCCAGCGCATGCTGCTCACGAACGCCGTCCAGTCGAGCGTGGCGTGGCCGCAGGGCAGCACCCGGCAGGTGGCGATCGTGCCCAACTGGGGCCAGCACGGCACGGCGCCGCACTCACTCATCGTCGACTCGCAGTTCTACAACGTCGAGGAGATCTACGGCAAGCAGCCGCTGAGTATCCACAAGTACGAGAAGAAGGAGTCGCCGGTGCACCATCTGAACGTGCCGCGCCACGACAAGAAGGAGACCAACCAGCTCTCGCCGGTGAAGAAGCGCGTGAAGGAGAACACGCCGCCCAACCACCACCATCACCACAACGGCCACAGCAATGGCCACCACCACCATCAGAGTCAGAACGTCCAGCCCAACCAGACGCGCTACAACAACCACCGATCGTCGTCCTACCACCAGCACCACGTGTCGCCGCAGCAGACGACCTCGTCGTCCGTCAACCAATATCATCACAATCAGTCGGCGTTCTACGGCGGGCAGAGTCAGCCGCAGGCCGCTGCAGCCGCCGCCGGGGGCTACTACGATCCCAGCCACGCTTATCCGTCCGCCGTCGTCATCAACACTGGCGGCAGCGGCGGCGGCAGTGGTGGCGGAGGAAAGCAGCGAAACCAGACGTCGACGGCGGTCGCGCCGCACCACCAGATCGTGAACTCCACAATCACCATCAACGACACGCCGTCGCCGACGTCCGTCATCCTCATATCGGACAGCGAGGACGAGGAGGAGAACCAGAAGAAGCAGCAGCAGCCGGCGAGCAGCAAAAACGGCCGCGACGCCACCCGACAATCGACGGCCAATCAAAACTACTCGTCGACGTCCTCCGCCGTCGCCGCCACCACCGTCAACTCGTCGACGCAGTGCAGCAAGAACGAGCCGCAGAACTCGTCCGTCATCAGCAGCAATCAGCAGAGGAAGAATGTCATATCGTGTGTCACTGTGGGCGACAGCGATGGCGAGGACACGAGTCCGAAAACGATACAGCAGCAACAACACAATGTAAAATATGAACAGCATGCAACGCAGAAGAAACGCTTACTGGCGATGACACAAAACGATCCGGCAATCAACAACAACAACAACGCCTCGAGCTCATCGACGAACCAGCTGAAGCAGGAGCCGGCCGAGTTTTCATCGTCGGCCTCGCTCGCGAACTACGCCGACTACCCGCCGTTCGACCACCAGAAGCGGTCGTCGTGGGTCGGCTCGTCCTCATCCTCAGCCGCGGCGTCGGCCAACTCGTCGAACATGGTCGCGGTGCAGCCGCCGAATGCGCACCAGAGTCACCACAGCTTGTCGTCGCACAGCAAACGCGAGTCGGTTGGGTGTGGATCGAATTCGGGTCCGAGCTCACAGCAGCAACCGCCGCTGGCACACGGAAAGAATGAGGTGCCATTATCGTCAAGCTCTACTACTACCTCGTTAAACAGTACACCAATATTTCACCATCATCATCATAGCAGTCATCATCATCATCATCATCATCATCGAAGTCAGCCTCAGAATACGACGCCGCTGGGGAATTCGCCGCTCGGCGCCGGCACGACCACGGCCCTGCTGCAGACCCAGCAGCCCGACATCTACGCCCAGGCCGAGATCTACCGACGGCCGACGGTCTTTGTCTCGCAAGCATCTGCCTATGCGTACAATGCTCGCGTGATCCCGCCGCCGCCGGCTCACAATCCATCAAATCGACAGGTGCTACCCACGCATCCGCTGCCGGCGCACATTCAGTTCCCGCAGTACGGCCAGTTCGGCGCACCGCCACTGAGTCCTCAGGTGGCAGCAAATTTGAGGCCCGGAAATTTATGGTATGCTGAG 

Protein: 1198 (aa)

 MEFCLLYNFFARSPDAGCDCGCVDSQTIISTTSTLNSSQTQQQQVNMGAQSNGSKTAHSKVANGTNAGTSGNTNQTTTSKRSSSGADGDYQLVQHEVLYSQSSQYEVLEFLGRGTFGQVCKCWKKGTSDIVAIKILKNHPSYARQGQIEVSILSRLSQENADEFNFVRAFECFQHKNHTCLVFEMLEQNLYDFLKQNKFSPLPLKYIRPILQQVLTALLKLKQLGLIHADLKPENIMLVDPVRQPYRVKVIDFGSASHVSKTVCNTYLQSRYYRAPEIILGLPFCEAIDMWSLGCVVAELFLGWPLYPGSSEYDQIRYISQTQGLPTEHMLNSASKTEKFFYRDEDSTYPFWRLISPEENEIMTNVKSKEARKYIFNCLDDIGQVNVQMDIESSQLLAEKIDRREFIDLLKRMLTIDQERRIQPAQALRHPFVTLSHLVDYAHCNNVKASVQMMEVCRRDPVMHSVPQATATLVTNFGPNTADNMTFTINNQLTNQVQRLVRERNPASYDNVYQFYGTPRNVVRQYANTRAAEVLPPQLSFICPYNPMPSPTTKHVVVGSAGMQPSLQVPPQQYVNVPVPVSMTMEPNGQRMLLTNAVQSSVAWPQGSTRQVAIVPNWGQHGTAPHSLIVDSQFYNVEEIYGKQPLSIHKYEKKESPVHHLNVPRHDKKETNQLSPVKKRVKENTPPNHHHHHNGHSNGHHHHQSQNVQPNQTRYNNHRSSSYHQHHVSPQQTTSSSVNQYHHNQSAFYGGQSQPQAAAAAAGGYYDPSHAYPSAVVINTGGSGGGSGGGGKQRNQTSTAVAPHHQIVNSTITINDTPSPTSVILISDSEDEEENQKKQQQPASSKNGRDATRQSTANQNYSSTSSAVAATTVNSSTQCSKNEPQNSSVISSNQQRKNVISCVTVGDSDGEDTSPKTIQQQQHNVKYEQHATQKKRLLAMTQNDPAINNNNNASSSSTNQLKQEPAEFSSSASLANYADYPPFDHQKRSSWVGSSSSSAAASANSSNMVAVQPPNAHQSHHSLSSHSKRESVGCGSNSGPSSQQQPPLAHGKNEVPLSSSSTTTSLNSTPIFHHHHHSSHHHHHHHHRSQPQNTTPLGNSPLGAGTTTALLQTQQPDIYAQAEIYRRPTVFVSQASAYAYNARVIPPPPAHNPSNRQVLPTHPLPAHIQFPQYGQFGAPPLSPQVAANLRPGNLWYAE 
Type Start End Length
CDS 2793 2832 40
CDS 3101 3138 38
CDS 3227 3466 240
CDS 5768 5790 23
CDS 6354 6651 298
CDS 7281 7381 101
CDS 7611 7820 210
CDS 7929 9751 1823
CDS 10168 10865 698
CDS 10935 11057 123
intron 2833 3100 268
intron 3139 3226 88
intron 3467 5767 2301
intron 5791 6353 563
intron 6652 7280 629
intron 7382 7610 229
intron 7821 7928 108
intron 9752 10167 416
intron 10866 10934 69

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_308469 AGAP007342-PA [Anopheles gambiae str. PEST] gb|EAA04272.5| AGAP007342-PA [Anopheles gambiae str. PEST] 0.0
InterPro IPR017441 Protein kinase, ATP binding site
InterPro IPR000719 Protein kinase, catalytic domain
InterPro IPR002290 Serine/threonine- / dual-specificity protein kinase, catalytic domain
InterPro IPR020635 Tyrosine-protein kinase, catalytic domain
InterPro IPR011009 Protein kinase-like domain
InterPro IPR008271 Serine/threonine-protein kinase, active site
Gene Ontology(BP) GO:0006468 protein phosphorylation
Gene Ontology(MF) GO:0004713 protein tyrosine kinase activity
Gene Ontology(MF) GO:0004674 protein serine/threonine kinase activity
Gene Ontology(MF) GO:0005524 ATP binding
Gene Ontology(MF) GO:0004672 protein kinase activity
Gene Ontology(MF) GO:0016772 transferase activity, transferring phosphorus-containing groups
Pfam PF00069.20 Protein kinase domain 1.5e-54
Pfam PF06293.9 Lipopolysaccharide kinase (Kdo/WaaP) family 0.0042
Pfam PF01636.18 Phosphotransferase enzyme family 0.0022
Pfam PF05445.6 Poxvirus serine/threonine protein kinase 0.021
Pfam PF07714.12 Protein tyrosine kinase 1.1e-22

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
M. musculus ENSMUSG00000008730
D. plexippus DPOGS213784PA
N. vitripennis NV17543-PA
H. sapiens ENSP00000409673
H. sapiens ENSP00000358571
H. sapiens ENSP00000340956
H. sapiens ENSP00000385571
S. invicta SI2.2.0_14230
P. vanderplanki Pv.17237
H. sapiens ENSP00000358574
H. melpomene HMEL003132-PA
H. sapiens ENSP00000384960
H. sapiens ENSP00000368301
H. sapiens ENSP00000398241
H. sapiens ENSP00000407442
H. sapiens ENSP00000358572
H. sapiens ENSP00000358566
P. humanus PHUM005730-PA
H. sapiens ENSP00000431710
M. musculus ENSMUSG00000061436
H. sapiens ENSP00000358568
D. melanogaster FBgn0035142
H. sapiens ENSP00000343108
T. castaneum TC007452
H. sapiens ENSP00000304226
H. sapiens ENSP00000413724
A. mellifera GB12884-PA
A. aegypti AAEL003731
C. quinquefasciatus CPIJ005449
B. mori BGIBMGA009267-TA
H. sapiens ENSP00000358567
H. sapiens ENSP00000355191
A. gambiae AGAP007342
M. musculus ENSMUSG00000027177