MidgeBase gene description page [Pn.01023]
Outline
Gene ID | Pn.01023 |
Type | Protein coding gene |
Scaffold | PnScaf1029 |
Start | 2793 |
End | 11060 |
Direction | + |
Sequence
Transcript: 3594 (bp)
ATGGAGTTCTGCCTCCTCTACAACTTCTTCGCTCGCTCTCCTGATGCGGGTTGCGATTGTGGTTGCGTTGATAGCCAAACTATTATCAGCACAACGAGTACACTGAATTCATCACAAACGCAGCAACAGCAAGTCAACATGGGTGCACAATCCAACGGCTCCAAAACGGCGCATTCAAAGGTGGCGAACGGCACGAACGCTGGAACGAGCGGCAACACCAATCAGACGACGACGAGCAAGCGATCGAGCAGCGGCGCTGACGGCGACTATCAGTTGGTGCAGCACGAGGTCCTCTACTCCCAATCGAGTCAATATGAGGTCTTGGAGTTTCTTGGACGAGGAACCTTCGGTCAAGTCTGCAAGTGCTGGAAGAAGGGCACCAGCGACATTGTCGCAATCAAAATCTTGAAGAACCATCCGTCGTACGCGCGACAAGGCCAAATCGAGGTCTCCATTCTGTCACGCCTCAGCCAAGAGAATGCCGACGAGTTCAACTTTGTGCGAGCGTTCGAGTGCTTCCAGCACAAGAACCACACATGTCTGGTGTTTGAAATGCTCGAGCAGAATTTGTATGATTTCCTGAAACAAAATAAATTTTCGCCATTACCGCTGAAGTACATCAGACCGATATTGCAACAGGTTTTAACAGCGCTGCTAAAGCTGAAGCAATTGGGTCTGATTCACGCCGATTTGAAGCCGGAAAATATTATGCTTGTGGATCCCGTTCGACAACCATACAGGGTAAAAGTGATTGATTTTGGAAGTGCATCGCACGTCAGCAAGACCGTGTGCAACACATACCTGCAATCGCGTTATTATCGTGCGCCGGAAATCATTCTCGGCCTGCCTTTCTGTGAAGCAATCGATATGTGGTCGCTGGGATGCGTCGTTGCTGAGCTCTTCCTCGGATGGCCTCTCTATCCGGGCTCATCTGAATATGATCAAATCAGATACATATCACAGACGCAAGGTCTACCGACCGAGCACATGCTGAACAGCGCGAGCAAGACGGAGAAGTTCTTCTACCGCGACGAGGACTCAACATATCCCTTCTGGCGCCTCATCTCGCCCGAGGAGAACGAGATCATGACGAACGTGAAGAGTAAGGAGGCGCGCAAGTACATCTTCAACTGCCTCGACGACATCGGCCAGGTGAACGTGCAGATGGACATCGAGAGCAGCCAACTGCTCGCCGAGAAAATCGACCGGCGCGAGTTCATCGACCTCCTTAAGCGGATGCTGACGATCGACCAGGAGCGCCGCATACAGCCGGCGCAGGCCCTGCGCCATCCCTTCGTCACGCTCTCGCACCTCGTCGACTACGCGCACTGCAACAACGTCAAGGCGAGCGTGCAGATGATGGAGGTGTGCCGGCGCGACCCGGTGATGCACTCGGTGCCGCAGGCAACCGCCACGCTCGTCACCAACTTCGGGCCCAACACTGCCGACAACATGACGTTCACGATCAACAATCAGCTGACGAACCAGGTGCAGCGGCTGGTGCGCGAGCGCAACCCCGCCTCCTACGACAACGTCTACCAGTTCTACGGGACGCCGCGCAACGTCGTGCGCCAGTACGCGAACACGCGCGCCGCCGAGGTCCTGCCGCCGCAGCTCAGCTTCATCTGCCCCTACAACCCCATGCCGAGTCCGACGACGAAGCACGTGGTCGTCGGCAGTGCCGGCATGCAGCCGTCGCTGCAGGTGCCGCCGCAGCAGTACGTCAACGTCCCGGTGCCGGTGTCGATGACGATGGAACCGAACGGCCAGCGCATGCTGCTCACGAACGCCGTCCAGTCGAGCGTGGCGTGGCCGCAGGGCAGCACCCGGCAGGTGGCGATCGTGCCCAACTGGGGCCAGCACGGCACGGCGCCGCACTCACTCATCGTCGACTCGCAGTTCTACAACGTCGAGGAGATCTACGGCAAGCAGCCGCTGAGTATCCACAAGTACGAGAAGAAGGAGTCGCCGGTGCACCATCTGAACGTGCCGCGCCACGACAAGAAGGAGACCAACCAGCTCTCGCCGGTGAAGAAGCGCGTGAAGGAGAACACGCCGCCCAACCACCACCATCACCACAACGGCCACAGCAATGGCCACCACCACCATCAGAGTCAGAACGTCCAGCCCAACCAGACGCGCTACAACAACCACCGATCGTCGTCCTACCACCAGCACCACGTGTCGCCGCAGCAGACGACCTCGTCGTCCGTCAACCAATATCATCACAATCAGTCGGCGTTCTACGGCGGGCAGAGTCAGCCGCAGGCCGCTGCAGCCGCCGCCGGGGGCTACTACGATCCCAGCCACGCTTATCCGTCCGCCGTCGTCATCAACACTGGCGGCAGCGGCGGCGGCAGTGGTGGCGGAGGAAAGCAGCGAAACCAGACGTCGACGGCGGTCGCGCCGCACCACCAGATCGTGAACTCCACAATCACCATCAACGACACGCCGTCGCCGACGTCCGTCATCCTCATATCGGACAGCGAGGACGAGGAGGAGAACCAGAAGAAGCAGCAGCAGCCGGCGAGCAGCAAAAACGGCCGCGACGCCACCCGACAATCGACGGCCAATCAAAACTACTCGTCGACGTCCTCCGCCGTCGCCGCCACCACCGTCAACTCGTCGACGCAGTGCAGCAAGAACGAGCCGCAGAACTCGTCCGTCATCAGCAGCAATCAGCAGAGGAAGAATGTCATATCGTGTGTCACTGTGGGCGACAGCGATGGCGAGGACACGAGTCCGAAAACGATACAGCAGCAACAACACAATGTAAAATATGAACAGCATGCAACGCAGAAGAAACGCTTACTGGCGATGACACAAAACGATCCGGCAATCAACAACAACAACAACGCCTCGAGCTCATCGACGAACCAGCTGAAGCAGGAGCCGGCCGAGTTTTCATCGTCGGCCTCGCTCGCGAACTACGCCGACTACCCGCCGTTCGACCACCAGAAGCGGTCGTCGTGGGTCGGCTCGTCCTCATCCTCAGCCGCGGCGTCGGCCAACTCGTCGAACATGGTCGCGGTGCAGCCGCCGAATGCGCACCAGAGTCACCACAGCTTGTCGTCGCACAGCAAACGCGAGTCGGTTGGGTGTGGATCGAATTCGGGTCCGAGCTCACAGCAGCAACCGCCGCTGGCACACGGAAAGAATGAGGTGCCATTATCGTCAAGCTCTACTACTACCTCGTTAAACAGTACACCAATATTTCACCATCATCATCATAGCAGTCATCATCATCATCATCATCATCATCGAAGTCAGCCTCAGAATACGACGCCGCTGGGGAATTCGCCGCTCGGCGCCGGCACGACCACGGCCCTGCTGCAGACCCAGCAGCCCGACATCTACGCCCAGGCCGAGATCTACCGACGGCCGACGGTCTTTGTCTCGCAAGCATCTGCCTATGCGTACAATGCTCGCGTGATCCCGCCGCCGCCGGCTCACAATCCATCAAATCGACAGGTGCTACCCACGCATCCGCTGCCGGCGCACATTCAGTTCCCGCAGTACGGCCAGTTCGGCGCACCGCCACTGAGTCCTCAGGTGGCAGCAAATTTGAGGCCCGGAAATTTATGGTATGCTGAG
Protein: 1198 (aa)
MEFCLLYNFFARSPDAGCDCGCVDSQTIISTTSTLNSSQTQQQQVNMGAQSNGSKTAHSKVANGTNAGTSGNTNQTTTSKRSSSGADGDYQLVQHEVLYSQSSQYEVLEFLGRGTFGQVCKCWKKGTSDIVAIKILKNHPSYARQGQIEVSILSRLSQENADEFNFVRAFECFQHKNHTCLVFEMLEQNLYDFLKQNKFSPLPLKYIRPILQQVLTALLKLKQLGLIHADLKPENIMLVDPVRQPYRVKVIDFGSASHVSKTVCNTYLQSRYYRAPEIILGLPFCEAIDMWSLGCVVAELFLGWPLYPGSSEYDQIRYISQTQGLPTEHMLNSASKTEKFFYRDEDSTYPFWRLISPEENEIMTNVKSKEARKYIFNCLDDIGQVNVQMDIESSQLLAEKIDRREFIDLLKRMLTIDQERRIQPAQALRHPFVTLSHLVDYAHCNNVKASVQMMEVCRRDPVMHSVPQATATLVTNFGPNTADNMTFTINNQLTNQVQRLVRERNPASYDNVYQFYGTPRNVVRQYANTRAAEVLPPQLSFICPYNPMPSPTTKHVVVGSAGMQPSLQVPPQQYVNVPVPVSMTMEPNGQRMLLTNAVQSSVAWPQGSTRQVAIVPNWGQHGTAPHSLIVDSQFYNVEEIYGKQPLSIHKYEKKESPVHHLNVPRHDKKETNQLSPVKKRVKENTPPNHHHHHNGHSNGHHHHQSQNVQPNQTRYNNHRSSSYHQHHVSPQQTTSSSVNQYHHNQSAFYGGQSQPQAAAAAAGGYYDPSHAYPSAVVINTGGSGGGSGGGGKQRNQTSTAVAPHHQIVNSTITINDTPSPTSVILISDSEDEEENQKKQQQPASSKNGRDATRQSTANQNYSSTSSAVAATTVNSSTQCSKNEPQNSSVISSNQQRKNVISCVTVGDSDGEDTSPKTIQQQQHNVKYEQHATQKKRLLAMTQNDPAINNNNNASSSSTNQLKQEPAEFSSSASLANYADYPPFDHQKRSSWVGSSSSSAAASANSSNMVAVQPPNAHQSHHSLSSHSKRESVGCGSNSGPSSQQQPPLAHGKNEVPLSSSSTTTSLNSTPIFHHHHHSSHHHHHHHHRSQPQNTTPLGNSPLGAGTTTALLQTQQPDIYAQAEIYRRPTVFVSQASAYAYNARVIPPPPAHNPSNRQVLPTHPLPAHIQFPQYGQFGAPPLSPQVAANLRPGNLWYAE
Type | Start | End | Length |
CDS |
2793 |
2832 |
40 |
CDS |
3101 |
3138 |
38 |
CDS |
3227 |
3466 |
240 |
CDS |
5768 |
5790 |
23 |
CDS |
6354 |
6651 |
298 |
CDS |
7281 |
7381 |
101 |
CDS |
7611 |
7820 |
210 |
CDS |
7929 |
9751 |
1823 |
CDS |
10168 |
10865 |
698 |
CDS |
10935 |
11057 |
123 |
intron |
2833 |
3100 |
268 |
intron |
3139 |
3226 |
88 |
intron |
3467 |
5767 |
2301 |
intron |
5791 |
6353 |
563 |
intron |
6652 |
7280 |
629 |
intron |
7382 |
7610 |
229 |
intron |
7821 |
7928 |
108 |
intron |
9752 |
10167 |
416 |
intron |
10866 |
10934 |
69 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_308469 |
AGAP007342-PA [Anopheles gambiae str. PEST] gb|EAA04272.5| AGAP007342-PA [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR017441 |
Protein kinase, ATP binding site |
|
InterPro |
IPR000719 |
Protein kinase, catalytic domain |
|
InterPro |
IPR002290 |
Serine/threonine- / dual-specificity protein kinase, catalytic domain |
|
InterPro |
IPR020635 |
Tyrosine-protein kinase, catalytic domain |
|
InterPro |
IPR011009 |
Protein kinase-like domain |
|
InterPro |
IPR008271 |
Serine/threonine-protein kinase, active site |
|
Gene Ontology(BP) |
GO:0006468 |
protein phosphorylation |
|
Gene Ontology(MF) |
GO:0004713 |
protein tyrosine kinase activity |
|
Gene Ontology(MF) |
GO:0004674 |
protein serine/threonine kinase activity |
|
Gene Ontology(MF) |
GO:0005524 |
ATP binding |
|
Gene Ontology(MF) |
GO:0004672 |
protein kinase activity |
|
Gene Ontology(MF) |
GO:0016772 |
transferase activity, transferring phosphorus-containing groups |
|
Pfam |
PF00069.20 |
Protein kinase domain |
1.5e-54 |
Pfam |
PF06293.9 |
Lipopolysaccharide kinase (Kdo/WaaP) family |
0.0042 |
Pfam |
PF01636.18 |
Phosphotransferase enzyme family |
0.0022 |
Pfam |
PF05445.6 |
Poxvirus serine/threonine protein kinase |
0.021 |
Pfam |
PF07714.12 |
Protein tyrosine kinase |
1.1e-22 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
M. musculus |
ENSMUSG00000008730 |
D. plexippus |
DPOGS213784PA |
N. vitripennis |
NV17543-PA |
H. sapiens |
ENSP00000409673 |
H. sapiens |
ENSP00000358571 |
H. sapiens |
ENSP00000340956 |
H. sapiens |
ENSP00000385571 |
S. invicta |
SI2.2.0_14230 |
P. vanderplanki |
Pv.17237 |
H. sapiens |
ENSP00000358574 |
H. melpomene |
HMEL003132-PA |
H. sapiens |
ENSP00000384960 |
H. sapiens |
ENSP00000368301 |
H. sapiens |
ENSP00000398241 |
H. sapiens |
ENSP00000407442 |
H. sapiens |
ENSP00000358572 |
H. sapiens |
ENSP00000358566 |
P. humanus |
PHUM005730-PA |
H. sapiens |
ENSP00000431710 |
M. musculus |
ENSMUSG00000061436 |
H. sapiens |
ENSP00000358568 |
D. melanogaster |
FBgn0035142 |
H. sapiens |
ENSP00000343108 |
T. castaneum |
TC007452 |
H. sapiens |
ENSP00000304226 |
H. sapiens |
ENSP00000413724 |
A. mellifera |
GB12884-PA |
A. aegypti |
AAEL003731 |
C. quinquefasciatus |
CPIJ005449 |
B. mori |
BGIBMGA009267-TA |
H. sapiens |
ENSP00000358567 |
H. sapiens |
ENSP00000355191 |
A. gambiae |
AGAP007342 |
M. musculus |
ENSMUSG00000027177 |