MidgeBase gene description page [Pn.03160]
Outline
Gene ID | Pn.03160 |
Type | Protein coding gene |
Scaffold | PnScaf2667 |
Start | 38792 |
End | 53915 |
Direction | - |
Sequence
Transcript: 4470 (bp)
ATGCATCACATTGTGCGTAATATTGAAACGCTTCCTCGCATCGTAAAGCACTTAAAGGATTTACGATGTTGCGATGGCGATGCAGTTACACTAGAGTGCCATGTGGAAGGACTGCCGGAGCCGAATGTCTTCTGGGAAAAGGACGGTAAAATCCTTCACGATCGATCCACCGAGCATAGACAGAACTTCAACGGACGCAAAGCGACGCTCTCGATTCCGCGCGTATTCCCCGAGGACGAAGGTCAATATTGTCTCGTCGCGTGTAACAACATGGGACGTGTCAAGAGCTCCGCTTGCATTATCGTCGACGTGCCGGAAGAGAAGGAGAACCTTCTCAGCCGTCAGCTATCACGTCCAATGACTTTCCTCTCACCAAATTCGACGCCGCGTTCAACACCGCGCTCTACACCGGCCCGCAGCATGTCGCCGCTCTCGCTGCACCTCTCGATGATCAGCAACAACATCAACCTCACATACCGCCAGCGACGCTACCGCTTCTCGGCGCCAAAGTTCTACTCGGTGCCGCACAACCGCGTCTGTGAGGAGGGCGACACCGTGCGCTTCCAGTGCGCGATCGCGGGCCACCCGGTGCCGTGGTCGACATGGGACAAGGACGGCATCATCGTGACGCCGTCGCAGCGTTTCCTCATCCGCGAGCGCGACGACATGCGCTACTTGGAGATCGAGGAGGTGAGCTTCGAGGACGCCGGCTTGTACCGCATCACACTCGAGAACGACTACGGGCGAATTGAGGCGACCGCACGTCTGGACATCATCAAGAGCGGCAAAATCCACAGACGCGGCATACGCGCATCGAGCGCACCGAACCGCGACTCGACCAGCATGACGCGACGCATCATGGGCTACAGCACGCGCATCGGCGGCCGCATGGCGCTTGCAGCGCAACGCGCTCAGTCGATCCCGGCGAAGAAGGCGAGCGTCTACCACAACGGCTACGACATCAGCGACTCGAGCCGGCTGACGGTGACGCAAAACGAGACCGAAATTCTGCTCGAAATCGACAGCGTGAAGACGTACGATGAGGGCGAGTACACGCTGATGCTCGAGGACGAGGCAGGCATTGCGACCACCACGACCACCTTCGCCAAGGTTCACTACGTCGAGGACGAGATCTTCGAGGAGACGATTGTGCCGAAGGTGACGCGACCGTTGGCGGCGAGTTTGACGAGCATCGAGGGAATTCCGCTCGATCTCACGCTCCAAATCGACTGCGAAGTTCCCTTCGACTACGTGTGGCTGCGCAACGACGAGCCGCTCCTCAACTGCGAGGACTTTGTCTACATCGACCACGGCTGCGGAACCCTGACGCTCCGCATCCAGGATCCGTTTGTGTTCGACTCGGGTAAATATGAGTGCGTCATCACGACACTCGCCGGTGAATGCACCAGCGAATGCAGCGTGGAAATCGAGGAGCTCAACAACTCAATCAATCTTCTCGACGTGATACCGGAATTCATCAAAACGCCGCTTCCCTCGATCGCCGTGCCCGGCTGTCCCGTGTCGTTCTGCACGCGCGTCACGCCGATTGACTCACTCGTCGAGTGGAGCGTTTGCGGCTGCGAGGTGACTGACGACACAAAAGGCTTCAAGATTGAGAGATTGGATGATGGACTCAATGTGCTGCATGTGCTGAGTGTCGATCACAAGAGGAGCGGCGAAGTGAAGTGCCGTGCCATAAGTAGCAATAACCCGCAAATTTTTAACGTCTATCACACGAATTTGACTGTTCTTCCCGTGCCGATCAGCGAACGCTACGAGCCGATCGAAAGTGTCTTAGAGAGTAATAGCTATAGTCATAATAATCATAAGAGTAGTTTAGTGCACGACCTAGACTTAACGTGCTTCATCACAAAGCGACCCGAGGACAGGACCGTGCTTGTCGGCGACTCGATCGAGTTGAATGTGTCTTACATTGGACGTCCTGAGCCAACCGTCCGTTGGATGCGAGCGGAACGTCTAATTGGAAACGAACCAAATACGATAATAATTACCAAAAATGGTCACTCAAGATTGATCATCAACAATATCACGTCAGACCAAAGCGGAAAGTATAGTGTGGAAATTATGAACGAACATAGCACGGATCTTGCGTCGAGCTCCGTTGCCGTCGAGGGAGTTCCCGATGCGCCAATCTCGCTGAGCTACTCGAAGGGCTCCGATCGTGTGGCCGTTGCCTGGTCTGGACCCGCCTACGATGGCGGCTGCATGCTTACTGGATTCATCCTGGAGATGCAGCAAGACGACGGCGAGTGGGAGGAGGTGGCAACAGTTGCAGACTCGCTAGCATATACCGTAAAGAACCTAACGCCCGACAACAAATACAAGTTTCGTGTGCGAGCGCAGAACGTTCACGGCAATTCGCCGCCGAGCAAGTCGACCGAGGAAATCGTTTTAGTGAAGCCAAGTGATGCGAGCGGCAACGGCAGAGACGACGAGGACGAGGAGAGCGAGGAGGAGTATTCGAAGGAAATTCCATATGTGAAGTCGGGCGGCGACTTTAAAGCGCGCTTCGAGATCATGGAGGAGCTGGGCAAGGGACGCTTCGGGACAGTTCATCGTGTGATGGAACGTGAAACTGGTCTCATACTGGCAGCAAAAATAATTAAGTGTATAAAAGCAATGGACCGCAAAAAGGTTCAAGACGAAATCAAAATAATGAAATCACTTCAGCATCCCAAACTCCTTCAGCTTTCAGCTTCATTTGAGACGCAGAAGGAAATTATCATGGTGATGGAATACTTAAACGGCGGCGAGCTTTTCGAGCGAGTCGTTGCAGACGACTTTACACTGACTGAGCGTGACTGCATTTTATTTATGAGGCAAATTTGTGAGGGCGTCGCATACATGCACAGCAAGTATGTCGTTCATCTCGATCTTAAACCCGAGAACATCATGTGTCACACGAGAACGAGCCATCAGATAAAGATCATCGATTTCGGCTTGGCACAAAATCTCGAGCCGGGAAAGCAGGTTCGAGTTCTCTTCGGCACGCCGGAATTTTGTCCCCCAGAGATAATCAATTACGAGCCAATAGGTCTTCAATCAGACATGTGGAGCTTGGGTGTCATCTGCTACGTTCTTCTGTCGGGCTTATCGCCTTTTATGGGCGAGACGGATGTGTCGACATTTTCCAATATAACCAGAGCTTATTATGATTTCGATGACGAGGCCTTCGATGCAATCAGCGAGGAGGCGAAGGACTTCATAGCGGGACTGCTCGTCTATCGCAAAGAGAACCGAATGAATGCCAGACAGTGCCTCGAATCGAAATGGTTGTCGCAGCACTACGATGTCATGGGCAGCACTAAATTGTGCACGGATAAGTTGAAAAAGTTCATCATTCGCAGAAAGTGGCAGAAAACAGGAAATGCCATACGAGCCATCGGAAAGATGAAAAATTTAACTTTATCCGCTGCATCGCGCAAGAGTGCCAGCGGCGTCTCATCGAATAGCAGCCCGCGACCATCGATTTCGGGTCCGATTTATGCGAACGCTCAGAACTTTGTCTCCGACACTCGCATCACGAGCGTCGACGAAGAGCTCGCAGAGGTCGGTCGAAATCTCCCTACCGCGAACGGCAACGACAGCAGCAAGTCGACGAAGGATCCGCGAATCTGCAACGAGCGCAGCGACTCGGGATTCAGTGAATGCTCGAACTGCTCCACTCCGTCCGCTTCGTGTGCGTGCAATCACCACACAAACAGCCTCAGTCACGAGCCAAAGGCCGACACGATCGTCGAGGAGCGGAGTAGCGTAAGTAGCAGCAGCAGCTCCTTGGAGCCGCATGAAACGACGGCGAGCGACGCAACCCACATCGAGGAGGAGGAGGAGCGCCATGCCGACTCCGACCACGACATTCACAGCGAAATTTCATCGCTCGAGTACGACACAAGTCACTGCGACAGCATCGCAGTGTCGCTGCGCGTTCCGACTCGTCGCGAGCTCTTCCTGAACGAGCAGGGCGAGCCGATGAGCGAAATCGAGCGCCGCAAGGTCTCGCTCGAGAACAAGGCGAGAAAGGTGGCGGCGACGAGCGTGCGACAAGAGTTCTCGCTCGAGAAGCTCAAGAAGACGAGCAAGGTCGCTCTGCTGATGGAGAAGTTCGAGGGTGAGGCGACGAGTCCCGTTGTCTCGCCCAAAGTGCCCGCGCACGCCGACAGTCAATTCAAATTCAAATCGACAGTTACCATTGAGAGAGCTGCAAAGTCTCCGCGAAATTCACCAGCAACGAGTCCAGCAGCAGCAACAGCGACATTTAGGTTAAGCGATAGAGTCAGAGAGGCGACCGAGAGATTGTCCAAGCCCAAGCAGCAGCCAGTGCCGCAGCAACAAGCGACAGCAACAACAACAATGACGACGACGACGACAACAATGACGAATAAGACCACAAAAGCCAGTATTCTCAAACAAAATGCCAACTTTGTACGATCGAAGGATTTTTGGAAGAGA
Protein: 1490 (aa)
MHHIVRNIETLPRIVKHLKDLRCCDGDAVTLECHVEGLPEPNVFWEKDGKILHDRSTEHRQNFNGRKATLSIPRVFPEDEGQYCLVACNNMGRVKSSACIIVDVPEEKENLLSRQLSRPMTFLSPNSTPRSTPRSTPARSMSPLSLHLSMISNNINLTYRQRRYRFSAPKFYSVPHNRVCEEGDTVRFQCAIAGHPVPWSTWDKDGIIVTPSQRFLIRERDDMRYLEIEEVSFEDAGLYRITLENDYGRIEATARLDIIKSGKIHRRGIRASSAPNRDSTSMTRRIMGYSTRIGGRMALAAQRAQSIPAKKASVYHNGYDISDSSRLTVTQNETEILLEIDSVKTYDEGEYTLMLEDEAGIATTTTTFAKVHYVEDEIFEETIVPKVTRPLAASLTSIEGIPLDLTLQIDCEVPFDYVWLRNDEPLLNCEDFVYIDHGCGTLTLRIQDPFVFDSGKYECVITTLAGECTSECSVEIEELNNSINLLDVIPEFIKTPLPSIAVPGCPVSFCTRVTPIDSLVEWSVCGCEVTDDTKGFKIERLDDGLNVLHVLSVDHKRSGEVKCRAISSNNPQIFNVYHTNLTVLPVPISERYEPIESVLESNSYSHNNHKSSLVHDLDLTCFITKRPEDRTVLVGDSIELNVSYIGRPEPTVRWMRAERLIGNEPNTIIITKNGHSRLIINNITSDQSGKYSVEIMNEHSTDLASSSVAVEGVPDAPISLSYSKGSDRVAVAWSGPAYDGGCMLTGFILEMQQDDGEWEEVATVADSLAYTVKNLTPDNKYKFRVRAQNVHGNSPPSKSTEEIVLVKPSDASGNGRDDEDEESEEEYSKEIPYVKSGGDFKARFEIMEELGKGRFGTVHRVMERETGLILAAKIIKCIKAMDRKKVQDEIKIMKSLQHPKLLQLSASFETQKEIIMVMEYLNGGELFERVVADDFTLTERDCILFMRQICEGVAYMHSKYVVHLDLKPENIMCHTRTSHQIKIIDFGLAQNLEPGKQVRVLFGTPEFCPPEIINYEPIGLQSDMWSLGVICYVLLSGLSPFMGETDVSTFSNITRAYYDFDDEAFDAISEEAKDFIAGLLVYRKENRMNARQCLESKWLSQHYDVMGSTKLCTDKLKKFIIRRKWQKTGNAIRAIGKMKNLTLSAASRKSASGVSSNSSPRPSISGPIYANAQNFVSDTRITSVDEELAEVGRNLPTANGNDSSKSTKDPRICNERSDSGFSECSNCSTPSASCACNHHTNSLSHEPKADTIVEERSSVSSSSSSLEPHETTASDATHIEEEEERHADSDHDIHSEISSLEYDTSHCDSIAVSLRVPTRRELFLNEQGEPMSEIERRKVSLENKARKVAATSVRQEFSLEKLKKTSKVALLMEKFEGEATSPVVSPKVPAHADSQFKFKSTVTIERAAKSPRNSPATSPAAATATFRLSDRVREATERLSKPKQQPVPQQQATATTTMTTTTTTMTNKTTKASILKQNANFVRSKDFWKR
Type | Start | End | Length |
CDS |
38795 |
39045 |
251 |
CDS |
39163 |
40003 |
841 |
CDS |
40345 |
40643 |
299 |
CDS |
40773 |
40911 |
139 |
CDS |
41006 |
41186 |
181 |
CDS |
42268 |
42783 |
516 |
CDS |
43060 |
43331 |
272 |
CDS |
46699 |
47058 |
360 |
CDS |
50732 |
51044 |
313 |
CDS |
51209 |
52196 |
988 |
CDS |
52624 |
52916 |
293 |
CDS |
53899 |
53915 |
17 |
intron |
39046 |
39162 |
117 |
intron |
40004 |
40344 |
341 |
intron |
40644 |
40772 |
129 |
intron |
40912 |
41005 |
94 |
intron |
41187 |
42267 |
1081 |
intron |
42784 |
43059 |
276 |
intron |
43332 |
46698 |
3367 |
intron |
47059 |
50731 |
3673 |
intron |
51045 |
51208 |
164 |
intron |
52197 |
52623 |
427 |
intron |
52917 |
53898 |
982 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
NP_001188954 |
Stretchin-Mlck, isoform O [Drosophila melanogaster] gb|ADV37200.1| Stretchin-Mlck, isoform O [Drosophila melanogaster] |
0.0 |
InterPro |
IPR003598 |
Immunoglobulin subtype 2 |
|
InterPro |
IPR020675 |
Myosin light chain kinase-related |
|
InterPro |
IPR017441 |
Protein kinase, ATP binding site |
|
InterPro |
IPR007110 |
Immunoglobulin-like |
|
InterPro |
IPR002290 |
Serine/threonine- / dual-specificity protein kinase, catalytic domain |
|
InterPro |
IPR003599 |
Immunoglobulin subtype |
|
InterPro |
IPR003961 |
Fibronectin, type III |
|
InterPro |
IPR013098 |
Immunoglobulin I-set |
|
InterPro |
IPR008271 |
Serine/threonine-protein kinase, active site |
|
InterPro |
IPR011009 |
Protein kinase-like domain |
|
InterPro |
IPR000719 |
Protein kinase, catalytic domain |
|
InterPro |
IPR013783 |
Immunoglobulin-like fold |
|
InterPro |
IPR020635 |
Tyrosine-protein kinase, catalytic domain |
|
Gene Ontology(BP) |
GO:0006468 |
protein phosphorylation |
|
Gene Ontology(MF) |
GO:0004713 |
protein tyrosine kinase activity |
|
Gene Ontology(MF) |
GO:0004674 |
protein serine/threonine kinase activity |
|
Gene Ontology(MF) |
GO:0005524 |
ATP binding |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0004672 |
protein kinase activity |
|
Gene Ontology(MF) |
GO:0016772 |
transferase activity, transferring phosphorus-containing groups |
|
Pfam |
PF00047.20 |
Immunoglobulin domain |
2.3e-12 |
Pfam |
PF11465.3 |
Natural killer cell receptor 2B4 |
0.00019 |
Pfam |
PF07679.11 |
Immunoglobulin I-set domain |
5e-71 |
Pfam |
PF07714.12 |
Protein tyrosine kinase |
3.2e-38 |
Pfam |
PF00069.20 |
Protein kinase domain |
1.7e-65 |
Pfam |
PF06293.9 |
Lipopolysaccharide kinase (Kdo/WaaP) family |
7.2e-05 |
Pfam |
PF00041.16 |
Fibronectin type III domain |
9e-14 |
Pfam |
PF13927.1 |
Immunoglobulin domain |
1.3e-15 |
Pfam |
PF13895.1 |
Immunoglobulin domain |
6.2e-24 |
Pfam |
PF07686.12 |
Immunoglobulin V-set domain |
2e-14 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Gene ID |
Pn.01161 |
Pn.01159 |
Orthologous genes
Species |
Gene ID |
B. mori |
BGIBMGA002033-TA |
N. vitripennis |
NV17141-PA |
S. invicta |
SI2.2.0_00037 |
A. aegypti |
AAEL008057 |
H. sapiens |
ENSP00000352088 |
B. mori |
BGIBMGA002035-TA |
T. castaneum |
TC002962 |
C. quinquefasciatus |
CPIJ012863 |
H. sapiens |
ENSP00000353452 |
H. sapiens |
ENSP00000346846 |
H. sapiens |
ENSP00000418335 |
A. aegypti |
AAEL007281 |
N. vitripennis |
NV17133-PA |
H. sapiens |
ENSP00000354004 |
D. plexippus |
DPOGS203928PA |
P. vanderplanki |
Pv.08790 |
H. sapiens |
ENSP00000320622 |
H. melpomene |
HMEL013510-PA |
C. quinquefasciatus |
CPIJ015022 |
B. mori |
BGIBMGA002034-TA |
M. musculus |
ENSMUSG00000022836 |
A. mellifera |
GB15635-PA |
A. mellifera |
GB16909-PA |
P. vanderplanki |
Pv.08789 |
S. invicta |
SI2.2.0_15022 |
N. vitripennis |
NV17142-PA |
S. invicta |
SI2.2.0_00399 |
P. humanus |
PHUM541150-PA |
A. mellifera |
GB17546-PA |
A. gambiae |
AGAP002737 |
A. gambiae |
AGAP002154 |
D. melanogaster |
FBgn0013988 |