MidgeBase gene description page [Pn.03160]

Outline

Link to gbrowse

Gene ID Pn.03160
Type Protein coding gene
Scaffold PnScaf2667
Start 38792
End 53915
Direction -

Sequence

Transcript: 4470 (bp)

 ATGCATCACATTGTGCGTAATATTGAAACGCTTCCTCGCATCGTAAAGCACTTAAAGGATTTACGATGTTGCGATGGCGATGCAGTTACACTAGAGTGCCATGTGGAAGGACTGCCGGAGCCGAATGTCTTCTGGGAAAAGGACGGTAAAATCCTTCACGATCGATCCACCGAGCATAGACAGAACTTCAACGGACGCAAAGCGACGCTCTCGATTCCGCGCGTATTCCCCGAGGACGAAGGTCAATATTGTCTCGTCGCGTGTAACAACATGGGACGTGTCAAGAGCTCCGCTTGCATTATCGTCGACGTGCCGGAAGAGAAGGAGAACCTTCTCAGCCGTCAGCTATCACGTCCAATGACTTTCCTCTCACCAAATTCGACGCCGCGTTCAACACCGCGCTCTACACCGGCCCGCAGCATGTCGCCGCTCTCGCTGCACCTCTCGATGATCAGCAACAACATCAACCTCACATACCGCCAGCGACGCTACCGCTTCTCGGCGCCAAAGTTCTACTCGGTGCCGCACAACCGCGTCTGTGAGGAGGGCGACACCGTGCGCTTCCAGTGCGCGATCGCGGGCCACCCGGTGCCGTGGTCGACATGGGACAAGGACGGCATCATCGTGACGCCGTCGCAGCGTTTCCTCATCCGCGAGCGCGACGACATGCGCTACTTGGAGATCGAGGAGGTGAGCTTCGAGGACGCCGGCTTGTACCGCATCACACTCGAGAACGACTACGGGCGAATTGAGGCGACCGCACGTCTGGACATCATCAAGAGCGGCAAAATCCACAGACGCGGCATACGCGCATCGAGCGCACCGAACCGCGACTCGACCAGCATGACGCGACGCATCATGGGCTACAGCACGCGCATCGGCGGCCGCATGGCGCTTGCAGCGCAACGCGCTCAGTCGATCCCGGCGAAGAAGGCGAGCGTCTACCACAACGGCTACGACATCAGCGACTCGAGCCGGCTGACGGTGACGCAAAACGAGACCGAAATTCTGCTCGAAATCGACAGCGTGAAGACGTACGATGAGGGCGAGTACACGCTGATGCTCGAGGACGAGGCAGGCATTGCGACCACCACGACCACCTTCGCCAAGGTTCACTACGTCGAGGACGAGATCTTCGAGGAGACGATTGTGCCGAAGGTGACGCGACCGTTGGCGGCGAGTTTGACGAGCATCGAGGGAATTCCGCTCGATCTCACGCTCCAAATCGACTGCGAAGTTCCCTTCGACTACGTGTGGCTGCGCAACGACGAGCCGCTCCTCAACTGCGAGGACTTTGTCTACATCGACCACGGCTGCGGAACCCTGACGCTCCGCATCCAGGATCCGTTTGTGTTCGACTCGGGTAAATATGAGTGCGTCATCACGACACTCGCCGGTGAATGCACCAGCGAATGCAGCGTGGAAATCGAGGAGCTCAACAACTCAATCAATCTTCTCGACGTGATACCGGAATTCATCAAAACGCCGCTTCCCTCGATCGCCGTGCCCGGCTGTCCCGTGTCGTTCTGCACGCGCGTCACGCCGATTGACTCACTCGTCGAGTGGAGCGTTTGCGGCTGCGAGGTGACTGACGACACAAAAGGCTTCAAGATTGAGAGATTGGATGATGGACTCAATGTGCTGCATGTGCTGAGTGTCGATCACAAGAGGAGCGGCGAAGTGAAGTGCCGTGCCATAAGTAGCAATAACCCGCAAATTTTTAACGTCTATCACACGAATTTGACTGTTCTTCCCGTGCCGATCAGCGAACGCTACGAGCCGATCGAAAGTGTCTTAGAGAGTAATAGCTATAGTCATAATAATCATAAGAGTAGTTTAGTGCACGACCTAGACTTAACGTGCTTCATCACAAAGCGACCCGAGGACAGGACCGTGCTTGTCGGCGACTCGATCGAGTTGAATGTGTCTTACATTGGACGTCCTGAGCCAACCGTCCGTTGGATGCGAGCGGAACGTCTAATTGGAAACGAACCAAATACGATAATAATTACCAAAAATGGTCACTCAAGATTGATCATCAACAATATCACGTCAGACCAAAGCGGAAAGTATAGTGTGGAAATTATGAACGAACATAGCACGGATCTTGCGTCGAGCTCCGTTGCCGTCGAGGGAGTTCCCGATGCGCCAATCTCGCTGAGCTACTCGAAGGGCTCCGATCGTGTGGCCGTTGCCTGGTCTGGACCCGCCTACGATGGCGGCTGCATGCTTACTGGATTCATCCTGGAGATGCAGCAAGACGACGGCGAGTGGGAGGAGGTGGCAACAGTTGCAGACTCGCTAGCATATACCGTAAAGAACCTAACGCCCGACAACAAATACAAGTTTCGTGTGCGAGCGCAGAACGTTCACGGCAATTCGCCGCCGAGCAAGTCGACCGAGGAAATCGTTTTAGTGAAGCCAAGTGATGCGAGCGGCAACGGCAGAGACGACGAGGACGAGGAGAGCGAGGAGGAGTATTCGAAGGAAATTCCATATGTGAAGTCGGGCGGCGACTTTAAAGCGCGCTTCGAGATCATGGAGGAGCTGGGCAAGGGACGCTTCGGGACAGTTCATCGTGTGATGGAACGTGAAACTGGTCTCATACTGGCAGCAAAAATAATTAAGTGTATAAAAGCAATGGACCGCAAAAAGGTTCAAGACGAAATCAAAATAATGAAATCACTTCAGCATCCCAAACTCCTTCAGCTTTCAGCTTCATTTGAGACGCAGAAGGAAATTATCATGGTGATGGAATACTTAAACGGCGGCGAGCTTTTCGAGCGAGTCGTTGCAGACGACTTTACACTGACTGAGCGTGACTGCATTTTATTTATGAGGCAAATTTGTGAGGGCGTCGCATACATGCACAGCAAGTATGTCGTTCATCTCGATCTTAAACCCGAGAACATCATGTGTCACACGAGAACGAGCCATCAGATAAAGATCATCGATTTCGGCTTGGCACAAAATCTCGAGCCGGGAAAGCAGGTTCGAGTTCTCTTCGGCACGCCGGAATTTTGTCCCCCAGAGATAATCAATTACGAGCCAATAGGTCTTCAATCAGACATGTGGAGCTTGGGTGTCATCTGCTACGTTCTTCTGTCGGGCTTATCGCCTTTTATGGGCGAGACGGATGTGTCGACATTTTCCAATATAACCAGAGCTTATTATGATTTCGATGACGAGGCCTTCGATGCAATCAGCGAGGAGGCGAAGGACTTCATAGCGGGACTGCTCGTCTATCGCAAAGAGAACCGAATGAATGCCAGACAGTGCCTCGAATCGAAATGGTTGTCGCAGCACTACGATGTCATGGGCAGCACTAAATTGTGCACGGATAAGTTGAAAAAGTTCATCATTCGCAGAAAGTGGCAGAAAACAGGAAATGCCATACGAGCCATCGGAAAGATGAAAAATTTAACTTTATCCGCTGCATCGCGCAAGAGTGCCAGCGGCGTCTCATCGAATAGCAGCCCGCGACCATCGATTTCGGGTCCGATTTATGCGAACGCTCAGAACTTTGTCTCCGACACTCGCATCACGAGCGTCGACGAAGAGCTCGCAGAGGTCGGTCGAAATCTCCCTACCGCGAACGGCAACGACAGCAGCAAGTCGACGAAGGATCCGCGAATCTGCAACGAGCGCAGCGACTCGGGATTCAGTGAATGCTCGAACTGCTCCACTCCGTCCGCTTCGTGTGCGTGCAATCACCACACAAACAGCCTCAGTCACGAGCCAAAGGCCGACACGATCGTCGAGGAGCGGAGTAGCGTAAGTAGCAGCAGCAGCTCCTTGGAGCCGCATGAAACGACGGCGAGCGACGCAACCCACATCGAGGAGGAGGAGGAGCGCCATGCCGACTCCGACCACGACATTCACAGCGAAATTTCATCGCTCGAGTACGACACAAGTCACTGCGACAGCATCGCAGTGTCGCTGCGCGTTCCGACTCGTCGCGAGCTCTTCCTGAACGAGCAGGGCGAGCCGATGAGCGAAATCGAGCGCCGCAAGGTCTCGCTCGAGAACAAGGCGAGAAAGGTGGCGGCGACGAGCGTGCGACAAGAGTTCTCGCTCGAGAAGCTCAAGAAGACGAGCAAGGTCGCTCTGCTGATGGAGAAGTTCGAGGGTGAGGCGACGAGTCCCGTTGTCTCGCCCAAAGTGCCCGCGCACGCCGACAGTCAATTCAAATTCAAATCGACAGTTACCATTGAGAGAGCTGCAAAGTCTCCGCGAAATTCACCAGCAACGAGTCCAGCAGCAGCAACAGCGACATTTAGGTTAAGCGATAGAGTCAGAGAGGCGACCGAGAGATTGTCCAAGCCCAAGCAGCAGCCAGTGCCGCAGCAACAAGCGACAGCAACAACAACAATGACGACGACGACGACAACAATGACGAATAAGACCACAAAAGCCAGTATTCTCAAACAAAATGCCAACTTTGTACGATCGAAGGATTTTTGGAAGAGA 

Protein: 1490 (aa)

 MHHIVRNIETLPRIVKHLKDLRCCDGDAVTLECHVEGLPEPNVFWEKDGKILHDRSTEHRQNFNGRKATLSIPRVFPEDEGQYCLVACNNMGRVKSSACIIVDVPEEKENLLSRQLSRPMTFLSPNSTPRSTPRSTPARSMSPLSLHLSMISNNINLTYRQRRYRFSAPKFYSVPHNRVCEEGDTVRFQCAIAGHPVPWSTWDKDGIIVTPSQRFLIRERDDMRYLEIEEVSFEDAGLYRITLENDYGRIEATARLDIIKSGKIHRRGIRASSAPNRDSTSMTRRIMGYSTRIGGRMALAAQRAQSIPAKKASVYHNGYDISDSSRLTVTQNETEILLEIDSVKTYDEGEYTLMLEDEAGIATTTTTFAKVHYVEDEIFEETIVPKVTRPLAASLTSIEGIPLDLTLQIDCEVPFDYVWLRNDEPLLNCEDFVYIDHGCGTLTLRIQDPFVFDSGKYECVITTLAGECTSECSVEIEELNNSINLLDVIPEFIKTPLPSIAVPGCPVSFCTRVTPIDSLVEWSVCGCEVTDDTKGFKIERLDDGLNVLHVLSVDHKRSGEVKCRAISSNNPQIFNVYHTNLTVLPVPISERYEPIESVLESNSYSHNNHKSSLVHDLDLTCFITKRPEDRTVLVGDSIELNVSYIGRPEPTVRWMRAERLIGNEPNTIIITKNGHSRLIINNITSDQSGKYSVEIMNEHSTDLASSSVAVEGVPDAPISLSYSKGSDRVAVAWSGPAYDGGCMLTGFILEMQQDDGEWEEVATVADSLAYTVKNLTPDNKYKFRVRAQNVHGNSPPSKSTEEIVLVKPSDASGNGRDDEDEESEEEYSKEIPYVKSGGDFKARFEIMEELGKGRFGTVHRVMERETGLILAAKIIKCIKAMDRKKVQDEIKIMKSLQHPKLLQLSASFETQKEIIMVMEYLNGGELFERVVADDFTLTERDCILFMRQICEGVAYMHSKYVVHLDLKPENIMCHTRTSHQIKIIDFGLAQNLEPGKQVRVLFGTPEFCPPEIINYEPIGLQSDMWSLGVICYVLLSGLSPFMGETDVSTFSNITRAYYDFDDEAFDAISEEAKDFIAGLLVYRKENRMNARQCLESKWLSQHYDVMGSTKLCTDKLKKFIIRRKWQKTGNAIRAIGKMKNLTLSAASRKSASGVSSNSSPRPSISGPIYANAQNFVSDTRITSVDEELAEVGRNLPTANGNDSSKSTKDPRICNERSDSGFSECSNCSTPSASCACNHHTNSLSHEPKADTIVEERSSVSSSSSSLEPHETTASDATHIEEEEERHADSDHDIHSEISSLEYDTSHCDSIAVSLRVPTRRELFLNEQGEPMSEIERRKVSLENKARKVAATSVRQEFSLEKLKKTSKVALLMEKFEGEATSPVVSPKVPAHADSQFKFKSTVTIERAAKSPRNSPATSPAAATATFRLSDRVREATERLSKPKQQPVPQQQATATTTMTTTTTTMTNKTTKASILKQNANFVRSKDFWKR 
Type Start End Length
CDS 38795 39045 251
CDS 39163 40003 841
CDS 40345 40643 299
CDS 40773 40911 139
CDS 41006 41186 181
CDS 42268 42783 516
CDS 43060 43331 272
CDS 46699 47058 360
CDS 50732 51044 313
CDS 51209 52196 988
CDS 52624 52916 293
CDS 53899 53915 17
intron 39046 39162 117
intron 40004 40344 341
intron 40644 40772 129
intron 40912 41005 94
intron 41187 42267 1081
intron 42784 43059 276
intron 43332 46698 3367
intron 47059 50731 3673
intron 51045 51208 164
intron 52197 52623 427
intron 52917 53898 982

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr NP_001188954 Stretchin-Mlck, isoform O [Drosophila melanogaster] gb|ADV37200.1| Stretchin-Mlck, isoform O [Drosophila melanogaster] 0.0
InterPro IPR003598 Immunoglobulin subtype 2
InterPro IPR020675 Myosin light chain kinase-related
InterPro IPR017441 Protein kinase, ATP binding site
InterPro IPR007110 Immunoglobulin-like
InterPro IPR002290 Serine/threonine- / dual-specificity protein kinase, catalytic domain
InterPro IPR003599 Immunoglobulin subtype
InterPro IPR003961 Fibronectin, type III
InterPro IPR013098 Immunoglobulin I-set
InterPro IPR008271 Serine/threonine-protein kinase, active site
InterPro IPR011009 Protein kinase-like domain
InterPro IPR000719 Protein kinase, catalytic domain
InterPro IPR013783 Immunoglobulin-like fold
InterPro IPR020635 Tyrosine-protein kinase, catalytic domain
Gene Ontology(BP) GO:0006468 protein phosphorylation
Gene Ontology(MF) GO:0004713 protein tyrosine kinase activity
Gene Ontology(MF) GO:0004674 protein serine/threonine kinase activity
Gene Ontology(MF) GO:0005524 ATP binding
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0004672 protein kinase activity
Gene Ontology(MF) GO:0016772 transferase activity, transferring phosphorus-containing groups
Pfam PF00047.20 Immunoglobulin domain 2.3e-12
Pfam PF11465.3 Natural killer cell receptor 2B4 0.00019
Pfam PF07679.11 Immunoglobulin I-set domain 5e-71
Pfam PF07714.12 Protein tyrosine kinase 3.2e-38
Pfam PF00069.20 Protein kinase domain 1.7e-65
Pfam PF06293.9 Lipopolysaccharide kinase (Kdo/WaaP) family 7.2e-05
Pfam PF00041.16 Fibronectin type III domain 9e-14
Pfam PF13927.1 Immunoglobulin domain 1.3e-15
Pfam PF13895.1 Immunoglobulin domain 6.2e-24
Pfam PF07686.12 Immunoglobulin V-set domain 2e-14

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.01161
Pn.01159

Orthologous genes

Species Gene ID
B. mori BGIBMGA002033-TA
N. vitripennis NV17141-PA
S. invicta SI2.2.0_00037
A. aegypti AAEL008057
H. sapiens ENSP00000352088
B. mori BGIBMGA002035-TA
T. castaneum TC002962
C. quinquefasciatus CPIJ012863
H. sapiens ENSP00000353452
H. sapiens ENSP00000346846
H. sapiens ENSP00000418335
A. aegypti AAEL007281
N. vitripennis NV17133-PA
H. sapiens ENSP00000354004
D. plexippus DPOGS203928PA
P. vanderplanki Pv.08790
H. sapiens ENSP00000320622
H. melpomene HMEL013510-PA
C. quinquefasciatus CPIJ015022
B. mori BGIBMGA002034-TA
M. musculus ENSMUSG00000022836
A. mellifera GB15635-PA
A. mellifera GB16909-PA
P. vanderplanki Pv.08789
S. invicta SI2.2.0_15022
N. vitripennis NV17142-PA
S. invicta SI2.2.0_00399
P. humanus PHUM541150-PA
A. mellifera GB17546-PA
A. gambiae AGAP002737
A. gambiae AGAP002154
D. melanogaster FBgn0013988