MidgeBase gene description page [Pn.12015]

Outline

Link to gbrowse

Gene ID Pn.12015
Type Protein coding gene
Scaffold PnScaf15238
Start 35995
End 46444
Direction -

Sequence

Transcript: 5697 (bp)

 ATGGATCAGTCGCACATGTCGGACCACTCGTCGGCGCAGTCGTCGCCCATCCACATGCAGAAGATGCAGTTCAAGCTGCAGCAGAGGCTGGCCGAGCAGCAGCAGCAGCAGCAGCAGCAGCAGATGTCGCCGATGATGGACAGACGGAAGATGGGCGCGAAGGCCGCGTCCGCGCTCGATCTCTCCAACGGCCAGCCCGCCTCCATGGACGCGCAGCTCCTCTCGCACCTCCAGCACCAGCAGCAACAACAACAACAGCAGCACTCGCTGATGCTGCACCAGCAGCAGAAGTTCCCGCACCAGAACCACCCGATCGAGCAGCAGATGTACCAGCAGTTCCACCAGCACAAGCCGATGATCACGCAGCTGATGAAGGTGCAGGGCAACAACGCGTCGCTCGCCGACTTCCGCTCGATCGTGCACGGCGCGGCCGGCGGCAGCGCCGCCCAGGTGGCCGACCTCATCGCGAGCAAGGAGCAGCTGAACGCGATCGTGAACGCGCAAAAGCACTCGGACTACATCGTCGAGGACTACATGGACAAGATCCAGACGCGCATCGCGCTGCTCGAGACGGAGCTGAAGTTCGCCGACCGCAAGCTGCACGTGCTGTACGGCGAGTACAACGACATGCTCGCCAAGATCGACAAGCTCGAGAACCTCACGATCGCGCAGCAGTCCGTGCTCGCCAACCTCCTCGACCTGTGCAGCAACCAGTCGCACGAGGTGCACGCGGCGAAGGCGATCCAGGCGAAGGCGGCGCAGCTGTTCGGCGTCGACCCGCGCGCGCTCCTCACGATGGGCGCGGCCGAGGCGGCGTTCGAGGGCGAGGCGGCGTTCCCGAGCGAGGCGCTCGACTCGACGGCCGCCGAGTTCAGCGAGCTGCTCGAGGACCTCAAGAACGACGCCATCATCGATGGGCTGCGCGGGCAGCAGCTGCAGCAGTTCGCCGCCGACTACCAGGGCGACGCCGACTACGAGGCGCTCATCGAGCTGCTCGGCAAGCAGATGAAGCTGCCGCCCGCCGACGCCGCCTTCTCGTTCTTCGGGCGCGGCGAGCAGGCCGAGGAGTTCCTCAAGGCGCAGGACGAGATGAAGCAGCACGCGCTGCAGCAGCAGCAGCACAACAGCAGCATGGAGCTGCTCAAGAACCTCGCCGACGGCCAGCTCAAGATGGAGATGGAGTTTTTTCGCAGCGGCGAGATCAACAAGAGCCTCTTCGAGACGCTCCAGGCGAGCGTCCAGCCGACGACCGCGCCGCCCACAGCCGCCACCGCCGCCGCCGCCACCCTCGACATGATCTACGAGGACAATGAGGAGCGCGCCAACGAAAACGGCGGCGCCGGCGGCGGCGTGAGGGAGGCGGACGCGAAGGCGCACGAGTCGTCGGCGGAGGAGAAGTCGTCGGGGTCGCGCAAGCCCAAGTCGAAGAAGAAGAAGCACCACCAGCTGCAGCTGCAGCAGCAGCAGTTCCAGCTGCAGAGCGACGACCAGCTGGTGCACGAGATCATCGAGGAGATCCTGCGGCTCGAGAGCCTCACGCACCTGCTCAGCAAGAACCAGCACGACGAGCTGAAGGCGCTCGTGAAGGGCGAGATCAAGGTGCTGCAGAGCATGCGCAAGCTCGACCTCAACTTCGTCGCGCTCCTCGTCAGCCCGACGGCGCAGCTCGCCTCGCCCTCCGCGCCCAGCTCCATGAGCCTCGAGGAGGAGGACCAGAAGTTCGACGCGCTCATGCGCAAGCTGCGCCGGAACCTCGACGCGCTGCGCACGATCGAGCCGACGGCGAGCGAGCCGACCAGCTCCAAGGCCGCGCTCTACAACGACGACGAGTACTTGCAGTCGCTACGCAAGTCCCTCGACCGCCACAACTCCATGCAACTCCTCCTGCAGCTGCAGAACCCCAACCTCAACTCCTCCTTGTCCTCCGCTGGCGGCGGCGGCGGCGGCGTCGGCTCCAAAATCAACGCCGGCGATTTTTTATCCGACGATCAGCTGGACGGCGGCGAGAGCAGCAGCCCGCCGCCGCCCGCACCCAACGGCGACCTCTACTACCCGAGCGGCGAGGACGAGCGGTTCGCGATGACGGCGATGGCGACGGCCGCGGAGCAGCAGGAGTGGAACCCCTTCCACGTCGACATCCTCATGATGAAGCAGCAGCAGCAGAGCCAGCCGCGCCAGCTGTCGCCCTACCGGCGCAACACGTCGCCCAAGAAGAGCGACTCGGGCCTGAGCTCCATGTCGGGGTTCTCGAGCTTTGAGAAGTCGCCCAACTCGCCGTCGTACTCGCTGCCCTACGATGGGGCCAGTGGTGGCGGCGGCGGTTGTGGGGGGAGCTACCGGGCGGCGACCATCCGCTCGCAGGACTACGTGAAAATGCTCAACACCTACCAGCAGCTTAAGGTGTCGTCGGCCGCCTCGCACGAGGCTGCTGTTGCCGGTGGTGGTGGTGGGATTGCGAATGTGAGTGCTGCTTTTCAAGAGCCCCAGTCGTCGGTGTCGCCCTTTCTCTTCTCCGACGTTCCCAGCAGCGGCAGCGCCCTCCTACAACAACAACAGCAGCAGCAGCAGCAACAACAAAGTCAGCAGCAAGGGCAGCTGGCGCCGCCGGAGGGCATCTACGGCGAGGAGAACCTGAACTACATAAAGGAGCTGTCGCAGAACGTGCCGATCTGCTCGATCTACGAGAACAAGTCGATTTTTGACAACGTGAGCGTCATCAAGCCAGCCTCAGCGTGGGAGATGTACGTGAAGAACCAGAGCGCTGAGACGTCGGCCGGCGGCCAGCCGGTGCCCTACCCGGACCTCCTCAACGTGAGCCAGGAGGCGAGCGGCGAGGCGCTGAAGAGCCTGCAGCAGCAGCAGCAGCTCGAGGAGCTGATGGAGCAGCGGCGCAAGCTCGACGCCCAGAAGCAGCAGCTCAAGCAGGAGCAGCAGCTGCTCATGCAGCGCAAGCTGCAGAAGCGCTCGCAGAACCTCACCGACCACCTCGTGTACTACCCCTCGCAGCCCTCCAACGACTACCAGCGCGACTACCATCCGCAGCACCACCAGAACCTCCAGCACGACGAGTTCATGCAGCACGCGCACGCCCAGCACCAGGTGCACCTGCAGCGCCAGATGCGTCTCACTGAGGAGGGAAAAAACGGCGGCGTCGCCGTCGGAGGAGCGGGCCCTGAAGGTGGGAGCGGTAGCGGGGGCAGCCATGGAGGGAGGGGCGACAAGAAGCGGCACGCGTACTTGAACAAGAGCCTCAACAAGGTGCAGAACTGGCTGCCCGAGATCAAGCTCAAGAAGATGTCGAAGCGCCACCGCAGCCACAGCCTCCCCGGGCAGGTCGACAGCGACGACGTGTACGAGCCGCAGCACTACAAGATGAACCTGAAGAAGGGCGGCGGCGGCGGCGGGGGCGGAGGTGGCGGCAGCGGCGCGAGGGGCGACAAGAAGCAGGGCGAGGTGTACGTGATGAAGAGCTACATGAAGGGCAAGAAGAAGGACCTCGTGCGCACGATGTCGTCGATCATGCACAAGGCGCAGAAGACCTACCGCCGGCACAGCTTCTCGCACCACCACCTCTCGGACGAGGAGGGCGGCGGCGGCGGCGGCGGGGGCGAGGAGCGCTCGCCGCAGTTCCAGCGTGTGCGCGGCGCGCGCAGCATGTCGTCGGCGGCGACGGCGAGCAAAAGCCGCGCCTACTCGGACACGGAGACGGACATCAGCTCGATGTTCAGCGACAGCGAGGAGCACTCGATGCCGCCCATTTTCGCGACCGTCGGCGATTCAAAGTCAAAACATTCTGATGCTGGTGATTCAGACCGAAATGCTGTGAGCGACTCGTCCAACGGCAGCCGCCAGCAGCACGCGTCCGAGCAGCAGCCGCAGTCGCAGGACAACAATTTCAACCTGAACTTCACCAGCACCAGCATGGAGTTTGCAGCGAGTCGCAAGGTCGGCATTTTCCGCAAGAAGTCGTCCACGAACGACGACTTGGGCGGGAGCTGTGGCGGCGGCGGCGGCAGTGCCAGCGACTCGCACAGTCCCACCAACAACGAGTCCGTCAACGACATCTTCGAGCGCACGAAGGCGGCGCAGCCGGCGGCAACGGCGACGGCGACGGCGCAAAAGCTCACCAAAACCCACTCGATTTTCGTCGACTCCATCGACGACGACGGCGAGGTGCGCGCGAAGCCGGTGGCGGCGCCGCGCAACGAGAGCCTCGACCCTTCACCGGCGACGGCTGTCACGGCGGCTCCGACGCGCCCGGTGCCGAGCCCGCGCTTCGAGCACTCGCGCTCGAACAGCATCAAGAACAGCCTGGACGTGCCGGGCGGCGGCGGCGGCAAGGAGGAGGAGGACAGCCGCTCGCAGCACTCGTTCCGCACGTCGATCTCGAGCCGGCGGCAGAGCACGGAGGACTCGATCGACACCGACGACGAGTACTTCTACTACGAGATGCGCAACCTCGAGGAGCTCGAGCGCAACAGCCACATGGAGTCGCTGCTGCAGGACAACAAGAGCGAGCTCATCAACAACATCATCCGGCTGGAGAGCTCCACGAACGCCGTCGAGCCGGACGACTGCGTGCGCCGCAACATGGCGACCGTGCTGCACGAGCTGCGCGAGACCGTGCGCCTCCGCGAGCCCTTCGACCAGGTCGCGCAGGACGCGCTCAACAACAACAACAACAAACAAAATGGCCGCCACAGGAAGGGCATCTACGACAAGTTCACGCTGGCGTCGACGAACGTCGCCGACCTGCCGTGGAACCGCGACAGCGACGACGACTACAACGAGGACGACGAGCTGATGACGCAGCTGACGCAGTTCGAGTCGGAGATCCGCGACGCGAGCTACGGCAAGGTGAAGAAGAAACGGAAAAAGCCGAAGCGCGCCGAGCGCCACGCCGCCGGCCAGTCGTCGTCGTCGTCGTCGAGTGCGGACGAGGAGCACGGAGGCGACGGCGGCGGCGACGTCGACGAGATGGCGGCGCCCAAGTACGAGCGGCCGCACTCGCAGTCGTCGGGCGTGACGTCCGGGCCGGACTCGCCGATCGCGAGTGACGGCGAGGCGGAGGAGGACGAGGCGGGGGCGCCGCGCGAGCGCTACGACGAGTTCAAGCAGAGGCAAAAAAATGAGCGGCTCCTCGAGCATCCACAGCAGCCGCTTCAACATCCGAGCGATGATCGAGGAGCGAACCGCGACGAAGGACGCACGCCGTCCGGTAGTGCTGCTGCCGAGACGGAACTAAACACACACGACAAACAACAACAACAGCAGGTCCAGGAGGGGGAGGCGCGCAAGGCGTCGCCGATGAGGAAACTCATGCAGCTGTCGTCGACGGTGTCGAGCGACTCGACCAACCAGGACAGCGGCATCAGCGACACGAGCGGCGGCGCCGGCGGCATGAACAGCAAGTGGAAGCTGCTGAAGACGCTGAAGGAGCGCAAGGAGATGAACAATCAAGTGAAGATCAAGGAGGAGGAGGAGAGCACCGCCAAGGAGGGAAAGACGGGCGGCGGCGATGGAGTGCGTGGCAATCACTCAAACGACAATCCATTCTACAGCAACATTGACAGCATGCCCGACATTCGGCCGCGCAGAAAATCAATTCCGCTCGTGTCGGAGCTGGGTTGGTATTCGAATGCCTACCCGAGCAACAACAGCAGCCAATTGGCATACTTTGAGTATGCCACAGGTTTT 

Protein: 1899 (aa)

 MDQSHMSDHSSAQSSPIHMQKMQFKLQQRLAEQQQQQQQQQMSPMMDRRKMGAKAASALDLSNGQPASMDAQLLSHLQHQQQQQQQQHSLMLHQQQKFPHQNHPIEQQMYQQFHQHKPMITQLMKVQGNNASLADFRSIVHGAAGGSAAQVADLIASKEQLNAIVNAQKHSDYIVEDYMDKIQTRIALLETELKFADRKLHVLYGEYNDMLAKIDKLENLTIAQQSVLANLLDLCSNQSHEVHAAKAIQAKAAQLFGVDPRALLTMGAAEAAFEGEAAFPSEALDSTAAEFSELLEDLKNDAIIDGLRGQQLQQFAADYQGDADYEALIELLGKQMKLPPADAAFSFFGRGEQAEEFLKAQDEMKQHALQQQQHNSSMELLKNLADGQLKMEMEFFRSGEINKSLFETLQASVQPTTAPPTAATAAAATLDMIYEDNEERANENGGAGGGVREADAKAHESSAEEKSSGSRKPKSKKKKHHQLQLQQQQFQLQSDDQLVHEIIEEILRLESLTHLLSKNQHDELKALVKGEIKVLQSMRKLDLNFVALLVSPTAQLASPSAPSSMSLEEEDQKFDALMRKLRRNLDALRTIEPTASEPTSSKAALYNDDEYLQSLRKSLDRHNSMQLLLQLQNPNLNSSLSSAGGGGGGVGSKINAGDFLSDDQLDGGESSSPPPPAPNGDLYYPSGEDERFAMTAMATAAEQQEWNPFHVDILMMKQQQQSQPRQLSPYRRNTSPKKSDSGLSSMSGFSSFEKSPNSPSYSLPYDGASGGGGGCGGSYRAATIRSQDYVKMLNTYQQLKVSSAASHEAAVAGGGGGIANVSAAFQEPQSSVSPFLFSDVPSSGSALLQQQQQQQQQQQSQQQGQLAPPEGIYGEENLNYIKELSQNVPICSIYENKSIFDNVSVIKPASAWEMYVKNQSAETSAGGQPVPYPDLLNVSQEASGEALKSLQQQQQLEELMEQRRKLDAQKQQLKQEQQLLMQRKLQKRSQNLTDHLVYYPSQPSNDYQRDYHPQHHQNLQHDEFMQHAHAQHQVHLQRQMRLTEEGKNGGVAVGGAGPEGGSGSGGSHGGRGDKKRHAYLNKSLNKVQNWLPEIKLKKMSKRHRSHSLPGQVDSDDVYEPQHYKMNLKKGGGGGGGGGGGSGARGDKKQGEVYVMKSYMKGKKKDLVRTMSSIMHKAQKTYRRHSFSHHHLSDEEGGGGGGGGEERSPQFQRVRGARSMSSAATASKSRAYSDTETDISSMFSDSEEHSMPPIFATVGDSKSKHSDAGDSDRNAVSDSSNGSRQQHASEQQPQSQDNNFNLNFTSTSMEFAASRKVGIFRKKSSTNDDLGGSCGGGGGSASDSHSPTNNESVNDIFERTKAAQPAATATATAQKLTKTHSIFVDSIDDDGEVRAKPVAAPRNESLDPSPATAVTAAPTRPVPSPRFEHSRSNSIKNSLDVPGGGGGKEEEDSRSQHSFRTSISSRRQSTEDSIDTDDEYFYYEMRNLEELERNSHMESLLQDNKSELINNIIRLESSTNAVEPDDCVRRNMATVLHELRETVRLREPFDQVAQDALNNNNNKQNGRHRKGIYDKFTLASTNVADLPWNRDSDDDYNEDDELMTQLTQFESEIRDASYGKVKKKRKKPKRAERHAAGQSSSSSSSADEEHGGDGGGDVDEMAAPKYERPHSQSSGVTSGPDSPIASDGEAEEDEAGAPRERYDEFKQRQKNERLLEHPQQPLQHPSDDRGANRDEGRTPSGSAAAETELNTHDKQQQQQVQEGEARKASPMRKLMQLSSTVSSDSTNQDSGISDTSGGAGGMNSKWKLLKTLKERKEMNNQVKIKEEEESTAKEGKTGGGDGVRGNHSNDNPFYSNIDSMPDIRPRRKSIPLVSELGWYSNAYPSNNSSQLAYFEYATGF 
Type Start End Length
CDS 35998 36069 72
CDS 36401 36517 117
CDS 40489 43526 3038
CDS 43975 46444 2470
intron 36070 36400 331
intron 36518 40488 3971
intron 43527 43974 448

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_002099624 GE14499 [Drosophila yakuba] gb|EDW99336.1| GE14499 [Drosophila yakuba] 4e-40

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.12014

Orthologous genes

Species Gene ID
P. humanus PHUM432180-PA
H. sapiens ENSP00000442569
A. gambiae AGAP000065
H. sapiens ENSP00000367757
H. sapiens ENSP00000438156
D. melanogaster FBgn0025726
S. invicta SI2.2.0_02136
H. sapiens ENSP00000447572
N. vitripennis NV21516-PA
S. invicta SI2.2.0_10367
H. sapiens ENSP00000252773
D. plexippus DPOGS204827PA
P. humanus PHUM401000-PA
N. vitripennis NV30665-PA
M. musculus ENSMUSG00000062151
H. melpomene HMEL014388-PA
H. sapiens ENSP00000446831
H. sapiens ENSP00000380006
C. quinquefasciatus CPIJ006394
H. sapiens ENSP00000429562
T. castaneum TC004071
A. mellifera GB12196-PA
A. aegypti AAEL002357
P. vanderplanki Pv.06447
H. sapiens ENSP00000367756
S. invicta SI2.2.0_11730
H. sapiens ENSP00000447236
B. mori BGIBMGA001566-TA
H. sapiens ENSP00000260323
H. sapiens ENSP00000400409
D. plexippus DPOGS204826PA