MidgeBase gene description page [Pn.04036]

Outline

Link to gbrowse

Gene ID Pn.04036
Type Protein coding gene
Scaffold PnScaf3332
Start 5641
End 31744
Direction +

Sequence

Transcript: 7095 (bp)

 ATGGCATCACAACCATCAGAGCACTTACAAAATGATACTGGTAATAATCAAGGAATAAAAGTGAACGAATATCAACAGCAGCAGCAGCAGGAGATTCTCACAAAGCTACAGAATGGTGGTCAAACGAACCAACCGCCTCCCGTTTCTGCAGTTACTAGTGATCAACAGCAGCAGCATGGAAAAGGCATGAAATCTCCTCCGAGTAACGGAAACAGTGGTCACACCAACCCAAACGAAATGATGCCAAACGTCCCCGGCTACCACCATCAGGACTCGTCGAGTTTGGGCATGCCTCCTCAGCAGCAGCAGCATCCCGGACAGCAGCACATGCATCACCATCCAAGTGCCGGCAACGAGAAAGATGACATACAGCAACAGCAACAACAGCAGCCACCTCATCATCAACAGCAACACCCGATGTATGGTCATCAGCCGATGCATCCGATGCACGCTCCGCAGCAGCATATGCCGGGACATCACTTGCCGATGCATCCGCACCAGCAAAGATACCCCCATCATCCACACGCTCCGCCACCAATGCAGCACCCGATGCATTCGCACGATCCGAGTCAACAGCAGCAGCAAATGGATCCGTATGCGCACTATCGCGGAATGATGGGACCGCGGCCGCCGCAGCGCTACGGACTGGCGCCTGCACCTGGTCCCCAAGGTCCGATGCCGAACAACGCTCAGAACGTCCCGGCAAACGCAGGCGGAGGCCAACAGGGTCCAACGCCGACGCTTAACTCACTACTACAATCACAGTCATCGCCATCACCTCAGCAATCGCCACAGAGCGGACCGCCGCCACACCGCTATGGACCGTACGATCCATACGCGCAGCCAGTGAATGCTGCTGCTGCCGCTGCCGCCGCCGCTGCTGCCTCGGCCCAACAAGGTCCACCTCCCCCCAACAGTGGATCGTCGTCGCCAATGCCGCCGACTCACACTCAGTCGCAGCCGCCGCCGCAGCAGCAGCAACAGCAGGGCTGGGCGCCGCCACCTCGTCCCTACAGCCCTCATCAGCAGTATCGAGGACCACCGCCACCAACGGGAAACTCGTCGAGAGGGCAGTCGCCGTATCCACCGCCGCCGACTTCTGGCGTTCAGAGTCCTGGTCCGTATCCCGGTCCGAATCAGCAGCAAGGAACTCAGCCACCACCTGGTCCACCACAACAATATCAGTATCCTCAGCGCTATCCGACGCCGCCAGGGCCGGGCGGCCAGCAACAATCAATGGGTCCGCAGAATCATCGACCACCATACTCACAATGGCCATCACCGGCTGCTAGTCCAGGTCCTCACTTAGGTCCGCCGCCACCATCGCAATCACCTCATGCGCCTCCTCAGAGTCCTGGACCACAGCAGCAGACGCCGCTGCAGCAGCACAGTGCTCCAAGTCCGTCCTCACAACAACAACCTCAGTCACCACATCAGTATTTAAATCGACCGAGTCAACCCTCAACACCGAGTGCAAACGACAATGAAATGATTGGGGTAAATCAAAAGAAAAAGAACAGCAGTGCAAAGAAGAAAACGAAGAAGCAACAAAAGCAGGAAGCAGCAGCTGCAGCAGCCGCCCAGAACAATTCCAATGAAAATTCTTCCAATCTGGCCGTGTCAAGTCAACAGCAGGCACCGCCATCGCAGCCACCGTCATCAAATAATGCTTCGCAACAGCCGGGCTCACAAATGAGGCCAATCTCATCGCCAAACAGCTCCTCGTCTGGTTCTCGATCGATGTCTCCCGCTGTAGTTGGACAACAAAATTTGCCGATGCCTCCGAGACCGTCGAGCAGTCATTCGCAAGGTATGTCGATGCCAAACCAGCAACAGCAGCAGCAAGGAAATCAAAGCCTCAACATTCCCAATTCCGCGAACCCACAAATGGAGGGCGCACAAATGCCCGGCGCTCCGCAAGGCATGCCGCAAGGATATGGCGGAGGGCCCAAAATGCCGCACGGCTCCTACAACATGTCCCCATATCCGCCGCAATCGCAGTATTCACAGGGAAGCTACTCGCCGCGATATCCCGGCTACGGACCATCCAGTCAGCACCATCCGCCGCCACCCAACAGCCCGAGCCAGTACCGGCCCATGCAGAACCACGTGAATCCCGCCGGACATCCGCAGTATCCTCCGCACGCGCCCTACCACCATCAAGTGTGGCCGCCGCCGCCACAAAACAGCGGCAACAATCCCGGTGCAATGAGCAATCACATTCAGGGCAAGAACATTGGTCCACCTCCACCACAGTCGCCGCAACAGCAGCAGCAACAACAGCCCATTCCCGGCCAGCCTCCGACACCGTCGCAGCAGCAACAACAACAACAGCAGCAGCATCAACATCAACAACAACAGGGTCAACAGCCGCAGCAGCCCGGTCCAGTCGGGTCTCCGCGCCCACTCAACTATTTGAAGCAGCACTTGCAGCACAAGAGTGGCTATCCGCAGGGGTCACCACCGCCGCAGCCTCAAGGCTACGGAAACGGTCCGGGCATGCATCCGATGGGTCCGCCGTCGCACCACATGGGTCCGCCTTCGCAAGGCGCAATGGGCCCACCGCCGGCCTCGACCGGAACACCTCCGGGCCATATGCAAGAGCCCGGCATGGCCATTCCGAGCATCCATCATCCCGAAGGTCAGGACAACGGCATGTCGCAGGGATCGCATCCCGCTACGTCGATAATCACCACGGGACCGGACGGCGTCGGCCTCGACGAAGCAAGTCAGCAAAGTACTCTATCAAACACGTCTATCGCGTCAGTAGAGGACCCGCAGTCAACTCCCAAGTCGCGCAAGACAAACGAGATGATGTATCCGGGCCTCGCCGGTCAGGCAAATGTCTCGCCGAGCACGAGTGGTGGCATGCCACACAGCGAGGACTTTGAAATGGGATCTCCTCCGTGGCCTCGAACGCCGGCCAGCCCGGTGTTCAATAGTCATGCTCCGCCACCGGTGTCAGCACAAGATTCGTTTAGATCATCGAAAATAACGGTTACCAAAAGTAAGACATGGAAAGACGTCGCCGGTTTGCTAGGAATCGGCGCCAGCAGTAGCGCAGCATACACGTTAAGGAAACACTACATTAAAAGTATTTTGCCCTTTGAATGCCAATTTGATCGGGGCGGTATCGACCCGGGTCCCATTATCCAGTCGGTCGAGGTCGGATCAAAGAAAAAGACACAGAAAGCGACATCGGTTCCATCGCCCGGCTCGAGCAACTCGCAAGACTCGTTTCCGGCTCCGCCAAGCAGCAGCCTCGATGCTTACTATGGTTCAGCACCCGGACAATATCCACCAGCACCGCCTTCTGGCCAGCAAGAATATGGTCCCGCAATGCCACGACCACCGTCTCAATCAAGCACTCAGCCAGGATCTGGTAATGCACCACCACCTACTAACGATAACATAAGTGTGAGCAATCCTTTTGAAGATTCCGTTGCATCGCGTCCCCCTTATCAACAGCCACAGCAGCAACACACTGGTGCTCAATATCCTAGACCTCCAGGTCCCTATCAAGGTCAATATGGTCAGCCGCCGTTTGGTCCGGGTCCAGAGCAGCAGCAGCAACCTTACCCGCATGGACCTCACAATTCGGGTCCAGGACCAAACCAATATCCACCATCGCAGAATCAATATCCAAACAACAGACAAATGTACGGCCCATACGGTCCCGAAGATTCTAATTTCAGAAACTCGGCTCCGCAAAGTGACGCCTATCGAGGCTACGGTCACGGTCCTCACTATCCACCACCACAAGGGCCGGGAAGCCAACAAGGCCCACGACCACCGTTCGGTCCACAGCAGCCACAGCAACAGCCTTCATCTGCGTCACCTTCGCCGCAAGCATCGGTCGCTTCTTCGGTTCCCTCGGTAACGAGCGGCTCGCAAGGACCGCATGCAACTCCCTCAGTGACGCCATCATCTGTAAATAGTAATGCACCTTCGAACCAAAACCAGTCACCTTCGGTTCCGCCACCGCCAACACAGAACTTTGGACCACCCAACAGTCAAGAGTACTACAATCGACCAGAACAGAATGCACCGCCAAGGCGGCATCCTGACTTTACAAAAGATCCGAATCAGCAACCGTATTCACCGTATGGCGCACAACGTCCACAGCAGATGTATGGAGGATGGCCACCAAACAGCAGCGGTTCGCAGTTTAGACCGCAGTATCCGCCACAAGGGCCACCGAATCAGTGGCCAAACCAAGGACCTCCACGACCTCAAGGTTCAAATCAATGGGATCAGCAGAACCGCTATCCGATGAACCAACAGTACGGGCCGAATCAGCCGTGGCAGCAAGGCCCCATGAGACCCCCGCAGCGAGGTGGAAAGCCCTTCTCGATGCCTCCGCCGCCACAAGGACCACAACAAATCAATAAAATGCCAAACCAATACGGACCGCCGCACATGCATCATCCGGGAATGCAGGGCGTAGGAATGCAAGGTCCCGGTGCGCAGCACGGACCACCGCAGCAAGGAGCACAAGTGAAGAGAGACATCGTATTTCCGTCGGACGCCGTCGAAGCGACGCAGCCTCTGCTCTATCGAAGAAAGCGAATGACGAAGCACGACGTGAGCCCCGTCGACCCGTGGAGAATCTTCATGTCACTCCGGTCTGGTCTCTTGTCCGAGAGCACTTGGGCGATAGACGTACTGAACGTTCTACTCTTCGACGACACGACGGTCGTCTACTTTGGCTTGACTCACTTGCCGGGACTGTTGAATCTTTTGCTGGAGCACTTTCAAAAGAGCCTCGCCGACACATTCGAGACAAAGTCCATATCGAGTGTGCCTACCATTGCCACCGTTGCCGCCATAAAGGATTCAAACGAGAGCGACGGCACCAAGAATGATGACGAGTGTAACAAAACTAATGGTGGTGATAATAAGTTAATAAGTAGCAATTTAGTTAGTAATAGTAATAGTGATACTGGTGATACTAGGTGTATTAAGAAACAAATTCAAATGCCAGCGAATCGCGAAGAACAAGACTCTTCAGTTGATTTGGGTTCAGTTACCGAAAACGATCTACCAAATCCAAACGAGCATATTGTGGTATTAAAGTCTTATAATTATACAATGCAATCGAGAAAGGGCGTTGCGGTCAAACTTCAAGATTCGAGCAACGACATTTTCATCATGGACAGCCAGCGGATGTGGGACAAAGTTTGCAATCGCGACTACTTCCTCAAGGCCACCGTTGAGGATGATCCGTTCAACGTTGGAAAGGAGCCGAACGACATCGAGTACATTATGGATTGCTTCAAAGCCGAATTTGCACACATTCCATTTTCGCGCTGCATCAAGAGTTCAAAGAGTGCCGAGTCAATCGAAAAGACGCGCAAAGTCGCTGAAGTGAAGCCAAAGCGCATGAGACTGTCTTCGGAGGAGGAGCGCGAGGTCGAGGAGTTGACGAGAAAGCTGTACAAGGCTCCACTCAAGCAACCGACCGTCAACAATGCTGTGTACAAGAAGGATTCAAACTCCTCGGACGCTGATTGTCGACAGGTCGACATGGAGATTGAAAAGGTTCCAAACGGTCCAGTGAATTCTGACGCAGCGCTTGACGGTGCAGAAGCCGAAAAGGCCGGCGAAGAGAACTCCACGACGAAGGAGGAAAGCGCAGAAACCAAGACTGAATTTGATATTAAAAGTACTGTTCGTGATGTGGCAAAGTGCTTGAAGCGAAGGCGCATGAGCGACTACGAGGACGAGTGTTACACGCGCGACGAGGCGAGTTTGCACTTGATCAGCGACAGCCAAGACTCTCTCGCTCGACGATGCATCTGCCTCTCGACGATCCTCCGAAATCTCACATTCGTTCCCGGCAACGAGCTCGAATTTTCCCGATCAACGACCTTCTTATCGATTCTCGGAAAACTCCTGCTTCTCCATCACGAGCATCCGATTCGAACGAAAAAGCAGCGAAACTATGATCGCGAAGAGGACGCAGACTTCTCCGACTCGTGCAGCAGCCTGCAGGGAGAGCACGAGTGGTGGTGGGACTTTTTAGTGCAAATTCGCGAGAACATGCTCGTGGCGTTGACGAACATTTCGGGATACTTGGACTTGTCGGCCTACGATGAGCCGATTTCGAGGCCCATTCTCGACGGACTTCTCCATTGGGCCGTGTGTCCGTCTGCGCACGGACAAGATCCCTTCACAACGCTCGCTAACAACAGTCCGATCAGTCCACAGCGGCTGGCATTGGAAGCTCTGTGCAAACTGTGCGTAACAGATGCGAACGTGGATTTGGTGATTGCAACACCACCGTTTTCACGACTCGAGAAGCTCTGTGCCGTTCTCACGAAACATCTGTGCAAAAACGAGGATCAGGTGCTACGAGAGTTCTCAGTGAATTTGCTGCACTACTTAGCGAGTGCCGATAGCGTCATGGCACGCGTCATTGCGAAGCAAAGTCCTTGCGTGTCCTACCTGGTGGCGTTCATCGAGCAGGCCGAGCAGACTGCCTTAGGCGTTGCCAACCAGCACGGCATCAACTTCCTGCGCGAAAATCCTGACTCGATGGGAACGAGCCTGGACATGCTGAGACGGGCGGCCGGGACACTGCTGCATCTGGCGAAGCATCCTGACAACCGACCGCTCTTCATGCAGCAAGAGCAGCGGCTTCTCGGTCTCGTGATGAGTCACATTTTGGACCAGCAAGTCGCACTGATAATCTCGCGAGTTCTCTTTCAAACTTCACGGGGCAGCGGCCCATTGACAACGACGCAGCAGAACGAGACTCAGAGCGACGAAGCACCGCAAAAAATACTTCCAGAAAGCCAGAAGAATTCTTCACAAAACACATCACAGATTGCAAACTTCGAGCAGAGCCAGAAAACTCCAGCTGTTGTTCACAACTCAACGCATTCGAGTGTACAATCGCAAAACTTTCCACCCTTGATGAAAACACCTCAACAATTTCTATCAACTCAAACCAAATTGTTAAATAGTAGTAACAGCAATGACGAAAGCATGAAATCGAAGTTCAACACGTTAAACAAACAAATTACAGCGAATGCGACTTCCGCTCAGTCATCGTCACCTCCTTCAATTCCTCAAACGGTTACGGCTTCATCT 

Protein: 2365 (aa)

 MASQPSEHLQNDTGNNQGIKVNEYQQQQQQEILTKLQNGGQTNQPPPVSAVTSDQQQQHGKGMKSPPSNGNSGHTNPNEMMPNVPGYHHQDSSSLGMPPQQQQHPGQQHMHHHPSAGNEKDDIQQQQQQQPPHHQQQHPMYGHQPMHPMHAPQQHMPGHHLPMHPHQQRYPHHPHAPPPMQHPMHSHDPSQQQQQMDPYAHYRGMMGPRPPQRYGLAPAPGPQGPMPNNAQNVPANAGGGQQGPTPTLNSLLQSQSSPSPQQSPQSGPPPHRYGPYDPYAQPVNAAAAAAAAAAASAQQGPPPPNSGSSSPMPPTHTQSQPPPQQQQQQGWAPPPRPYSPHQQYRGPPPPTGNSSRGQSPYPPPPTSGVQSPGPYPGPNQQQGTQPPPGPPQQYQYPQRYPTPPGPGGQQQSMGPQNHRPPYSQWPSPAASPGPHLGPPPPSQSPHAPPQSPGPQQQTPLQQHSAPSPSSQQQPQSPHQYLNRPSQPSTPSANDNEMIGVNQKKKNSSAKKKTKKQQKQEAAAAAAAQNNSNENSSNLAVSSQQQAPPSQPPSSNNASQQPGSQMRPISSPNSSSSGSRSMSPAVVGQQNLPMPPRPSSSHSQGMSMPNQQQQQQGNQSLNIPNSANPQMEGAQMPGAPQGMPQGYGGGPKMPHGSYNMSPYPPQSQYSQGSYSPRYPGYGPSSQHHPPPPNSPSQYRPMQNHVNPAGHPQYPPHAPYHHQVWPPPPQNSGNNPGAMSNHIQGKNIGPPPPQSPQQQQQQQPIPGQPPTPSQQQQQQQQQHQHQQQQGQQPQQPGPVGSPRPLNYLKQHLQHKSGYPQGSPPPQPQGYGNGPGMHPMGPPSHHMGPPSQGAMGPPPASTGTPPGHMQEPGMAIPSIHHPEGQDNGMSQGSHPATSIITTGPDGVGLDEASQQSTLSNTSIASVEDPQSTPKSRKTNEMMYPGLAGQANVSPSTSGGMPHSEDFEMGSPPWPRTPASPVFNSHAPPPVSAQDSFRSSKITVTKSKTWKDVAGLLGIGASSSAAYTLRKHYIKSILPFECQFDRGGIDPGPIIQSVEVGSKKKTQKATSVPSPGSSNSQDSFPAPPSSSLDAYYGSAPGQYPPAPPSGQQEYGPAMPRPPSQSSTQPGSGNAPPPTNDNISVSNPFEDSVASRPPYQQPQQQHTGAQYPRPPGPYQGQYGQPPFGPGPEQQQQPYPHGPHNSGPGPNQYPPSQNQYPNNRQMYGPYGPEDSNFRNSAPQSDAYRGYGHGPHYPPPQGPGSQQGPRPPFGPQQPQQQPSSASPSPQASVASSVPSVTSGSQGPHATPSVTPSSVNSNAPSNQNQSPSVPPPPTQNFGPPNSQEYYNRPEQNAPPRRHPDFTKDPNQQPYSPYGAQRPQQMYGGWPPNSSGSQFRPQYPPQGPPNQWPNQGPPRPQGSNQWDQQNRYPMNQQYGPNQPWQQGPMRPPQRGGKPFSMPPPPQGPQQINKMPNQYGPPHMHHPGMQGVGMQGPGAQHGPPQQGAQVKRDIVFPSDAVEATQPLLYRRKRMTKHDVSPVDPWRIFMSLRSGLLSESTWAIDVLNVLLFDDTTVVYFGLTHLPGLLNLLLEHFQKSLADTFETKSISSVPTIATVAAIKDSNESDGTKNDDECNKTNGGDNKLISSNLVSNSNSDTGDTRCIKKQIQMPANREEQDSSVDLGSVTENDLPNPNEHIVVLKSYNYTMQSRKGVAVKLQDSSNDIFIMDSQRMWDKVCNRDYFLKATVEDDPFNVGKEPNDIEYIMDCFKAEFAHIPFSRCIKSSKSAESIEKTRKVAEVKPKRMRLSSEEEREVEELTRKLYKAPLKQPTVNNAVYKKDSNSSDADCRQVDMEIEKVPNGPVNSDAALDGAEAEKAGEENSTTKEESAETKTEFDIKSTVRDVAKCLKRRRMSDYEDECYTRDEASLHLISDSQDSLARRCICLSTILRNLTFVPGNELEFSRSTTFLSILGKLLLLHHEHPIRTKKQRNYDREEDADFSDSCSSLQGEHEWWWDFLVQIRENMLVALTNISGYLDLSAYDEPISRPILDGLLHWAVCPSAHGQDPFTTLANNSPISPQRLALEALCKLCVTDANVDLVIATPPFSRLEKLCAVLTKHLCKNEDQVLREFSVNLLHYLASADSVMARVIAKQSPCVSYLVAFIEQAEQTALGVANQHGINFLRENPDSMGTSLDMLRRAAGTLLHLAKHPDNRPLFMQQEQRLLGLVMSHILDQQVALIISRVLFQTSRGSGPLTTTQQNETQSDEAPQKILPESQKNSSQNTSQIANFEQSQKTPAVVHNSTHSSVQSQNFPPLMKTPQQFLSTQTKLLNSSNSNDESMKSKFNTLNKQITANATSAQSSSPPSIPQTVTASS 
Type Start End Length
CDS 5641 6693 1053
CDS 11587 11805 219
CDS 11882 12046 165
CDS 12652 12711 60
CDS 16170 16226 57
CDS 22355 22559 205
CDS 24339 25340 1002
CDS 25422 25657 236
CDS 26048 26432 385
CDS 27033 27161 129
CDS 27631 27805 175
CDS 27879 28233 355
CDS 28560 28653 94
CDS 28718 28875 158
CDS 28940 31741 2802
intron 6694 11586 4893
intron 11806 11881 76
intron 12047 12651 605
intron 12712 16169 3458
intron 16227 22354 6128
intron 22560 24338 1779
intron 25341 25421 81
intron 25658 26047 390
intron 26433 27032 600
intron 27162 27630 469
intron 27806 27878 73
intron 28234 28559 326
intron 28654 28717 64
intron 28876 28939 64

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001854080 conserved hypothetical protein [Culex quinquefasciatus] gb|EDS34417.1| conserved hypothetical protein [Culex quinquefasciatus] 0.0
InterPro IPR016024 Armadillo-type fold
InterPro IPR021906 Protein of unknown function DUF3518
InterPro IPR001606 ARID/BRIGHT DNA-binding domain
Gene Ontology(CC) GO:0005622 intracellular
Gene Ontology(MF) GO:0003677 DNA binding
Gene Ontology(MF) GO:0005488 binding
Pfam PF12031.3 Domain of unknown function (DUF3518) 5.4e-121
Pfam PF01388.16 ARID/BRIGHT DNA binding domain 8.7e-05

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
H. sapiens ENSP00000387636
A. aegypti AAEL017280
H. sapiens ENSP00000320485
B. mori BGIBMGA010273-TA
C. quinquefasciatus CPIJ010152
M. musculus ENSMUSG00000007880
N. vitripennis NV17809-PA
H. sapiens ENSP00000363267
D. plexippus DPOGS207345PA
H. sapiens ENSP00000275248
H. sapiens ENSP00000313006
P. humanus PHUM080380-PA
H. sapiens ENSP00000442437
P. vanderplanki Pv.17090
D. melanogaster FBgn0261885
H. sapiens ENSP00000344546
A. gambiae AGAP001786
T. castaneum TC013586
H. sapiens ENSP00000390317
H. sapiens ENSP00000412835
H. sapiens ENSP00000055163
H. sapiens ENSP00000356116
A. mellifera GB17648-PA
S. invicta SI2.2.0_16128
M. musculus ENSMUSG00000069729