MidgeBase gene description page [Pn.01035]

Outline

Link to gbrowse

Gene ID Pn.01035
Type Protein coding gene
Scaffold PnScaf1035
Start 27101
End 34852
Direction -

Sequence

Transcript: 4842 (bp)

 ATGAAGAAAAGAGTGGATGAAATACAAACTTTTCATAGTACAGCACGTGTTACTGGAGGTACGGCTCGAATATTAACACCGCCACAGCATAACAGCAGCCCAAATGCTACTCAGAGTCAGTTTCAAATTTTGAATGTTCTCCGTGACTCTGCCCCAGCTCAAAGCGTACAATACACATATACTTCAAACATTCAGCAAAGAAATAATGTAGCACAGCAACCACCACCACAGAGTCAACCGGTGCAAGTTGTTAATGCGTCTGCAATTAACAATGCTGTTCGTCCAAAGCCGACATCTGTAACAACGGTGGTAACAGCTTCTGGAGCCGTTGTAGGAGGCGGAGCTGGGAGTGGTACAACCGCCGCAGTAGTAGCAAGTGGAGGAAGCGGAGGAAGCATAACTCCTACTCCTGTTTCAGTTACAAGTCCTACTGGCAATTCTATTCCTGCTCAGAACGTTTCTGGGCAAAATTTTCCACGACTTAAAGTGGAGGATGCCTTGAGTTATCTCGATCAGGTCAAATATAAATTTGGAAATCAACCGCAAGTCTACAATGACTTTTTGGATATTATGAAAGAATTCAAATCTCAAAGCATCGATACGCCCGGTGTTATTCAACGTGTTTCGAATCTTTTCAAAGGACATCCTGAATTGATTGTCGGATTCAATACGTTCCTTCCGCCAGGTTATAAAATAGAGGTTCAAGCAAATGATCAAGGCTTTGCATATCAAGTCTCAGTTTCTGTGCCATCTCCATCGGGCAACCAAACGATTAGTTCTCAACATTCGCCACCATCGAAATTTATACAAGGCACACACATTATCCAACCGCCCGTTAATCTCATTACGCATACCGGTCATACACTTCATGTTCAACCTCAGCCGACACAAACGAATCAACAAGATCAACGATCAATTCAGAATCAACATATGACCACCACTAATATAAATATTGCACAAAGTTTTTCTCGAGAACGAACGCTGCCAGCCGTTTCAAATTCTAGTGGAAGTGTACAATCAAATCAGCAGCAGCAACAACAACAACAACAATCATCAACAGTTTCTCAGCAACCATCACAGCAACATCACTCTCAAACTAATCAAATTCAATCGACAATTAATGAAACTTCGAGTTTGCATAGGATGCAACCAATTTTTCAAAATGATAATCAACCAGTACAGTTTAATCAAGCAATTGTTTATGTCAATAAAATTAAGTGTCGCTTCCAAGAGCAACCTGAAAAATACAAACATTTTCTTCGAATCTTACACAAATATCAAAATGAGCAGCGAAGAAAGGATAACGCAGGCAAGACAGGATCACCACTTACAGAGACAGAAGTTTTTAAGCAGGTTGCGCATTTATTCGATAATCAAGAAGATCTCCTACGCGAGTTTGGTCAATTCTTGCCTGATGCATCGCCGGGCGCCACAAACGTATTGACTGGTAAAGGATCTGCTTCGGGACACAGTGAGTCTAACGATAAATCAAGCTCGGTCATCGGCAATCAAAATACAAATGCAAAACAATTGGTCAATAACAATTCTCGGAATAGTATTATCAATCCTTTGGACATGTTTCCAGCAAGCAATAATGATAAAGGTGACTTTCATGGAACTTCATATGGTGCAATCAACCGAGAAAAAGATTCGAATAGAAATCACATAAACTCTAATAATCAAAAGTTCAGTAGTATGAGTACATCTGGGAGCGGATTAAAGCGTTCACCTGCAATGTCCATAAACCACGTGAATCGTGGACACGACCGAAACGAACCGCCCATAAAACGCTACAAGCCAGTTTTTCGTGATGTTTCTTATGCTGATGCTGCTCGTTCAGGTACACTACAAGATTATGCCTTCTTCGATAAAGTGCGAGAAGCATTAAAAACGCCAGACGTTTACGATAACTTTCTTCGGTGTCTAACACTGTACAATCAGGAAATTGTGAGCAAGTCAGAACTTATAACTCTAGTCGCACCGTTCTTGAATAAAGAACCAGAACTCCTGAAGCGTTTCCAAGAGTTTCTTAAGTTCTCGGCATCTTCCGAAACGCTTCCCTTGTCGGTTGCACAACGTCAAGAACATCGTGTGCAAATTGATACTGCAACTGAAATTGATGTGACTCAATGTAAACGATTAGGAACGAGTTATTGTGCCATTCCCAAATCAAGTGAGCCCAAGAAGTGTAGTGGTCGAACAGCTCTCTGCAAAGAGGTGCTTAACGATACGTGGGTATCATTCCCGTCGTGGTCTGAGGATTCAACATTCAACACTTCTCGCAAAACACAATATGAAGAGTTCATATATCGCTGTGAAGACGAGCGCTTTGAACTTGATGTTGTTATCGAGACGAATAGTGCAACAATTCGTGTACTTGAAGGTGTACAAAAGAAAATGTCAAAAATGGCAGCTGATGAGCTTGCACGATTCAAGCTAGATGATTCATTAGGCGGAACTTCACAATCAATACATCAACGAGCTTTACGCCGCATTTATGGGGACAAAGCCCCCGACATTATCGAAGGACTCAAGAAGAATCCAAACGTTGCCGTGCCTGTTGTTCTTCGAAGATTGAAGGCTAAAGAGGAGGAGTGGAGAGAGGCGCAGAAAGGTTTTAATCAGCAATGGCGAGAACAGAACGAAAAATACTATCTCAAATCACTTGATCATCAAGGCATCAATTTCAAACACGCTGACACTAAAGCGCTTAGATCTAAAAGTCTAATGAATCAAATTGAGTCGGCCTATGAAGAGCGCAACGAAGGTAATAATGGAGAAGCAATTCCTGGACCTCATCTTGTTCTTCACTACAAAGACAAGTCTATACTAGAAGATGCTGCAAACTTACTTATACATCACGTAAAACGCCAGACAGGTATTCAAAAACTAGAAAAGGCCAGAATTAAGCACATCTTGAGGCAATTTGTTCCCGAATTATTCTTTGCTCCTCGGCAGCCCTTAAGTGATGATGAAAGAGAAGACGTATTTCCCTTTTCAGTAGAAGACAAAGAATCAAATATTTCTGAAAATGATGGAAAATCAAAATCAAATAGCAAAAATAACAGTCCATCAAGAAGTGAAAGTGTGCCTGTAAATAAAATACCTACAACAAATTGTGAAAATTATTCAAAGAATATATCAATATCCACGAATGATGAACAAATACAACAACAGTTGCAACCAAGTACAAAGAATTCTGCTGACCAGGCATCAGAAACTCCTGCAGATGGTGATATTAAAGTTGAAATTAAGGCTGATCCGGATGCTGTTAAACAAGTTCCGAACAACGCTGTTCAACCGGCGCCTGGAAGTAATCTCCTACCGCCTCATGCAGCAAATGCGAAGCATCAGGACGAGGCATATACATTGTTTTTCTCAAATAGCAATTGGTACTACTTTCTGCGACTTCATGCTATTTTGTGCGAGCGTTTAAGAACAATGTATGATCGAACACAAATTTTGGCTGCAGAGGAGGATAAATACAAAGTGAACAGGCGGGAAAGTGTTGCTATTGCATTGAGATTGAAGCCACAGAACGAATTCGAAATTCCCGATTACTATCCAGCTTTCTTGGACATGCTTAAGAGTCTTTTGGACGGTAACATGGACGCAACAACGTATGAAGATAAATTAAGAGACATGTATGGTATTCATGCATACATTGCGTTCACTCTGGACCGGGTTGTATCCAACGCTGTCCGCCAACTTCAATTTTGTGTGACTGAACGAAATGCTCTCGAATGTTTTGAATTATATCAACTGGAGAGCAAGAATAATGCCACCGGTGGACTTTGTTCAACTGCATTCAAAAGAACGGCTGCCGAATTGGCATATCAACGGAAAACAGAGTCGAACTTGCAAGAAGAAATGTATTTTAAAGTGATAATTTATAAAATAGACTGCCGTGTTACGATTGAAATGTTAGAAAACGATAATGAAGACACGTCGACAACAAATTTCGAGCAGATTCAAACGACAAGTAAATATATTGAACGTTACACGAATCCTTCGGCTTCGGGAGGAGGCAATGGCAAAACCAGCAGAAATAATTCTGTAAATGGTGCTTCCTTTAGCAACCTTGATATAAAGACAGAAAAATCAGAGGAAGAAATGAAAAAAAATCGTAAGCCCTTATTTTTGCATAGAAACATCAGGAAACTGAATCGAAGAATGTCAAATAGTATAAAATTTGAAAATGGAGAAGGTTCAGAGAATTGTGTAGAAGTTAGCTCCACTACTGCACCTCCAATAAGCACTACGACTACGACTACAAGCACAACAAGTTCTTCCTCAACAGTTCAAATTACAAAAGGCAATGAATCATCAATATTCTCAGCAATTGCCACTACTTCCTCATGTACCTTAACAACCTCAACTTCTTCAGTTCCACCACCGTTAATAATTTTACCGGAATTCACCAGCAAGAGTGCACCCCCATCGTCTAGAATCGGGCAATCCAATTCGTTTGGCGACTTTTTCGTTGATGATCAAGAACAAGTGAAATTCAATTTCCAAACCTACAAGACGGTTTTCTGCAACACTACTAATAAAGGCTACATGCTCTATCGCTACAATTCATTGAAACGAGCGAAAGAGACCCATATGAAAGTTACAAAACAAATGGACTTACGTTTTAATGAATATGTACAAAAATGGCTCGTAAGAAACGTCAGCGACTTACAGCGAATAACTATTAATGATTGGCTTCTAGGACGAAATCAAGCTGATTTTGTTTCGTGTAGAACAACAATTAAAAAAGATAATAACATTACAGAAACGCCATATTGCATCTTTAATCGCTATAAAGTTGAATACATTTCGGCAAGTACCGATAAGTGC 

Protein: 1614 (aa)

 MKKRVDEIQTFHSTARVTGGTARILTPPQHNSSPNATQSQFQILNVLRDSAPAQSVQYTYTSNIQQRNNVAQQPPPQSQPVQVVNASAINNAVRPKPTSVTTVVTASGAVVGGGAGSGTTAAVVASGGSGGSITPTPVSVTSPTGNSIPAQNVSGQNFPRLKVEDALSYLDQVKYKFGNQPQVYNDFLDIMKEFKSQSIDTPGVIQRVSNLFKGHPELIVGFNTFLPPGYKIEVQANDQGFAYQVSVSVPSPSGNQTISSQHSPPSKFIQGTHIIQPPVNLITHTGHTLHVQPQPTQTNQQDQRSIQNQHMTTTNINIAQSFSRERTLPAVSNSSGSVQSNQQQQQQQQQSSTVSQQPSQQHHSQTNQIQSTINETSSLHRMQPIFQNDNQPVQFNQAIVYVNKIKCRFQEQPEKYKHFLRILHKYQNEQRRKDNAGKTGSPLTETEVFKQVAHLFDNQEDLLREFGQFLPDASPGATNVLTGKGSASGHSESNDKSSSVIGNQNTNAKQLVNNNSRNSIINPLDMFPASNNDKGDFHGTSYGAINREKDSNRNHINSNNQKFSSMSTSGSGLKRSPAMSINHVNRGHDRNEPPIKRYKPVFRDVSYADAARSGTLQDYAFFDKVREALKTPDVYDNFLRCLTLYNQEIVSKSELITLVAPFLNKEPELLKRFQEFLKFSASSETLPLSVAQRQEHRVQIDTATEIDVTQCKRLGTSYCAIPKSSEPKKCSGRTALCKEVLNDTWVSFPSWSEDSTFNTSRKTQYEEFIYRCEDERFELDVVIETNSATIRVLEGVQKKMSKMAADELARFKLDDSLGGTSQSIHQRALRRIYGDKAPDIIEGLKKNPNVAVPVVLRRLKAKEEEWREAQKGFNQQWREQNEKYYLKSLDHQGINFKHADTKALRSKSLMNQIESAYEERNEGNNGEAIPGPHLVLHYKDKSILEDAANLLIHHVKRQTGIQKLEKARIKHILRQFVPELFFAPRQPLSDDEREDVFPFSVEDKESNISENDGKSKSNSKNNSPSRSESVPVNKIPTTNCENYSKNISISTNDEQIQQQLQPSTKNSADQASETPADGDIKVEIKADPDAVKQVPNNAVQPAPGSNLLPPHAANAKHQDEAYTLFFSNSNWYYFLRLHAILCERLRTMYDRTQILAAEEDKYKVNRRESVAIALRLKPQNEFEIPDYYPAFLDMLKSLLDGNMDATTYEDKLRDMYGIHAYIAFTLDRVVSNAVRQLQFCVTERNALECFELYQLESKNNATGGLCSTAFKRTAAELAYQRKTESNLQEEMYFKVIIYKIDCRVTIEMLENDNEDTSTTNFEQIQTTSKYIERYTNPSASGGGNGKTSRNNSVNGASFSNLDIKTEKSEEEMKKNRKPLFLHRNIRKLNRRMSNSIKFENGEGSENCVEVSSTTAPPISTTTTTTSTTSSSSTVQITKGNESSIFSAIATTSSCTLTTSTSSVPPPLIILPEFTSKSAPPSSRIGQSNSFGDFFVDDQEQVKFNFQTYKTVFCNTTNKGYMLYRYNSLKRAKETHMKVTKQMDLRFNEYVQKWLVRNVSDLQRITINDWLLGRNQADFVSCRTTIKKDNNITETPYCIFNRYKVEYISASTDKC 
Type Start End Length
CDS 27104 27346 243
CDS 27434 27916 483
CDS 29011 29235 225
CDS 29306 29512 207
CDS 29588 29917 330
CDS 30464 30831 368
CDS 31682 31910 229
CDS 32006 33544 1539
CDS 33635 34852 1218
intron 27347 27433 87
intron 27917 29010 1094
intron 29236 29305 70
intron 29513 29587 75
intron 29918 30463 546
intron 30832 31681 850
intron 31911 32005 95
intron 33545 33634 90

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_003400052 PREDICTED: paired amphipathic helix protein Sin3b-like [Bombus terrestris] 0.0
InterPro IPR013194 Histone deacetylase interacting
InterPro IPR003822 Paired amphipathic helix
Gene Ontology(BP) GO:0006355 regulation of transcription, DNA-dependent
Gene Ontology(CC) GO:0005634 nucleus
Pfam PF02671.16 Paired amphipathic helix repeat 9.4e-48
Pfam PF08295.7 Sin3 family co-repressor 4.2e-42

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
N. vitripennis NV15662-PA
H. sapiens ENSP00000353622
H. sapiens ENSP00000248054
H. sapiens ENSP00000378402
H. sapiens ENSP00000378403
A. aegypti AAEL014491
H. sapiens ENSP00000369131
D. melanogaster FBgn0022764
T. castaneum TC009311
S. invicta SI2.2.0_15534
P. humanus PHUM089280-PA
D. plexippus DPOGS214927PA
P. vanderplanki Pv.01140
B. mori BGIBMGA000204-TA
A. gambiae AGAP007892
M. musculus ENSMUSG00000042557
H. melpomene HMEL016134-PA
A. aegypti AAEL014711
P. vanderplanki Pv.14357
C. quinquefasciatus CPIJ802190
A. mellifera GB15858-PA