MidgeBase gene description page [Pn.01035]
Outline
Gene ID | Pn.01035 |
Type | Protein coding gene |
Scaffold | PnScaf1035 |
Start | 27101 |
End | 34852 |
Direction | - |
Sequence
Transcript: 4842 (bp)
ATGAAGAAAAGAGTGGATGAAATACAAACTTTTCATAGTACAGCACGTGTTACTGGAGGTACGGCTCGAATATTAACACCGCCACAGCATAACAGCAGCCCAAATGCTACTCAGAGTCAGTTTCAAATTTTGAATGTTCTCCGTGACTCTGCCCCAGCTCAAAGCGTACAATACACATATACTTCAAACATTCAGCAAAGAAATAATGTAGCACAGCAACCACCACCACAGAGTCAACCGGTGCAAGTTGTTAATGCGTCTGCAATTAACAATGCTGTTCGTCCAAAGCCGACATCTGTAACAACGGTGGTAACAGCTTCTGGAGCCGTTGTAGGAGGCGGAGCTGGGAGTGGTACAACCGCCGCAGTAGTAGCAAGTGGAGGAAGCGGAGGAAGCATAACTCCTACTCCTGTTTCAGTTACAAGTCCTACTGGCAATTCTATTCCTGCTCAGAACGTTTCTGGGCAAAATTTTCCACGACTTAAAGTGGAGGATGCCTTGAGTTATCTCGATCAGGTCAAATATAAATTTGGAAATCAACCGCAAGTCTACAATGACTTTTTGGATATTATGAAAGAATTCAAATCTCAAAGCATCGATACGCCCGGTGTTATTCAACGTGTTTCGAATCTTTTCAAAGGACATCCTGAATTGATTGTCGGATTCAATACGTTCCTTCCGCCAGGTTATAAAATAGAGGTTCAAGCAAATGATCAAGGCTTTGCATATCAAGTCTCAGTTTCTGTGCCATCTCCATCGGGCAACCAAACGATTAGTTCTCAACATTCGCCACCATCGAAATTTATACAAGGCACACACATTATCCAACCGCCCGTTAATCTCATTACGCATACCGGTCATACACTTCATGTTCAACCTCAGCCGACACAAACGAATCAACAAGATCAACGATCAATTCAGAATCAACATATGACCACCACTAATATAAATATTGCACAAAGTTTTTCTCGAGAACGAACGCTGCCAGCCGTTTCAAATTCTAGTGGAAGTGTACAATCAAATCAGCAGCAGCAACAACAACAACAACAATCATCAACAGTTTCTCAGCAACCATCACAGCAACATCACTCTCAAACTAATCAAATTCAATCGACAATTAATGAAACTTCGAGTTTGCATAGGATGCAACCAATTTTTCAAAATGATAATCAACCAGTACAGTTTAATCAAGCAATTGTTTATGTCAATAAAATTAAGTGTCGCTTCCAAGAGCAACCTGAAAAATACAAACATTTTCTTCGAATCTTACACAAATATCAAAATGAGCAGCGAAGAAAGGATAACGCAGGCAAGACAGGATCACCACTTACAGAGACAGAAGTTTTTAAGCAGGTTGCGCATTTATTCGATAATCAAGAAGATCTCCTACGCGAGTTTGGTCAATTCTTGCCTGATGCATCGCCGGGCGCCACAAACGTATTGACTGGTAAAGGATCTGCTTCGGGACACAGTGAGTCTAACGATAAATCAAGCTCGGTCATCGGCAATCAAAATACAAATGCAAAACAATTGGTCAATAACAATTCTCGGAATAGTATTATCAATCCTTTGGACATGTTTCCAGCAAGCAATAATGATAAAGGTGACTTTCATGGAACTTCATATGGTGCAATCAACCGAGAAAAAGATTCGAATAGAAATCACATAAACTCTAATAATCAAAAGTTCAGTAGTATGAGTACATCTGGGAGCGGATTAAAGCGTTCACCTGCAATGTCCATAAACCACGTGAATCGTGGACACGACCGAAACGAACCGCCCATAAAACGCTACAAGCCAGTTTTTCGTGATGTTTCTTATGCTGATGCTGCTCGTTCAGGTACACTACAAGATTATGCCTTCTTCGATAAAGTGCGAGAAGCATTAAAAACGCCAGACGTTTACGATAACTTTCTTCGGTGTCTAACACTGTACAATCAGGAAATTGTGAGCAAGTCAGAACTTATAACTCTAGTCGCACCGTTCTTGAATAAAGAACCAGAACTCCTGAAGCGTTTCCAAGAGTTTCTTAAGTTCTCGGCATCTTCCGAAACGCTTCCCTTGTCGGTTGCACAACGTCAAGAACATCGTGTGCAAATTGATACTGCAACTGAAATTGATGTGACTCAATGTAAACGATTAGGAACGAGTTATTGTGCCATTCCCAAATCAAGTGAGCCCAAGAAGTGTAGTGGTCGAACAGCTCTCTGCAAAGAGGTGCTTAACGATACGTGGGTATCATTCCCGTCGTGGTCTGAGGATTCAACATTCAACACTTCTCGCAAAACACAATATGAAGAGTTCATATATCGCTGTGAAGACGAGCGCTTTGAACTTGATGTTGTTATCGAGACGAATAGTGCAACAATTCGTGTACTTGAAGGTGTACAAAAGAAAATGTCAAAAATGGCAGCTGATGAGCTTGCACGATTCAAGCTAGATGATTCATTAGGCGGAACTTCACAATCAATACATCAACGAGCTTTACGCCGCATTTATGGGGACAAAGCCCCCGACATTATCGAAGGACTCAAGAAGAATCCAAACGTTGCCGTGCCTGTTGTTCTTCGAAGATTGAAGGCTAAAGAGGAGGAGTGGAGAGAGGCGCAGAAAGGTTTTAATCAGCAATGGCGAGAACAGAACGAAAAATACTATCTCAAATCACTTGATCATCAAGGCATCAATTTCAAACACGCTGACACTAAAGCGCTTAGATCTAAAAGTCTAATGAATCAAATTGAGTCGGCCTATGAAGAGCGCAACGAAGGTAATAATGGAGAAGCAATTCCTGGACCTCATCTTGTTCTTCACTACAAAGACAAGTCTATACTAGAAGATGCTGCAAACTTACTTATACATCACGTAAAACGCCAGACAGGTATTCAAAAACTAGAAAAGGCCAGAATTAAGCACATCTTGAGGCAATTTGTTCCCGAATTATTCTTTGCTCCTCGGCAGCCCTTAAGTGATGATGAAAGAGAAGACGTATTTCCCTTTTCAGTAGAAGACAAAGAATCAAATATTTCTGAAAATGATGGAAAATCAAAATCAAATAGCAAAAATAACAGTCCATCAAGAAGTGAAAGTGTGCCTGTAAATAAAATACCTACAACAAATTGTGAAAATTATTCAAAGAATATATCAATATCCACGAATGATGAACAAATACAACAACAGTTGCAACCAAGTACAAAGAATTCTGCTGACCAGGCATCAGAAACTCCTGCAGATGGTGATATTAAAGTTGAAATTAAGGCTGATCCGGATGCTGTTAAACAAGTTCCGAACAACGCTGTTCAACCGGCGCCTGGAAGTAATCTCCTACCGCCTCATGCAGCAAATGCGAAGCATCAGGACGAGGCATATACATTGTTTTTCTCAAATAGCAATTGGTACTACTTTCTGCGACTTCATGCTATTTTGTGCGAGCGTTTAAGAACAATGTATGATCGAACACAAATTTTGGCTGCAGAGGAGGATAAATACAAAGTGAACAGGCGGGAAAGTGTTGCTATTGCATTGAGATTGAAGCCACAGAACGAATTCGAAATTCCCGATTACTATCCAGCTTTCTTGGACATGCTTAAGAGTCTTTTGGACGGTAACATGGACGCAACAACGTATGAAGATAAATTAAGAGACATGTATGGTATTCATGCATACATTGCGTTCACTCTGGACCGGGTTGTATCCAACGCTGTCCGCCAACTTCAATTTTGTGTGACTGAACGAAATGCTCTCGAATGTTTTGAATTATATCAACTGGAGAGCAAGAATAATGCCACCGGTGGACTTTGTTCAACTGCATTCAAAAGAACGGCTGCCGAATTGGCATATCAACGGAAAACAGAGTCGAACTTGCAAGAAGAAATGTATTTTAAAGTGATAATTTATAAAATAGACTGCCGTGTTACGATTGAAATGTTAGAAAACGATAATGAAGACACGTCGACAACAAATTTCGAGCAGATTCAAACGACAAGTAAATATATTGAACGTTACACGAATCCTTCGGCTTCGGGAGGAGGCAATGGCAAAACCAGCAGAAATAATTCTGTAAATGGTGCTTCCTTTAGCAACCTTGATATAAAGACAGAAAAATCAGAGGAAGAAATGAAAAAAAATCGTAAGCCCTTATTTTTGCATAGAAACATCAGGAAACTGAATCGAAGAATGTCAAATAGTATAAAATTTGAAAATGGAGAAGGTTCAGAGAATTGTGTAGAAGTTAGCTCCACTACTGCACCTCCAATAAGCACTACGACTACGACTACAAGCACAACAAGTTCTTCCTCAACAGTTCAAATTACAAAAGGCAATGAATCATCAATATTCTCAGCAATTGCCACTACTTCCTCATGTACCTTAACAACCTCAACTTCTTCAGTTCCACCACCGTTAATAATTTTACCGGAATTCACCAGCAAGAGTGCACCCCCATCGTCTAGAATCGGGCAATCCAATTCGTTTGGCGACTTTTTCGTTGATGATCAAGAACAAGTGAAATTCAATTTCCAAACCTACAAGACGGTTTTCTGCAACACTACTAATAAAGGCTACATGCTCTATCGCTACAATTCATTGAAACGAGCGAAAGAGACCCATATGAAAGTTACAAAACAAATGGACTTACGTTTTAATGAATATGTACAAAAATGGCTCGTAAGAAACGTCAGCGACTTACAGCGAATAACTATTAATGATTGGCTTCTAGGACGAAATCAAGCTGATTTTGTTTCGTGTAGAACAACAATTAAAAAAGATAATAACATTACAGAAACGCCATATTGCATCTTTAATCGCTATAAAGTTGAATACATTTCGGCAAGTACCGATAAGTGC
Protein: 1614 (aa)
MKKRVDEIQTFHSTARVTGGTARILTPPQHNSSPNATQSQFQILNVLRDSAPAQSVQYTYTSNIQQRNNVAQQPPPQSQPVQVVNASAINNAVRPKPTSVTTVVTASGAVVGGGAGSGTTAAVVASGGSGGSITPTPVSVTSPTGNSIPAQNVSGQNFPRLKVEDALSYLDQVKYKFGNQPQVYNDFLDIMKEFKSQSIDTPGVIQRVSNLFKGHPELIVGFNTFLPPGYKIEVQANDQGFAYQVSVSVPSPSGNQTISSQHSPPSKFIQGTHIIQPPVNLITHTGHTLHVQPQPTQTNQQDQRSIQNQHMTTTNINIAQSFSRERTLPAVSNSSGSVQSNQQQQQQQQQSSTVSQQPSQQHHSQTNQIQSTINETSSLHRMQPIFQNDNQPVQFNQAIVYVNKIKCRFQEQPEKYKHFLRILHKYQNEQRRKDNAGKTGSPLTETEVFKQVAHLFDNQEDLLREFGQFLPDASPGATNVLTGKGSASGHSESNDKSSSVIGNQNTNAKQLVNNNSRNSIINPLDMFPASNNDKGDFHGTSYGAINREKDSNRNHINSNNQKFSSMSTSGSGLKRSPAMSINHVNRGHDRNEPPIKRYKPVFRDVSYADAARSGTLQDYAFFDKVREALKTPDVYDNFLRCLTLYNQEIVSKSELITLVAPFLNKEPELLKRFQEFLKFSASSETLPLSVAQRQEHRVQIDTATEIDVTQCKRLGTSYCAIPKSSEPKKCSGRTALCKEVLNDTWVSFPSWSEDSTFNTSRKTQYEEFIYRCEDERFELDVVIETNSATIRVLEGVQKKMSKMAADELARFKLDDSLGGTSQSIHQRALRRIYGDKAPDIIEGLKKNPNVAVPVVLRRLKAKEEEWREAQKGFNQQWREQNEKYYLKSLDHQGINFKHADTKALRSKSLMNQIESAYEERNEGNNGEAIPGPHLVLHYKDKSILEDAANLLIHHVKRQTGIQKLEKARIKHILRQFVPELFFAPRQPLSDDEREDVFPFSVEDKESNISENDGKSKSNSKNNSPSRSESVPVNKIPTTNCENYSKNISISTNDEQIQQQLQPSTKNSADQASETPADGDIKVEIKADPDAVKQVPNNAVQPAPGSNLLPPHAANAKHQDEAYTLFFSNSNWYYFLRLHAILCERLRTMYDRTQILAAEEDKYKVNRRESVAIALRLKPQNEFEIPDYYPAFLDMLKSLLDGNMDATTYEDKLRDMYGIHAYIAFTLDRVVSNAVRQLQFCVTERNALECFELYQLESKNNATGGLCSTAFKRTAAELAYQRKTESNLQEEMYFKVIIYKIDCRVTIEMLENDNEDTSTTNFEQIQTTSKYIERYTNPSASGGGNGKTSRNNSVNGASFSNLDIKTEKSEEEMKKNRKPLFLHRNIRKLNRRMSNSIKFENGEGSENCVEVSSTTAPPISTTTTTTSTTSSSSTVQITKGNESSIFSAIATTSSCTLTTSTSSVPPPLIILPEFTSKSAPPSSRIGQSNSFGDFFVDDQEQVKFNFQTYKTVFCNTTNKGYMLYRYNSLKRAKETHMKVTKQMDLRFNEYVQKWLVRNVSDLQRITINDWLLGRNQADFVSCRTTIKKDNNITETPYCIFNRYKVEYISASTDKC
Type | Start | End | Length |
CDS |
27104 |
27346 |
243 |
CDS |
27434 |
27916 |
483 |
CDS |
29011 |
29235 |
225 |
CDS |
29306 |
29512 |
207 |
CDS |
29588 |
29917 |
330 |
CDS |
30464 |
30831 |
368 |
CDS |
31682 |
31910 |
229 |
CDS |
32006 |
33544 |
1539 |
CDS |
33635 |
34852 |
1218 |
intron |
27347 |
27433 |
87 |
intron |
27917 |
29010 |
1094 |
intron |
29236 |
29305 |
70 |
intron |
29513 |
29587 |
75 |
intron |
29918 |
30463 |
546 |
intron |
30832 |
31681 |
850 |
intron |
31911 |
32005 |
95 |
intron |
33545 |
33634 |
90 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_003400052 |
PREDICTED: paired amphipathic helix protein Sin3b-like [Bombus terrestris] |
0.0 |
InterPro |
IPR013194 |
Histone deacetylase interacting |
|
InterPro |
IPR003822 |
Paired amphipathic helix |
|
Gene Ontology(BP) |
GO:0006355 |
regulation of transcription, DNA-dependent |
|
Gene Ontology(CC) |
GO:0005634 |
nucleus |
|
Pfam |
PF02671.16 |
Paired amphipathic helix repeat |
9.4e-48 |
Pfam |
PF08295.7 |
Sin3 family co-repressor |
4.2e-42 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
N. vitripennis |
NV15662-PA |
H. sapiens |
ENSP00000353622 |
H. sapiens |
ENSP00000248054 |
H. sapiens |
ENSP00000378402 |
H. sapiens |
ENSP00000378403 |
A. aegypti |
AAEL014491 |
H. sapiens |
ENSP00000369131 |
D. melanogaster |
FBgn0022764 |
T. castaneum |
TC009311 |
S. invicta |
SI2.2.0_15534 |
P. humanus |
PHUM089280-PA |
D. plexippus |
DPOGS214927PA |
P. vanderplanki |
Pv.01140 |
B. mori |
BGIBMGA000204-TA |
A. gambiae |
AGAP007892 |
M. musculus |
ENSMUSG00000042557 |
H. melpomene |
HMEL016134-PA |
A. aegypti |
AAEL014711 |
P. vanderplanki |
Pv.14357 |
C. quinquefasciatus |
CPIJ802190 |
A. mellifera |
GB15858-PA |