MidgeBase gene description page [Pn.09838]
Outline
Gene ID | Pn.09838 |
Type | Protein coding gene |
Scaffold | PnScaf10232 |
Start | 33298 |
End | 37576 |
Direction | + |
Sequence
Transcript: 3810 (bp)
ATGCCGAGAAAGCGAAAGGTAGCGGTGCCAAATCACGAAAATGTGCAGCCAGTGGAGAAAGTTGTCGAAAAGCCGAAGAAAAATGATAATGCCGAGAAGCCAGCAAAGAAACAAAAGCGCGAGCCTGAATATATGATTGGCGATGGGCTGAGTGTCGTAAAAGCCATCCTATCTGTACAAAAGACGGATTGCAACAGCAGCAAGAGTCTCAATGAGCTTCAAAAACTGTACAAAAAGATGACACACTCATTATTCATGAAATCCTTCATGGCCGCACTCCAAACGTTCCTAATTCGTGACGACGGCGAAGAGTACGCTTCAAGAGTGCTGAAATTCATGGGAGTTTTTGTCGCCTCTTATGGCGAGGAAGTTACAGAATCGGGCGCTTCTCACCCGATCATCGACACTTTCTTCAAGGAGATTCTCGGGATTACGGCTAATTTGGTGCATGTAAGGACGAGAATTTGCCTGCTGGTCACCAACACCATGTCATCGTTCTCGCAGAAGGCCGAACTGGATGAGGCCATCATCGAAAAAATAACCGAACGAATGCTTTTCTTCATGAAAGACGTTTCACCTCTGGTTCGAATGCAGGCGGTGTTCGCACTGCAGCGCTTGCAAGATCCAGAAAACACTAACGAAGATCCAGTCACGAAAGCCTACATTTTCCACATGGAATCCGATCCGGCGCCGAAAGTTCGTCAGGCCACCATCACGGCTATTGCCAAGAAAATTCATAATATTCCAGCGATTCTCGAGAGACTTCACGACGTCGACGAGAAGGTCCGTCGTCACACCTACTTGCAAATGTCGAGCTACTCCGTCAAGAGCTACAAAATCGCTGATCGCATAAACATCCTTACTGCCGGCCTCAACGATCGCTCGGATGTGGTCAAGAAGGCAGTCACAAATTTGCTGCTGTCCAACTGGATTGGCGTTTACGACAACGACTATGCCGAATTTATTCGCGCCATCAAACTCGACTCGGACGAAAATGAACTGATCAAGTTCCGCAATCTCGCCGAAATGGCCCTGTCGGAGATTTTCAAGAAGCGAAAGATCGCTGATTTGGTCGCCTACCTCAACTTGCCGGCGTCCAAGGACTTCAAGAACTGTCTGCCATTGGAGAAGTCGACACTGGAAATGCTGATTGTTTGGAAGATGATTTCAAAGTGCTATCAGGACTACATCAGTGGCAAACGCAACGGCAACGACATCAAAGACGATGTCGGCAGCGAGGACGAAGAGGAGTCGGCAGTTGCCAACCAAACCATCTCAAATGTCGACATTTTTCCCGAACTGTCCGTCCTCTGCGACTACATGGAGAAGTTCGTCACCGACTTTAGCGGCGAAATGGAGACGAAAAATCAGAAAATTTACTTCAGCCAGTGCATCGTCGCCCTCCTCGAGATCGTGCAGCTGAACGACCTGGACGACGAGGTCGGCAAGGAGCGCTTGAAGAAGCTGCTGAAGACAATCTTGCTTACGTACGACATCTCGGAGTTTGCCATCCAGGAAATTGCGCACGTCGTCGAGAAAATCATTCCCGACGTCAACACGCGTCTCGCGTTCTTTAACCAGGTCGTCACCGAGATGATCAAGCCGGGAGCGCCGTCCGAGTACAGCCGCCAGACGATCATCGACGATCTCATCGACCGTTCGGACATCGACAGAAAAGTGCAGGCGAAGCAAATCAAGCTGCGCATGATGGACCTGAAGGAGCAGGAGAGCTCCTTCGCCCAGCAGAAGCAATATGCCGAGTGCCAGAAGGTGTCGGAGGAGTACAACAAGCTGAATGGCGAGCTGATTGAGCTCCTCAAGCCGGTCGCCTCGCAAACATCGTCGGAATCGACGCAGTCGCTTCTCGAGAACCTTTCGAGTGTCGTGACGTCGAAGAAAATCACGCAGAGCGAAATTCTCAAGAACTTGCGCATCTGCTACTTCGCGATTGTGTCGCGCGGCGTCAAAGTCATCACGCACGAGATCCTCCAGATCTACAAGGAGTTCGTGCGCTATCACTTGGAGTCGGCGTACGTGGAGACGCGCATCTGGGCGCTGAAGACGGCGACCGCTTACAGTCTCCTCTACGAGTCGATTGCAAAGGAGATCTACATCATCATCAAGTCGCAGGTCTTCCGCTTCACGCACGCGGTGCTCTGGGAGTGCTCGATTCAGTGCATCTTCGATCTGCTGCTGCGCTACGGCATCGAGAAGATGGACGGCGACAGCAACGAGACCAGCGTGAGCATGTCGATGACAAATCGCAGCAAGCGCGGCGGCCGCACTCTCTACACGGACGTCGAGGACGAGGAGGACGAGCCGGACGAGCTGAACATCGCGAAGACGCTCGACGTGATGCAGATGCTGCTGCACCTGCTCGAGCAGAGCAACGACGTGAAGATCACGAAGGTGCTGATCCGCGGCTTCTGCAAGCTCATCGTTCACAGCGTCTACTGCACACGCGAGCTCATGTCAAAGTTCCTGCTCATGTACTTCAACCCGGCGACGTCGGCCGAAATCAGCCAAATACTCGGCATCTTCCTCGAGAACATCATCAAGCGCAAGAAGCAGGAGTACTTGCACAACGCCCTCACGCCCACCATCGTCACGCTGGTCGAGGCGCCCTACGACTCGCCGCTGCGTGATGTGAAGCTCGACACGGTGTTGAAGTATATCGACAATCCCGACAACAAGGAGATCTTGAAGATATTCGCGAAGGAGCTGTTCGAGCTCGAAATCGGCGAGGATCCGCTGCTCAAGCGGGACATGGCGAAGCAGGTTGAGATTCTGAGCGGAACTTATCGAGCGCCGCTGACGTTCTCGAGCTTGGCAAAGGCTCCGAACAGCGGCGACGAAAACGAGCTGGAGGAGGAGGAGAACGAAGAGGAAGCGGCAAACGAGGAGGAAAAATCTTCGAACGACACGAAAGAGATCGAAACCGAAAAAAGCTTTGAAGTGAAAATCGAGAAAATCGACATTCCCGCCGACGGACTGATCGATTTTACGGAAGGCGAATCGCCGAAAGCCATTGAAACGCCGGCTGCTGCTGCTGCTGACGACGAAGAATGCTCGATGAACGACGAGGACATTCCGCAATCGCAGGAACCTATCGAACTGCCCGCCACCCAAGAGATTCATGTCGAATTGCCCGCAACACAAGACTTTTCGTTCGGCGAGTCGTCAGTCAATGCGAGCGTCGAAGATGCGAGCATCATTTCGAGCGACGTGCCTTCAACGCCTTCGACGCCGATGACTGTCGTAAAGAAACCGGCTCTAAAGAAGCAATCGGCCAAAGCCCCAATGTCTGCCATCGATGAGAGCGGCGAGGAGTCGATTGTGATTCCCGAAACGCCTGAAGTTCGATCGCGTCGAAGCGCACTGTCGAACGCCAAACGACAGTTGAACATCTCACAATCGACGCCCGCGACGCCCTCTCAGAGCTCGCCTTTCCGCAAGCTACCGCGACGGGCCGCAGCGACGCCGAAGTCGGCACTCGCCTCGCCACTTGCCACTCAATCTCCCCGAACGCCCACTTCGCGTTTGTCAACGTCGCTGAATGCGACGACCAGCACTCCAAACACGAGTCGCTTAACAACGAGGCAACAGTCGCGCGTCGAAGTTGCACAGAAAACGACACTCACGCGCTCGGCTTCGAAGAATCTCAAAGTGAAGCCATCAACCATCGTGTCGAAGCTGACGGAGAGCGAGAAGAGCCAGAAGGAGGTGAAGAAGCCGGAGAACCCAAAGGGCACCGATGCGCGCCCGGTCAGAGCCACGAGGAGTCAAAACCCGTCGCGACCTCCATGGAAG
Protein: 1270 (aa)
MPRKRKVAVPNHENVQPVEKVVEKPKKNDNAEKPAKKQKREPEYMIGDGLSVVKAILSVQKTDCNSSKSLNELQKLYKKMTHSLFMKSFMAALQTFLIRDDGEEYASRVLKFMGVFVASYGEEVTESGASHPIIDTFFKEILGITANLVHVRTRICLLVTNTMSSFSQKAELDEAIIEKITERMLFFMKDVSPLVRMQAVFALQRLQDPENTNEDPVTKAYIFHMESDPAPKVRQATITAIAKKIHNIPAILERLHDVDEKVRRHTYLQMSSYSVKSYKIADRINILTAGLNDRSDVVKKAVTNLLLSNWIGVYDNDYAEFIRAIKLDSDENELIKFRNLAEMALSEIFKKRKIADLVAYLNLPASKDFKNCLPLEKSTLEMLIVWKMISKCYQDYISGKRNGNDIKDDVGSEDEEESAVANQTISNVDIFPELSVLCDYMEKFVTDFSGEMETKNQKIYFSQCIVALLEIVQLNDLDDEVGKERLKKLLKTILLTYDISEFAIQEIAHVVEKIIPDVNTRLAFFNQVVTEMIKPGAPSEYSRQTIIDDLIDRSDIDRKVQAKQIKLRMMDLKEQESSFAQQKQYAECQKVSEEYNKLNGELIELLKPVASQTSSESTQSLLENLSSVVTSKKITQSEILKNLRICYFAIVSRGVKVITHEILQIYKEFVRYHLESAYVETRIWALKTATAYSLLYESIAKEIYIIIKSQVFRFTHAVLWECSIQCIFDLLLRYGIEKMDGDSNETSVSMSMTNRSKRGGRTLYTDVEDEEDEPDELNIAKTLDVMQMLLHLLEQSNDVKITKVLIRGFCKLIVHSVYCTRELMSKFLLMYFNPATSAEISQILGIFLENIIKRKKQEYLHNALTPTIVTLVEAPYDSPLRDVKLDTVLKYIDNPDNKEILKIFAKELFELEIGEDPLLKRDMAKQVEILSGTYRAPLTFSSLAKAPNSGDENELEEEENEEEAANEEEKSSNDTKEIETEKSFEVKIEKIDIPADGLIDFTEGESPKAIETPAAAAADDEECSMNDEDIPQSQEPIELPATQEIHVELPATQDFSFGESSVNASVEDASIISSDVPSTPSTPMTVVKKPALKKQSAKAPMSAIDESGEESIVIPETPEVRSRRSALSNAKRQLNISQSTPATPSQSSPFRKLPRRAAATPKSALASPLATQSPRTPTSRLSTSLNATTSTPNTSRLTTRQQSRVEVAQKTTLTRSASKNLKVKPSTIVSKLTESEKSQKEVKKPENPKGTDARPVRATRSQNPSRPPWK
Type | Start | End | Length |
CDS |
33298 |
33534 |
237 |
CDS |
33605 |
33796 |
192 |
CDS |
33857 |
34476 |
620 |
CDS |
34543 |
36169 |
1627 |
CDS |
36254 |
36361 |
108 |
CDS |
36425 |
36586 |
162 |
CDS |
36644 |
37435 |
792 |
CDS |
37502 |
37573 |
72 |
intron |
33535 |
33604 |
70 |
intron |
33797 |
33856 |
60 |
intron |
34477 |
34542 |
66 |
intron |
36170 |
36253 |
84 |
intron |
36362 |
36424 |
63 |
intron |
36587 |
36643 |
57 |
intron |
37436 |
37501 |
66 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001661330 |
condensin, XCAP-G'-subunit, putative [Aedes aegypti] gb|EAT36904.1| condensin, XCAP-G'-subunit, putative [Aedes aegypti] |
1e-170 |
InterPro |
IPR011989 |
Armadillo-like helical |
|
InterPro |
IPR016024 |
Armadillo-type fold |
|
InterPro |
IPR025977 |
Nuclear condensin complex subunit 3, C-terminal domain |
|
Gene Ontology(MF) |
GO:0005488 |
binding |
|
Pfam |
PF12717.2 |
non-SMC mitotic condensation complex subunit 1 |
0.0058 |
Pfam |
PF12719.2 |
Nuclear condensing complex subunits, C-term domain |
1.8e-34 |
Pfam |
PF02985.17 |
HEAT repeat |
2e-08 |
Pfam |
PF13646.1 |
HEAT repeats |
1.8e-09 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
M. musculus |
ENSMUSG00000015880 |
A. gambiae |
AGAP007568 |
H. sapiens |
ENSP00000425625 |
H. melpomene |
HMEL009419-PA |
P. vanderplanki |
Pv.13019 |
D. plexippus |
DPOGS207031PA |
H. sapiens |
ENSP00000251496 |
A. mellifera |
GB17574-PA |
P. humanus |
PHUM527880-PA |
S. invicta |
SI2.2.0_15411 |
P. vanderplanki |
Pv.12273 |
A. aegypti |
AAEL011049 |
B. mori |
BGIBMGA012974-TA |
T. castaneum |
TC007921 |
T. castaneum |
TC009926 |
C. quinquefasciatus |
CPIJ006884 |
D. melanogaster |
FBgn0259876 |
N. vitripennis |
NV15731-PA |