MidgeBase gene description page [Pn.00334]
Outline
Gene ID | Pn.00334 |
Type | Protein coding gene |
Scaffold | PnScaf390 |
Start | 8419 |
End | 12863 |
Direction | - |
Sequence
Transcript: 4056 (bp)
ATGCCTTGTACGTGCGTCGCAAGCTATAAAGAAAAGGTATCACACCTCCAAAAGCTTCTTGAGGATGAGAGGAAGAAAAGTGAAGATCTTCAGTTTACAATCGACGAAGCTCAAACAAATGCTGACGAAAATGAGAGTACTCTTCGTAACCGAATCAAGCAACTTGAGCAAGCTTTAAGCGAAGGCAATCATGCTTCAGCAGGCGGTGACGCTGTGCAAAATGTTGAGGTCATTAACAGCTTGCGCAGCAACGTGGAAGAACTCACCAAGAAGCTACTACTCGCAGAATCGTTGAGGTCCACGCTGGAAAATGAAGTGCAGTCGTTGAATGAGAAGATCACTGAATATACTGGCGAATTGGATTTGAAATCTTCAGCTAACGAGGCACTGGAAAGGGTGTTTAAAGAAGAAATTAACTACTTGCAAGGTCGTGTGAATGATTTGGAAAAGGAAATCACTGAAAAGTCATCTGAAATAGAGAAATTCGAGAGCAAGCTTGCAGAGACCGATGTATCTCGCAACGATGAACTGACCATGCTCGAAAATCAGATGAAGGAGCGAGTGGCTGCGCTAGTCAGCAGTGAAAATTCCCTCGTGCAGCAACTCAACGAGAAAAATTCAGAGATTGAGCAGCTCAAGTTAGACATTGAGAAGCTCACAAGCTCAGACGCTGGTGCTCAAGAAATCATCGACAAGAAGGAATCGGAGATTCTCAAACTGACTGAAACAATAAAGTGCCGCGATGATTCGATTAAGGAATTTTCAACAGAGTTGACCAAAAAAATTGATGAAGTCGCTGTGCACCTCAATGAAATCGAAAGTCTGCAAAAATCAATTGAAGCATTGCAAATTGAACAAAGTAACAGCAAAGAGACTTGCTCTCAATATTTGGCAGAGAAAGAAAAGAACTTGGCAGAAATTTCGCACCAAAAGTCTGAAATTGAGGCTCAAGCTGCAGAGATTGCGTCCCTTAAAGCAGACATTCTGAGATTGGAAACATCGCTGAAAGAGTTGAATGAAACGGTAGCTAAATCGCAGGCAGCAGAGACAGATTTGACCACCAAAGTTACTGACATGTCCGAGTCTAAAGTAAAACTCGAAAACATTCTCGCTCTGAAGGATAAAGAAATGGTGGAGTTGAACGATTCGAAAGAGAAACTCATAAAGGAAATTGAAGAAATCAAAAAGTGCATGAGTGGCCTTAATGCGGAAGCAGAAGGAACGGTTAGTCTCCTAAATGAAAAGGACTCTCTTATCGAGAAATTGACGAAAGACTTCAACGAGCTGAAGGAAGGAAGTCAAAGAGAAATTTTGCAACTCAAAGAAGCTAAAAATTTGCTCGAAAATAAATTGCTTGAGGAACAAAAGTCATTTGAGGAGAAGTTGAATGGAGAAAGAAAAGAAAATGAGTCTGAGAAAGTTAAACTAAAGTCCACCATTGAGGAATTTGAAAAGATTTCGAAGGAAAAGGATGCACGCCTTGAGGCGATGGAGAAGAATTGCGCGCTGCTCAGCGAAACCAAGGGAAATCTAGAAAGGTCCATTGCTGACAACGAATCAAAACTCAACGAAACCTTGACCAAATTAAATAAATTAACGACTGAATACGAGAAAAAGGTGTCGCAATACGAGAAGCTTCAGTCAGCCCATGAAGACATTTTTAATAAGTTCAAGGATGCTGAAGTGACAGCAAGGAACTTGCAAGATGACAAAAAGACACTCGAGGTAGCCAACAAGGACCTAAATCGCAAGCTCGGTGCTTTGGATGAAAAAATTGCACAAGTAAATGATCAAAAGACAAAGCTTCAGGAGGAGCTCAATGCGCTCAGTACCTCATCGTTTGATGCCAATAGCGAGCTTAAGAAGCTTCACGATGACCTGAGAGAAAAACAAGCCGCTTTCGAGTCCTACCAAAGCGAGGCGGACAAGAAAAGCTTTGAGCTGCAGCGTCGGGTCGATGAGCTGGAAAATCAGCGAGATGCAGCCAATCAGATGTGCACCAAATTGAAGAATGAGATGGATGTCCTTAATCAGCAAAAAATTGAGAATGAGAATAGCCTGAATACAGATTTGCAAAACATTAAGGCGAGAACGGAGGAGGAGAGAACGACTTTGGAAAATGAAATTAGCAGCCTGCGTGCTGCCTTTGAAAACGAACGTGCAGAGTTGAAGAGAGCAAATATTGAGCTAACAGAGGCTGCTGAAAGATGCAAGAATGAGTTGGAGGCAAAAATAAAGAGCCTCAACGAGAATCTAGAAGACTTGAGAAAAGCTACCGAAGAGGCACGCAACGACACCAAAAACTCCGAATCACGTTTCGGTGCTGCTATCGAGGAGCTTAAATCTAAGGAACTGCAGCTAAATGAAGATTTGTCAAAGGAGCGCGAGGAAACGGACAAATTGAAACATAAATTAGAAGAACTCGAAGCATCAAAAGACATCCAAAGGAATGAGTATGAGGAAAAATTGGCCATTGAAAAGGCAAAAATTTCCAAGCTCGAGAGTGATTTGGCTCTGCAAATGGAGCAGCATAAGCTAGCATCGGCTGGCAATGACGAAAAGGTTAAAATTCTTGATGAGATTCAAGCCAAATGTTTCGAATATGAGCAAAAAGTCTCAGATCTTAGCAACCAAATTCAGAATGAAATCTCATCGAAGACTAAGCTTGAGGAGAGCATTAAAGTCCTCCAAGAAAAGCTCCGCGAGATAGAGGAGGAACAAGTCGATTTGGTCACTAGGAAGGAGGAATACAAAAACCAGACTATCACGCTTGAAGAGCAGATAAAAGATCTCAACCAACGGCGTGATAGCTTAGATGAAAGAATTTCACTCGAAAGGAAGGAGTCTGAACTGTTTAAAACAGAGTCTGACATCAAAATTAAAGAGATGCAGCAAAAATTGAATGAACTTCATCAAACGATAGCCGCCAAGGACTACGAGCTGGAGGAATACATATCGACCTTGAAGCAACGTGACGATAAACTGAATCAGTTGAGTGAAGCTATCAAAGAGCAGGATGTAAAGCTTAAGACTGAAATCGAAAGCGTTCACAAAGAAATGAAGCTAAAGGACAATGAACTGGCAAAAATGGCAGAAGAGTGCAGTGTCAAGGAGAATTTACTAGCAGACTTGAGAAACACTGTCGCCAATCTCAAGACGTGCCTTAACTCAATGAATAGTGAAAAGACATCTAGCAGTAGTGCCGTTGAAGAGCTAAACAAGTCAATCCACGTTCGCGATGAAAAAATCAACGAGTTAACATCCAAATTGTCGTTACTTGAAGCAACTGTCAACGAGAAAGACGATCAACTGAAAAATATTTACAGCATCAAGAGTAAGGTCGAAAGTGAGAGTAAAATGCGAATTGACGACTTACTGGAGCAGCTGACTCTACTCGATGAAGTTAAGAGTAAGGAAGTTGCCGAGATTCAGGATAAGCTGACACAGCTGTCGAGTCAGATGACAATTTATCAAAGTGAATCTGAAACATCGCGAATATCCGAAAAAGACGTTGAACGAGAACGACAGGAGTATCTCGCACGCATCCGAGATTTGGAAATATCGGAAACTGAATTGCAAATTTCAAATAAATCTCTTCAAAAGCAATTGGAGGAAGCTCAAAGCAGCTCAAATATTATTCCAAAGCCCGATGCCGCAGCAGATGGCCAAGACTTATTGGAGCACATTGAATTTTTGAACTCGATCATTGCGGACATGCACAAGAAAGACTTGAAGCTGGTCAAACGAGTACAAGCTCTTGAAAGTGAAATATCGAAATCAAGCAATTCGTCGTTCAAGGACATTGACTTTGACACGAAATTTATGGATAAGAAATTGCCACCGCCGAGAATGTACTGCGACATTTGCGAGGAATTTGACGCGCACGAAACAGAAGACTGTCCTACACAGTGCTCTGGAACCGATCCAGATGCTCCAGCCGTGCGACGAGACGAGAAAAAAGAGCGCAAGAAGCCACCGCCGAGGAAATATTGTGACTTTTGTGAAGTCTTTGACGCCCATGAGACGGAAGAGTGTCCAAATAGCGACGAAACATTT
Protein: 1352 (aa)
MPCTCVASYKEKVSHLQKLLEDERKKSEDLQFTIDEAQTNADENESTLRNRIKQLEQALSEGNHASAGGDAVQNVEVINSLRSNVEELTKKLLLAESLRSTLENEVQSLNEKITEYTGELDLKSSANEALERVFKEEINYLQGRVNDLEKEITEKSSEIEKFESKLAETDVSRNDELTMLENQMKERVAALVSSENSLVQQLNEKNSEIEQLKLDIEKLTSSDAGAQEIIDKKESEILKLTETIKCRDDSIKEFSTELTKKIDEVAVHLNEIESLQKSIEALQIEQSNSKETCSQYLAEKEKNLAEISHQKSEIEAQAAEIASLKADILRLETSLKELNETVAKSQAAETDLTTKVTDMSESKVKLENILALKDKEMVELNDSKEKLIKEIEEIKKCMSGLNAEAEGTVSLLNEKDSLIEKLTKDFNELKEGSQREILQLKEAKNLLENKLLEEQKSFEEKLNGERKENESEKVKLKSTIEEFEKISKEKDARLEAMEKNCALLSETKGNLERSIADNESKLNETLTKLNKLTTEYEKKVSQYEKLQSAHEDIFNKFKDAEVTARNLQDDKKTLEVANKDLNRKLGALDEKIAQVNDQKTKLQEELNALSTSSFDANSELKKLHDDLREKQAAFESYQSEADKKSFELQRRVDELENQRDAANQMCTKLKNEMDVLNQQKIENENSLNTDLQNIKARTEEERTTLENEISSLRAAFENERAELKRANIELTEAAERCKNELEAKIKSLNENLEDLRKATEEARNDTKNSESRFGAAIEELKSKELQLNEDLSKEREETDKLKHKLEELEASKDIQRNEYEEKLAIEKAKISKLESDLALQMEQHKLASAGNDEKVKILDEIQAKCFEYEQKVSDLSNQIQNEISSKTKLEESIKVLQEKLREIEEEQVDLVTRKEEYKNQTITLEEQIKDLNQRRDSLDERISLERKESELFKTESDIKIKEMQQKLNELHQTIAAKDYELEEYISTLKQRDDKLNQLSEAIKEQDVKLKTEIESVHKEMKLKDNELAKMAEECSVKENLLADLRNTVANLKTCLNSMNSEKTSSSSAVEELNKSIHVRDEKINELTSKLSLLEATVNEKDDQLKNIYSIKSKVESESKMRIDDLLEQLTLLDEVKSKEVAEIQDKLTQLSSQMTIYQSESETSRISEKDVERERQEYLARIRDLEISETELQISNKSLQKQLEEAQSSSNIIPKPDAAADGQDLLEHIEFLNSIIADMHKKDLKLVKRVQALESEISKSSNSSFKDIDFDTKFMDKKLPPPRMYCDICEEFDAHETEDCPTQCSGTDPDAPAVRRDEKKERKKPPPRKYCDFCEVFDAHETEECPNSDETF
Type | Start | End | Length |
CDS |
8422 |
8677 |
256 |
CDS |
8734 |
12398 |
3665 |
CDS |
12667 |
12777 |
111 |
CDS |
12840 |
12863 |
24 |
intron |
8678 |
8733 |
56 |
intron |
12399 |
12666 |
268 |
intron |
12778 |
12839 |
62 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001846683 |
condensin, SMC5-subunit [Culex quinquefasciatus] gb|EDS44350.1| condensin, SMC5-subunit [Culex quinquefasciatus] |
1e-104 |
InterPro |
IPR009053 |
Prefoldin |
|
Pfam |
PF04582.7 |
Reovirus sigma C capsid protein |
0.00036 |
Pfam |
PF05465.8 |
Halobacterial gas vesicle protein C (GVPC) repeat |
0.86 |
Pfam |
PF13696.1 |
Zinc knuckle |
1.1 |
Pfam |
PF06220.7 |
U1 zinc finger |
0.021 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
A. gambiae |
AGAP009210 |
C. quinquefasciatus |
CPIJ005305 |
H. sapiens |
ENSP00000223398 |
H. sapiens |
ENSP00000355151 |
H. sapiens |
ENSP00000446379 |
H. sapiens |
ENSP00000437786 |
D. melanogaster |
FBgn0020503 |
H. sapiens |
ENSP00000460103 |
H. sapiens |
ENSP00000355314 |
H. sapiens |
ENSP00000303585 |
H. sapiens |
ENSP00000378500 |
P. vanderplanki |
Pv.13818 |
S. invicta |
SI2.2.0_05857 |
H. sapiens |
ENSP00000351665 |
H. sapiens |
ENSP00000460322 |
H. sapiens |
ENSP00000445531 |
D. plexippus |
DPOGS212139PA |
A. aegypti |
AAEL013698 |
H. sapiens |
ENSP00000461219 |
H. sapiens |
ENSP00000441409 |
A. mellifera |
GB10948-PA |
H. sapiens |
ENSP00000438743 |
M. musculus |
ENSMUSG00000063146 |
A. aegypti |
AAEL013697 |
P. vanderplanki |
Pv.06642 |
B. mori |
BGIBMGA006589-TA |
C. quinquefasciatus |
CPIJ005306 |
T. castaneum |
TC011966 |
H. sapiens |
ENSP00000439093 |
P. humanus |
PHUM291650-PA |
A. aegypti |
AAEL015374 |
A. mellifera |
GB14183-PA |
H. melpomene |
HMEL007733-PA |
M. musculus |
ENSMUSG00000049550 |
T. castaneum |
TC011965 |
N. vitripennis |
NV11205-PA |