MidgeBase gene description page [Pn.00334]

Outline

Link to gbrowse

Gene ID Pn.00334
Type Protein coding gene
Scaffold PnScaf390
Start 8419
End 12863
Direction -

Sequence

Transcript: 4056 (bp)

 ATGCCTTGTACGTGCGTCGCAAGCTATAAAGAAAAGGTATCACACCTCCAAAAGCTTCTTGAGGATGAGAGGAAGAAAAGTGAAGATCTTCAGTTTACAATCGACGAAGCTCAAACAAATGCTGACGAAAATGAGAGTACTCTTCGTAACCGAATCAAGCAACTTGAGCAAGCTTTAAGCGAAGGCAATCATGCTTCAGCAGGCGGTGACGCTGTGCAAAATGTTGAGGTCATTAACAGCTTGCGCAGCAACGTGGAAGAACTCACCAAGAAGCTACTACTCGCAGAATCGTTGAGGTCCACGCTGGAAAATGAAGTGCAGTCGTTGAATGAGAAGATCACTGAATATACTGGCGAATTGGATTTGAAATCTTCAGCTAACGAGGCACTGGAAAGGGTGTTTAAAGAAGAAATTAACTACTTGCAAGGTCGTGTGAATGATTTGGAAAAGGAAATCACTGAAAAGTCATCTGAAATAGAGAAATTCGAGAGCAAGCTTGCAGAGACCGATGTATCTCGCAACGATGAACTGACCATGCTCGAAAATCAGATGAAGGAGCGAGTGGCTGCGCTAGTCAGCAGTGAAAATTCCCTCGTGCAGCAACTCAACGAGAAAAATTCAGAGATTGAGCAGCTCAAGTTAGACATTGAGAAGCTCACAAGCTCAGACGCTGGTGCTCAAGAAATCATCGACAAGAAGGAATCGGAGATTCTCAAACTGACTGAAACAATAAAGTGCCGCGATGATTCGATTAAGGAATTTTCAACAGAGTTGACCAAAAAAATTGATGAAGTCGCTGTGCACCTCAATGAAATCGAAAGTCTGCAAAAATCAATTGAAGCATTGCAAATTGAACAAAGTAACAGCAAAGAGACTTGCTCTCAATATTTGGCAGAGAAAGAAAAGAACTTGGCAGAAATTTCGCACCAAAAGTCTGAAATTGAGGCTCAAGCTGCAGAGATTGCGTCCCTTAAAGCAGACATTCTGAGATTGGAAACATCGCTGAAAGAGTTGAATGAAACGGTAGCTAAATCGCAGGCAGCAGAGACAGATTTGACCACCAAAGTTACTGACATGTCCGAGTCTAAAGTAAAACTCGAAAACATTCTCGCTCTGAAGGATAAAGAAATGGTGGAGTTGAACGATTCGAAAGAGAAACTCATAAAGGAAATTGAAGAAATCAAAAAGTGCATGAGTGGCCTTAATGCGGAAGCAGAAGGAACGGTTAGTCTCCTAAATGAAAAGGACTCTCTTATCGAGAAATTGACGAAAGACTTCAACGAGCTGAAGGAAGGAAGTCAAAGAGAAATTTTGCAACTCAAAGAAGCTAAAAATTTGCTCGAAAATAAATTGCTTGAGGAACAAAAGTCATTTGAGGAGAAGTTGAATGGAGAAAGAAAAGAAAATGAGTCTGAGAAAGTTAAACTAAAGTCCACCATTGAGGAATTTGAAAAGATTTCGAAGGAAAAGGATGCACGCCTTGAGGCGATGGAGAAGAATTGCGCGCTGCTCAGCGAAACCAAGGGAAATCTAGAAAGGTCCATTGCTGACAACGAATCAAAACTCAACGAAACCTTGACCAAATTAAATAAATTAACGACTGAATACGAGAAAAAGGTGTCGCAATACGAGAAGCTTCAGTCAGCCCATGAAGACATTTTTAATAAGTTCAAGGATGCTGAAGTGACAGCAAGGAACTTGCAAGATGACAAAAAGACACTCGAGGTAGCCAACAAGGACCTAAATCGCAAGCTCGGTGCTTTGGATGAAAAAATTGCACAAGTAAATGATCAAAAGACAAAGCTTCAGGAGGAGCTCAATGCGCTCAGTACCTCATCGTTTGATGCCAATAGCGAGCTTAAGAAGCTTCACGATGACCTGAGAGAAAAACAAGCCGCTTTCGAGTCCTACCAAAGCGAGGCGGACAAGAAAAGCTTTGAGCTGCAGCGTCGGGTCGATGAGCTGGAAAATCAGCGAGATGCAGCCAATCAGATGTGCACCAAATTGAAGAATGAGATGGATGTCCTTAATCAGCAAAAAATTGAGAATGAGAATAGCCTGAATACAGATTTGCAAAACATTAAGGCGAGAACGGAGGAGGAGAGAACGACTTTGGAAAATGAAATTAGCAGCCTGCGTGCTGCCTTTGAAAACGAACGTGCAGAGTTGAAGAGAGCAAATATTGAGCTAACAGAGGCTGCTGAAAGATGCAAGAATGAGTTGGAGGCAAAAATAAAGAGCCTCAACGAGAATCTAGAAGACTTGAGAAAAGCTACCGAAGAGGCACGCAACGACACCAAAAACTCCGAATCACGTTTCGGTGCTGCTATCGAGGAGCTTAAATCTAAGGAACTGCAGCTAAATGAAGATTTGTCAAAGGAGCGCGAGGAAACGGACAAATTGAAACATAAATTAGAAGAACTCGAAGCATCAAAAGACATCCAAAGGAATGAGTATGAGGAAAAATTGGCCATTGAAAAGGCAAAAATTTCCAAGCTCGAGAGTGATTTGGCTCTGCAAATGGAGCAGCATAAGCTAGCATCGGCTGGCAATGACGAAAAGGTTAAAATTCTTGATGAGATTCAAGCCAAATGTTTCGAATATGAGCAAAAAGTCTCAGATCTTAGCAACCAAATTCAGAATGAAATCTCATCGAAGACTAAGCTTGAGGAGAGCATTAAAGTCCTCCAAGAAAAGCTCCGCGAGATAGAGGAGGAACAAGTCGATTTGGTCACTAGGAAGGAGGAATACAAAAACCAGACTATCACGCTTGAAGAGCAGATAAAAGATCTCAACCAACGGCGTGATAGCTTAGATGAAAGAATTTCACTCGAAAGGAAGGAGTCTGAACTGTTTAAAACAGAGTCTGACATCAAAATTAAAGAGATGCAGCAAAAATTGAATGAACTTCATCAAACGATAGCCGCCAAGGACTACGAGCTGGAGGAATACATATCGACCTTGAAGCAACGTGACGATAAACTGAATCAGTTGAGTGAAGCTATCAAAGAGCAGGATGTAAAGCTTAAGACTGAAATCGAAAGCGTTCACAAAGAAATGAAGCTAAAGGACAATGAACTGGCAAAAATGGCAGAAGAGTGCAGTGTCAAGGAGAATTTACTAGCAGACTTGAGAAACACTGTCGCCAATCTCAAGACGTGCCTTAACTCAATGAATAGTGAAAAGACATCTAGCAGTAGTGCCGTTGAAGAGCTAAACAAGTCAATCCACGTTCGCGATGAAAAAATCAACGAGTTAACATCCAAATTGTCGTTACTTGAAGCAACTGTCAACGAGAAAGACGATCAACTGAAAAATATTTACAGCATCAAGAGTAAGGTCGAAAGTGAGAGTAAAATGCGAATTGACGACTTACTGGAGCAGCTGACTCTACTCGATGAAGTTAAGAGTAAGGAAGTTGCCGAGATTCAGGATAAGCTGACACAGCTGTCGAGTCAGATGACAATTTATCAAAGTGAATCTGAAACATCGCGAATATCCGAAAAAGACGTTGAACGAGAACGACAGGAGTATCTCGCACGCATCCGAGATTTGGAAATATCGGAAACTGAATTGCAAATTTCAAATAAATCTCTTCAAAAGCAATTGGAGGAAGCTCAAAGCAGCTCAAATATTATTCCAAAGCCCGATGCCGCAGCAGATGGCCAAGACTTATTGGAGCACATTGAATTTTTGAACTCGATCATTGCGGACATGCACAAGAAAGACTTGAAGCTGGTCAAACGAGTACAAGCTCTTGAAAGTGAAATATCGAAATCAAGCAATTCGTCGTTCAAGGACATTGACTTTGACACGAAATTTATGGATAAGAAATTGCCACCGCCGAGAATGTACTGCGACATTTGCGAGGAATTTGACGCGCACGAAACAGAAGACTGTCCTACACAGTGCTCTGGAACCGATCCAGATGCTCCAGCCGTGCGACGAGACGAGAAAAAAGAGCGCAAGAAGCCACCGCCGAGGAAATATTGTGACTTTTGTGAAGTCTTTGACGCCCATGAGACGGAAGAGTGTCCAAATAGCGACGAAACATTT 

Protein: 1352 (aa)

 MPCTCVASYKEKVSHLQKLLEDERKKSEDLQFTIDEAQTNADENESTLRNRIKQLEQALSEGNHASAGGDAVQNVEVINSLRSNVEELTKKLLLAESLRSTLENEVQSLNEKITEYTGELDLKSSANEALERVFKEEINYLQGRVNDLEKEITEKSSEIEKFESKLAETDVSRNDELTMLENQMKERVAALVSSENSLVQQLNEKNSEIEQLKLDIEKLTSSDAGAQEIIDKKESEILKLTETIKCRDDSIKEFSTELTKKIDEVAVHLNEIESLQKSIEALQIEQSNSKETCSQYLAEKEKNLAEISHQKSEIEAQAAEIASLKADILRLETSLKELNETVAKSQAAETDLTTKVTDMSESKVKLENILALKDKEMVELNDSKEKLIKEIEEIKKCMSGLNAEAEGTVSLLNEKDSLIEKLTKDFNELKEGSQREILQLKEAKNLLENKLLEEQKSFEEKLNGERKENESEKVKLKSTIEEFEKISKEKDARLEAMEKNCALLSETKGNLERSIADNESKLNETLTKLNKLTTEYEKKVSQYEKLQSAHEDIFNKFKDAEVTARNLQDDKKTLEVANKDLNRKLGALDEKIAQVNDQKTKLQEELNALSTSSFDANSELKKLHDDLREKQAAFESYQSEADKKSFELQRRVDELENQRDAANQMCTKLKNEMDVLNQQKIENENSLNTDLQNIKARTEEERTTLENEISSLRAAFENERAELKRANIELTEAAERCKNELEAKIKSLNENLEDLRKATEEARNDTKNSESRFGAAIEELKSKELQLNEDLSKEREETDKLKHKLEELEASKDIQRNEYEEKLAIEKAKISKLESDLALQMEQHKLASAGNDEKVKILDEIQAKCFEYEQKVSDLSNQIQNEISSKTKLEESIKVLQEKLREIEEEQVDLVTRKEEYKNQTITLEEQIKDLNQRRDSLDERISLERKESELFKTESDIKIKEMQQKLNELHQTIAAKDYELEEYISTLKQRDDKLNQLSEAIKEQDVKLKTEIESVHKEMKLKDNELAKMAEECSVKENLLADLRNTVANLKTCLNSMNSEKTSSSSAVEELNKSIHVRDEKINELTSKLSLLEATVNEKDDQLKNIYSIKSKVESESKMRIDDLLEQLTLLDEVKSKEVAEIQDKLTQLSSQMTIYQSESETSRISEKDVERERQEYLARIRDLEISETELQISNKSLQKQLEEAQSSSNIIPKPDAAADGQDLLEHIEFLNSIIADMHKKDLKLVKRVQALESEISKSSNSSFKDIDFDTKFMDKKLPPPRMYCDICEEFDAHETEDCPTQCSGTDPDAPAVRRDEKKERKKPPPRKYCDFCEVFDAHETEECPNSDETF 
Type Start End Length
CDS 8422 8677 256
CDS 8734 12398 3665
CDS 12667 12777 111
CDS 12840 12863 24
intron 8678 8733 56
intron 12399 12666 268
intron 12778 12839 62

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001846683 condensin, SMC5-subunit [Culex quinquefasciatus] gb|EDS44350.1| condensin, SMC5-subunit [Culex quinquefasciatus] 1e-104
InterPro IPR009053 Prefoldin
Pfam PF04582.7 Reovirus sigma C capsid protein 0.00036
Pfam PF05465.8 Halobacterial gas vesicle protein C (GVPC) repeat 0.86
Pfam PF13696.1 Zinc knuckle 1.1
Pfam PF06220.7 U1 zinc finger 0.021

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.00335

Orthologous genes

Species Gene ID
A. gambiae AGAP009210
C. quinquefasciatus CPIJ005305
H. sapiens ENSP00000223398
H. sapiens ENSP00000355151
H. sapiens ENSP00000446379
H. sapiens ENSP00000437786
D. melanogaster FBgn0020503
H. sapiens ENSP00000460103
H. sapiens ENSP00000355314
H. sapiens ENSP00000303585
H. sapiens ENSP00000378500
P. vanderplanki Pv.13818
S. invicta SI2.2.0_05857
H. sapiens ENSP00000351665
H. sapiens ENSP00000460322
H. sapiens ENSP00000445531
D. plexippus DPOGS212139PA
A. aegypti AAEL013698
H. sapiens ENSP00000461219
H. sapiens ENSP00000441409
A. mellifera GB10948-PA
H. sapiens ENSP00000438743
M. musculus ENSMUSG00000063146
A. aegypti AAEL013697
P. vanderplanki Pv.06642
B. mori BGIBMGA006589-TA
C. quinquefasciatus CPIJ005306
T. castaneum TC011966
H. sapiens ENSP00000439093
P. humanus PHUM291650-PA
A. aegypti AAEL015374
A. mellifera GB14183-PA
H. melpomene HMEL007733-PA
M. musculus ENSMUSG00000049550
T. castaneum TC011965
N. vitripennis NV11205-PA