MidgeBase gene description page [Pn.00294]

Outline

Link to gbrowse

Gene ID Pn.00294
Type Protein coding gene
Scaffold PnScaf352
Start 74358
End 78996
Direction +

Sequence

Transcript: 4515 (bp)

 ATGGAGAATGTTTTGGCTGTGAAAATCATTACGTGCGATTATTATAACGCTCAGCCAATACCGGGACTAGACGTGACACATTCAGAATTTAGAGAAATTGCTGTGAAATATGTTCCTGTGCTCAGAATTTTCGGAATCACCAAAAATAAGCAAAAGATTTGCTGCAATGTTCATAATGTGATGCCGTATTTGTTTGTGCCATGTCCAGAATCAAATCCGGAAAAGATTGACTCGTTAATGAAGGCAATGGCAGTTGAAATCGACCAAAGGATAAACGTTTCATTTGGTCAATCGAATAGCTCACAGCAACACGTTTACAGAATATCTCTAGCAAAAGGAAGATTCTTCTATGGATATCATCACAACGAGAATATGCTCTTACGAATCGAGTTATACAATCCAAACTTACTTAAACGAACAGCTGGGCTTTTACAGAGCGGCTGCATACTAGGACGAATTTTCCAGAGCTATGAAAGTCACATTCCGTATGCCATGAGATTCTTTATTGACTTTAATTTGTTTGGCATGAGTTATCTTCATGTGCCTCTAAACAAGGTGCATGCACGATTGTCGACCAGCGAAGGAGCTCACAAGAAGAAATCTGTTTCCCATTTAGAAGTGGACTTTAACGCAATTCACATCATTAATCGAACAGTATTAAGCGAGAGCGAAGAGACTGATAAAGCGGCAAACATAGGAATAGAATTTCTTTGGGAAGACGAAAGACGAAGGAGAGAGAGCCACTTTAGTTCACAGGATATGCCGGAATTGAAGCCAGCAGAAATCGAATTCAGACCGCTCTGCACAATCGCTTCCAGTGATACATTTTTTCTGTGCGCATTGAGGGATAAATTGAAGTCGCTCTTTCATGATAAAAACAACTGTGTAGAAGCAGGACCATCGTCGGACAATAAAATCAATAAAAAGACCTTTGACTTAAAACAATTTCTCGATTCATCAACATATGGAGTGGAATTTTCGCAGAGTTCCTCAGAAGACTTTGAAGCGATTAATGAGAGTATATCAATAGAAGAATTGGAAAAGGTTCTTTTAGACTCTGAACCAATGGATGAAGATCAAAAAGAAATTCTCGAAATACAAAAACTGCTGGAGAATGAAAGCAGTGATTCAGACAATGACAGCATTCTGGCTCCGTTGTCACAGCAACAGTTCCCCGATAGCACTTTAAAAAAAGTTCAACAAGCGGTAAAAGCAGACTTCAGTAAAAGTCTGTTGATTAATGAAGAGAGCAACTTCGAAGAGGATGACGACGATGATTATGATGCATTCAACATGACACTCGCGGACATGGAAGCAGAAATATTTGGTAAAGGCGATGAGACCATTCCACAGTTAGATGGCGCTATCGATGAACCTGTACCAAGCACATCAGCCGCCTCAATTTTATCAAGTCATGATATTCAAGAAAGTCTCCAAAATTTAGCTAGCTTTTTTACTGCGTCTCAAGATAGTTCGAGAAATGGAAGCGAGATTGAAGCAGATGAGGAGATGGACGTTAATGATGAGAATGAAGACGAAAAGATGAAAAGCTTTTACAATGAGTCTCAATTTTATGATTTTGAAGATTTGGAAATAAGCAGCCCTGAAGAGTCTGTGAAAATTAAGGAGGAAATTACTGAAGACGATTTGTCGAATTTTAAAAAGTGTGTGATCACACCAAAATCAGAGCCCCCTCATCCCTCCAGTGTGGTTGAAAATTTAAAAGGTTACAATATACCTGAGACGGTCAACATGTCTGCATTCTATAGTAATCCAGCGGATTTAACGGAAAAGAAAGAGATTGGAAACACAATTTTGGATGTACGAAGCGATCGTCTCAATGACTTCGATGACTTCAAAAGCGTTTTATTCGATAAAAATCAATTCCATCTTCAGCAAAACCAGAAAATTTCATACAACCTGGGATTTGCACAGAAAAATAACAAGGTAATTTATGATATTATTGACGATCGGAGGGATGTCGTCATTCATCCAATAAATGTACCACCAACAAACAAAGAAGTTAAATTATGGATTAAATCAACAGCAGCTCCTGAACGTGAATCGCAACAAATAGGCATTGAACAAGATAGTCCGGTAAAAGTTAAACGGGAAAAGACAATAATGGTCCTCGAATCTGATGAAATTGACCCATTCAACGACATCACTCTTCTAGATCTTGATAAAACACTGGTGCCAGAGACCGGAACTCCCAAAGATTACAATACTGTCACGCTAATACCAAACTCAAATGAGAAGGAAACACCGTCTTTAACGGAATTTGTGGAGGAAGGAAAATATTTGAGTTATAGTGCGCGGAAAAAGCGGAAACGAAAAATGAAACAAAGTTTTAGTAAACGATTTCAAGAGATAATGAAGGCAAAGGTAGTTGCCGGTAAAGATGATTCAGTTCCCAATCCTCAAAATTCTCCTCTGACCGGTACATCGGACTCATCAGAAGATTTTCTGCAACAGACTGCCGATAAATGTTCAGACAGCACATCTATCAGCCCTTCTTTCTTTCAAGATGCAAACATCCCGCTAGGCGATAGTCACGATTCATCGACAATAAACAGTTCCTTCGGGTTCAAGGTTAAACTAGAAAGTCTTCACACGAACGACGAACACATTGATCTCACAATTCTGTCGATGGAACTTCATGTACAAACAAAAGGCGAATTCAAACCGAATCCTGAGACAGACGATATCTCTGCCATTTTCTACTCAGTCGAAGGATATTATGTCGATGGTATCGCTACATTCTTAAGCGGGATTATAGTCGTAGAATGTGACGACAACTTGGGTTTCTTTAAGGATGATATTGAGGTGATTCGAGTAAAAAATGAAATGGAGTTGCTTGATAATTTTTTTCAAAAAATTCGACTCTTTGATCCAGACATTTTTGCTGGCTACGAAATTGAGTTAAACTCTTGGGGATTCTTAATTGAGCGAGGCTACGTACTAAGCATGAATTTTTGTAACATTCTATCACGAATGCCACTTGAAAAGGAATACAAGCCTAAAAATACAAATCGTGATGACGACCAAGATTTCGACCAAGGAGATTATCAGACCGAACAAAAAATACCTGGAAGAATATTATTGGACGTGTGGCGGCTTATGAGGCACGAAATTGCTCTCACATCATACACCTTTGAAAATATTGCTTATCATATTCTGCACAGAAGATATCCGAAGCACTCGTACAGCTATCTTACTAAAATGTGGCAGGAGCCATTGAGAAGGTGGATTGTCCTTGAATACTACACGATTCGGTCGAAGACACTCTTAGAAATTTTGACACAGCTCGATTTGGTCGGAAGAACATGTGAATTGGCAAAGCTTTTTGGGATTCAGTTCTTTGAAGTTCTTTCACGTGGTTCACAATTTAGAGTCGAAAGCATGATGTTGCGTATAGCTAAAAGAAGAAATTACGTTGCGGTGTCACCGAGTGTTCAGCAGAGAGCTCATCAACGTGCGCCAGAATATGTGCCGCTCATCTTAGAGCCGGAATCTCGCTTCTACACCGATCCTGTCATCGTTCTCGATTTTCAAAGCTTATATCCGAGCATGATTATTGCGTATAATTATTGCTTTTCGACTTGTCTCGGAAGAATTGAACATTTGCTTAAGGGCTCATCGCAGCCATTCGAGTTTGGGGCATACCAGCTAAAAGTACCGCCAGAGAGACTTAAATTCTTCCTTGATAATGATCTGCTAACTGTTTCACCTGCTGGGATAGCGTTTGTTAAATCATCAGTGAGAGAAGGTGTTTTGCCCAGAATGCTCAAAGAAATTCTTGATACCCGGCTCATGGTGAAGCAGTCGATGAAATTATACAAAAATAACACCGCACTCCAACGAATTCTTCACTCAAGACAATTGGGATTAAAACTGATTGCGAATGTTACCTACGGCTACACAGCAGCCAATTTCAGCGGCAGGATGCCGTCCATTGAGGTCGGTGATTCAGTCGTCAGCAAAGGGAGAGAGACACTTGAAAGAGCAATAAATATCGTTGAGCAAAATAAAAACTGGAATTGTAAAGTTTGCTATGGAGACACCGATTCAATGTTTGTATTGGTACCTGGGAGAACACGTGAAGAGGCATTCCGAATAGGCTCTGAAATCGCAGATGTGATCACTAACGACAACCCGTATCCGGTGAAGCTGAAATTGGAAAAGGTGTACCAGCCGTGCATATTGCAAACCAAAAAACGGTATGTAGGCAACATGTATGAGTCAGTCGATCAAAAAGAACCGATATTTGAGGCCAAAGGAATTGAAACAATAAGGAGAGATGGATGTACAGCCTCTTCAAAAATTTTACAGAAATCGCTAAAAATTCTCTTCGAAACATACGATATTAGTCGAGTAAAGGAGTACGTGTGCAGGCAATTTACGAAGATTCTCGAAGGTAGATACAGCATTCAAGATTTAATTATCGCAAAGGAATTTCGAGGAGTTCAAGGCTACAAGGAAAAGGCAGTGGTTCCTGCTCTCACATTAACCAAG 

Protein: 1505 (aa)

 MENVLAVKIITCDYYNAQPIPGLDVTHSEFREIAVKYVPVLRIFGITKNKQKICCNVHNVMPYLFVPCPESNPEKIDSLMKAMAVEIDQRINVSFGQSNSSQQHVYRISLAKGRFFYGYHHNENMLLRIELYNPNLLKRTAGLLQSGCILGRIFQSYESHIPYAMRFFIDFNLFGMSYLHVPLNKVHARLSTSEGAHKKKSVSHLEVDFNAIHIINRTVLSESEETDKAANIGIEFLWEDERRRRESHFSSQDMPELKPAEIEFRPLCTIASSDTFFLCALRDKLKSLFHDKNNCVEAGPSSDNKINKKTFDLKQFLDSSTYGVEFSQSSSEDFEAINESISIEELEKVLLDSEPMDEDQKEILEIQKLLENESSDSDNDSILAPLSQQQFPDSTLKKVQQAVKADFSKSLLINEESNFEEDDDDDYDAFNMTLADMEAEIFGKGDETIPQLDGAIDEPVPSTSAASILSSHDIQESLQNLASFFTASQDSSRNGSEIEADEEMDVNDENEDEKMKSFYNESQFYDFEDLEISSPEESVKIKEEITEDDLSNFKKCVITPKSEPPHPSSVVENLKGYNIPETVNMSAFYSNPADLTEKKEIGNTILDVRSDRLNDFDDFKSVLFDKNQFHLQQNQKISYNLGFAQKNNKVIYDIIDDRRDVVIHPINVPPTNKEVKLWIKSTAAPERESQQIGIEQDSPVKVKREKTIMVLESDEIDPFNDITLLDLDKTLVPETGTPKDYNTVTLIPNSNEKETPSLTEFVEEGKYLSYSARKKRKRKMKQSFSKRFQEIMKAKVVAGKDDSVPNPQNSPLTGTSDSSEDFLQQTADKCSDSTSISPSFFQDANIPLGDSHDSSTINSSFGFKVKLESLHTNDEHIDLTILSMELHVQTKGEFKPNPETDDISAIFYSVEGYYVDGIATFLSGIIVVECDDNLGFFKDDIEVIRVKNEMELLDNFFQKIRLFDPDIFAGYEIELNSWGFLIERGYVLSMNFCNILSRMPLEKEYKPKNTNRDDDQDFDQGDYQTEQKIPGRILLDVWRLMRHEIALTSYTFENIAYHILHRRYPKHSYSYLTKMWQEPLRRWIVLEYYTIRSKTLLEILTQLDLVGRTCELAKLFGIQFFEVLSRGSQFRVESMMLRIAKRRNYVAVSPSVQQRAHQRAPEYVPLILEPESRFYTDPVIVLDFQSLYPSMIIAYNYCFSTCLGRIEHLLKGSSQPFEFGAYQLKVPPERLKFFLDNDLLTVSPAGIAFVKSSVREGVLPRMLKEILDTRLMVKQSMKLYKNNTALQRILHSRQLGLKLIANVTYGYTAANFSGRMPSIEVGDSVVSKGRETLERAINIVEQNKNWNCKVCYGDTDSMFVLVPGRTREEAFRIGSEIADVITNDNPYPVKLKLEKVYQPCILQTKKRYVGNMYESVDQKEPIFEAKGIETIRRDGCTASSKILQKSLKILFETYDISRVKEYVCRQFTKILEGRYSIQDLIIAKEFRGVQGYKEKAVVPALTLTK 
Type Start End Length
CDS 74358 74534 177
CDS 74598 74675 78
CDS 74734 78993 4260
intron 74535 74597 63
intron 74676 74733 58

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001660448 DNA polymerase zeta catalytic subunit [Aedes aegypti] gb|EAT38236.1| DNA polymerase zeta catalytic subunit [Aedes aegypti] 0.0
InterPro IPR023211 DNA polymerase, palm domain
InterPro IPR006133 DNA-directed DNA polymerase, family B, exonuclease domain
InterPro IPR012337 Ribonuclease H-like domain
InterPro IPR017964 DNA-directed DNA polymerase, family B, conserved site
InterPro IPR006172 DNA-directed DNA polymerase, family B
InterPro IPR006134 DNA-directed DNA polymerase, family B, multifunctional domain
Gene Ontology(BP) GO:0006260 DNA replication
Gene Ontology(BP) GO:0006139 nucleobase-containing compound metabolic process
Gene Ontology(MF) GO:0003887 DNA-directed DNA polymerase activity
Gene Ontology(MF) GO:0003677 DNA binding
Gene Ontology(MF) GO:0000166 nucleotide binding
Gene Ontology(MF) GO:0003676 nucleic acid binding
Pfam PF03104.14 DNA polymerase family B, exonuclease domain 1.5e-30
Pfam PF00136.16 DNA polymerase family B 4.6e-76

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
D. plexippus DPOGS208944PA
H. melpomene HMEL016066-PA
A. aegypti AAEL009851
T. castaneum TC004938
H. sapiens ENSP00000351697
P. humanus PHUM106820-PA
A. mellifera GB18804-PA
A. gambiae AGAP013386
N. vitripennis NV10399-PA
H. sapiens ENSP00000402003
H. sapiens ENSP00000357792
C. quinquefasciatus CPIJ013042
B. mori BGIBMGA002415-TA
S. invicta SI2.2.0_12915
M. musculus ENSMUSG00000019841
D. melanogaster FBgn0002891
P. vanderplanki Pv.08344
H. sapiens ENSP00000357795