MidgeBase gene description page [Pn.00294]
Outline
Gene ID | Pn.00294 |
Type | Protein coding gene |
Scaffold | PnScaf352 |
Start | 74358 |
End | 78996 |
Direction | + |
Sequence
Transcript: 4515 (bp)
ATGGAGAATGTTTTGGCTGTGAAAATCATTACGTGCGATTATTATAACGCTCAGCCAATACCGGGACTAGACGTGACACATTCAGAATTTAGAGAAATTGCTGTGAAATATGTTCCTGTGCTCAGAATTTTCGGAATCACCAAAAATAAGCAAAAGATTTGCTGCAATGTTCATAATGTGATGCCGTATTTGTTTGTGCCATGTCCAGAATCAAATCCGGAAAAGATTGACTCGTTAATGAAGGCAATGGCAGTTGAAATCGACCAAAGGATAAACGTTTCATTTGGTCAATCGAATAGCTCACAGCAACACGTTTACAGAATATCTCTAGCAAAAGGAAGATTCTTCTATGGATATCATCACAACGAGAATATGCTCTTACGAATCGAGTTATACAATCCAAACTTACTTAAACGAACAGCTGGGCTTTTACAGAGCGGCTGCATACTAGGACGAATTTTCCAGAGCTATGAAAGTCACATTCCGTATGCCATGAGATTCTTTATTGACTTTAATTTGTTTGGCATGAGTTATCTTCATGTGCCTCTAAACAAGGTGCATGCACGATTGTCGACCAGCGAAGGAGCTCACAAGAAGAAATCTGTTTCCCATTTAGAAGTGGACTTTAACGCAATTCACATCATTAATCGAACAGTATTAAGCGAGAGCGAAGAGACTGATAAAGCGGCAAACATAGGAATAGAATTTCTTTGGGAAGACGAAAGACGAAGGAGAGAGAGCCACTTTAGTTCACAGGATATGCCGGAATTGAAGCCAGCAGAAATCGAATTCAGACCGCTCTGCACAATCGCTTCCAGTGATACATTTTTTCTGTGCGCATTGAGGGATAAATTGAAGTCGCTCTTTCATGATAAAAACAACTGTGTAGAAGCAGGACCATCGTCGGACAATAAAATCAATAAAAAGACCTTTGACTTAAAACAATTTCTCGATTCATCAACATATGGAGTGGAATTTTCGCAGAGTTCCTCAGAAGACTTTGAAGCGATTAATGAGAGTATATCAATAGAAGAATTGGAAAAGGTTCTTTTAGACTCTGAACCAATGGATGAAGATCAAAAAGAAATTCTCGAAATACAAAAACTGCTGGAGAATGAAAGCAGTGATTCAGACAATGACAGCATTCTGGCTCCGTTGTCACAGCAACAGTTCCCCGATAGCACTTTAAAAAAAGTTCAACAAGCGGTAAAAGCAGACTTCAGTAAAAGTCTGTTGATTAATGAAGAGAGCAACTTCGAAGAGGATGACGACGATGATTATGATGCATTCAACATGACACTCGCGGACATGGAAGCAGAAATATTTGGTAAAGGCGATGAGACCATTCCACAGTTAGATGGCGCTATCGATGAACCTGTACCAAGCACATCAGCCGCCTCAATTTTATCAAGTCATGATATTCAAGAAAGTCTCCAAAATTTAGCTAGCTTTTTTACTGCGTCTCAAGATAGTTCGAGAAATGGAAGCGAGATTGAAGCAGATGAGGAGATGGACGTTAATGATGAGAATGAAGACGAAAAGATGAAAAGCTTTTACAATGAGTCTCAATTTTATGATTTTGAAGATTTGGAAATAAGCAGCCCTGAAGAGTCTGTGAAAATTAAGGAGGAAATTACTGAAGACGATTTGTCGAATTTTAAAAAGTGTGTGATCACACCAAAATCAGAGCCCCCTCATCCCTCCAGTGTGGTTGAAAATTTAAAAGGTTACAATATACCTGAGACGGTCAACATGTCTGCATTCTATAGTAATCCAGCGGATTTAACGGAAAAGAAAGAGATTGGAAACACAATTTTGGATGTACGAAGCGATCGTCTCAATGACTTCGATGACTTCAAAAGCGTTTTATTCGATAAAAATCAATTCCATCTTCAGCAAAACCAGAAAATTTCATACAACCTGGGATTTGCACAGAAAAATAACAAGGTAATTTATGATATTATTGACGATCGGAGGGATGTCGTCATTCATCCAATAAATGTACCACCAACAAACAAAGAAGTTAAATTATGGATTAAATCAACAGCAGCTCCTGAACGTGAATCGCAACAAATAGGCATTGAACAAGATAGTCCGGTAAAAGTTAAACGGGAAAAGACAATAATGGTCCTCGAATCTGATGAAATTGACCCATTCAACGACATCACTCTTCTAGATCTTGATAAAACACTGGTGCCAGAGACCGGAACTCCCAAAGATTACAATACTGTCACGCTAATACCAAACTCAAATGAGAAGGAAACACCGTCTTTAACGGAATTTGTGGAGGAAGGAAAATATTTGAGTTATAGTGCGCGGAAAAAGCGGAAACGAAAAATGAAACAAAGTTTTAGTAAACGATTTCAAGAGATAATGAAGGCAAAGGTAGTTGCCGGTAAAGATGATTCAGTTCCCAATCCTCAAAATTCTCCTCTGACCGGTACATCGGACTCATCAGAAGATTTTCTGCAACAGACTGCCGATAAATGTTCAGACAGCACATCTATCAGCCCTTCTTTCTTTCAAGATGCAAACATCCCGCTAGGCGATAGTCACGATTCATCGACAATAAACAGTTCCTTCGGGTTCAAGGTTAAACTAGAAAGTCTTCACACGAACGACGAACACATTGATCTCACAATTCTGTCGATGGAACTTCATGTACAAACAAAAGGCGAATTCAAACCGAATCCTGAGACAGACGATATCTCTGCCATTTTCTACTCAGTCGAAGGATATTATGTCGATGGTATCGCTACATTCTTAAGCGGGATTATAGTCGTAGAATGTGACGACAACTTGGGTTTCTTTAAGGATGATATTGAGGTGATTCGAGTAAAAAATGAAATGGAGTTGCTTGATAATTTTTTTCAAAAAATTCGACTCTTTGATCCAGACATTTTTGCTGGCTACGAAATTGAGTTAAACTCTTGGGGATTCTTAATTGAGCGAGGCTACGTACTAAGCATGAATTTTTGTAACATTCTATCACGAATGCCACTTGAAAAGGAATACAAGCCTAAAAATACAAATCGTGATGACGACCAAGATTTCGACCAAGGAGATTATCAGACCGAACAAAAAATACCTGGAAGAATATTATTGGACGTGTGGCGGCTTATGAGGCACGAAATTGCTCTCACATCATACACCTTTGAAAATATTGCTTATCATATTCTGCACAGAAGATATCCGAAGCACTCGTACAGCTATCTTACTAAAATGTGGCAGGAGCCATTGAGAAGGTGGATTGTCCTTGAATACTACACGATTCGGTCGAAGACACTCTTAGAAATTTTGACACAGCTCGATTTGGTCGGAAGAACATGTGAATTGGCAAAGCTTTTTGGGATTCAGTTCTTTGAAGTTCTTTCACGTGGTTCACAATTTAGAGTCGAAAGCATGATGTTGCGTATAGCTAAAAGAAGAAATTACGTTGCGGTGTCACCGAGTGTTCAGCAGAGAGCTCATCAACGTGCGCCAGAATATGTGCCGCTCATCTTAGAGCCGGAATCTCGCTTCTACACCGATCCTGTCATCGTTCTCGATTTTCAAAGCTTATATCCGAGCATGATTATTGCGTATAATTATTGCTTTTCGACTTGTCTCGGAAGAATTGAACATTTGCTTAAGGGCTCATCGCAGCCATTCGAGTTTGGGGCATACCAGCTAAAAGTACCGCCAGAGAGACTTAAATTCTTCCTTGATAATGATCTGCTAACTGTTTCACCTGCTGGGATAGCGTTTGTTAAATCATCAGTGAGAGAAGGTGTTTTGCCCAGAATGCTCAAAGAAATTCTTGATACCCGGCTCATGGTGAAGCAGTCGATGAAATTATACAAAAATAACACCGCACTCCAACGAATTCTTCACTCAAGACAATTGGGATTAAAACTGATTGCGAATGTTACCTACGGCTACACAGCAGCCAATTTCAGCGGCAGGATGCCGTCCATTGAGGTCGGTGATTCAGTCGTCAGCAAAGGGAGAGAGACACTTGAAAGAGCAATAAATATCGTTGAGCAAAATAAAAACTGGAATTGTAAAGTTTGCTATGGAGACACCGATTCAATGTTTGTATTGGTACCTGGGAGAACACGTGAAGAGGCATTCCGAATAGGCTCTGAAATCGCAGATGTGATCACTAACGACAACCCGTATCCGGTGAAGCTGAAATTGGAAAAGGTGTACCAGCCGTGCATATTGCAAACCAAAAAACGGTATGTAGGCAACATGTATGAGTCAGTCGATCAAAAAGAACCGATATTTGAGGCCAAAGGAATTGAAACAATAAGGAGAGATGGATGTACAGCCTCTTCAAAAATTTTACAGAAATCGCTAAAAATTCTCTTCGAAACATACGATATTAGTCGAGTAAAGGAGTACGTGTGCAGGCAATTTACGAAGATTCTCGAAGGTAGATACAGCATTCAAGATTTAATTATCGCAAAGGAATTTCGAGGAGTTCAAGGCTACAAGGAAAAGGCAGTGGTTCCTGCTCTCACATTAACCAAG
Protein: 1505 (aa)
MENVLAVKIITCDYYNAQPIPGLDVTHSEFREIAVKYVPVLRIFGITKNKQKICCNVHNVMPYLFVPCPESNPEKIDSLMKAMAVEIDQRINVSFGQSNSSQQHVYRISLAKGRFFYGYHHNENMLLRIELYNPNLLKRTAGLLQSGCILGRIFQSYESHIPYAMRFFIDFNLFGMSYLHVPLNKVHARLSTSEGAHKKKSVSHLEVDFNAIHIINRTVLSESEETDKAANIGIEFLWEDERRRRESHFSSQDMPELKPAEIEFRPLCTIASSDTFFLCALRDKLKSLFHDKNNCVEAGPSSDNKINKKTFDLKQFLDSSTYGVEFSQSSSEDFEAINESISIEELEKVLLDSEPMDEDQKEILEIQKLLENESSDSDNDSILAPLSQQQFPDSTLKKVQQAVKADFSKSLLINEESNFEEDDDDDYDAFNMTLADMEAEIFGKGDETIPQLDGAIDEPVPSTSAASILSSHDIQESLQNLASFFTASQDSSRNGSEIEADEEMDVNDENEDEKMKSFYNESQFYDFEDLEISSPEESVKIKEEITEDDLSNFKKCVITPKSEPPHPSSVVENLKGYNIPETVNMSAFYSNPADLTEKKEIGNTILDVRSDRLNDFDDFKSVLFDKNQFHLQQNQKISYNLGFAQKNNKVIYDIIDDRRDVVIHPINVPPTNKEVKLWIKSTAAPERESQQIGIEQDSPVKVKREKTIMVLESDEIDPFNDITLLDLDKTLVPETGTPKDYNTVTLIPNSNEKETPSLTEFVEEGKYLSYSARKKRKRKMKQSFSKRFQEIMKAKVVAGKDDSVPNPQNSPLTGTSDSSEDFLQQTADKCSDSTSISPSFFQDANIPLGDSHDSSTINSSFGFKVKLESLHTNDEHIDLTILSMELHVQTKGEFKPNPETDDISAIFYSVEGYYVDGIATFLSGIIVVECDDNLGFFKDDIEVIRVKNEMELLDNFFQKIRLFDPDIFAGYEIELNSWGFLIERGYVLSMNFCNILSRMPLEKEYKPKNTNRDDDQDFDQGDYQTEQKIPGRILLDVWRLMRHEIALTSYTFENIAYHILHRRYPKHSYSYLTKMWQEPLRRWIVLEYYTIRSKTLLEILTQLDLVGRTCELAKLFGIQFFEVLSRGSQFRVESMMLRIAKRRNYVAVSPSVQQRAHQRAPEYVPLILEPESRFYTDPVIVLDFQSLYPSMIIAYNYCFSTCLGRIEHLLKGSSQPFEFGAYQLKVPPERLKFFLDNDLLTVSPAGIAFVKSSVREGVLPRMLKEILDTRLMVKQSMKLYKNNTALQRILHSRQLGLKLIANVTYGYTAANFSGRMPSIEVGDSVVSKGRETLERAINIVEQNKNWNCKVCYGDTDSMFVLVPGRTREEAFRIGSEIADVITNDNPYPVKLKLEKVYQPCILQTKKRYVGNMYESVDQKEPIFEAKGIETIRRDGCTASSKILQKSLKILFETYDISRVKEYVCRQFTKILEGRYSIQDLIIAKEFRGVQGYKEKAVVPALTLTK
Type | Start | End | Length |
CDS |
74358 |
74534 |
177 |
CDS |
74598 |
74675 |
78 |
CDS |
74734 |
78993 |
4260 |
intron |
74535 |
74597 |
63 |
intron |
74676 |
74733 |
58 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001660448 |
DNA polymerase zeta catalytic subunit [Aedes aegypti] gb|EAT38236.1| DNA polymerase zeta catalytic subunit [Aedes aegypti] |
0.0 |
InterPro |
IPR023211 |
DNA polymerase, palm domain |
|
InterPro |
IPR006133 |
DNA-directed DNA polymerase, family B, exonuclease domain |
|
InterPro |
IPR012337 |
Ribonuclease H-like domain |
|
InterPro |
IPR017964 |
DNA-directed DNA polymerase, family B, conserved site |
|
InterPro |
IPR006172 |
DNA-directed DNA polymerase, family B |
|
InterPro |
IPR006134 |
DNA-directed DNA polymerase, family B, multifunctional domain |
|
Gene Ontology(BP) |
GO:0006260 |
DNA replication |
|
Gene Ontology(BP) |
GO:0006139 |
nucleobase-containing compound metabolic process |
|
Gene Ontology(MF) |
GO:0003887 |
DNA-directed DNA polymerase activity |
|
Gene Ontology(MF) |
GO:0003677 |
DNA binding |
|
Gene Ontology(MF) |
GO:0000166 |
nucleotide binding |
|
Gene Ontology(MF) |
GO:0003676 |
nucleic acid binding |
|
Pfam |
PF03104.14 |
DNA polymerase family B, exonuclease domain |
1.5e-30 |
Pfam |
PF00136.16 |
DNA polymerase family B |
4.6e-76 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
D. plexippus |
DPOGS208944PA |
H. melpomene |
HMEL016066-PA |
A. aegypti |
AAEL009851 |
T. castaneum |
TC004938 |
H. sapiens |
ENSP00000351697 |
P. humanus |
PHUM106820-PA |
A. mellifera |
GB18804-PA |
A. gambiae |
AGAP013386 |
N. vitripennis |
NV10399-PA |
H. sapiens |
ENSP00000402003 |
H. sapiens |
ENSP00000357792 |
C. quinquefasciatus |
CPIJ013042 |
B. mori |
BGIBMGA002415-TA |
S. invicta |
SI2.2.0_12915 |
M. musculus |
ENSMUSG00000019841 |
D. melanogaster |
FBgn0002891 |
P. vanderplanki |
Pv.08344 |
H. sapiens |
ENSP00000357795 |