MidgeBase gene description page [Pn.04943]
Outline
Gene ID | Pn.04943 |
Type | Protein coding gene |
Scaffold | PnScaf4249 |
Start | 20599 |
End | 24042 |
Direction | + |
Sequence
Transcript: 3270 (bp)
ATGAAGAATAAGTTTAATGGTGGGAGGGCTGGGCCGAGTAAGAAATTCAAAGCTGAGGATGAAGACGAGGAATATCAGAGTGCTTTTGCTTCGGATCTGGCTATGATGGACACTGAAGACATTGGGGATTATGAGGTTGGCGACGGGCCAGAGAATCAAATGCAAAATCAGAAATGGTCAAGGCCAGATTTGCCTGAAATTAATCCCAACAAAGATCCCGTCGTTTTTCAACAAATTGACATCGATCACTACACAGGAAAACCTATGGCTGGAATGCCTGGATCACAGATCGAGCCCGTGCCAATTTTCCGAATGTATGGCGTAACGATGAACGGCAACTCGGTGTGTGCGCACATCCATGGCTTTTCACCCTTCCTCTACGTACAAGCACCAGAAGGCTTCGAGAAATCTCATTTGCCCGATTTCAAATCGAGACTCGACGCAATCGTGCTTAAAGATATGCGATCCAACAAGGAAAACGTGCAGGAAGCTATTCTGCGTGTGGAGCTCATCTACAAGCAATCACTACAGTTTTACGTCGGCGACGATAAAGTCAAGTTCATCAAGATTACGGTCGCTCTGCCGAAACTGATTGCGGCCGTCAAGCGATTGATGGATCGCGAAATCATCATGCCCGAGATGAATTTTCAAGATTGCCGCTGCTTCGAGAGCAACATCGATTTCGATATTCGTTTTATGGTGGAGACAAAAGTGGTCGGCTGCAGTTGGATCGAAATTCCACCAAAGGCTTGGCGAAAGCGAGTGAAAGGATCGCATCCGGAGCCCGAGAGCCGATGCCAGCTGGAAGTTGATGTCGCTTACGACAAATTCATTGCTCACGAGCCGGAGGGAGAGTGGTCAAAGGTGGCTCCGTTTCGAATTCTGAGCTTTGACATTGAGTGCGCGGGAAGAAAGGGAATTTTTCCCGAACCGAAGCACGATCCAGTCATTCAAATCGCAAATATGGTGATGAGGCAAGGCGAGAAGGAACCATTTTTGAGGAACGTCTTTACGCTCAACACTTGTGCGCCGATCGTGGGCTCGCAAGTTCTGAGCTACGCAAGAGAAACAGAACTCCTCGATGCCTGGAGCGCGTTCGTCCGGGAACTCGATCCCGACATCATCACCGGCTACAACATTAACAACTTCGACGTGCCCTACTTGATTGAGCGAGCGAAACACCTGAAGGTCAATAATTTCGTGTATCTCGGACGAATCAAGAACGTCAAATCTGTCATCAAGGAGTCGGTCATTCAGTCGAAACAGATGGGACGACGTGAGAACAAGCAAGTGAACTTCGAGGGACGTGTGCCATTCGACCTGCTTTTCGTTCTGCTTCGCGATTACAAGCTGAGATCCTACACGCTGAACGCCGTCAGTTACCATTTTCTGCAAGAGCAAAAGGAGGACGTTCATCACAGCATCATCACGGACCTTCAGAACGGAACCGACCAGACACGCCGTAGATTGGCAATGTACTGCCTGAAAGACGCTTACCTCCCGCTTCGATTGCTCAACAAACTCATGTGTATCGTCAACTACATGGAAATGGCTCGTGTAACTGGCGTTTCGCTAGCCAGTCTCCTCACACGTGGTCAGCAGATAAAAGTCGTCAGCCAACTCTTGAGGAAGGCTCGTGAAGCGGGCTACTTGATGCCGACCTACACGAGTCAGGGAGGAGACAACGAGCAGTTTGAGGGAGCGACTGTCATCGAGCCTGCGCGCGGCTACTATCCAGAACCGATTGCGACACTCGATTTTGCGTCCCTGTACCCGTCAATCATGATGGCCCACAATTTGTGCTACACAACGCTGATCAAGCCGTCAGACAGAAAGAAACTCGACTTGAAGGACGAGGACGTGACTCAAACGCCGGCTGGAAATTGCTTCGTGAAACCAACTGTCAGAAAGGGTCTGCTGCCTGAAATTCTCGAGTCTCTGCTTTCTGCCCGTAAAAAGGCAAAGGCAGATTTGAAATCCGAGACTGATCCGTTCAAGAGAAGCGTGCTCGACGGTCGTCAACTCGCCCTGAAAATTTCCGCAAATTCAGTATACGGTTTTACGGGCGCACAGGTCGGAAAGCTGCCGTGTTTGGAGATCTCTGGAAGTGTCACTGCCTACGGTCGAACGATGATTGAAATGACCAAGAACGAGGTGGAGAAGCGCTACACGATCGACAACGGATACAAGGCGAATGCATATGTGATCTACGGAGACACCGATTCAGTGATGGTGAACTTTAAAGCGCCAACAGTTCCCGAGGCCATGGAGCTGGGCAGAGAGGCGGCTGAGTTTGTTAGCGCCAAGTTCATCAAACCCATTAAGCTCGAGTTCGAGAAGGTCTATTTCCCATACTTGCTCATCAACAAGAAGCGCTATGCGGGTCTCTATTGGACAAAGACAGAAACATATGACAAAATGGACTGCAAGGGAATCGAAACTGTTCGTCGCGACAACTCGCCGCTCGTTGCCAATCTCATGAACACTTGCTTGCAAAAACTGCTAATCGAACGCAATCCGCAAGAGGCTGTCGAGTATGTGAAATCTGTGATTTCCGATCTGCTCTGCAATCGCATCGACATCTCGCAGTTGGTCATCACGAAGGAACTTACGAAGCACGACTACGCAGCGAAACAGGCGCACGTTGAGTTGGCTGCGAAAATGAAGAAGCGCGATCCTGGAAATGCTCCAAAATTGGGCGACCGTGTGCCGTACGTCATAACTGCCGCCGCCAAAAGCACACCGGCCTATCAGAAGGCCGAAGATCCCGTTTATGTGCTGGAGAACAACATACCGATAGACTTCCAGTACTATCTGGAGAATCAATTGTCCAAGCCGCTCCTGAGAATTTTCGAGCCGATTTTGGGAGACAAGGCAGAGTCGATTTTACTCAAAGGCGATCACACTCGAACGCGCCTCGGGGGAACGTCGAAAGTTAGTGCCTTGGCTGCTTTCGTTCAGCGAAAAGAAACGTGCATCGGATGCAAGTCAGTACTGCCAGCTGATCGTGCAAAGAAAGCACTGTGCCAATTTTGCGAACAAAAGTCTGACGAAATTTATCAGGCAGAAATCACTCAGCAGTGTCAGTTAGAGGAGAGATTCTCGCGACTTTGGTCCGAGTGTCAGCGATGTCAGGGAGCGAGAAATGAGGAAGTTTTATGCACGAGCCGGGATTGCCCGATTTTCTACATGCGAACCAAAGTCAAGATGGACCTCGACACGCAAAACAAACGACTCATGCGGTTCGGCCTCTCGGAAATCAACAACTGG
Protein: 1090 (aa)
MKNKFNGGRAGPSKKFKAEDEDEEYQSAFASDLAMMDTEDIGDYEVGDGPENQMQNQKWSRPDLPEINPNKDPVVFQQIDIDHYTGKPMAGMPGSQIEPVPIFRMYGVTMNGNSVCAHIHGFSPFLYVQAPEGFEKSHLPDFKSRLDAIVLKDMRSNKENVQEAILRVELIYKQSLQFYVGDDKVKFIKITVALPKLIAAVKRLMDREIIMPEMNFQDCRCFESNIDFDIRFMVETKVVGCSWIEIPPKAWRKRVKGSHPEPESRCQLEVDVAYDKFIAHEPEGEWSKVAPFRILSFDIECAGRKGIFPEPKHDPVIQIANMVMRQGEKEPFLRNVFTLNTCAPIVGSQVLSYARETELLDAWSAFVRELDPDIITGYNINNFDVPYLIERAKHLKVNNFVYLGRIKNVKSVIKESVIQSKQMGRRENKQVNFEGRVPFDLLFVLLRDYKLRSYTLNAVSYHFLQEQKEDVHHSIITDLQNGTDQTRRRLAMYCLKDAYLPLRLLNKLMCIVNYMEMARVTGVSLASLLTRGQQIKVVSQLLRKAREAGYLMPTYTSQGGDNEQFEGATVIEPARGYYPEPIATLDFASLYPSIMMAHNLCYTTLIKPSDRKKLDLKDEDVTQTPAGNCFVKPTVRKGLLPEILESLLSARKKAKADLKSETDPFKRSVLDGRQLALKISANSVYGFTGAQVGKLPCLEISGSVTAYGRTMIEMTKNEVEKRYTIDNGYKANAYVIYGDTDSVMVNFKAPTVPEAMELGREAAEFVSAKFIKPIKLEFEKVYFPYLLINKKRYAGLYWTKTETYDKMDCKGIETVRRDNSPLVANLMNTCLQKLLIERNPQEAVEYVKSVISDLLCNRIDISQLVITKELTKHDYAAKQAHVELAAKMKKRDPGNAPKLGDRVPYVITAAAKSTPAYQKAEDPVYVLENNIPIDFQYYLENQLSKPLLRIFEPILGDKAESILLKGDHTRTRLGGTSKVSALAAFVQRKETCIGCKSVLPADRAKKALCQFCEQKSDEIYQAEITQQCQLEERFSRLWSECQRCQGARNEEVLCTSRDCPIFYMRTKVKMDLDTQNKRLMRFGLSEINNW
Type | Start | End | Length |
CDS |
20599 |
20648 |
50 |
CDS |
20711 |
20948 |
238 |
CDS |
21003 |
22910 |
1908 |
CDS |
22966 |
24039 |
1074 |
intron |
20649 |
20710 |
62 |
intron |
20949 |
21002 |
54 |
intron |
22911 |
22965 |
55 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
EFR23299 |
hypothetical protein AND_13138 [Anopheles darlingi] |
0.0 |
InterPro |
IPR006133 |
DNA-directed DNA polymerase, family B, exonuclease domain |
|
InterPro |
IPR025687 |
C4-type zinc-finger of DNA polymerase delta |
|
InterPro |
IPR006134 |
DNA-directed DNA polymerase, family B, multifunctional domain |
|
InterPro |
IPR023211 |
DNA polymerase, palm domain |
|
InterPro |
IPR012337 |
Ribonuclease H-like domain |
|
InterPro |
IPR017964 |
DNA-directed DNA polymerase, family B, conserved site |
|
InterPro |
IPR004578 |
DNA-directed DNA polymerase, family B, pol2 |
|
InterPro |
IPR006172 |
DNA-directed DNA polymerase, family B |
|
Gene Ontology(BP) |
GO:0006260 |
DNA replication |
|
Gene Ontology(BP) |
GO:0006139 |
nucleobase-containing compound metabolic process |
|
Gene Ontology(MF) |
GO:0003887 |
DNA-directed DNA polymerase activity |
|
Gene Ontology(MF) |
GO:0003677 |
DNA binding |
|
Gene Ontology(MF) |
GO:0000166 |
nucleotide binding |
|
Gene Ontology(MF) |
GO:0003676 |
nucleic acid binding |
|
Pfam |
PF03104.14 |
DNA polymerase family B, exonuclease domain |
6.7e-78 |
Pfam |
PF00136.16 |
DNA polymerase family B |
2.6e-146 |
Pfam |
PF03175.8 |
DNA polymerase type B, organellar and viral |
0.0076 |
Pfam |
PF14260.1 |
C4-type zinc-finger of DNA polymerase delta |
6.7e-19 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
C. quinquefasciatus |
CPIJ018287 |
H. sapiens |
ENSP00000406046 |
P. vanderplanki |
Pv.09048 |
N. vitripennis |
NV15693-PA |
T. castaneum |
TC004992 |
D. melanogaster |
FBgn0263600 |
A. aegypti |
AAEL014178 |
D. plexippus |
DPOGS213019PA |
B. mori |
BGIBMGA006939-TA |
H. melpomene |
HMEL012296-PA |
N. vitripennis |
NV17782-PA |
S. invicta |
SI2.2.0_03834 |
P. vanderplanki |
Pv.06545 |
A. mellifera |
GB17691-PA |
P. humanus |
PHUM453000-PA |
M. musculus |
ENSMUSG00000038644 |