MidgeBase gene description page [Pn.03361]

Outline

Link to gbrowse

Gene ID Pn.03361
Type Protein coding gene
Scaffold PnScaf2789
Start 2707
End 10438
Direction -

Sequence

Transcript: 4359 (bp)

 ATGCTCTTAAACACGCATCTCATTAGAAAAGCACCGTCGGCGTACTCGGATGGCGTGTACGGCATGGCCGGCCCCAATCGTCCTTCGCCGCGGAAGCTCAGCCGACTTTTCATGCGCGGCGAGGATGGCCTGAGCTCGATGGAGAACCGGACAGCTCTGCTGGCCTTCTTCGGACAGGTGGTGACCAACGAAATCGTGATGGCTTCCGAGTCCGGATGTCCCATCGAGATGCACCGCATCGAAGTGGAGAAGTGCGACGAGATGTACGACAAAGAGTGCCGCGGCGACCGCTACATTCCCTTTCATCGCGCCATGTACGATCGGAAGACTGGTCAGAGCCCGAATGCGCCGCGCGAACAAATAAATCAAATGACGGCGTGGATCGACGGGAGCTTCATTTACAGCACATCCGAAGCTTGGCTCAACGCAATGCGATCGTTTCAAAATGGATCGCTTCTGACTGATAAGAGCGGTAGCATGCCCGTCAAAAACACGATGCGGGTGCCGCTTTTTAACAATCCCGTGCCGCACGTCATGCGCGCTCTCAGCCCCGAGCGCCTTTATCTTCTGGGCGATCCGCGAACGAACCAAAACCCGGCCGTGCTGACGTTCGCCATCCTCCTCATGCGTTGGCATAACGTGCAGGCACAAGAAGTCAAACGGCTCCATCCGGACTGGAGCGACGAGGAGATTTTCCAGCGCGCGCGCCGACAGGTCATTGCCAGCTTGCAGAGCATTTTCGTGTACGAGTACCTGCCAGCGTTTCTGGGCGGAGTGACGCTCGAGCCTTACACGGGCTACAAGCAAGACATTCATCCCGGCATCTCGCACATGTTTCAAGCCGCCGCGTTTCGCTTTGGACACTCACTCATTCCGCCCGGCATCATGAAGAGAGACGGCAAATGTAATTACAAGTACACGCGAATGGGATCGCCTGCCATTCGACTCTGCTCCACCTGGTGGGACTCGAGCGACATATTTGAAGAGTCGAGCATCGAGGAGTTTCTCATGGGCATGAGCTCGCAGCTGGCCGAGAGAGAGGACCCGTTTCTGTGCTCCGACGTGCGTGACAAGCTGTTTGGACCGATGGAGTTTACGCGCAGAGACCTCGGCGCGCTCAACATCATGCGCGGCCGCGACAACGGACTCGCTGACTTCAACACAGCACGCGCCGCTTACAAGCTCCCACGCTACGAAAACTGGAAAGACATCAACCCGGAGCTGTTCGAGAAGCGTCCCGAGCTCCTGCAGATCCTCATCTCGGCCTACAAGGGCCGCCTCGACAACGTGGACGCCTACATTGGTGGCATGCTCGAGTCGGACGGCAAGCCGGGCGAGCTGTTTACAAACGTGATTCTCGACCAATTCACGCGCATCCGCGACGCCGACCGCTACTGGTTCGAGAACGAAGACAACGGCATCTTCACGCGCGAGGAGATCGAGAAGTTCCGCAGCATTCGCCTCTACGACATCATCGTCAACAGCACAAGTATCGAGCCGGGCCAGATCCAGCGCAATGTTTTCCAGTTCGTCGACGGCGACCCGTGCCCGCAGCCCGAGCAGCTGAACGCGTCGCTGCTGGAGCCGTGCAGCCACATCGAGGGCTACGACTACTTCTCGGGCTCGGAGCTCGCCTACATCTACGCGTGCGTCTTCCTCGGCTTCGTGCCCATCCTGTGCGCCACCGCCGGCTACTGCGTGGTGAAGCTCCAGAACAGCCGTCGACGTCGGCTGAAGGTGAAGCAGGAGACGATGCGCTCGAAGAGCGGCAAGCAGACGGTCGACAAGATGATTGCGCGCGAGTGGCTGCACGCGAACCACAAGCGCCTCGTGACGGTGAAGTTCGGCCCCGAGTCGGCGATCTACACGGTGGACAGGAAGGGCGAGAAGCTGAGGACGTTCAACCTGAAGAACACGGACACCATTCAGGTGGAGTTGAGCGTCGAGAGCTACACGAACAAGCGGCCGCTCGCGCTCCTCCGCATCCACAACGACCACGACCTCGTGCTGGAGCTGGAGACGATCGGAGCGCGCCGGAAGTTCGTCAAGAAGCTCGAGGACTTCCTCATCCTGCACAAGAAGGAGATGTCGATCAACGAGCAGAACCGCGAGCTCATGCTGGCCAAGGCGGAGACGCGCGAGCGCCGCCAGAAGAAACTCGAGCACTTCTTCCGCGAGGCGTATGCGCTGACCTTCGGTCTGCGACCGGGCGAGAGACGCCGTCGCTCGGACGCCTCCAGCGACGGCGAAGTGATGACCGTCATGCGAACCAGCCTCTCGAAGTCGGAGTTCGCGGCCGCTCTCGGCATGAAGCCCGACGCCATGTTCGTGCGGAAGATGTTCAACATCGTCGACAAGGACCAGGACGGCCGCATCTCCTTCCAGGAGTTCCTCGAGACGGTGGTGCTGTTCTCACGCGGCAAGACCGAGGACAAGCTGCGCATCATCTTCGACATGTGCGACAACGACCGCAACGGCGTCATCGACAAGTCCGAGTTCGGCGAGATGATGCGGTCGCTGGTGGAGATTGCGCGAACGACCAGCCTCACCGACGACCAGGTCACGGAGCTGATTGACGGCATGTTCTCGGACTGCGGGCTCGAGCACAAAAATCACCTCACCTACAAGGACTTCAAGGAGATGATGAAGGAGTACAAGGGCGAGTTTGTCGCCATCGGACTCGACTGCAAGGGCGCCAAGCAGAACTTCCTCGACACGTCCACCAACATCGCGCGCATGACGTCGTTCGCGATCGAGCCGACCGAGGGCGAGAAGCACTGGATTCTCGAGAAGTGGGACACCTACACGACCTTCTTGGAGGAGAACCGGCAGAACATCTTCTATCTCTTTCTCTTCTATGTCGTCACCATTGCGCTCTTCGTCGAGCGCTTCATTCACTACTCATTTATGGCTGAACACACTGACTTGCGACACATAATGGGCGTAGGTATTGCGATTACGCGAGGTTCTGCCGCTGCTCTCTCCTTCTGCTACTCCCTCCTCCTGCTGACAATGTCAAGGAACTTCCTCACGAAGCTCAAGGAGTTTCCCATCCAGCAGTATATTCCGCTCGACTCGCACATTCAGTTCCACAAAATCGCCGCTTGCACTGCTCTCTTCTTCTCGCTCCTCCACACCGTCGGCCACATCGTCAACTTCTACCACGTCTCGACGCAGTCGCATGAAAATCTGAGATGCCTCACGCGCGAAGTGCACTTTGCGTCGGACTACAAGCCCGACATCACGTTCTGGCTGTTCAAGACGCTGACCGGTACGACTGGCGTGGCGCTCTTTGTCATCATGTGCATCATCTTCGCCTTCGCCCATCCGACCATCAGAAAGCGCGCCTACAAGTTCTTCTGGCACACGCACTCGCTCTACATCCTGCTCTACATTCTCTCGCTGCTGCACGGTCTCGCAAGAATCTCCGGCGCGCCTCGCTTCTGGATTTTCTTCATCGGTCCCGCGATCATCTTCGTCATCGACAAGGTGGTTTCATTGCGCACGAAATACATGGCGCTCGATGTTATTGACGTCGAGTTGCTGCCGTCTGACGTCATAAAAATCAAGTTTTATCGACCGCCGAATTTGAAGTATTTGTCGGGACAATGGGTGCGGTTTTCGTGCACTGCCATCAAACCCGAGGAGATGCATAGCTTTACGCTAACGTCTGCACCGCACGAGAACTTTTTGAGCTGCCACATTAAGGCGCAGGGACCGTGGACGTGGAAGTTGCGCAACTACTTCGATCCGTGCAACTACAATCCCGACGATCAGCCGAGAATCCGAATCGAGGGACCTTACGGCGGCGGAAATCAAGATTGGTACAAGTTCGAGGTCGCTGTCATGGTTGGCGGTGGCATTGGCGTAACGCCGTATGCGAGCATCTTGAATGACTTGGTCTTCGGCACGAGCACAAACAGATACTCGGGAGTTGCCTGCAAGAAGGTCTACTTCCTCTGGATTTGTCCATCACACAAGCACTTCGAGTGGTTCATCGACGTGCTGCGCGATGTCGAGAAGAAGGACGTGACGAATGTCTTGGAAATACACATTTTCATCACGCAATTCTTCCACAAGTTCGATTTGAGAACGACCATGTTGTACATTTGCGAGAATCACTTCCAGCGGCTCGCAAAGACGTCAATGTTCACGGGACTGAAGGCCGTAAATCACTTCGGTCGTCCCGATATGTCTAGCTTCTTAAAGTTCGTTCAGAAGAAGCACTCGTATGTGTCAAAGATTGGCGTATTCTCATGCGGTCCTAGGCCGCTCACAAAGAGCGTCATGTCGGCATGCGATGAGGTCAACAAGGGACGCAAACTTCCATACTTTATTCACCACTTTGAGAACTTTGGT 

Protein: 1453 (aa)

 MLLNTHLIRKAPSAYSDGVYGMAGPNRPSPRKLSRLFMRGEDGLSSMENRTALLAFFGQVVTNEIVMASESGCPIEMHRIEVEKCDEMYDKECRGDRYIPFHRAMYDRKTGQSPNAPREQINQMTAWIDGSFIYSTSEAWLNAMRSFQNGSLLTDKSGSMPVKNTMRVPLFNNPVPHVMRALSPERLYLLGDPRTNQNPAVLTFAILLMRWHNVQAQEVKRLHPDWSDEEIFQRARRQVIASLQSIFVYEYLPAFLGGVTLEPYTGYKQDIHPGISHMFQAAAFRFGHSLIPPGIMKRDGKCNYKYTRMGSPAIRLCSTWWDSSDIFEESSIEEFLMGMSSQLAEREDPFLCSDVRDKLFGPMEFTRRDLGALNIMRGRDNGLADFNTARAAYKLPRYENWKDINPELFEKRPELLQILISAYKGRLDNVDAYIGGMLESDGKPGELFTNVILDQFTRIRDADRYWFENEDNGIFTREEIEKFRSIRLYDIIVNSTSIEPGQIQRNVFQFVDGDPCPQPEQLNASLLEPCSHIEGYDYFSGSELAYIYACVFLGFVPILCATAGYCVVKLQNSRRRRLKVKQETMRSKSGKQTVDKMIAREWLHANHKRLVTVKFGPESAIYTVDRKGEKLRTFNLKNTDTIQVELSVESYTNKRPLALLRIHNDHDLVLELETIGARRKFVKKLEDFLILHKKEMSINEQNRELMLAKAETRERRQKKLEHFFREAYALTFGLRPGERRRRSDASSDGEVMTVMRTSLSKSEFAAALGMKPDAMFVRKMFNIVDKDQDGRISFQEFLETVVLFSRGKTEDKLRIIFDMCDNDRNGVIDKSEFGEMMRSLVEIARTTSLTDDQVTELIDGMFSDCGLEHKNHLTYKDFKEMMKEYKGEFVAIGLDCKGAKQNFLDTSTNIARMTSFAIEPTEGEKHWILEKWDTYTTFLEENRQNIFYLFLFYVVTIALFVERFIHYSFMAEHTDLRHIMGVGIAITRGSAAALSFCYSLLLLTMSRNFLTKLKEFPIQQYIPLDSHIQFHKIAACTALFFSLLHTVGHIVNFYHVSTQSHENLRCLTREVHFASDYKPDITFWLFKTLTGTTGVALFVIMCIIFAFAHPTIRKRAYKFFWHTHSLYILLYILSLLHGLARISGAPRFWIFFIGPAIIFVIDKVVSLRTKYMALDVIDVELLPSDVIKIKFYRPPNLKYLSGQWVRFSCTAIKPEEMHSFTLTSAPHENFLSCHIKAQGPWTWKLRNYFDPCNYNPDDQPRIRIEGPYGGGNQDWYKFEVAVMVGGGIGVTPYASILNDLVFGTSTNRYSGVACKKVYFLWICPSHKHFEWFIDVLRDVEKKDVTNVLEIHIFITQFFHKFDLRTTMLYICENHFQRLAKTSMFTGLKAVNHFGRPDMSSFLKFVQKKHSYVSKIGVFSCGPRPLTKSVMSACDEVNKGRKLPYFIHHFENFG 
Type Start End Length
CDS 2710 2835 126
CDS 2935 3063 129
CDS 3128 3363 236
CDS 3739 4710 972
CDS 5056 6979 1924
CDS 7502 7908 407
CDS 8004 8208 205
CDS 8590 8939 350
CDS 10429 10438 10
intron 2836 2934 99
intron 3064 3127 64
intron 3364 3738 375
intron 4711 5055 345
intron 6980 7501 522
intron 7909 8003 95
intron 8209 8589 381
intron 8940 10428 1489

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001658452 dual oxidase 1 [Aedes aegypti] gb|EAT40728.1| dual oxidase 1 [Aedes aegypti] 0.0
InterPro IPR017927 Ferredoxin reductase-type FAD-binding domain
InterPro IPR002007 Haem peroxidase, animal
InterPro IPR018247 EF-Hand 1, calcium-binding site
InterPro IPR011992 EF-hand-like domain
InterPro IPR002048 Calcium-binding EF-hand
InterPro IPR018248 EF-hand
InterPro IPR013130 Ferric reductase transmembrane component-like domain
InterPro IPR010255 Haem peroxidase
InterPro IPR013121 Ferric reductase, NAD binding
InterPro IPR019791 Haem peroxidase, animal, subgroup
InterPro IPR018249 EF-HAND 2
InterPro IPR013112 FAD-binding 8
InterPro IPR017938 Riboflavin synthase-like beta-barrel
Gene Ontology(BP) GO:0006979 response to oxidative stress
Gene Ontology(BP) GO:0055114 oxidation-reduction process
Gene Ontology(MF) GO:0020037 heme binding
Gene Ontology(MF) GO:0004601 peroxidase activity
Gene Ontology(MF) GO:0005509 calcium ion binding
Gene Ontology(MF) GO:0016491 oxidoreductase activity
Pfam PF08022.7 FAD-binding domain 5.2e-19
Pfam PF01794.14 Ferric reductase like transmembrane component 3.5e-16
Pfam PF12763.2 Cytoskeletal-regulatory complex EF hand 0.0059
Pfam PF10591.4 Secreted protein acidic and rich in cysteine Ca binding region 0.0097
Pfam PF13499.1 EF-hand domain pair 8.7e-10
Pfam PF03098.10 Animal haem peroxidase 5e-132
Pfam PF08030.7 Ferric reductase NAD binding domain 5.5e-25
Pfam PF00036.27 EF hand 7.4e-14
Pfam PF00175.16 Oxidoreductase NAD-binding domain 0.03
Pfam PF09069.6 EF-hand 0.049
Pfam PF00970.19 Oxidoreductase FAD-binding domain 0.046
Pfam PF13833.1 EF-hand domain pair 9.8e-10
Pfam PF13405.1 EF-hand domain 3.7e-10
Pfam PF13202.1 EF hand 2.2e-09

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
M. musculus ENSMUSG00000068452
H. sapiens ENSP00000400283
B. mori BGIBMGA005478-TA
S. invicta SI2.2.0_02883
H. sapiens ENSP00000317997
M. musculus ENSMUSG00000033268
H. melpomene HMEL017581-PA
A. gambiae AGAP009978
H. sapiens ENSP00000454065
N. vitripennis NV11842-PA
T. castaneum TC002498
S. invicta SI2.2.0_15117
P. humanus PHUM454140-PA
T. castaneum TC004592
A. mellifera GB13459-PA
D. melanogaster FBgn0031464
D. plexippus DPOGS207783PA
H. sapiens ENSP00000373691
P. vanderplanki Pv.08799
A. aegypti AAEL007563
H. sapiens ENSP00000373689
C. quinquefasciatus CPIJ003117
H. sapiens ENSP00000452623