MidgeBase gene description page [Pn.01396]

Outline

Link to gbrowse

Gene ID Pn.01396
Type Protein coding gene
Scaffold PnScaf1346
Start 383
End 5025
Direction +

Sequence

Transcript: 3504 (bp)

 ATGAGCGGCAGGCCGCAGGACGTGAGCGCGTGCATGAGCGACGTGATGCAGGTGCCAAAGGCGGTCATCGACCACGTGATCTTCTGCCATCAGGAGGACTCGCACTGGCCACTCGACGAGGCCCAGAAGGTGAAGAAGAAGTTCGACGAGATTTTCGACACGAAGCCGCACGACGAGGCGCTCGAGAAGCTGCTCAAGATGCGGCGCGAGTTCGAGCAGCGGCTGTGCGAGGCGAAGCCGGAGACGGAAAAAAATTTGATCCACCGAACGCAAGCGCGAGGCATGCGGGAGAAGCTCGAGAGAGACGGCGCACGGTGCGAGGAAATTGCGACGCAGTGCGCCGAGTTGGACGAAAAGCTGGCAGTTTTGGGCGAAAAACGAGCGGAGTTGCGCGAAATTGAGCGAAAATTCGCGGAGTGGCGCGCGCAGAAGCTGAAATGTGAGGAGAAAGTCGCGAGCTTTGGGCGCGAGTGCAAGCGACTGTCGCTCAACGTCCTCCGAAGCAGCCGACAAGAGCTCGAGAGCGAGTTGAAGGCCATCGGGGAGCATGAGGCGGCGACGAGGAGCAAAAACGTCGATTTGAGCGCAAAAAACGAAGCTCTGAGGGCGGAACAGGCGCAGCGCGAAAAAGACCTGGTGGAGCTGAGGAGGGAGCAGGTCGAGGTCTTCTGCAGTCAGCGGCAGATGCAGGCGAAACTGTCGGAGAGATTCGCGAGGGCAAAAGAGCTTTGCGGCCAGCTGGGCATTGACTTCTGTGGCAAACGGCAGGCCACGCATGCTGTTCCGGACGAGATCGTGAGCGAAATTTCCGCGCAGGTCGCCTCCAGGGAGGCCCAGCTGAAGGCTGACTCCCTCTCGTCCGACTCGGCATCGCAGGAGCTGCTGAGCGGCCTTCGCGCGCGTCGAGTCGAGCTCGAAACGCTGATCGCGTCGAGTCGCAGTGAGCTGAAAAGCCGGAGCGAGCAGCTCGAGAAGTCGCGCGACGAGCTCGGAAGGTCGGAAAAGGACTTGTCGACCTCGCGCGAGATCGTCGCGGCTTTCGGGAAAATCGAGGCGGATTTGAGGCGCCGCTGCGAGGACAACCGGTTGGGCAGGCTGGAGGAGAGGAAGAGTCGGCTGAAGGCCGACATTGGCGAGCTGGAGGCGCGGCAGGTCGCGCTCTTCGGCGACCTCAAAGCCCTGCACGTAGTGGCTTCCTTGAAGATCGAGCTGGACGTCAAGGAGGCGGATTTGGAGCAGAAAAAGCGCGAGTTCGCGGCGCTCAAGAGCAGCTCCTCGCAAGTCCTCGCGACGCTCTTCAAAGACGTCAACATCGACGCGAACTTTCACTCCCACATCCAGAAGCGGCAGAACCTGCTGCGACTCGAGGTCGAGGAGTTCGAGCGCAAAATCAAGGCCAACGAGGTCGCCAAGATTCGGCTCGAGCTGAAGACTCGGGCGACTGCCGGCCGAAAGTCCCGCAAAGCCCACAAACTCGCCGAAATCGAGCGAAAAATCAGAAATCTGTGCGAAAATGGCGTCGAAAAGTTTCCCGAAATTCTCGCCGCGCAGCAGCAAAACGTCGCCGAAATCCACGCGAAGTTGGCCGCCGAGGAGTCGTCGAAGGCGACGCAGCAGCAAAACCAGGCCAGAGTCGTGGAGACGGCGTGCTGCCCGCTCTGCTGCAAGAAGTTCGAGGGCGACGAAGGCCACGAACTGTTCGATGCACTGCTGTGCTCCGTCGACGAGGCTTCCAACAAAATGGCGGCCATCGAAGGCGAGCTCGCGAGCGCAACACAAAGACTCGACGAGCTCGCAGCCGCGAAGGTTTTTTTCGAGAAAATTCCTCGATTGCAGTCGGCGCTGCGGGAGGCCGAGGAGGAGCTGAGGGAGTGCGCGAAAAACACGCAAAAATGCGCCGAAGAACGCTCGGATTTGGAGCGGCGCTGCTCGAAGCCGAGGGCAGTGGCGTCGCTGATCAAGCTGTCGGTCGTGGGCGACATGATGAGGCTCGACGCCATCGGCAAAGCCATCGAGGAGCGCTCGAAGGAGGTCGAGGCGTTGAGGCTGAAAATTGCCGGAAAATCCTCGGGAAAATCGTTTTGCGATGCGATCTGCGAGCAGGAGGACGTTTGCAGGGAGCTCAGGCAGAAACGCGAGCAAATGGGCGAAATCGAGGGTAAAATCAGCGGCTTTCAGAACACGATGGTCGAGATCCACAAGCAGCTGAGGACGCTGAGCGACAAGAGAGGCGAGCACTCGTACAAGGTCAAGACACTCGACCTCGCAAGGGAGCAAATCGAGAAAGCGTCCCAAGACAAAAGCCGCCTGCAAGTCAAACTGGGCCAGCTGGAGGAGGAGCTGATCGCCATCGAGGCTCAAACAGTAACCGAAACGCAAGGCCAACTAGACTCGCTCGCAAAACTGAAGGACAAAATCGAAAGCGACGAGCGAAGCCTCAAGGAGCTTCACTTCAAGGCCCGCACACTAGCCGACCTGACGGTGCAGCTGAAGCTCTTCGAGCGCATGAACCTCGACGAAAAATCCCGGAAAATCTCCGCGAAAATCAGCGTCCACGAGCTGAACGCGAGCCTGAAGGCGGCGTGCATGCGCGCTAACGAGGCGGCCATGCAAGAAGGCCGCGAGGCGCTGAGCCGACACGCCGTCGAGCGGCGCAACGTGCAGGACAGCCTGCAGCTGCTGGCGCTCCAGGAGGAGGCCTCGTCGGCGGCCGAGAAGTGCACGCAACTCGAGGCGCTCATGGCGGGCGTGGACGTCGCCAAAATCAGTGAAGAAGGCGCGAAAATCGCGACCGAAATTGACGACTGCCGGAGGTCCAAGCACCAGCTGATCGGCGCCGGAACGTCGCTGAAAAGGTCGATGCAGGAGCTGCGCCGCGAGCTCGACAAACCCCACCTCAAGGACGCCGAGGCAAACTACCGGCGCTCGCACGCCGAGCAGGCCGTGCTGACCGCCACCATCGCCGACCTGCAGCGCTTTTGCGAGCGACTCGAGATGGCGCTGCTCGAGCACCACGAGCTCAAGATGGCGGCCGTGAACGCGAAAATCGAGCGCCTGTGGAGCAGCGTGTATCGGGGCGACGACATCGAGACGATCCTGGTGCGGACGCACGAGGAGAAGTCGCGCACGACGCTTCGGCGACGCAGCTACGACTACCGCGTGCTGCAGCGGAAGGTCGGCGGCGAGCTGAGCGAAATGCGCGGCCGCTGCTCGAGCGGACAGAAGGCGCTCGCTGCGCTGGTCATCCGCATGGCGCTGGCCGAGACCTTCGGCTCGCAGCTCGGTCTGCTTGCGCTCGACGAGCCGACGGCGTGCCTCGACGAGGCGAACGTCAGGGCAGTGGCCGCAGAGCTGGCGGCCATTGCTAGCGCGCGAAACGACGGGAAATTCATGCTGATCGTCATCACGCACGACGCGGAGTTCGTGGAGGCGTTTGAGGGTGGTGCCGTTTGCTACAAGGTGTCGATGGCCGGGGGCGTGTCGCTCGTACGGAAGTTACGCGGT 

Protein: 1168 (aa)

 MSGRPQDVSACMSDVMQVPKAVIDHVIFCHQEDSHWPLDEAQKVKKKFDEIFDTKPHDEALEKLLKMRREFEQRLCEAKPETEKNLIHRTQARGMREKLERDGARCEEIATQCAELDEKLAVLGEKRAELREIERKFAEWRAQKLKCEEKVASFGRECKRLSLNVLRSSRQELESELKAIGEHEAATRSKNVDLSAKNEALRAEQAQREKDLVELRREQVEVFCSQRQMQAKLSERFARAKELCGQLGIDFCGKRQATHAVPDEIVSEISAQVASREAQLKADSLSSDSASQELLSGLRARRVELETLIASSRSELKSRSEQLEKSRDELGRSEKDLSTSREIVAAFGKIEADLRRRCEDNRLGRLEERKSRLKADIGELEARQVALFGDLKALHVVASLKIELDVKEADLEQKKREFAALKSSSSQVLATLFKDVNIDANFHSHIQKRQNLLRLEVEEFERKIKANEVAKIRLELKTRATAGRKSRKAHKLAEIERKIRNLCENGVEKFPEILAAQQQNVAEIHAKLAAEESSKATQQQNQARVVETACCPLCCKKFEGDEGHELFDALLCSVDEASNKMAAIEGELASATQRLDELAAAKVFFEKIPRLQSALREAEEELRECAKNTQKCAEERSDLERRCSKPRAVASLIKLSVVGDMMRLDAIGKAIEERSKEVEALRLKIAGKSSGKSFCDAICEQEDVCRELRQKREQMGEIEGKISGFQNTMVEIHKQLRTLSDKRGEHSYKVKTLDLAREQIEKASQDKSRLQVKLGQLEEELIAIEAQTVTETQGQLDSLAKLKDKIESDERSLKELHFKARTLADLTVQLKLFERMNLDEKSRKISAKISVHELNASLKAACMRANEAAMQEGREALSRHAVERRNVQDSLQLLALQEEASSAAEKCTQLEALMAGVDVAKISEEGAKIATEIDDCRRSKHQLIGAGTSLKRSMQELRRELDKPHLKDAEANYRRSHAEQAVLTATIADLQRFCERLEMALLEHHELKMAAVNAKIERLWSSVYRGDDIETILVRTHEEKSRTTLRRRSYDYRVLQRKVGGELSEMRGRCSSGQKALAALVIRMALAETFGSQLGLLALDEPTACLDEANVRAVAAELAAIASARNDGKFMLIVITHDAEFVEAFEGGAVCYKVSMAGGVSLVRKLRG 
Type Start End Length
CDS 383 1163 781
CDS 1675 3146 1472
CDS 3772 5022 1251
intron 1164 1674 511
intron 3147 3771 625

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001845783 DNA repair protein RAD50 [Culex quinquefasciatus] gb|EDS41690.1| DNA repair protein RAD50 [Culex quinquefasciatus] 1e-104
Pfam PF13558.1 Putative exonuclease SbcCD, C subunit 7.5e-10

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.14177

Orthologous genes

Species Gene ID
B. mori BGIBMGA005449-TA
D. plexippus DPOGS206445PA
A. aegypti AAEL005245
M. musculus ENSMUSG00000020380
P. vanderplanki Pv.02239
P. humanus PHUM549380-PA
A. mellifera GB15340-PA
H. melpomene HMEL014001-PA
P. vanderplanki Pv.07016
N. vitripennis NV18538-PA
A. aegypti AAEL014748
A. aegypti AAEL011772
S. invicta SI2.2.0_80256
H. sapiens ENSP00000400049
C. quinquefasciatus CPIJ004115
H. sapiens ENSP00000390971
A. gambiae AGAP003676
D. melanogaster FBgn0034728
T. castaneum TC015093
H. sapiens ENSP00000368100
H. sapiens ENSP00000265335