MidgeBase gene description page [Pn.01396]
Outline
Gene ID | Pn.01396 |
Type | Protein coding gene |
Scaffold | PnScaf1346 |
Start | 383 |
End | 5025 |
Direction | + |
Sequence
Transcript: 3504 (bp)
ATGAGCGGCAGGCCGCAGGACGTGAGCGCGTGCATGAGCGACGTGATGCAGGTGCCAAAGGCGGTCATCGACCACGTGATCTTCTGCCATCAGGAGGACTCGCACTGGCCACTCGACGAGGCCCAGAAGGTGAAGAAGAAGTTCGACGAGATTTTCGACACGAAGCCGCACGACGAGGCGCTCGAGAAGCTGCTCAAGATGCGGCGCGAGTTCGAGCAGCGGCTGTGCGAGGCGAAGCCGGAGACGGAAAAAAATTTGATCCACCGAACGCAAGCGCGAGGCATGCGGGAGAAGCTCGAGAGAGACGGCGCACGGTGCGAGGAAATTGCGACGCAGTGCGCCGAGTTGGACGAAAAGCTGGCAGTTTTGGGCGAAAAACGAGCGGAGTTGCGCGAAATTGAGCGAAAATTCGCGGAGTGGCGCGCGCAGAAGCTGAAATGTGAGGAGAAAGTCGCGAGCTTTGGGCGCGAGTGCAAGCGACTGTCGCTCAACGTCCTCCGAAGCAGCCGACAAGAGCTCGAGAGCGAGTTGAAGGCCATCGGGGAGCATGAGGCGGCGACGAGGAGCAAAAACGTCGATTTGAGCGCAAAAAACGAAGCTCTGAGGGCGGAACAGGCGCAGCGCGAAAAAGACCTGGTGGAGCTGAGGAGGGAGCAGGTCGAGGTCTTCTGCAGTCAGCGGCAGATGCAGGCGAAACTGTCGGAGAGATTCGCGAGGGCAAAAGAGCTTTGCGGCCAGCTGGGCATTGACTTCTGTGGCAAACGGCAGGCCACGCATGCTGTTCCGGACGAGATCGTGAGCGAAATTTCCGCGCAGGTCGCCTCCAGGGAGGCCCAGCTGAAGGCTGACTCCCTCTCGTCCGACTCGGCATCGCAGGAGCTGCTGAGCGGCCTTCGCGCGCGTCGAGTCGAGCTCGAAACGCTGATCGCGTCGAGTCGCAGTGAGCTGAAAAGCCGGAGCGAGCAGCTCGAGAAGTCGCGCGACGAGCTCGGAAGGTCGGAAAAGGACTTGTCGACCTCGCGCGAGATCGTCGCGGCTTTCGGGAAAATCGAGGCGGATTTGAGGCGCCGCTGCGAGGACAACCGGTTGGGCAGGCTGGAGGAGAGGAAGAGTCGGCTGAAGGCCGACATTGGCGAGCTGGAGGCGCGGCAGGTCGCGCTCTTCGGCGACCTCAAAGCCCTGCACGTAGTGGCTTCCTTGAAGATCGAGCTGGACGTCAAGGAGGCGGATTTGGAGCAGAAAAAGCGCGAGTTCGCGGCGCTCAAGAGCAGCTCCTCGCAAGTCCTCGCGACGCTCTTCAAAGACGTCAACATCGACGCGAACTTTCACTCCCACATCCAGAAGCGGCAGAACCTGCTGCGACTCGAGGTCGAGGAGTTCGAGCGCAAAATCAAGGCCAACGAGGTCGCCAAGATTCGGCTCGAGCTGAAGACTCGGGCGACTGCCGGCCGAAAGTCCCGCAAAGCCCACAAACTCGCCGAAATCGAGCGAAAAATCAGAAATCTGTGCGAAAATGGCGTCGAAAAGTTTCCCGAAATTCTCGCCGCGCAGCAGCAAAACGTCGCCGAAATCCACGCGAAGTTGGCCGCCGAGGAGTCGTCGAAGGCGACGCAGCAGCAAAACCAGGCCAGAGTCGTGGAGACGGCGTGCTGCCCGCTCTGCTGCAAGAAGTTCGAGGGCGACGAAGGCCACGAACTGTTCGATGCACTGCTGTGCTCCGTCGACGAGGCTTCCAACAAAATGGCGGCCATCGAAGGCGAGCTCGCGAGCGCAACACAAAGACTCGACGAGCTCGCAGCCGCGAAGGTTTTTTTCGAGAAAATTCCTCGATTGCAGTCGGCGCTGCGGGAGGCCGAGGAGGAGCTGAGGGAGTGCGCGAAAAACACGCAAAAATGCGCCGAAGAACGCTCGGATTTGGAGCGGCGCTGCTCGAAGCCGAGGGCAGTGGCGTCGCTGATCAAGCTGTCGGTCGTGGGCGACATGATGAGGCTCGACGCCATCGGCAAAGCCATCGAGGAGCGCTCGAAGGAGGTCGAGGCGTTGAGGCTGAAAATTGCCGGAAAATCCTCGGGAAAATCGTTTTGCGATGCGATCTGCGAGCAGGAGGACGTTTGCAGGGAGCTCAGGCAGAAACGCGAGCAAATGGGCGAAATCGAGGGTAAAATCAGCGGCTTTCAGAACACGATGGTCGAGATCCACAAGCAGCTGAGGACGCTGAGCGACAAGAGAGGCGAGCACTCGTACAAGGTCAAGACACTCGACCTCGCAAGGGAGCAAATCGAGAAAGCGTCCCAAGACAAAAGCCGCCTGCAAGTCAAACTGGGCCAGCTGGAGGAGGAGCTGATCGCCATCGAGGCTCAAACAGTAACCGAAACGCAAGGCCAACTAGACTCGCTCGCAAAACTGAAGGACAAAATCGAAAGCGACGAGCGAAGCCTCAAGGAGCTTCACTTCAAGGCCCGCACACTAGCCGACCTGACGGTGCAGCTGAAGCTCTTCGAGCGCATGAACCTCGACGAAAAATCCCGGAAAATCTCCGCGAAAATCAGCGTCCACGAGCTGAACGCGAGCCTGAAGGCGGCGTGCATGCGCGCTAACGAGGCGGCCATGCAAGAAGGCCGCGAGGCGCTGAGCCGACACGCCGTCGAGCGGCGCAACGTGCAGGACAGCCTGCAGCTGCTGGCGCTCCAGGAGGAGGCCTCGTCGGCGGCCGAGAAGTGCACGCAACTCGAGGCGCTCATGGCGGGCGTGGACGTCGCCAAAATCAGTGAAGAAGGCGCGAAAATCGCGACCGAAATTGACGACTGCCGGAGGTCCAAGCACCAGCTGATCGGCGCCGGAACGTCGCTGAAAAGGTCGATGCAGGAGCTGCGCCGCGAGCTCGACAAACCCCACCTCAAGGACGCCGAGGCAAACTACCGGCGCTCGCACGCCGAGCAGGCCGTGCTGACCGCCACCATCGCCGACCTGCAGCGCTTTTGCGAGCGACTCGAGATGGCGCTGCTCGAGCACCACGAGCTCAAGATGGCGGCCGTGAACGCGAAAATCGAGCGCCTGTGGAGCAGCGTGTATCGGGGCGACGACATCGAGACGATCCTGGTGCGGACGCACGAGGAGAAGTCGCGCACGACGCTTCGGCGACGCAGCTACGACTACCGCGTGCTGCAGCGGAAGGTCGGCGGCGAGCTGAGCGAAATGCGCGGCCGCTGCTCGAGCGGACAGAAGGCGCTCGCTGCGCTGGTCATCCGCATGGCGCTGGCCGAGACCTTCGGCTCGCAGCTCGGTCTGCTTGCGCTCGACGAGCCGACGGCGTGCCTCGACGAGGCGAACGTCAGGGCAGTGGCCGCAGAGCTGGCGGCCATTGCTAGCGCGCGAAACGACGGGAAATTCATGCTGATCGTCATCACGCACGACGCGGAGTTCGTGGAGGCGTTTGAGGGTGGTGCCGTTTGCTACAAGGTGTCGATGGCCGGGGGCGTGTCGCTCGTACGGAAGTTACGCGGT
Protein: 1168 (aa)
MSGRPQDVSACMSDVMQVPKAVIDHVIFCHQEDSHWPLDEAQKVKKKFDEIFDTKPHDEALEKLLKMRREFEQRLCEAKPETEKNLIHRTQARGMREKLERDGARCEEIATQCAELDEKLAVLGEKRAELREIERKFAEWRAQKLKCEEKVASFGRECKRLSLNVLRSSRQELESELKAIGEHEAATRSKNVDLSAKNEALRAEQAQREKDLVELRREQVEVFCSQRQMQAKLSERFARAKELCGQLGIDFCGKRQATHAVPDEIVSEISAQVASREAQLKADSLSSDSASQELLSGLRARRVELETLIASSRSELKSRSEQLEKSRDELGRSEKDLSTSREIVAAFGKIEADLRRRCEDNRLGRLEERKSRLKADIGELEARQVALFGDLKALHVVASLKIELDVKEADLEQKKREFAALKSSSSQVLATLFKDVNIDANFHSHIQKRQNLLRLEVEEFERKIKANEVAKIRLELKTRATAGRKSRKAHKLAEIERKIRNLCENGVEKFPEILAAQQQNVAEIHAKLAAEESSKATQQQNQARVVETACCPLCCKKFEGDEGHELFDALLCSVDEASNKMAAIEGELASATQRLDELAAAKVFFEKIPRLQSALREAEEELRECAKNTQKCAEERSDLERRCSKPRAVASLIKLSVVGDMMRLDAIGKAIEERSKEVEALRLKIAGKSSGKSFCDAICEQEDVCRELRQKREQMGEIEGKISGFQNTMVEIHKQLRTLSDKRGEHSYKVKTLDLAREQIEKASQDKSRLQVKLGQLEEELIAIEAQTVTETQGQLDSLAKLKDKIESDERSLKELHFKARTLADLTVQLKLFERMNLDEKSRKISAKISVHELNASLKAACMRANEAAMQEGREALSRHAVERRNVQDSLQLLALQEEASSAAEKCTQLEALMAGVDVAKISEEGAKIATEIDDCRRSKHQLIGAGTSLKRSMQELRRELDKPHLKDAEANYRRSHAEQAVLTATIADLQRFCERLEMALLEHHELKMAAVNAKIERLWSSVYRGDDIETILVRTHEEKSRTTLRRRSYDYRVLQRKVGGELSEMRGRCSSGQKALAALVIRMALAETFGSQLGLLALDEPTACLDEANVRAVAAELAAIASARNDGKFMLIVITHDAEFVEAFEGGAVCYKVSMAGGVSLVRKLRG
Type | Start | End | Length |
CDS |
383 |
1163 |
781 |
CDS |
1675 |
3146 |
1472 |
CDS |
3772 |
5022 |
1251 |
intron |
1164 |
1674 |
511 |
intron |
3147 |
3771 |
625 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001845783 |
DNA repair protein RAD50 [Culex quinquefasciatus] gb|EDS41690.1| DNA repair protein RAD50 [Culex quinquefasciatus] |
1e-104 |
Pfam |
PF13558.1 |
Putative exonuclease SbcCD, C subunit |
7.5e-10 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
B. mori |
BGIBMGA005449-TA |
D. plexippus |
DPOGS206445PA |
A. aegypti |
AAEL005245 |
M. musculus |
ENSMUSG00000020380 |
P. vanderplanki |
Pv.02239 |
P. humanus |
PHUM549380-PA |
A. mellifera |
GB15340-PA |
H. melpomene |
HMEL014001-PA |
P. vanderplanki |
Pv.07016 |
N. vitripennis |
NV18538-PA |
A. aegypti |
AAEL014748 |
A. aegypti |
AAEL011772 |
S. invicta |
SI2.2.0_80256 |
H. sapiens |
ENSP00000400049 |
C. quinquefasciatus |
CPIJ004115 |
H. sapiens |
ENSP00000390971 |
A. gambiae |
AGAP003676 |
D. melanogaster |
FBgn0034728 |
T. castaneum |
TC015093 |
H. sapiens |
ENSP00000368100 |
H. sapiens |
ENSP00000265335 |