MidgeBase gene description page [Pn.10910]

Outline

Link to gbrowse

Gene ID Pn.10910
Type Protein coding gene
Scaffold PnScaf12389
Start 28536
End 34232
Direction -

Sequence

Transcript: 3627 (bp)

 ATGGAATCGAGTGGAAATACAACACTGTATGTTGGCAATTTGGATCACTCCGTGACAGAAGAACTGATTTGTACTTTGTTTTCTCAAATGGGCCCCGTTAAGAGCTGTAAAGTAATTCGCGATTCAGGCAGCGATCCGTACGCCTTTGTCGAATATCATCATCATACTGCAGCTGCGACAGCCTTAGGCGCAATGAATAAACGACTCTTCTTTGATCGCGAGATGAAGGTTAATTGGGCGACGAGTAGCGCCGGAAATCAGCCAAAAACCGATACCAGTCAGCATCATCACATATTTGTCGGCGACCTCAGTCCCGAAATCGAAACGGAGACATTGCGAGAGGCGTTTGCACCATTCGGAGAGATCTCCAATTGTCGCATTGTCCGCGATCCCCAAACATTGAAGTCCAAGGGATATGCGTTTGTGTCGTTTGTGAAGAAGTCCGACGCGGAGAGTGCCATTCAGGCAATGAATGGCCAGTGGCTTGGGTCGAGAGCGATTCGAACAAATTGGTCCACACGAAAGCCGCCAGCGCCGACTGGCAAGGGCACCAAGACGGCAGACTACGAGGAAATCTACAACAAGACGAGTCCCACAAATACCACAGTCTACTGTGGTGGCTTCTCAGCCAACGTTCTCACTGACGATATTATCTACAAACACTTTGGCCAATTCGGACAGATACATGACATACGAGTGTTCAAAGATAAGGGCTATGCATTCGTGAAGTTCACAACAAAGGAAGCAGCCGCACGAGCAATCGAGGGTACAAACAATTCCGATGTTTATGGACACACTGTGAAGTGCTTCTGGGGCAAGGAAAACGGAAACGGCGGCGTCGATCCAGCACAGGCTACTGCGATGATGCAGGGTCAGAGCATAATTGCTGCTCCCCCGCCATCCAATCCGACACAGCAACAGCTTCAACAGCAGCAAATTTATTATCAGCAAATGGGTTATTGGTATCCGACTGCAGCCGCCGGTTATCCTACACAGATGACAGCACAGTACATGACGCAGCAAGGATATGCCGCTGCAGCAGCCGCCGGCTATCCGTCGTTTGCATACGCTAGTCCACAAAATCCTGCTGCAGCCGCCGCCGCTGCTGGATACACTCGGTTAATTCCTCCAACTGCTCTTCCAGCAAACATGGCATGGGTTCCGCCAAACGCAGCCGGAGGCGCCTCAAATCCCATTTCAAATATTTCGAATGGAGGCAGCAGCAATGCCGAAAGTCACATTTTGTTCTTCGACATAGAAGGTTTGATTGTTGAGCTGCATAGCGAAGAATACGCCGCAATGAGTCCGACCACTTTGGAGGCGCAAGGTCTACTCATTCAGCAGCATAATACAACAAACAATCCGTCCTCAGACGCAGGCAAAAAAATTACAGATTTTCTTGGTAAAGTTAAGAATAAGGCAGGAGATGTTGAAAAAATTCTAAAGGAAAAAGACAAACATGGAATTCTTGCAAAAGTAAAGGAGGGAGCGGAGCAAGTTGGAACGAAGATACAAGCCAAAGTCGAGACGGTTGTTAAGCAAATAAATGAGACGGGCGAGAAGAGTTCTGATGGCACCAAACGGTTCACACCTCGCATGGAAGCGACGCACGTTATGGAGGTTGCACAGCTTTTGCTATCGCTTATTCATTCATGGGGTCTCGATCCCCACCTTGATAAAGTCTGCGAGTCGCAGCTCGGTCTTCTTCGACCGATGGTCCCGATTTCGTTTGGCGTTCTCTCCAAAGGAGGCTACATGTCGCTTCTTCTGCCAACTTGGCAGAATACCATCAAGCAAGACACCGTTCAGAGGCTGCTCCATGGCACGGACATGCAGTTGAGCAAATCTCTGCCGGAGAATCTGCTGAGACAGGAGGCTCTCACGAAGCTCTTCACTGCGCGCTTGCACTGGGAATTGAGCACGACCATTACGTCGAATCATCTGCTCGGAATGGTCGCCATGTCCAACACGCTCATGTCCATGAAAATGGCCACTTTCGTACCGGAATCGGAGAGGGCACGCAAGCTCATGAGGCAGGCGACAAAGAGCAATGTGGCGTGGGCGAGCGACGATGACCACGAGGAGCAGCTGGTGACAGAACAGCAGGCACAAATAAAGCAGGGCTGGAGTTTGCTTTCGACCCACCATTGCTTCCTCCTGCCCGACAAAATCGACGCCATGGAGCCGAAGAACTTCAAGCGACCGCAGGTGGAGCTCATGGCACGCCGTTGGCAACATCACTGTATCGAGATTCGCGAGGCGGCCCAACAAATGCTCCTCGGCGAGCTCGGCCGGATGGGAAAGAAGGGTCGAAAGCAACTTGTTGAAAATTGGGCACAGTATTTACCGCAGTACACGCACACGGAGCCGATTGTTCAACAGGCGCAACCGTCGAGTCCAGGTTCCAGTTCGCCGACATCTTCGCCGCCCGGTGCGTCGCAACAGCCGAATGAGGCCGAAGAGGAGGTCGAGGAGGAGGAAGAAGTCGTCCGACAAAAGCCATCGAGTTTGGCCGAGCTCAAGAGAAAGCAATCGACGGCCGTCATTATTTTGGGCGTGATTGGCGCGGAGTTTGGCCAAGACATTTCGATCGAGACCGGATCAAACGGTAAAAAGCCGACGACTGAGCAACAGCAACGAAGAAAGAGCTCAATTGTTGAGGGCTTCGGAATTGGAAACAACAACTTGGCACGTCTCACGGCAATGGCATTGACGCATTTATTGCTCGCTCCGCCGACTCCCAAATTGCCGGCATACACTCCGTTGAGACGTGCCGCAATCGATCTGATTGGACGCGGATTTACTGTTTGGGAGCCGTACTTGGATGTCAGCAAAGTTCTGCTGGGGCTTTTGGAGATTTCGTGCGACTCGTCTCGACTCATTCCCAGTCTAACGTACAAGCTACCGTTAACACCTCAAGCGGATGCTTGCAGAACTGCAAGACACGCACTGCGTTTGATTGCCACGGCACGTCCTGCTGCATTCATAACGACGATGGCTCGCGAAGTGGCGCGCTACAATACTTTGCAGCAAAATTCACAAGCTCTAAGCGTGCCACTCACGCAGTCAGTGCTGCATCGTGCGAAGAAAGAAATTCTGCAGTGCGTCGAGATGTTGATTGAGAAAATGCAATCGGATATGGCTACACTGCTCGTGGAAGTCACTGATATCACATTACACTGCTTGGACATGTCTGACCTGAAGAATCGTCCACTGAACGAAGTGTGTCCCTCAATATGCAAATTCAACCAGGTCTCGCATTGCTCATCGACACGAAGAATTGCAGTCGGTGCAAACAACGGTCATTTGGCGATTTATGAGCTCCGACAAAACAAGTGCCAAATGATTCCGGCTCACTCGCAACCGATCACAGCTTTGGCATTTAGTCCCGATGGGAAATACCTGGTCAGCTACTCGTGCAGCGAAAATCGATTATCGTTTTGGCAGACGAGTACAGGCATGTTTGGTCTGGGACAAAGTCAAACGCGTTGCATTAAAGGCTACTCTACCGCGCCCATACCGGACGTAACTCGTCTAAATCCGATGCGTCTGGGCAAGCTGATATGGATCAATAATCGAACTGTTACTCTAATGCTGGCCGATGGCTCAGAAACGCGTTTCAATGTT 

Protein: 1209 (aa)

 MESSGNTTLYVGNLDHSVTEELICTLFSQMGPVKSCKVIRDSGSDPYAFVEYHHHTAAATALGAMNKRLFFDREMKVNWATSSAGNQPKTDTSQHHHIFVGDLSPEIETETLREAFAPFGEISNCRIVRDPQTLKSKGYAFVSFVKKSDAESAIQAMNGQWLGSRAIRTNWSTRKPPAPTGKGTKTADYEEIYNKTSPTNTTVYCGGFSANVLTDDIIYKHFGQFGQIHDIRVFKDKGYAFVKFTTKEAAARAIEGTNNSDVYGHTVKCFWGKENGNGGVDPAQATAMMQGQSIIAAPPPSNPTQQQLQQQQIYYQQMGYWYPTAAAGYPTQMTAQYMTQQGYAAAAAAGYPSFAYASPQNPAAAAAAAGYTRLIPPTALPANMAWVPPNAAGGASNPISNISNGGSSNAESHILFFDIEGLIVELHSEEYAAMSPTTLEAQGLLIQQHNTTNNPSSDAGKKITDFLGKVKNKAGDVEKILKEKDKHGILAKVKEGAEQVGTKIQAKVETVVKQINETGEKSSDGTKRFTPRMEATHVMEVAQLLLSLIHSWGLDPHLDKVCESQLGLLRPMVPISFGVLSKGGYMSLLLPTWQNTIKQDTVQRLLHGTDMQLSKSLPENLLRQEALTKLFTARLHWELSTTITSNHLLGMVAMSNTLMSMKMATFVPESERARKLMRQATKSNVAWASDDDHEEQLVTEQQAQIKQGWSLLSTHHCFLLPDKIDAMEPKNFKRPQVELMARRWQHHCIEIREAAQQMLLGELGRMGKKGRKQLVENWAQYLPQYTHTEPIVQQAQPSSPGSSSPTSSPPGASQQPNEAEEEVEEEEEVVRQKPSSLAELKRKQSTAVIILGVIGAEFGQDISIETGSNGKKPTTEQQQRRKSSIVEGFGIGNNNLARLTAMALTHLLLAPPTPKLPAYTPLRRAAIDLIGRGFTVWEPYLDVSKVLLGLLEISCDSSRLIPSLTYKLPLTPQADACRTARHALRLIATARPAAFITTMAREVARYNTLQQNSQALSVPLTQSVLHRAKKEILQCVEMLIEKMQSDMATLLVEVTDITLHCLDMSDLKNRPLNEVCPSICKFNQVSHCSSTRRIAVGANNGHLAIYELRQNKCQMIPAHSQPITALAFSPDGKYLVSYSCSENRLSFWQTSTGMFGLGQSQTRCIKGYSTAPIPDVTRLNPMRLGKLIWINNRTVTLMLADGSETRFNV 
Type Start End Length
CDS 28539 28708 170
CDS 28794 28964 171
CDS 29026 29353 328
CDS 29417 30912 1496
CDS 31139 31207 69
CDS 31424 31496 73
CDS 31643 31737 95
CDS 33008 34232 1225
intron 28709 28793 85
intron 28965 29025 61
intron 29354 29416 63
intron 30913 31138 226
intron 31208 31423 216
intron 31497 31642 146
intron 31738 33007 1270

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_555462 AGAP008003-PA [Anopheles gambiae str. PEST] gb|EAL39674.3| AGAP008003-PA [Anopheles gambiae str. PEST] 0.0
InterPro IPR000504 RNA recognition motif domain
InterPro IPR001680 WD40 repeat
InterPro IPR015943 WD40/YVTN repeat-like-containing domain
InterPro IPR012677 Nucleotide-binding, alpha-beta plait
InterPro IPR003954 RNA recognition motif domain, eukaryote
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0000166 nucleotide binding
Gene Ontology(MF) GO:0003676 nucleic acid binding
Pfam PF00076.17 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) 6.5e-59
Pfam PF08553.5 VID27 cytoplasmic protein 0.0054
Pfam PF06103.6 Bacterial protein of unknown function (DUF948) 0.076
Pfam PF13893.1 RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) 3.4e-38
Pfam PF00400.27 WD domain, G-beta repeat 0.00028
Pfam PF08777.6 RNA binding motif 0.052
Pfam PF11608.3 Limkain b1 0.0028
Pfam PF04847.7 Calcipressin 0.0044
Pfam PF14259.1 RNA recognition motif (a.k.a. RRM, RBD, or RNP domain) 1.8e-36

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.10911

Orthologous genes

Species Gene ID
H. sapiens ENSP00000254442
S. invicta SI2.2.0_14674
N. vitripennis NV12897-PA
D. plexippus DPOGS212245PA
A. aegypti AAEL013534
H. sapiens ENSP00000452765
D. melanogaster FBgn0023510
P. vanderplanki Pv.04082
H. sapiens ENSP00000453378
H. sapiens ENSP00000453813
H. sapiens ENSP00000350187
M. musculus ENSMUSG00000040560
H. melpomene HMEL007143-PA
T. castaneum TC008973
H. sapiens ENSP00000353699
A. gambiae AGAP008003
H. sapiens ENSP00000468357
D. plexippus DPOGS213852PA
P. humanus PHUM035190-PA
H. melpomene HMEL007133-PA
H. sapiens ENSP00000379619
C. quinquefasciatus CPIJ801952
A. mellifera GB11315-PA
H. melpomene HMEL009318-PA
B. mori BGIBMGA009657-TA
T. castaneum TC008972
B. mori BGIBMGA009658-TA
M. musculus ENSMUSG00000044976
S. invicta SI2.2.0_15580
D. plexippus DPOGS212483PA
P. vanderplanki Pv.04084