MidgeBase gene description page [Pn.10910]
Outline
Gene ID | Pn.10910 |
Type | Protein coding gene |
Scaffold | PnScaf12389 |
Start | 28536 |
End | 34232 |
Direction | - |
Sequence
Transcript: 3627 (bp)
ATGGAATCGAGTGGAAATACAACACTGTATGTTGGCAATTTGGATCACTCCGTGACAGAAGAACTGATTTGTACTTTGTTTTCTCAAATGGGCCCCGTTAAGAGCTGTAAAGTAATTCGCGATTCAGGCAGCGATCCGTACGCCTTTGTCGAATATCATCATCATACTGCAGCTGCGACAGCCTTAGGCGCAATGAATAAACGACTCTTCTTTGATCGCGAGATGAAGGTTAATTGGGCGACGAGTAGCGCCGGAAATCAGCCAAAAACCGATACCAGTCAGCATCATCACATATTTGTCGGCGACCTCAGTCCCGAAATCGAAACGGAGACATTGCGAGAGGCGTTTGCACCATTCGGAGAGATCTCCAATTGTCGCATTGTCCGCGATCCCCAAACATTGAAGTCCAAGGGATATGCGTTTGTGTCGTTTGTGAAGAAGTCCGACGCGGAGAGTGCCATTCAGGCAATGAATGGCCAGTGGCTTGGGTCGAGAGCGATTCGAACAAATTGGTCCACACGAAAGCCGCCAGCGCCGACTGGCAAGGGCACCAAGACGGCAGACTACGAGGAAATCTACAACAAGACGAGTCCCACAAATACCACAGTCTACTGTGGTGGCTTCTCAGCCAACGTTCTCACTGACGATATTATCTACAAACACTTTGGCCAATTCGGACAGATACATGACATACGAGTGTTCAAAGATAAGGGCTATGCATTCGTGAAGTTCACAACAAAGGAAGCAGCCGCACGAGCAATCGAGGGTACAAACAATTCCGATGTTTATGGACACACTGTGAAGTGCTTCTGGGGCAAGGAAAACGGAAACGGCGGCGTCGATCCAGCACAGGCTACTGCGATGATGCAGGGTCAGAGCATAATTGCTGCTCCCCCGCCATCCAATCCGACACAGCAACAGCTTCAACAGCAGCAAATTTATTATCAGCAAATGGGTTATTGGTATCCGACTGCAGCCGCCGGTTATCCTACACAGATGACAGCACAGTACATGACGCAGCAAGGATATGCCGCTGCAGCAGCCGCCGGCTATCCGTCGTTTGCATACGCTAGTCCACAAAATCCTGCTGCAGCCGCCGCCGCTGCTGGATACACTCGGTTAATTCCTCCAACTGCTCTTCCAGCAAACATGGCATGGGTTCCGCCAAACGCAGCCGGAGGCGCCTCAAATCCCATTTCAAATATTTCGAATGGAGGCAGCAGCAATGCCGAAAGTCACATTTTGTTCTTCGACATAGAAGGTTTGATTGTTGAGCTGCATAGCGAAGAATACGCCGCAATGAGTCCGACCACTTTGGAGGCGCAAGGTCTACTCATTCAGCAGCATAATACAACAAACAATCCGTCCTCAGACGCAGGCAAAAAAATTACAGATTTTCTTGGTAAAGTTAAGAATAAGGCAGGAGATGTTGAAAAAATTCTAAAGGAAAAAGACAAACATGGAATTCTTGCAAAAGTAAAGGAGGGAGCGGAGCAAGTTGGAACGAAGATACAAGCCAAAGTCGAGACGGTTGTTAAGCAAATAAATGAGACGGGCGAGAAGAGTTCTGATGGCACCAAACGGTTCACACCTCGCATGGAAGCGACGCACGTTATGGAGGTTGCACAGCTTTTGCTATCGCTTATTCATTCATGGGGTCTCGATCCCCACCTTGATAAAGTCTGCGAGTCGCAGCTCGGTCTTCTTCGACCGATGGTCCCGATTTCGTTTGGCGTTCTCTCCAAAGGAGGCTACATGTCGCTTCTTCTGCCAACTTGGCAGAATACCATCAAGCAAGACACCGTTCAGAGGCTGCTCCATGGCACGGACATGCAGTTGAGCAAATCTCTGCCGGAGAATCTGCTGAGACAGGAGGCTCTCACGAAGCTCTTCACTGCGCGCTTGCACTGGGAATTGAGCACGACCATTACGTCGAATCATCTGCTCGGAATGGTCGCCATGTCCAACACGCTCATGTCCATGAAAATGGCCACTTTCGTACCGGAATCGGAGAGGGCACGCAAGCTCATGAGGCAGGCGACAAAGAGCAATGTGGCGTGGGCGAGCGACGATGACCACGAGGAGCAGCTGGTGACAGAACAGCAGGCACAAATAAAGCAGGGCTGGAGTTTGCTTTCGACCCACCATTGCTTCCTCCTGCCCGACAAAATCGACGCCATGGAGCCGAAGAACTTCAAGCGACCGCAGGTGGAGCTCATGGCACGCCGTTGGCAACATCACTGTATCGAGATTCGCGAGGCGGCCCAACAAATGCTCCTCGGCGAGCTCGGCCGGATGGGAAAGAAGGGTCGAAAGCAACTTGTTGAAAATTGGGCACAGTATTTACCGCAGTACACGCACACGGAGCCGATTGTTCAACAGGCGCAACCGTCGAGTCCAGGTTCCAGTTCGCCGACATCTTCGCCGCCCGGTGCGTCGCAACAGCCGAATGAGGCCGAAGAGGAGGTCGAGGAGGAGGAAGAAGTCGTCCGACAAAAGCCATCGAGTTTGGCCGAGCTCAAGAGAAAGCAATCGACGGCCGTCATTATTTTGGGCGTGATTGGCGCGGAGTTTGGCCAAGACATTTCGATCGAGACCGGATCAAACGGTAAAAAGCCGACGACTGAGCAACAGCAACGAAGAAAGAGCTCAATTGTTGAGGGCTTCGGAATTGGAAACAACAACTTGGCACGTCTCACGGCAATGGCATTGACGCATTTATTGCTCGCTCCGCCGACTCCCAAATTGCCGGCATACACTCCGTTGAGACGTGCCGCAATCGATCTGATTGGACGCGGATTTACTGTTTGGGAGCCGTACTTGGATGTCAGCAAAGTTCTGCTGGGGCTTTTGGAGATTTCGTGCGACTCGTCTCGACTCATTCCCAGTCTAACGTACAAGCTACCGTTAACACCTCAAGCGGATGCTTGCAGAACTGCAAGACACGCACTGCGTTTGATTGCCACGGCACGTCCTGCTGCATTCATAACGACGATGGCTCGCGAAGTGGCGCGCTACAATACTTTGCAGCAAAATTCACAAGCTCTAAGCGTGCCACTCACGCAGTCAGTGCTGCATCGTGCGAAGAAAGAAATTCTGCAGTGCGTCGAGATGTTGATTGAGAAAATGCAATCGGATATGGCTACACTGCTCGTGGAAGTCACTGATATCACATTACACTGCTTGGACATGTCTGACCTGAAGAATCGTCCACTGAACGAAGTGTGTCCCTCAATATGCAAATTCAACCAGGTCTCGCATTGCTCATCGACACGAAGAATTGCAGTCGGTGCAAACAACGGTCATTTGGCGATTTATGAGCTCCGACAAAACAAGTGCCAAATGATTCCGGCTCACTCGCAACCGATCACAGCTTTGGCATTTAGTCCCGATGGGAAATACCTGGTCAGCTACTCGTGCAGCGAAAATCGATTATCGTTTTGGCAGACGAGTACAGGCATGTTTGGTCTGGGACAAAGTCAAACGCGTTGCATTAAAGGCTACTCTACCGCGCCCATACCGGACGTAACTCGTCTAAATCCGATGCGTCTGGGCAAGCTGATATGGATCAATAATCGAACTGTTACTCTAATGCTGGCCGATGGCTCAGAAACGCGTTTCAATGTT
Protein: 1209 (aa)
MESSGNTTLYVGNLDHSVTEELICTLFSQMGPVKSCKVIRDSGSDPYAFVEYHHHTAAATALGAMNKRLFFDREMKVNWATSSAGNQPKTDTSQHHHIFVGDLSPEIETETLREAFAPFGEISNCRIVRDPQTLKSKGYAFVSFVKKSDAESAIQAMNGQWLGSRAIRTNWSTRKPPAPTGKGTKTADYEEIYNKTSPTNTTVYCGGFSANVLTDDIIYKHFGQFGQIHDIRVFKDKGYAFVKFTTKEAAARAIEGTNNSDVYGHTVKCFWGKENGNGGVDPAQATAMMQGQSIIAAPPPSNPTQQQLQQQQIYYQQMGYWYPTAAAGYPTQMTAQYMTQQGYAAAAAAGYPSFAYASPQNPAAAAAAAGYTRLIPPTALPANMAWVPPNAAGGASNPISNISNGGSSNAESHILFFDIEGLIVELHSEEYAAMSPTTLEAQGLLIQQHNTTNNPSSDAGKKITDFLGKVKNKAGDVEKILKEKDKHGILAKVKEGAEQVGTKIQAKVETVVKQINETGEKSSDGTKRFTPRMEATHVMEVAQLLLSLIHSWGLDPHLDKVCESQLGLLRPMVPISFGVLSKGGYMSLLLPTWQNTIKQDTVQRLLHGTDMQLSKSLPENLLRQEALTKLFTARLHWELSTTITSNHLLGMVAMSNTLMSMKMATFVPESERARKLMRQATKSNVAWASDDDHEEQLVTEQQAQIKQGWSLLSTHHCFLLPDKIDAMEPKNFKRPQVELMARRWQHHCIEIREAAQQMLLGELGRMGKKGRKQLVENWAQYLPQYTHTEPIVQQAQPSSPGSSSPTSSPPGASQQPNEAEEEVEEEEEVVRQKPSSLAELKRKQSTAVIILGVIGAEFGQDISIETGSNGKKPTTEQQQRRKSSIVEGFGIGNNNLARLTAMALTHLLLAPPTPKLPAYTPLRRAAIDLIGRGFTVWEPYLDVSKVLLGLLEISCDSSRLIPSLTYKLPLTPQADACRTARHALRLIATARPAAFITTMAREVARYNTLQQNSQALSVPLTQSVLHRAKKEILQCVEMLIEKMQSDMATLLVEVTDITLHCLDMSDLKNRPLNEVCPSICKFNQVSHCSSTRRIAVGANNGHLAIYELRQNKCQMIPAHSQPITALAFSPDGKYLVSYSCSENRLSFWQTSTGMFGLGQSQTRCIKGYSTAPIPDVTRLNPMRLGKLIWINNRTVTLMLADGSETRFNV
Type | Start | End | Length |
CDS |
28539 |
28708 |
170 |
CDS |
28794 |
28964 |
171 |
CDS |
29026 |
29353 |
328 |
CDS |
29417 |
30912 |
1496 |
CDS |
31139 |
31207 |
69 |
CDS |
31424 |
31496 |
73 |
CDS |
31643 |
31737 |
95 |
CDS |
33008 |
34232 |
1225 |
intron |
28709 |
28793 |
85 |
intron |
28965 |
29025 |
61 |
intron |
29354 |
29416 |
63 |
intron |
30913 |
31138 |
226 |
intron |
31208 |
31423 |
216 |
intron |
31497 |
31642 |
146 |
intron |
31738 |
33007 |
1270 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_555462 |
AGAP008003-PA [Anopheles gambiae str. PEST] gb|EAL39674.3| AGAP008003-PA [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR000504 |
RNA recognition motif domain |
|
InterPro |
IPR001680 |
WD40 repeat |
|
InterPro |
IPR015943 |
WD40/YVTN repeat-like-containing domain |
|
InterPro |
IPR012677 |
Nucleotide-binding, alpha-beta plait |
|
InterPro |
IPR003954 |
RNA recognition motif domain, eukaryote |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0000166 |
nucleotide binding |
|
Gene Ontology(MF) |
GO:0003676 |
nucleic acid binding |
|
Pfam |
PF00076.17 |
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) |
6.5e-59 |
Pfam |
PF08553.5 |
VID27 cytoplasmic protein |
0.0054 |
Pfam |
PF06103.6 |
Bacterial protein of unknown function (DUF948) |
0.076 |
Pfam |
PF13893.1 |
RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain) |
3.4e-38 |
Pfam |
PF00400.27 |
WD domain, G-beta repeat |
0.00028 |
Pfam |
PF08777.6 |
RNA binding motif |
0.052 |
Pfam |
PF11608.3 |
Limkain b1 |
0.0028 |
Pfam |
PF04847.7 |
Calcipressin |
0.0044 |
Pfam |
PF14259.1 |
RNA recognition motif (a.k.a. RRM, RBD, or RNP domain) |
1.8e-36 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
H. sapiens |
ENSP00000254442 |
S. invicta |
SI2.2.0_14674 |
N. vitripennis |
NV12897-PA |
D. plexippus |
DPOGS212245PA |
A. aegypti |
AAEL013534 |
H. sapiens |
ENSP00000452765 |
D. melanogaster |
FBgn0023510 |
P. vanderplanki |
Pv.04082 |
H. sapiens |
ENSP00000453378 |
H. sapiens |
ENSP00000453813 |
H. sapiens |
ENSP00000350187 |
M. musculus |
ENSMUSG00000040560 |
H. melpomene |
HMEL007143-PA |
T. castaneum |
TC008973 |
H. sapiens |
ENSP00000353699 |
A. gambiae |
AGAP008003 |
H. sapiens |
ENSP00000468357 |
D. plexippus |
DPOGS213852PA |
P. humanus |
PHUM035190-PA |
H. melpomene |
HMEL007133-PA |
H. sapiens |
ENSP00000379619 |
C. quinquefasciatus |
CPIJ801952 |
A. mellifera |
GB11315-PA |
H. melpomene |
HMEL009318-PA |
B. mori |
BGIBMGA009657-TA |
T. castaneum |
TC008972 |
B. mori |
BGIBMGA009658-TA |
M. musculus |
ENSMUSG00000044976 |
S. invicta |
SI2.2.0_15580 |
D. plexippus |
DPOGS212483PA |
P. vanderplanki |
Pv.04084 |