MidgeBase gene description page [Pn.10201]
Outline
Gene ID | Pn.10201 |
Type | Protein coding gene |
Scaffold | PnScaf10994 |
Start | 38531 |
End | 60970 |
Direction | - |
Sequence
Transcript: 6453 (bp)
ATGTCGCGAGAAGCCTACAATTCACTGCCTAATATTGGAACTCGCGGAAGTCCAGGAAATCGTGCCAGAAGACTCACAGAATTGCCAACTGTCGATAAGTCACCCTCACGTGATTTTCAGAAAGGAAGTCCTTTAGGACATTTGCGATATACCAGATTTTCAAGACACAATTACGGTCCGTACCATTCATCACTTTATTATCCACATCATTCTATTGCAGCCATGAGCAGCCCGTATTATCACCGTGACATGGAGGACCCCATAAGTCCAGCGTTGCCGGGACACCATCGCAGTCGAAGCGCAAGTCGACCGCCTGTCTCGCACACGATGGATTATCCAAGACGCTATCAGTCATTGGATCGCGGCGGATTTGCCGATCCACACGACCGCGAATTCGTGCCGATACGAGAACCTCGAGACCGCTCTCGCGACCGCTCACTCGAGCGCGGTCTCTACCTTGAGGAGGAGCTTTACGGACGATCTGCACGACAAAGCCCCAACCCCATGATGGGACCGGACCGAGGCTACTTGGGCGACCTGCAGTACGTCAACAGCGACCTGCAGCGCGAGCTCGGCAACCTCAAGAAGGAACTAGAATTAACCAATCAGAAGCTTAGCAGTTCGATGCACAGTATTAAGACATTCTGGTCGCCGGAGCTGAAGAAGGAGCGGGCGCTGCGCAAGGAGGAGAGCGCCAAGTACAGCCTCATCAACGACCAGCTGAAGCTGCTGAACAACGAGAACCAGAAGCAGGCGATGCTGGTGCGTCAGCTCGAGGAGGAGCTCCGCATGCGAATGCGGCAGCCGAGTCTCGAGGTGCAGCAGCAAATCGAAACGCTCTATGCGGAAAATGAGCACTTGACACGAGAAATAGCAATACTGCGCGACACGATAAAAGAGCTCGAGTTACGCTTGGAGACACAGAAGCAGACGCTCCTGGCCCGCGACGAAAGCATCAAAAAGCTCTTAGAGATGCTGCAGAACAAGGGAATGGGCAAGGAAGAGGAGCGCGCAATGTTCCAGCAAATGCAAGCCATGGCACAAAAACAGATGTTCACTGCCTATCCATCTGAGCGATCAAATTTTTACGGCGGGCCAAGCTCAGCGACGTCTAATACAGCGCCATCTGCTGCTCCATCGAGCAGTCTCTATTCCTCATTACCGCCTATTCCGCAGCCACAGCCAGCGACGCCTACCGCAATGTTGCCGCCCATTCCGCAGCCGGCCGCCGCCGCAGCCGGAGCCAGCAACCTCATGTCAACCAACACCTCCTCGTACATCGGACTGAACCGAATGAGTCCGCTGCCGATGCGCCGCTACAGCTATAGCGGCGTGCCGTCGTCGACCTCGTTCGCCGATCTGAACATGAATTTCGATAGCACAATGGTGCGCGACAGCAGCATAACGAATCTAATTAACGAAACGTGTCAGTCAATCTCGAGATCGTCGAACATTCTGAACCGACAATTGTCGGCCGATATGGCTCTGAGTAGCATGAGTCTTACGAATTTAATCGATGAGAATATTGAGAGCGATATTGTAAAATATGATTTGTTAAATGAAATTACGACGACGGCCGCGTTGCCGCCGCTCAAGTACGCCGACACATACTACTCGCATCCGAGCAAGCATGTGTCGGCCATTACTGCACCTACTTCCTACTTTAATCGCAATGTCTCACACATGCTCAGCAGCGACGACTATCTGTTGTCGAAGCCGATCTTTAGCAAGAGCTATGTTAATCCGATGTCATCATCATACATAAGTGACGATCACCATCATCACTATAATAACAGTAGTTTCTATGACACACAACCGCCACACCAAATACACCACCAGCTACCGCAAATACCGCCGACACACCACTCCTACATGACACCGTCACACTACCACTATCACTACTCACAGCCACACAGCCATCATCATAATTTGCGTCATCATCATTATGCATCGAATCCGTGTCTCTCGTCGCACTCATATCATAATGCCTCCTCCTCCTCTCCCTACTCGCAGCCCCTCAATCCCAACACCATAGCATTTAATCGTAATTATAATTATCCATCTAATCATCAGCAAGCGTTCGCGTCGTCACGTCCGCCATCCATCTCGTCGCTCAATCGCTACGCGGCCCTTACGAATAACTATCAAAACAGTTTCAATGCTCAGCCCTATTATCGGACATCGTCGCGAATCGATCTTGACAATCCCAAGGGCAGCGAGATCAAGCGACAACTGGATGATTTCCGTCTAGAGATACAGAGACGCGACCAAGAAATTATGAGCATGGCGGCGAAGATGAAAACACTCGAGGAGCAGCACCAGGATTATCAGCGTCATATTTCAGTGCTGAAGGAGTCTTTGTGCGCGAAGGAAGAGCACTACAACATGCTTCAGTCCGACGTGGAAGAATTGAGAAATCGCTTGGAGGAGAAGAACCGAATAATTGAGAAGCACACGCAGGGTTCGCTGCAGTCCAATCAGGAGCGAAATCGCGCCCAGCAAGAGGTGGTCGAGCTGAAGGAGCATCTCGACATCAAGGAGCGCAAAATTAGCGTTCTTCAGCGAAAAATCGAGAACCTCGAGGACCTGCTGAAGGAGAAGGACAATCAAGTGGACATGGCACGAGCACGACTGACTGCCATGCAGGCGCACCACTGCAGCTCGGAGGGAGCGCTCTCCAGTCTCGAGGAACAAATTGGCGACAAGGACAAGCAGATGAACATGCTGCGCGAGCAACGAGATCGTGCCGAAGCGGAGAAGATAGAGGAGCGTGAGCTGCATGAGCGTGAAATCGCCGAATACAAGATGAAACTGCACTCGTTCGAGAGCGAAGTCGACAAGCTTACGACGCGCTTGCATCGAGCACTCGCCGAGAAGGATCGGCTCGAGACGAGATTGGAGTCGGGCGATTTGGCCAAGTCGAGGAGCGGCCTTGACCGTGACCGCGAGCGAGAACGCGAGCGTGACCTATACGAAAGACGCGCTGCCAGCGAGTACGACTCGGGTCTCAGCCAGAAGATTCAGCGATTGGAGATCGAAAACGACCGACTGCGAGCTGAGCTCGAGAGGTCGCAGGCAACGTTCGGACGCACGACAGTGACGTCGTCGCACGAGTTGGACCGCGTGCAGGAGCGAGCCGACAAGACAATTGCAGAGCTGAGACGAACACAAGCCGAGCTTCGTGTCACGCAGAGCGATGCCGAGAGATCACGAGCAGAAAATGCCGCCCTACAAGAGAAGCTCGAGAAGAGTCAAGGAGAAGTCTACCGCCTTAAGGCACGATTGGAGAACGCACAACAAGAGCAGGACACGGTGAAACAAGAACTCGAACGTCACCAATCCACCGTGCAGCGCTACTACAGCGAGCGCGACAAGACCGTCAGTGAAATTGAGAAGCTTCGCGAGGAGATGGAACGAACACAAGCGACACTGGGCAAGTCACAGCTAACACAGGAGAAGCTCCAGAACTCACTCGATAAGGCCACAAGCGACGTCGAGCACTTGCAGGACAAACTTGAGAAGGCAACCGCTGAAATCAGACGCTTGACACTCGAGAAGGAGAAGCAGACGTACGAGTTCGAGAACGTTCAGTCGCAACTGGACAAGGCGCTCGGTCAGGCGGCTCGAATTCAGAAGGAGCGCGAGCAGATTCAAGTGGAAATGGAGCGCTACCGCGAGAAGTGCGAGAAGATTCAGAATCAGATGATTCGCGTGCAAAAGGAGCGCGACACATTCTTCGAGGAGCTCGAGAAGGTCAAGGAGCGCAACGAGGGCTCGCAGAGCCTGATGATGAAGGCACAGCGAGAGAAGGAGCAGCTGCAGACCGAGCTCGATGTGCTCAAAGAGCGCTGGGACAAGACGCACGCCATTCACCAGAAGCTACAGATGGAGCGCGACGACTCGATAACGGAAATCGGCATTCTGAAGGAGAAGCTCGACAAGGCGCTCTACGCCAGCCAGAAGCTCATCGACGAGAAAGAGTCGTCGACAAAGGAGTTTGAGAAAATCCTCGAGAAGTACGATAGGGCACAAAATGAAATCTATAGACTTCAATCGAGAGTCGACACAGCCGAGGCTGATCGCAATCGACTCGAAGTTGAAGCCGAGAGATCCACTTTGGCCGCTACGAAGGCTCGCGAGGATTTGCGAAAACTGCAGGAGGAGACGTCGCGCTTGCAGGAGGCCTGCGACCGGGCCGCCCTGCAACTCGGCCGCTCCAAGGAGCTCGAGGACAAGGCAAAGGAGGAGGTTGACTTGTACCGCGAGCGAGCCGACAAGTATCAGAACGACATGAGGAAGCTGCAGGCGGAGAAGGAGCACTTGATAGCGGAGGTCGAGAGACTCACGTACGAGGTCGATCGCAGTCACAATGCTCACACCAAACACAGTGCGGCCCTCGAGAGTGCACACGAGGAGGCCGCCCGACTCAACCTAGAATTAGAAAAAATGCGCGATCGTTACAATAAGGCGCAAGCGGAATTAAGTCACCTGCAGGAGACCGAGCAGTACTCGCGCGAAAACCGCCGTCTCAAGGAGGAGAACGAGCGGTTGCGCGAGCGCATGGACAAGATTATGCTCGAGCTGGAGCAGATTCGCGGCAAGTCGCAGTACGAGCAAGAGGCGTACGAGAAATTCAAGGAGAAGCTTCAGCAAAAGGAGAACGATTTGAGTGCCCTCGAGGCGAAACTGCACGAGACGACGCTGCAGCTCGAGCTGTCGCGTCAAGAGACACAGAAGTATGTTTCGAGTCAGGACAAGCAGCGCAACGACCTCGAGCGAACGCATATTGAGTATGAGAAGCTCAAAGACAAGTACGAGCGCGTTGTGAAGGAAATGGAGCGAATTAACACCGGCGGCGGTGCTGGCAAGTTGAACACCATTAGTCCTAGCCAATCGATGATTCAGTCAATGAATCCCGCCGACAGAAGCGAGGTCGAGAGACTTCGTGAGCGCCTAGAAAAGACCCTTCAGCAACGAGACGCGACAGAGCTCGAAGCCGGCCGACTGGCGAAGGAGCTCGAGAAAGCACAAATTCACATGACGAAGCAGCAAGAAGCCTACGAATCGACGCGCATAGAGTTCGAACGAATGTCTGCCGAACTCAACCGTGTACTGGAACTGCTGGAGAAGTCCGAGGCCGAGAAGGAGGCCATGCGTCACAACGCCAAAGTATACGAGAAGCGAGACATGCACGCGGGTCAGATCGAGAAGAACATGATGAAGATGGAGTCGGACGTGAAGCAGTTGACGGCCGAGCGAGACCAGCTCGTCGTGCAGCTCGAAAAGAGCCAGGAGATGCTAATGAACTTCCAGAAAGAACTGACCGGTGCTGAGCAGGAGCTTCAGCGATTGCGTCAAGAAAATGCCAACCTGCGAAGTAGAAGTCCCCAGCAAGTGCAGCAGATGCAGCAGGAAATTATGCGGCTTCAGCAACTGCTTCAAGAGCAGCAGCAGAGAGGCGCAGCAAACGGCGGAAACGCAGCCGAACTCGAGCAGTGGCGCAAAGTCATCGAGCAAGAAAAGGCGCGCGCCGACCAAGCCGAGCTGGCCATTCAGGAGTTCCAGAAGCACCACCAGGCAATGGACAAGCAATTGCAGCAGCAGAACCAACAGATGCAGACACTTCAGCAGCAGATTCAGCAACAAAACCAGACAGTTCAGCAGCAACAGCAGCAGATTCAGACGCAGCAGCAGCAGCTGCAAGCCAGCCAAAAGCAGCAGCCGAATCAACAGAACAATGCGCAAATCAATCAGCAGCTCGAGCAACTCAAGAAGGAGCTCACGGCGTCGAACACCGAACGAGACCGATTCCAGTCGCAGCTCGAGATGCTCGTGCAGGAGCTCGAAAAGAGTCAGATCGAGCTGCTCGAGGCCAACAAGAAACTGCAGTCGATGCAGCAGAACAACGTCGGCCACCAGCAGCAGGATGAGCAGACGAGGAAGCAACTCGAGATGCAGATCAAACAGATCGAAGACGCAAAGGCGCAACTCGACAACGAGAGGAAACTCGTCGACGATCAGAAGAAATCGATCGAGAAGAAGCGAAAGGAGGTTGAAGGAAAGGAGCAGAAGATGGCTGAGCTCAATCAGCAACTCAATAAGCGCAAGGAGCAGATGGATCAGCTCGAGAAGTCATTGCAGAAAGCGGGCGGATCAGCGGCCGCCACTACAGAGCTTAGCAAGAAACTCGCCGACACTCAACAGCAACTCGAAACCGTTGTCAAGCAACTCGAAGCAGCGAACGAAGAGTCGAAGAGGGCGGCAGCCGAAACTGAGAGGTTGCTTCAACTCGTACAAATGTCACAAGAAGAGCAAAACTCCAAGGAGAAGACAATCATGGACCTGCAACAAGCTCTGAAGAATGCTCAAGCTAAACTCAAAGCACAGCAACAAGCACAAGCAGAGGCATCAAATCCCGCTCAACAGGCTGCTGGCTTCCTCAAAAGCTTTTTC
Protein: 2151 (aa)
MSREAYNSLPNIGTRGSPGNRARRLTELPTVDKSPSRDFQKGSPLGHLRYTRFSRHNYGPYHSSLYYPHHSIAAMSSPYYHRDMEDPISPALPGHHRSRSASRPPVSHTMDYPRRYQSLDRGGFADPHDREFVPIREPRDRSRDRSLERGLYLEEELYGRSARQSPNPMMGPDRGYLGDLQYVNSDLQRELGNLKKELELTNQKLSSSMHSIKTFWSPELKKERALRKEESAKYSLINDQLKLLNNENQKQAMLVRQLEEELRMRMRQPSLEVQQQIETLYAENEHLTREIAILRDTIKELELRLETQKQTLLARDESIKKLLEMLQNKGMGKEEERAMFQQMQAMAQKQMFTAYPSERSNFYGGPSSATSNTAPSAAPSSSLYSSLPPIPQPQPATPTAMLPPIPQPAAAAAGASNLMSTNTSSYIGLNRMSPLPMRRYSYSGVPSSTSFADLNMNFDSTMVRDSSITNLINETCQSISRSSNILNRQLSADMALSSMSLTNLIDENIESDIVKYDLLNEITTTAALPPLKYADTYYSHPSKHVSAITAPTSYFNRNVSHMLSSDDYLLSKPIFSKSYVNPMSSSYISDDHHHHYNNSSFYDTQPPHQIHHQLPQIPPTHHSYMTPSHYHYHYSQPHSHHHNLRHHHYASNPCLSSHSYHNASSSSPYSQPLNPNTIAFNRNYNYPSNHQQAFASSRPPSISSLNRYAALTNNYQNSFNAQPYYRTSSRIDLDNPKGSEIKRQLDDFRLEIQRRDQEIMSMAAKMKTLEEQHQDYQRHISVLKESLCAKEEHYNMLQSDVEELRNRLEEKNRIIEKHTQGSLQSNQERNRAQQEVVELKEHLDIKERKISVLQRKIENLEDLLKEKDNQVDMARARLTAMQAHHCSSEGALSSLEEQIGDKDKQMNMLREQRDRAEAEKIEERELHEREIAEYKMKLHSFESEVDKLTTRLHRALAEKDRLETRLESGDLAKSRSGLDRDRERERERDLYERRAASEYDSGLSQKIQRLEIENDRLRAELERSQATFGRTTVTSSHELDRVQERADKTIAELRRTQAELRVTQSDAERSRAENAALQEKLEKSQGEVYRLKARLENAQQEQDTVKQELERHQSTVQRYYSERDKTVSEIEKLREEMERTQATLGKSQLTQEKLQNSLDKATSDVEHLQDKLEKATAEIRRLTLEKEKQTYEFENVQSQLDKALGQAARIQKEREQIQVEMERYREKCEKIQNQMIRVQKERDTFFEELEKVKERNEGSQSLMMKAQREKEQLQTELDVLKERWDKTHAIHQKLQMERDDSITEIGILKEKLDKALYASQKLIDEKESSTKEFEKILEKYDRAQNEIYRLQSRVDTAEADRNRLEVEAERSTLAATKAREDLRKLQEETSRLQEACDRAALQLGRSKELEDKAKEEVDLYRERADKYQNDMRKLQAEKEHLIAEVERLTYEVDRSHNAHTKHSAALESAHEEAARLNLELEKMRDRYNKAQAELSHLQETEQYSRENRRLKEENERLRERMDKIMLELEQIRGKSQYEQEAYEKFKEKLQQKENDLSALEAKLHETTLQLELSRQETQKYVSSQDKQRNDLERTHIEYEKLKDKYERVVKEMERINTGGGAGKLNTISPSQSMIQSMNPADRSEVERLRERLEKTLQQRDATELEAGRLAKELEKAQIHMTKQQEAYESTRIEFERMSAELNRVLELLEKSEAEKEAMRHNAKVYEKRDMHAGQIEKNMMKMESDVKQLTAERDQLVVQLEKSQEMLMNFQKELTGAEQELQRLRQENANLRSRSPQQVQQMQQEIMRLQQLLQEQQQRGAANGGNAAELEQWRKVIEQEKARADQAELAIQEFQKHHQAMDKQLQQQNQQMQTLQQQIQQQNQTVQQQQQQIQTQQQQLQASQKQQPNQQNNAQINQQLEQLKKELTASNTERDRFQSQLEMLVQELEKSQIELLEANKKLQSMQQNNVGHQQQDEQTRKQLEMQIKQIEDAKAQLDNERKLVDDQKKSIEKKRKEVEGKEQKMAELNQQLNKRKEQMDQLEKSLQKAGGSAAATTELSKKLADTQQQLETVVKQLEAANEESKRAAAETERLLQLVQMSQEEQNSKEKTIMDLQQALKNAQAKLKAQQQAQAEASNPAQQAAGFLKSFF
Type | Start | End | Length |
CDS |
38534 |
38581 |
48 |
CDS |
38659 |
38713 |
55 |
CDS |
38777 |
38911 |
135 |
CDS |
39187 |
39378 |
192 |
CDS |
39443 |
40096 |
654 |
CDS |
40162 |
40559 |
398 |
CDS |
40779 |
42206 |
1428 |
CDS |
42274 |
42624 |
351 |
CDS |
44399 |
44515 |
117 |
CDS |
44580 |
44925 |
346 |
CDS |
45029 |
45357 |
329 |
CDS |
45459 |
45536 |
78 |
CDS |
45623 |
45712 |
90 |
CDS |
49254 |
50435 |
1182 |
CDS |
56240 |
56295 |
56 |
CDS |
57246 |
57342 |
97 |
CDS |
57458 |
57846 |
389 |
CDS |
57914 |
58081 |
168 |
CDS |
58166 |
58369 |
204 |
CDS |
60835 |
60970 |
136 |
intron |
38582 |
38658 |
77 |
intron |
38714 |
38776 |
63 |
intron |
38912 |
39186 |
275 |
intron |
39379 |
39442 |
64 |
intron |
40097 |
40161 |
65 |
intron |
40560 |
40778 |
219 |
intron |
42207 |
42273 |
67 |
intron |
42625 |
44398 |
1774 |
intron |
44516 |
44579 |
64 |
intron |
44926 |
45028 |
103 |
intron |
45358 |
45458 |
101 |
intron |
45537 |
45622 |
86 |
intron |
45713 |
49253 |
3541 |
intron |
50436 |
56239 |
5804 |
intron |
56296 |
57245 |
950 |
intron |
57343 |
57457 |
115 |
intron |
57847 |
57913 |
67 |
intron |
58082 |
58165 |
84 |
intron |
58370 |
60834 |
2465 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001867912 |
bruchpilot [Culex quinquefasciatus] gb|EDS45873.1| bruchpilot [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR009053 |
Prefoldin |
|
InterPro |
IPR019323 |
CAZ complex, RIM-binding protein |
|
Pfam |
PF05791.6 |
Bacillus haemolytic enterotoxin (HBL) |
0.0023 |
Pfam |
PF10174.4 |
RIM-binding protein of the cytomatrix active zone |
2.5e-241 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
B. mori |
BGIBMGA013779-TA |
A. gambiae |
AGAP010286 |
P. humanus |
PHUM380940-PA |
S. invicta |
SI2.2.0_04295 |
D. plexippus |
DPOGS211537PA |
A. aegypti |
AAEL008273 |
T. castaneum |
TC030684 |
S. invicta |
SI2.2.0_06696 |
H. melpomene |
HMEL003122-PA |
N. vitripennis |
NV10238-PA |
A. gambiae |
AGAP010285 |
A. mellifera |
GB19596-PA |
P. vanderplanki |
Pv.04095 |
C. quinquefasciatus |
CPIJ017714 |
D. melanogaster |
FBgn0259246 |