MidgeBase gene description page [Pn.10201]

Outline

Link to gbrowse

Gene ID Pn.10201
Type Protein coding gene
Scaffold PnScaf10994
Start 38531
End 60970
Direction -

Sequence

Transcript: 6453 (bp)

 ATGTCGCGAGAAGCCTACAATTCACTGCCTAATATTGGAACTCGCGGAAGTCCAGGAAATCGTGCCAGAAGACTCACAGAATTGCCAACTGTCGATAAGTCACCCTCACGTGATTTTCAGAAAGGAAGTCCTTTAGGACATTTGCGATATACCAGATTTTCAAGACACAATTACGGTCCGTACCATTCATCACTTTATTATCCACATCATTCTATTGCAGCCATGAGCAGCCCGTATTATCACCGTGACATGGAGGACCCCATAAGTCCAGCGTTGCCGGGACACCATCGCAGTCGAAGCGCAAGTCGACCGCCTGTCTCGCACACGATGGATTATCCAAGACGCTATCAGTCATTGGATCGCGGCGGATTTGCCGATCCACACGACCGCGAATTCGTGCCGATACGAGAACCTCGAGACCGCTCTCGCGACCGCTCACTCGAGCGCGGTCTCTACCTTGAGGAGGAGCTTTACGGACGATCTGCACGACAAAGCCCCAACCCCATGATGGGACCGGACCGAGGCTACTTGGGCGACCTGCAGTACGTCAACAGCGACCTGCAGCGCGAGCTCGGCAACCTCAAGAAGGAACTAGAATTAACCAATCAGAAGCTTAGCAGTTCGATGCACAGTATTAAGACATTCTGGTCGCCGGAGCTGAAGAAGGAGCGGGCGCTGCGCAAGGAGGAGAGCGCCAAGTACAGCCTCATCAACGACCAGCTGAAGCTGCTGAACAACGAGAACCAGAAGCAGGCGATGCTGGTGCGTCAGCTCGAGGAGGAGCTCCGCATGCGAATGCGGCAGCCGAGTCTCGAGGTGCAGCAGCAAATCGAAACGCTCTATGCGGAAAATGAGCACTTGACACGAGAAATAGCAATACTGCGCGACACGATAAAAGAGCTCGAGTTACGCTTGGAGACACAGAAGCAGACGCTCCTGGCCCGCGACGAAAGCATCAAAAAGCTCTTAGAGATGCTGCAGAACAAGGGAATGGGCAAGGAAGAGGAGCGCGCAATGTTCCAGCAAATGCAAGCCATGGCACAAAAACAGATGTTCACTGCCTATCCATCTGAGCGATCAAATTTTTACGGCGGGCCAAGCTCAGCGACGTCTAATACAGCGCCATCTGCTGCTCCATCGAGCAGTCTCTATTCCTCATTACCGCCTATTCCGCAGCCACAGCCAGCGACGCCTACCGCAATGTTGCCGCCCATTCCGCAGCCGGCCGCCGCCGCAGCCGGAGCCAGCAACCTCATGTCAACCAACACCTCCTCGTACATCGGACTGAACCGAATGAGTCCGCTGCCGATGCGCCGCTACAGCTATAGCGGCGTGCCGTCGTCGACCTCGTTCGCCGATCTGAACATGAATTTCGATAGCACAATGGTGCGCGACAGCAGCATAACGAATCTAATTAACGAAACGTGTCAGTCAATCTCGAGATCGTCGAACATTCTGAACCGACAATTGTCGGCCGATATGGCTCTGAGTAGCATGAGTCTTACGAATTTAATCGATGAGAATATTGAGAGCGATATTGTAAAATATGATTTGTTAAATGAAATTACGACGACGGCCGCGTTGCCGCCGCTCAAGTACGCCGACACATACTACTCGCATCCGAGCAAGCATGTGTCGGCCATTACTGCACCTACTTCCTACTTTAATCGCAATGTCTCACACATGCTCAGCAGCGACGACTATCTGTTGTCGAAGCCGATCTTTAGCAAGAGCTATGTTAATCCGATGTCATCATCATACATAAGTGACGATCACCATCATCACTATAATAACAGTAGTTTCTATGACACACAACCGCCACACCAAATACACCACCAGCTACCGCAAATACCGCCGACACACCACTCCTACATGACACCGTCACACTACCACTATCACTACTCACAGCCACACAGCCATCATCATAATTTGCGTCATCATCATTATGCATCGAATCCGTGTCTCTCGTCGCACTCATATCATAATGCCTCCTCCTCCTCTCCCTACTCGCAGCCCCTCAATCCCAACACCATAGCATTTAATCGTAATTATAATTATCCATCTAATCATCAGCAAGCGTTCGCGTCGTCACGTCCGCCATCCATCTCGTCGCTCAATCGCTACGCGGCCCTTACGAATAACTATCAAAACAGTTTCAATGCTCAGCCCTATTATCGGACATCGTCGCGAATCGATCTTGACAATCCCAAGGGCAGCGAGATCAAGCGACAACTGGATGATTTCCGTCTAGAGATACAGAGACGCGACCAAGAAATTATGAGCATGGCGGCGAAGATGAAAACACTCGAGGAGCAGCACCAGGATTATCAGCGTCATATTTCAGTGCTGAAGGAGTCTTTGTGCGCGAAGGAAGAGCACTACAACATGCTTCAGTCCGACGTGGAAGAATTGAGAAATCGCTTGGAGGAGAAGAACCGAATAATTGAGAAGCACACGCAGGGTTCGCTGCAGTCCAATCAGGAGCGAAATCGCGCCCAGCAAGAGGTGGTCGAGCTGAAGGAGCATCTCGACATCAAGGAGCGCAAAATTAGCGTTCTTCAGCGAAAAATCGAGAACCTCGAGGACCTGCTGAAGGAGAAGGACAATCAAGTGGACATGGCACGAGCACGACTGACTGCCATGCAGGCGCACCACTGCAGCTCGGAGGGAGCGCTCTCCAGTCTCGAGGAACAAATTGGCGACAAGGACAAGCAGATGAACATGCTGCGCGAGCAACGAGATCGTGCCGAAGCGGAGAAGATAGAGGAGCGTGAGCTGCATGAGCGTGAAATCGCCGAATACAAGATGAAACTGCACTCGTTCGAGAGCGAAGTCGACAAGCTTACGACGCGCTTGCATCGAGCACTCGCCGAGAAGGATCGGCTCGAGACGAGATTGGAGTCGGGCGATTTGGCCAAGTCGAGGAGCGGCCTTGACCGTGACCGCGAGCGAGAACGCGAGCGTGACCTATACGAAAGACGCGCTGCCAGCGAGTACGACTCGGGTCTCAGCCAGAAGATTCAGCGATTGGAGATCGAAAACGACCGACTGCGAGCTGAGCTCGAGAGGTCGCAGGCAACGTTCGGACGCACGACAGTGACGTCGTCGCACGAGTTGGACCGCGTGCAGGAGCGAGCCGACAAGACAATTGCAGAGCTGAGACGAACACAAGCCGAGCTTCGTGTCACGCAGAGCGATGCCGAGAGATCACGAGCAGAAAATGCCGCCCTACAAGAGAAGCTCGAGAAGAGTCAAGGAGAAGTCTACCGCCTTAAGGCACGATTGGAGAACGCACAACAAGAGCAGGACACGGTGAAACAAGAACTCGAACGTCACCAATCCACCGTGCAGCGCTACTACAGCGAGCGCGACAAGACCGTCAGTGAAATTGAGAAGCTTCGCGAGGAGATGGAACGAACACAAGCGACACTGGGCAAGTCACAGCTAACACAGGAGAAGCTCCAGAACTCACTCGATAAGGCCACAAGCGACGTCGAGCACTTGCAGGACAAACTTGAGAAGGCAACCGCTGAAATCAGACGCTTGACACTCGAGAAGGAGAAGCAGACGTACGAGTTCGAGAACGTTCAGTCGCAACTGGACAAGGCGCTCGGTCAGGCGGCTCGAATTCAGAAGGAGCGCGAGCAGATTCAAGTGGAAATGGAGCGCTACCGCGAGAAGTGCGAGAAGATTCAGAATCAGATGATTCGCGTGCAAAAGGAGCGCGACACATTCTTCGAGGAGCTCGAGAAGGTCAAGGAGCGCAACGAGGGCTCGCAGAGCCTGATGATGAAGGCACAGCGAGAGAAGGAGCAGCTGCAGACCGAGCTCGATGTGCTCAAAGAGCGCTGGGACAAGACGCACGCCATTCACCAGAAGCTACAGATGGAGCGCGACGACTCGATAACGGAAATCGGCATTCTGAAGGAGAAGCTCGACAAGGCGCTCTACGCCAGCCAGAAGCTCATCGACGAGAAAGAGTCGTCGACAAAGGAGTTTGAGAAAATCCTCGAGAAGTACGATAGGGCACAAAATGAAATCTATAGACTTCAATCGAGAGTCGACACAGCCGAGGCTGATCGCAATCGACTCGAAGTTGAAGCCGAGAGATCCACTTTGGCCGCTACGAAGGCTCGCGAGGATTTGCGAAAACTGCAGGAGGAGACGTCGCGCTTGCAGGAGGCCTGCGACCGGGCCGCCCTGCAACTCGGCCGCTCCAAGGAGCTCGAGGACAAGGCAAAGGAGGAGGTTGACTTGTACCGCGAGCGAGCCGACAAGTATCAGAACGACATGAGGAAGCTGCAGGCGGAGAAGGAGCACTTGATAGCGGAGGTCGAGAGACTCACGTACGAGGTCGATCGCAGTCACAATGCTCACACCAAACACAGTGCGGCCCTCGAGAGTGCACACGAGGAGGCCGCCCGACTCAACCTAGAATTAGAAAAAATGCGCGATCGTTACAATAAGGCGCAAGCGGAATTAAGTCACCTGCAGGAGACCGAGCAGTACTCGCGCGAAAACCGCCGTCTCAAGGAGGAGAACGAGCGGTTGCGCGAGCGCATGGACAAGATTATGCTCGAGCTGGAGCAGATTCGCGGCAAGTCGCAGTACGAGCAAGAGGCGTACGAGAAATTCAAGGAGAAGCTTCAGCAAAAGGAGAACGATTTGAGTGCCCTCGAGGCGAAACTGCACGAGACGACGCTGCAGCTCGAGCTGTCGCGTCAAGAGACACAGAAGTATGTTTCGAGTCAGGACAAGCAGCGCAACGACCTCGAGCGAACGCATATTGAGTATGAGAAGCTCAAAGACAAGTACGAGCGCGTTGTGAAGGAAATGGAGCGAATTAACACCGGCGGCGGTGCTGGCAAGTTGAACACCATTAGTCCTAGCCAATCGATGATTCAGTCAATGAATCCCGCCGACAGAAGCGAGGTCGAGAGACTTCGTGAGCGCCTAGAAAAGACCCTTCAGCAACGAGACGCGACAGAGCTCGAAGCCGGCCGACTGGCGAAGGAGCTCGAGAAAGCACAAATTCACATGACGAAGCAGCAAGAAGCCTACGAATCGACGCGCATAGAGTTCGAACGAATGTCTGCCGAACTCAACCGTGTACTGGAACTGCTGGAGAAGTCCGAGGCCGAGAAGGAGGCCATGCGTCACAACGCCAAAGTATACGAGAAGCGAGACATGCACGCGGGTCAGATCGAGAAGAACATGATGAAGATGGAGTCGGACGTGAAGCAGTTGACGGCCGAGCGAGACCAGCTCGTCGTGCAGCTCGAAAAGAGCCAGGAGATGCTAATGAACTTCCAGAAAGAACTGACCGGTGCTGAGCAGGAGCTTCAGCGATTGCGTCAAGAAAATGCCAACCTGCGAAGTAGAAGTCCCCAGCAAGTGCAGCAGATGCAGCAGGAAATTATGCGGCTTCAGCAACTGCTTCAAGAGCAGCAGCAGAGAGGCGCAGCAAACGGCGGAAACGCAGCCGAACTCGAGCAGTGGCGCAAAGTCATCGAGCAAGAAAAGGCGCGCGCCGACCAAGCCGAGCTGGCCATTCAGGAGTTCCAGAAGCACCACCAGGCAATGGACAAGCAATTGCAGCAGCAGAACCAACAGATGCAGACACTTCAGCAGCAGATTCAGCAACAAAACCAGACAGTTCAGCAGCAACAGCAGCAGATTCAGACGCAGCAGCAGCAGCTGCAAGCCAGCCAAAAGCAGCAGCCGAATCAACAGAACAATGCGCAAATCAATCAGCAGCTCGAGCAACTCAAGAAGGAGCTCACGGCGTCGAACACCGAACGAGACCGATTCCAGTCGCAGCTCGAGATGCTCGTGCAGGAGCTCGAAAAGAGTCAGATCGAGCTGCTCGAGGCCAACAAGAAACTGCAGTCGATGCAGCAGAACAACGTCGGCCACCAGCAGCAGGATGAGCAGACGAGGAAGCAACTCGAGATGCAGATCAAACAGATCGAAGACGCAAAGGCGCAACTCGACAACGAGAGGAAACTCGTCGACGATCAGAAGAAATCGATCGAGAAGAAGCGAAAGGAGGTTGAAGGAAAGGAGCAGAAGATGGCTGAGCTCAATCAGCAACTCAATAAGCGCAAGGAGCAGATGGATCAGCTCGAGAAGTCATTGCAGAAAGCGGGCGGATCAGCGGCCGCCACTACAGAGCTTAGCAAGAAACTCGCCGACACTCAACAGCAACTCGAAACCGTTGTCAAGCAACTCGAAGCAGCGAACGAAGAGTCGAAGAGGGCGGCAGCCGAAACTGAGAGGTTGCTTCAACTCGTACAAATGTCACAAGAAGAGCAAAACTCCAAGGAGAAGACAATCATGGACCTGCAACAAGCTCTGAAGAATGCTCAAGCTAAACTCAAAGCACAGCAACAAGCACAAGCAGAGGCATCAAATCCCGCTCAACAGGCTGCTGGCTTCCTCAAAAGCTTTTTC 

Protein: 2151 (aa)

 MSREAYNSLPNIGTRGSPGNRARRLTELPTVDKSPSRDFQKGSPLGHLRYTRFSRHNYGPYHSSLYYPHHSIAAMSSPYYHRDMEDPISPALPGHHRSRSASRPPVSHTMDYPRRYQSLDRGGFADPHDREFVPIREPRDRSRDRSLERGLYLEEELYGRSARQSPNPMMGPDRGYLGDLQYVNSDLQRELGNLKKELELTNQKLSSSMHSIKTFWSPELKKERALRKEESAKYSLINDQLKLLNNENQKQAMLVRQLEEELRMRMRQPSLEVQQQIETLYAENEHLTREIAILRDTIKELELRLETQKQTLLARDESIKKLLEMLQNKGMGKEEERAMFQQMQAMAQKQMFTAYPSERSNFYGGPSSATSNTAPSAAPSSSLYSSLPPIPQPQPATPTAMLPPIPQPAAAAAGASNLMSTNTSSYIGLNRMSPLPMRRYSYSGVPSSTSFADLNMNFDSTMVRDSSITNLINETCQSISRSSNILNRQLSADMALSSMSLTNLIDENIESDIVKYDLLNEITTTAALPPLKYADTYYSHPSKHVSAITAPTSYFNRNVSHMLSSDDYLLSKPIFSKSYVNPMSSSYISDDHHHHYNNSSFYDTQPPHQIHHQLPQIPPTHHSYMTPSHYHYHYSQPHSHHHNLRHHHYASNPCLSSHSYHNASSSSPYSQPLNPNTIAFNRNYNYPSNHQQAFASSRPPSISSLNRYAALTNNYQNSFNAQPYYRTSSRIDLDNPKGSEIKRQLDDFRLEIQRRDQEIMSMAAKMKTLEEQHQDYQRHISVLKESLCAKEEHYNMLQSDVEELRNRLEEKNRIIEKHTQGSLQSNQERNRAQQEVVELKEHLDIKERKISVLQRKIENLEDLLKEKDNQVDMARARLTAMQAHHCSSEGALSSLEEQIGDKDKQMNMLREQRDRAEAEKIEERELHEREIAEYKMKLHSFESEVDKLTTRLHRALAEKDRLETRLESGDLAKSRSGLDRDRERERERDLYERRAASEYDSGLSQKIQRLEIENDRLRAELERSQATFGRTTVTSSHELDRVQERADKTIAELRRTQAELRVTQSDAERSRAENAALQEKLEKSQGEVYRLKARLENAQQEQDTVKQELERHQSTVQRYYSERDKTVSEIEKLREEMERTQATLGKSQLTQEKLQNSLDKATSDVEHLQDKLEKATAEIRRLTLEKEKQTYEFENVQSQLDKALGQAARIQKEREQIQVEMERYREKCEKIQNQMIRVQKERDTFFEELEKVKERNEGSQSLMMKAQREKEQLQTELDVLKERWDKTHAIHQKLQMERDDSITEIGILKEKLDKALYASQKLIDEKESSTKEFEKILEKYDRAQNEIYRLQSRVDTAEADRNRLEVEAERSTLAATKAREDLRKLQEETSRLQEACDRAALQLGRSKELEDKAKEEVDLYRERADKYQNDMRKLQAEKEHLIAEVERLTYEVDRSHNAHTKHSAALESAHEEAARLNLELEKMRDRYNKAQAELSHLQETEQYSRENRRLKEENERLRERMDKIMLELEQIRGKSQYEQEAYEKFKEKLQQKENDLSALEAKLHETTLQLELSRQETQKYVSSQDKQRNDLERTHIEYEKLKDKYERVVKEMERINTGGGAGKLNTISPSQSMIQSMNPADRSEVERLRERLEKTLQQRDATELEAGRLAKELEKAQIHMTKQQEAYESTRIEFERMSAELNRVLELLEKSEAEKEAMRHNAKVYEKRDMHAGQIEKNMMKMESDVKQLTAERDQLVVQLEKSQEMLMNFQKELTGAEQELQRLRQENANLRSRSPQQVQQMQQEIMRLQQLLQEQQQRGAANGGNAAELEQWRKVIEQEKARADQAELAIQEFQKHHQAMDKQLQQQNQQMQTLQQQIQQQNQTVQQQQQQIQTQQQQLQASQKQQPNQQNNAQINQQLEQLKKELTASNTERDRFQSQLEMLVQELEKSQIELLEANKKLQSMQQNNVGHQQQDEQTRKQLEMQIKQIEDAKAQLDNERKLVDDQKKSIEKKRKEVEGKEQKMAELNQQLNKRKEQMDQLEKSLQKAGGSAAATTELSKKLADTQQQLETVVKQLEAANEESKRAAAETERLLQLVQMSQEEQNSKEKTIMDLQQALKNAQAKLKAQQQAQAEASNPAQQAAGFLKSFF 
Type Start End Length
CDS 38534 38581 48
CDS 38659 38713 55
CDS 38777 38911 135
CDS 39187 39378 192
CDS 39443 40096 654
CDS 40162 40559 398
CDS 40779 42206 1428
CDS 42274 42624 351
CDS 44399 44515 117
CDS 44580 44925 346
CDS 45029 45357 329
CDS 45459 45536 78
CDS 45623 45712 90
CDS 49254 50435 1182
CDS 56240 56295 56
CDS 57246 57342 97
CDS 57458 57846 389
CDS 57914 58081 168
CDS 58166 58369 204
CDS 60835 60970 136
intron 38582 38658 77
intron 38714 38776 63
intron 38912 39186 275
intron 39379 39442 64
intron 40097 40161 65
intron 40560 40778 219
intron 42207 42273 67
intron 42625 44398 1774
intron 44516 44579 64
intron 44926 45028 103
intron 45358 45458 101
intron 45537 45622 86
intron 45713 49253 3541
intron 50436 56239 5804
intron 56296 57245 950
intron 57343 57457 115
intron 57847 57913 67
intron 58082 58165 84
intron 58370 60834 2465

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001867912 bruchpilot [Culex quinquefasciatus] gb|EDS45873.1| bruchpilot [Culex quinquefasciatus] 0.0
InterPro IPR009053 Prefoldin
InterPro IPR019323 CAZ complex, RIM-binding protein
Pfam PF05791.6 Bacillus haemolytic enterotoxin (HBL) 0.0023
Pfam PF10174.4 RIM-binding protein of the cytomatrix active zone 2.5e-241

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
B. mori BGIBMGA013779-TA
A. gambiae AGAP010286
P. humanus PHUM380940-PA
S. invicta SI2.2.0_04295
D. plexippus DPOGS211537PA
A. aegypti AAEL008273
T. castaneum TC030684
S. invicta SI2.2.0_06696
H. melpomene HMEL003122-PA
N. vitripennis NV10238-PA
A. gambiae AGAP010285
A. mellifera GB19596-PA
P. vanderplanki Pv.04095
C. quinquefasciatus CPIJ017714
D. melanogaster FBgn0259246