MidgeBase gene description page [Pn.10906]

Outline

Link to gbrowse

Gene ID Pn.10906
Type Protein coding gene
Scaffold PnScaf12389
Start 9017
End 15692
Direction -

Sequence

Transcript: 6000 (bp)

 ATGTTCAAAGTGCCAATTATCATTCTAATTTTCTCTGTTCTGAGAGGAGCGGAGTTAAAGACTTTTACCAAATGCGAGATCGCAGAAGAGTTGATAAGAAACCAAAATTTGTCAGTGGCCGAGGCCAAAAAGCACTTGTGCATCATTGGCAGTGCGAATGATACAAATCTGGTCGAGGATGTCTATTTGGGAATTTACAAGATCAGCTCGCAATGGTGGTGTGGAATGAACGGCAAGCGCGCAGGAAATTGCAATTTGAAGTGCGAGCAGCTGCACGATGACGACATCAGCGACGACGTGGTGTGCGCAAGAAAAATTCTGCTCGACTTTGGAACGGAAGGATGGGGACTTGACAAGAGAAAATGTGAACAGGTGTTGAGTGAAATTAACACGACCTGTTTTCTAGATGATTTTTTACGCGAGAAAAAAGAAGCACCGACAACAACAACAGCGGCCCCCGAAATAGCGGCGAGCTTCGAACAACACGAAACTGATCAGAACGTATTTTTCAATATTTTCTCATTCAATATTAACGGAAGCAATCACAAGATCGAATACCATGTCGCATGTGGAGCGGACGCAAAGAGCTACACGCAGTGTGAGCTTTCCGAAGAGCTTTTAACGGTACACAATGTGACAATTAACGATGCACGCAACCTCGTTTGCATTGCGCAGAAATTTTCGCGTCTCAACACGAACGTCGTCGTCGGCGAGCGCTATGGAATTTTCCAAATCAACGAAAAGTTTTGTGGACACAAGAAGGCGGGCGGAGAGTGCAAAATAAAGTGTCATAGTTTGTTGGATGACGACATAAGCGATGATGCAAGGTGTGCTCAAGAGGTCATCGCAAGGCTGGGCTTAAGCGTGGCATGGCGCATCGAGAAGACGCAAGGCTGCCAGAAGGAGCTGAAAAATCTCGAAGAGAATTGCTCGCTCGGTGACCACGGAAATAATCAAAACAATAACAATAATGATGATGCCGCTAATCGCGACCATTGTGAATACGCTAGAAAATTGGTAGCGTCGTATGGCATTTCTAAGTTAGACGCTGTGACATGGTCATGTATCGCAAAGCATCGCTCGCCGAAAACTGGAAATGTGAACTCGAAACTGGCGGCCGGAGGCGATGAAGATGCGGCATGTGTCTTCAAACATGACGTAGAGTGCTCAATAAAAAATCAAAAACTTCGCAGTTCGAGTGGGTTCACGATTTGGCCAGAATACGAAGAGTTTTGCGCGAATCTCTCTGAAAATGACGAAATCGCGAAGAAGTGCTTCTCCTTCGACGAGAGCGAAAGTGATGACAGAAAAACGTTGAAAATCTTCGCGAGCCCGACAACAGAGCGACAGCACTTTACAACATCGACTGATATTTCGGTCATCGTTGAGGAAGACGCGCCAAAAGTGACGACCGAGGCTCAGCCCACAACGGAATTTCTGTCGAAGACGATTGAAAAGCCCGCCAGCGACGAGCAGGACATCATGGTGGAGGGCTACTCGAAGGTCAGAGTGATGTTTTCCGATGCGAGCGTCATCGAGCCAACGGAAGAACTGACGACGACGGCGGAGAAAATTTCAGAGGAAATTTTCAAGGAATCTCTAAAATGTCACTTCACGCGAAACTTTGTTCAGTCGCGCAGCATTCCTCAAAACCTGATTGCCACGTTTGTTTGCATCGCGGAGCATGAGTCACACTTTAATGTCTCATTGATCACCACGAGCGAGAAGTCGCGAAAATATGGACTTTTTCAGATCGACAACGCGCAATATTGTAATACAAACGAAAAAATTAACATCTGTGACACTCTCTGCGAGCACCTCACCGACGATCAGTACGACAATGATTTGGATTGCGCTTTAAAAGTCCATGAGACGGCCGGCTTCGATTATTGGCCGTCGTATGCGCAGCACTGTCAGCAAGTGAGCCAAAGTCTCGTCGAGGACTGCCATGAAACACACAGTACGACTCATCAACCTTACACCATTGTTAATTATGAGCTATTGAAGAAAAAAGTGCTAGAGGAATTAACATCAACAACAACAACGACATTCGTCGATGCTTTAAATGATGAAGAGCAGAGAATAGCGACAACTCCTGTTCTAGTGGGTGAATCACAATCGGTTGTGAGGCAATTTGATGTTTGCGAATTTTCGCAAGAACTTAATGTAAATCAACGCATTGAGCTAAACAAGCTGAATGATTTTGTTTGCATTGCTGACCTAATGACGAAGTTGCGAGTTCAAAAAAACACCGAAAGTACCGATAAAATTGGAATTTTTGGACTCAAATCTGACTCATGCGGTGAAAATGCGGAAATCGGTGGGAGATGCTCGGCCTCGTGCTTGAGTTTCTTCGATGAAAGTCTAACGGAAGTCGTTAAATGTGCATCTAAAATTTATGAAAGCGAATCATTGAGTTACTGGAATTTGACTGATGAAATTTGCAAACCATACAGCAATAAGATTCTCAAGTGCATTCACAAAGGAACCTTCGTTCCGACCGAAGGCATCGATTCTGACGATGAGGATGAGTTCGATGTGATAAACTTCGGAGACCTCAACGAGAGCTCCACAGAGTCCACAGCTGAGGAAAATTCGACAAAGATTGATGAAGAAGAAACTTCAACAGAGGAACTAACTACGAAGGGTGTAAACGACGAAATTTTCCATCAAAGCACAGCTGACCAAGAAAACCTCGATGAACTGCTGACAAATGTTGCATCGGATGATGCAGACGAAACTACTGCGCAGCCACTAGCAGCAGGGATAGATATAGATCAAGAAATCTTGGCACAACTCAACAAAACCTCTGAAAATATCTTAAAACATAACGAACAAAATTTAACGTCTCTGATTTCTGATTTTCTCGAATCTGTAGTGCAAACTTCTGAACTGCCTCAACAGCTTGACTCGGAAACTGTCTCAGAGCCTTCTGAGAGCGATGACGTGATAGAGGAAATTGCAGTAAAGTTGGAGGAAAAACTAAAAGTGAAGCAAGAGTTGACTGAAAACTCGAACGTTGGCCTCACTTTCGATGTAGACGAAACTACTGCGCAGCCACTAGCAACACAGATAGTTATAGAGGAAGAAATCTTGGCACAACTCAACAAAACCATTGAAAATGTCTTAAAACATTTTAAACAAAATTTTACGTCTGAAGAGGAAATTTCTGATTTCCTCGAATCTGAAGTGCAAACTTCTGAACTGCCTCAACAGCTTGACTCGGAAACTGTCTTGGAGCCTTCTGAGAGCGAAGACTTGATTGAGGAAATTACAGTAAAGTTGGAGGAAAAGCTAAAAGTGAAGCAAGAGTTGACTGAAAACTCAAACGTTGACCTCATTTTCGATGCAACGGAGAGAAGCGAAAGTAAGGAAACAAACGAGCAGAGCGATGAGTTGCTTAGCAGTTCCGTCAACCTGGATGTTTTGTCCTCCAAAGCCAATAAAACATTTGAGAGTGAACGCGATGAAAATTCGACACTCGTGAGCACTACGGAGTCGCTCTATTACACAAGTGTGCAGGAGATAATTGACGATTTTGAAGACAAGAATCTCATAAAACGCGAAACGACTCTTTTCCCGGAGAGCGGCGAGGACAATGAGACTACGACCGAAAATGACAATGATGTCGACGTCGTTTATGTCTTCAAACCTACTTACCCACCACCTAGCGAGGGCCATATCGAGAAGTGTGCGCTAGCTCGCTACATGCGGGAATCGACAAAAATACCCTTGAATCTCGTATCGCCGCTCATGTGCATCGCCGAGCATGAGTCAAACCTTAACATTTCGCTGATTCGCAGCGAAGGCGGCAAGACTCGTTACACCCGATATGGCCTCTTCCAAATTGACGACGTAAATTACTGCAACACCAATCAAAAAATCAACAAATGCGACGTCATTTGTCCTCATTTGGTCGACGATCAGCTGGACAACGATCTGCAGTGTGTCATGGACATTTATCGCAGGGAGGGCCTCGAATACTGGCCATCCTACAACAAACACTGCAAAGATGCAAAGTCAATCGACTACTTGAGCATGTGTCGTGAAGTCTTTACCACTACTCACTATCCATACACCTTCTACGACAGAGAGCTCGCCATCCAAAATATTTTCACGACAACGACCACGAGTGCAACCACAACTACTGTCCCGACCACAACCGTCTACATTCCAAAGGCTAATTATACTGATGACGATGGACGCTTCGTGATTAGGCGTTTCGACGATTGCGAATTGACAAACGAAATTTACCAAAACAATGTGACTATCGATGAAGTCTCGAAACTCGTGTGTGTTGGTGATCTGAAATCCGGCTTAAAAATAAGAAAGCCAAATGAAGACGAAATGCATTTTGGAATTTTTTCGATTAACAACACTTTTTGTGGAGAAGGCGGCGTCTGCGGCATCGAGTGCTCTCAACTGTTGGACGATGACATAACGGACGACATTTTGTGCGCCAAGATAATCCTCGAAAGCGACGAAGTCAGTGCCTGGAAATTGTCTGAAGAAGAATGCAAGCCATATGGCGCAAAATTACTGGAATGTGCCGATGAGGGCCACGTGCAAACGACCAGAGATCCAGAAAATGCAGAAATCACGCCTTGGTCGGGTTTCTCTTACACAACTGCCGCCTCGCCGATGACAAAGGAACCGTTTAGCTACAAGCCAAACAAGTATGTGTACTTCGAAGACTACTACACGCCTGCAACGACAAAGTACACGACACCACACGACCTTGGAGACTTGGACTTTTTGGCTGTTTTGACGTCTGAGAATCCTCTACCGACTCGCAGCACAACAACCGAACAAACCACTAGCACAACTACCAGCAGTGAAAAGCCTGTTACTCTTTCAGCGTTTCTCGAGCCGCCACATTTGAGCACAGCGTCCGACGACGACGAAATCGTAGAAGCGAAGACCGAGGCTTCGGTGAAACACATTGATAGCGAGCCGAATGCTTCAGAGGACATCGAATTGACCACCACTAGCATTACTGAGGAGGAATTGAAGCGAATAGTCAAAGATGTCATTGCACCGCCTCAGCAAGTTCTGTTTCTGCCGAAGGAAAAATTCGCCGATGAAATTGAGGAAGAATTCGAGATTGAAAAGCTAAGAAGTGAGGAGACGACAGAACTGTCCGAAACTGAAACTACAACCGTTACGTCGAGAATTGACACAAAAAACAATGAGGAATCAAATGAGGTGCCAGAAACTGGTGAATCGACCGAAAAACCTTTGCAAAAAGACCTAGAGAGCGGCAAAATAATTTCGTTGCATCAGTCGTTGCTTCCCCCGCACGATATCGTTGCGATCACGGAGGAGGAGACGGAAAAACCAGAGTTTAAGATTATCGTTGAAAGTGGAGAAATTAATTCAGCTGAGGATGTTGTGAAAGAGAAGGAGAGCTCCGAAGCAGTTACGCTGAGTTCTGAAAGTTTGCTGATGCATGAGAGTGTTGGCAGCGACGAACAGACGTCTGAAAGTTCATCGAAAAATGCAGACGAAGTGAGTACAACAGTGATTCCAGAAACAACACCATCATCGAGCACAATACCATACTTCCGACTGAAGTCACGAACCACTACGGAAGCTCCAGCGACGACACAAACTCACCAGGATTTTAAAGAACAAAATATTCAAAACTCAGCTGAAGAAGTAGAGATAACTTTGACGACTGCATCAACGGAAACGGCAATTGAAACAACTACAGAAAAAGAAGTCATCTCTTCGAGAATAATTGGAACTTTGATGACCGCAGCAACATCAGAAGCGCCAGAGTCAAGCGAGGATGACAATGATGAGCCGAAAATTCCGCAAATCATTCTGAACGACCAGAACCTAACGAATCAAGTTAAATCGATATTGAAGGCGCACTCTGGCACTAAAAACAGCACAAAGCCTATCGTCTTTAATATTTTCAATTTCAACATCGCTGGCCAGAATCACGAGATAAATTACGCGATAGATGGAAACAAACAAAAGCAGCCGCAGAAACCT 

Protein: 2000 (aa)

 MFKVPIIILIFSVLRGAELKTFTKCEIAEELIRNQNLSVAEAKKHLCIIGSANDTNLVEDVYLGIYKISSQWWCGMNGKRAGNCNLKCEQLHDDDISDDVVCARKILLDFGTEGWGLDKRKCEQVLSEINTTCFLDDFLREKKEAPTTTTAAPEIAASFEQHETDQNVFFNIFSFNINGSNHKIEYHVACGADAKSYTQCELSEELLTVHNVTINDARNLVCIAQKFSRLNTNVVVGERYGIFQINEKFCGHKKAGGECKIKCHSLLDDDISDDARCAQEVIARLGLSVAWRIEKTQGCQKELKNLEENCSLGDHGNNQNNNNNDDAANRDHCEYARKLVASYGISKLDAVTWSCIAKHRSPKTGNVNSKLAAGGDEDAACVFKHDVECSIKNQKLRSSSGFTIWPEYEEFCANLSENDEIAKKCFSFDESESDDRKTLKIFASPTTERQHFTTSTDISVIVEEDAPKVTTEAQPTTEFLSKTIEKPASDEQDIMVEGYSKVRVMFSDASVIEPTEELTTTAEKISEEIFKESLKCHFTRNFVQSRSIPQNLIATFVCIAEHESHFNVSLITTSEKSRKYGLFQIDNAQYCNTNEKINICDTLCEHLTDDQYDNDLDCALKVHETAGFDYWPSYAQHCQQVSQSLVEDCHETHSTTHQPYTIVNYELLKKKVLEELTSTTTTTFVDALNDEEQRIATTPVLVGESQSVVRQFDVCEFSQELNVNQRIELNKLNDFVCIADLMTKLRVQKNTESTDKIGIFGLKSDSCGENAEIGGRCSASCLSFFDESLTEVVKCASKIYESESLSYWNLTDEICKPYSNKILKCIHKGTFVPTEGIDSDDEDEFDVINFGDLNESSTESTAEENSTKIDEEETSTEELTTKGVNDEIFHQSTADQENLDELLTNVASDDADETTAQPLAAGIDIDQEILAQLNKTSENILKHNEQNLTSLISDFLESVVQTSELPQQLDSETVSEPSESDDVIEEIAVKLEEKLKVKQELTENSNVGLTFDVDETTAQPLATQIVIEEEILAQLNKTIENVLKHFKQNFTSEEEISDFLESEVQTSELPQQLDSETVLEPSESEDLIEEITVKLEEKLKVKQELTENSNVDLIFDATERSESKETNEQSDELLSSSVNLDVLSSKANKTFESERDENSTLVSTTESLYYTSVQEIIDDFEDKNLIKRETTLFPESGEDNETTTENDNDVDVVYVFKPTYPPPSEGHIEKCALARYMRESTKIPLNLVSPLMCIAEHESNLNISLIRSEGGKTRYTRYGLFQIDDVNYCNTNQKINKCDVICPHLVDDQLDNDLQCVMDIYRREGLEYWPSYNKHCKDAKSIDYLSMCREVFTTTHYPYTFYDRELAIQNIFTTTTTSATTTTVPTTTVYIPKANYTDDDGRFVIRRFDDCELTNEIYQNNVTIDEVSKLVCVGDLKSGLKIRKPNEDEMHFGIFSINNTFCGEGGVCGIECSQLLDDDITDDILCAKIILESDEVSAWKLSEEECKPYGAKLLECADEGHVQTTRDPENAEITPWSGFSYTTAASPMTKEPFSYKPNKYVYFEDYYTPATTKYTTPHDLGDLDFLAVLTSENPLPTRSTTTEQTTSTTTSSEKPVTLSAFLEPPHLSTASDDDEIVEAKTEASVKHIDSEPNASEDIELTTTSITEEELKRIVKDVIAPPQQVLFLPKEKFADEIEEEFEIEKLRSEETTELSETETTTVTSRIDTKNNEESNEVPETGESTEKPLQKDLESGKIISLHQSLLPPHDIVAITEEETEKPEFKIIVESGEINSAEDVVKEKESSEAVTLSSESLLMHESVGSDEQTSESSSKNADEVSTTVIPETTPSSSTIPYFRLKSRTTTEAPATTQTHQDFKEQNIQNSAEEVEITLTTASTETAIETTTEKEVISSRIIGTLMTAATSEAPESSEDDNDEPKIPQIILNDQNLTNQVKSILKAHSGTKNSTKPIVFNIFNFNIAGQNHEINYAIDGNKQKQPQKP 
Type Start End Length
CDS 9020 10827 1808
CDS 11032 13149 2118
CDS 13232 14742 1511
CDS 15130 15692 563
intron 10828 11031 204
intron 13150 13231 82
intron 14743 15129 387

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr EFR28988 hypothetical protein AND_02410 [Anopheles darlingi] 6e-32
InterPro IPR001916 Glycoside hydrolase, family 22
InterPro IPR019799 Glycoside hydrolase, family 22, conserved site
InterPro IPR023346 Lysozyme-like domain
Pfam PF01464.15 Transglycosylase SLT domain 7.1e-06
Pfam PF00062.15 C-type lysozyme/alpha-lactalbumin family 8.3e-95

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
P. vanderplanki Pv.04077