MidgeBase gene description page [Pn.10906]
Outline
Gene ID | Pn.10906 |
Type | Protein coding gene |
Scaffold | PnScaf12389 |
Start | 9017 |
End | 15692 |
Direction | - |
Sequence
Transcript: 6000 (bp)
ATGTTCAAAGTGCCAATTATCATTCTAATTTTCTCTGTTCTGAGAGGAGCGGAGTTAAAGACTTTTACCAAATGCGAGATCGCAGAAGAGTTGATAAGAAACCAAAATTTGTCAGTGGCCGAGGCCAAAAAGCACTTGTGCATCATTGGCAGTGCGAATGATACAAATCTGGTCGAGGATGTCTATTTGGGAATTTACAAGATCAGCTCGCAATGGTGGTGTGGAATGAACGGCAAGCGCGCAGGAAATTGCAATTTGAAGTGCGAGCAGCTGCACGATGACGACATCAGCGACGACGTGGTGTGCGCAAGAAAAATTCTGCTCGACTTTGGAACGGAAGGATGGGGACTTGACAAGAGAAAATGTGAACAGGTGTTGAGTGAAATTAACACGACCTGTTTTCTAGATGATTTTTTACGCGAGAAAAAAGAAGCACCGACAACAACAACAGCGGCCCCCGAAATAGCGGCGAGCTTCGAACAACACGAAACTGATCAGAACGTATTTTTCAATATTTTCTCATTCAATATTAACGGAAGCAATCACAAGATCGAATACCATGTCGCATGTGGAGCGGACGCAAAGAGCTACACGCAGTGTGAGCTTTCCGAAGAGCTTTTAACGGTACACAATGTGACAATTAACGATGCACGCAACCTCGTTTGCATTGCGCAGAAATTTTCGCGTCTCAACACGAACGTCGTCGTCGGCGAGCGCTATGGAATTTTCCAAATCAACGAAAAGTTTTGTGGACACAAGAAGGCGGGCGGAGAGTGCAAAATAAAGTGTCATAGTTTGTTGGATGACGACATAAGCGATGATGCAAGGTGTGCTCAAGAGGTCATCGCAAGGCTGGGCTTAAGCGTGGCATGGCGCATCGAGAAGACGCAAGGCTGCCAGAAGGAGCTGAAAAATCTCGAAGAGAATTGCTCGCTCGGTGACCACGGAAATAATCAAAACAATAACAATAATGATGATGCCGCTAATCGCGACCATTGTGAATACGCTAGAAAATTGGTAGCGTCGTATGGCATTTCTAAGTTAGACGCTGTGACATGGTCATGTATCGCAAAGCATCGCTCGCCGAAAACTGGAAATGTGAACTCGAAACTGGCGGCCGGAGGCGATGAAGATGCGGCATGTGTCTTCAAACATGACGTAGAGTGCTCAATAAAAAATCAAAAACTTCGCAGTTCGAGTGGGTTCACGATTTGGCCAGAATACGAAGAGTTTTGCGCGAATCTCTCTGAAAATGACGAAATCGCGAAGAAGTGCTTCTCCTTCGACGAGAGCGAAAGTGATGACAGAAAAACGTTGAAAATCTTCGCGAGCCCGACAACAGAGCGACAGCACTTTACAACATCGACTGATATTTCGGTCATCGTTGAGGAAGACGCGCCAAAAGTGACGACCGAGGCTCAGCCCACAACGGAATTTCTGTCGAAGACGATTGAAAAGCCCGCCAGCGACGAGCAGGACATCATGGTGGAGGGCTACTCGAAGGTCAGAGTGATGTTTTCCGATGCGAGCGTCATCGAGCCAACGGAAGAACTGACGACGACGGCGGAGAAAATTTCAGAGGAAATTTTCAAGGAATCTCTAAAATGTCACTTCACGCGAAACTTTGTTCAGTCGCGCAGCATTCCTCAAAACCTGATTGCCACGTTTGTTTGCATCGCGGAGCATGAGTCACACTTTAATGTCTCATTGATCACCACGAGCGAGAAGTCGCGAAAATATGGACTTTTTCAGATCGACAACGCGCAATATTGTAATACAAACGAAAAAATTAACATCTGTGACACTCTCTGCGAGCACCTCACCGACGATCAGTACGACAATGATTTGGATTGCGCTTTAAAAGTCCATGAGACGGCCGGCTTCGATTATTGGCCGTCGTATGCGCAGCACTGTCAGCAAGTGAGCCAAAGTCTCGTCGAGGACTGCCATGAAACACACAGTACGACTCATCAACCTTACACCATTGTTAATTATGAGCTATTGAAGAAAAAAGTGCTAGAGGAATTAACATCAACAACAACAACGACATTCGTCGATGCTTTAAATGATGAAGAGCAGAGAATAGCGACAACTCCTGTTCTAGTGGGTGAATCACAATCGGTTGTGAGGCAATTTGATGTTTGCGAATTTTCGCAAGAACTTAATGTAAATCAACGCATTGAGCTAAACAAGCTGAATGATTTTGTTTGCATTGCTGACCTAATGACGAAGTTGCGAGTTCAAAAAAACACCGAAAGTACCGATAAAATTGGAATTTTTGGACTCAAATCTGACTCATGCGGTGAAAATGCGGAAATCGGTGGGAGATGCTCGGCCTCGTGCTTGAGTTTCTTCGATGAAAGTCTAACGGAAGTCGTTAAATGTGCATCTAAAATTTATGAAAGCGAATCATTGAGTTACTGGAATTTGACTGATGAAATTTGCAAACCATACAGCAATAAGATTCTCAAGTGCATTCACAAAGGAACCTTCGTTCCGACCGAAGGCATCGATTCTGACGATGAGGATGAGTTCGATGTGATAAACTTCGGAGACCTCAACGAGAGCTCCACAGAGTCCACAGCTGAGGAAAATTCGACAAAGATTGATGAAGAAGAAACTTCAACAGAGGAACTAACTACGAAGGGTGTAAACGACGAAATTTTCCATCAAAGCACAGCTGACCAAGAAAACCTCGATGAACTGCTGACAAATGTTGCATCGGATGATGCAGACGAAACTACTGCGCAGCCACTAGCAGCAGGGATAGATATAGATCAAGAAATCTTGGCACAACTCAACAAAACCTCTGAAAATATCTTAAAACATAACGAACAAAATTTAACGTCTCTGATTTCTGATTTTCTCGAATCTGTAGTGCAAACTTCTGAACTGCCTCAACAGCTTGACTCGGAAACTGTCTCAGAGCCTTCTGAGAGCGATGACGTGATAGAGGAAATTGCAGTAAAGTTGGAGGAAAAACTAAAAGTGAAGCAAGAGTTGACTGAAAACTCGAACGTTGGCCTCACTTTCGATGTAGACGAAACTACTGCGCAGCCACTAGCAACACAGATAGTTATAGAGGAAGAAATCTTGGCACAACTCAACAAAACCATTGAAAATGTCTTAAAACATTTTAAACAAAATTTTACGTCTGAAGAGGAAATTTCTGATTTCCTCGAATCTGAAGTGCAAACTTCTGAACTGCCTCAACAGCTTGACTCGGAAACTGTCTTGGAGCCTTCTGAGAGCGAAGACTTGATTGAGGAAATTACAGTAAAGTTGGAGGAAAAGCTAAAAGTGAAGCAAGAGTTGACTGAAAACTCAAACGTTGACCTCATTTTCGATGCAACGGAGAGAAGCGAAAGTAAGGAAACAAACGAGCAGAGCGATGAGTTGCTTAGCAGTTCCGTCAACCTGGATGTTTTGTCCTCCAAAGCCAATAAAACATTTGAGAGTGAACGCGATGAAAATTCGACACTCGTGAGCACTACGGAGTCGCTCTATTACACAAGTGTGCAGGAGATAATTGACGATTTTGAAGACAAGAATCTCATAAAACGCGAAACGACTCTTTTCCCGGAGAGCGGCGAGGACAATGAGACTACGACCGAAAATGACAATGATGTCGACGTCGTTTATGTCTTCAAACCTACTTACCCACCACCTAGCGAGGGCCATATCGAGAAGTGTGCGCTAGCTCGCTACATGCGGGAATCGACAAAAATACCCTTGAATCTCGTATCGCCGCTCATGTGCATCGCCGAGCATGAGTCAAACCTTAACATTTCGCTGATTCGCAGCGAAGGCGGCAAGACTCGTTACACCCGATATGGCCTCTTCCAAATTGACGACGTAAATTACTGCAACACCAATCAAAAAATCAACAAATGCGACGTCATTTGTCCTCATTTGGTCGACGATCAGCTGGACAACGATCTGCAGTGTGTCATGGACATTTATCGCAGGGAGGGCCTCGAATACTGGCCATCCTACAACAAACACTGCAAAGATGCAAAGTCAATCGACTACTTGAGCATGTGTCGTGAAGTCTTTACCACTACTCACTATCCATACACCTTCTACGACAGAGAGCTCGCCATCCAAAATATTTTCACGACAACGACCACGAGTGCAACCACAACTACTGTCCCGACCACAACCGTCTACATTCCAAAGGCTAATTATACTGATGACGATGGACGCTTCGTGATTAGGCGTTTCGACGATTGCGAATTGACAAACGAAATTTACCAAAACAATGTGACTATCGATGAAGTCTCGAAACTCGTGTGTGTTGGTGATCTGAAATCCGGCTTAAAAATAAGAAAGCCAAATGAAGACGAAATGCATTTTGGAATTTTTTCGATTAACAACACTTTTTGTGGAGAAGGCGGCGTCTGCGGCATCGAGTGCTCTCAACTGTTGGACGATGACATAACGGACGACATTTTGTGCGCCAAGATAATCCTCGAAAGCGACGAAGTCAGTGCCTGGAAATTGTCTGAAGAAGAATGCAAGCCATATGGCGCAAAATTACTGGAATGTGCCGATGAGGGCCACGTGCAAACGACCAGAGATCCAGAAAATGCAGAAATCACGCCTTGGTCGGGTTTCTCTTACACAACTGCCGCCTCGCCGATGACAAAGGAACCGTTTAGCTACAAGCCAAACAAGTATGTGTACTTCGAAGACTACTACACGCCTGCAACGACAAAGTACACGACACCACACGACCTTGGAGACTTGGACTTTTTGGCTGTTTTGACGTCTGAGAATCCTCTACCGACTCGCAGCACAACAACCGAACAAACCACTAGCACAACTACCAGCAGTGAAAAGCCTGTTACTCTTTCAGCGTTTCTCGAGCCGCCACATTTGAGCACAGCGTCCGACGACGACGAAATCGTAGAAGCGAAGACCGAGGCTTCGGTGAAACACATTGATAGCGAGCCGAATGCTTCAGAGGACATCGAATTGACCACCACTAGCATTACTGAGGAGGAATTGAAGCGAATAGTCAAAGATGTCATTGCACCGCCTCAGCAAGTTCTGTTTCTGCCGAAGGAAAAATTCGCCGATGAAATTGAGGAAGAATTCGAGATTGAAAAGCTAAGAAGTGAGGAGACGACAGAACTGTCCGAAACTGAAACTACAACCGTTACGTCGAGAATTGACACAAAAAACAATGAGGAATCAAATGAGGTGCCAGAAACTGGTGAATCGACCGAAAAACCTTTGCAAAAAGACCTAGAGAGCGGCAAAATAATTTCGTTGCATCAGTCGTTGCTTCCCCCGCACGATATCGTTGCGATCACGGAGGAGGAGACGGAAAAACCAGAGTTTAAGATTATCGTTGAAAGTGGAGAAATTAATTCAGCTGAGGATGTTGTGAAAGAGAAGGAGAGCTCCGAAGCAGTTACGCTGAGTTCTGAAAGTTTGCTGATGCATGAGAGTGTTGGCAGCGACGAACAGACGTCTGAAAGTTCATCGAAAAATGCAGACGAAGTGAGTACAACAGTGATTCCAGAAACAACACCATCATCGAGCACAATACCATACTTCCGACTGAAGTCACGAACCACTACGGAAGCTCCAGCGACGACACAAACTCACCAGGATTTTAAAGAACAAAATATTCAAAACTCAGCTGAAGAAGTAGAGATAACTTTGACGACTGCATCAACGGAAACGGCAATTGAAACAACTACAGAAAAAGAAGTCATCTCTTCGAGAATAATTGGAACTTTGATGACCGCAGCAACATCAGAAGCGCCAGAGTCAAGCGAGGATGACAATGATGAGCCGAAAATTCCGCAAATCATTCTGAACGACCAGAACCTAACGAATCAAGTTAAATCGATATTGAAGGCGCACTCTGGCACTAAAAACAGCACAAAGCCTATCGTCTTTAATATTTTCAATTTCAACATCGCTGGCCAGAATCACGAGATAAATTACGCGATAGATGGAAACAAACAAAAGCAGCCGCAGAAACCT
Protein: 2000 (aa)
MFKVPIIILIFSVLRGAELKTFTKCEIAEELIRNQNLSVAEAKKHLCIIGSANDTNLVEDVYLGIYKISSQWWCGMNGKRAGNCNLKCEQLHDDDISDDVVCARKILLDFGTEGWGLDKRKCEQVLSEINTTCFLDDFLREKKEAPTTTTAAPEIAASFEQHETDQNVFFNIFSFNINGSNHKIEYHVACGADAKSYTQCELSEELLTVHNVTINDARNLVCIAQKFSRLNTNVVVGERYGIFQINEKFCGHKKAGGECKIKCHSLLDDDISDDARCAQEVIARLGLSVAWRIEKTQGCQKELKNLEENCSLGDHGNNQNNNNNDDAANRDHCEYARKLVASYGISKLDAVTWSCIAKHRSPKTGNVNSKLAAGGDEDAACVFKHDVECSIKNQKLRSSSGFTIWPEYEEFCANLSENDEIAKKCFSFDESESDDRKTLKIFASPTTERQHFTTSTDISVIVEEDAPKVTTEAQPTTEFLSKTIEKPASDEQDIMVEGYSKVRVMFSDASVIEPTEELTTTAEKISEEIFKESLKCHFTRNFVQSRSIPQNLIATFVCIAEHESHFNVSLITTSEKSRKYGLFQIDNAQYCNTNEKINICDTLCEHLTDDQYDNDLDCALKVHETAGFDYWPSYAQHCQQVSQSLVEDCHETHSTTHQPYTIVNYELLKKKVLEELTSTTTTTFVDALNDEEQRIATTPVLVGESQSVVRQFDVCEFSQELNVNQRIELNKLNDFVCIADLMTKLRVQKNTESTDKIGIFGLKSDSCGENAEIGGRCSASCLSFFDESLTEVVKCASKIYESESLSYWNLTDEICKPYSNKILKCIHKGTFVPTEGIDSDDEDEFDVINFGDLNESSTESTAEENSTKIDEEETSTEELTTKGVNDEIFHQSTADQENLDELLTNVASDDADETTAQPLAAGIDIDQEILAQLNKTSENILKHNEQNLTSLISDFLESVVQTSELPQQLDSETVSEPSESDDVIEEIAVKLEEKLKVKQELTENSNVGLTFDVDETTAQPLATQIVIEEEILAQLNKTIENVLKHFKQNFTSEEEISDFLESEVQTSELPQQLDSETVLEPSESEDLIEEITVKLEEKLKVKQELTENSNVDLIFDATERSESKETNEQSDELLSSSVNLDVLSSKANKTFESERDENSTLVSTTESLYYTSVQEIIDDFEDKNLIKRETTLFPESGEDNETTTENDNDVDVVYVFKPTYPPPSEGHIEKCALARYMRESTKIPLNLVSPLMCIAEHESNLNISLIRSEGGKTRYTRYGLFQIDDVNYCNTNQKINKCDVICPHLVDDQLDNDLQCVMDIYRREGLEYWPSYNKHCKDAKSIDYLSMCREVFTTTHYPYTFYDRELAIQNIFTTTTTSATTTTVPTTTVYIPKANYTDDDGRFVIRRFDDCELTNEIYQNNVTIDEVSKLVCVGDLKSGLKIRKPNEDEMHFGIFSINNTFCGEGGVCGIECSQLLDDDITDDILCAKIILESDEVSAWKLSEEECKPYGAKLLECADEGHVQTTRDPENAEITPWSGFSYTTAASPMTKEPFSYKPNKYVYFEDYYTPATTKYTTPHDLGDLDFLAVLTSENPLPTRSTTTEQTTSTTTSSEKPVTLSAFLEPPHLSTASDDDEIVEAKTEASVKHIDSEPNASEDIELTTTSITEEELKRIVKDVIAPPQQVLFLPKEKFADEIEEEFEIEKLRSEETTELSETETTTVTSRIDTKNNEESNEVPETGESTEKPLQKDLESGKIISLHQSLLPPHDIVAITEEETEKPEFKIIVESGEINSAEDVVKEKESSEAVTLSSESLLMHESVGSDEQTSESSSKNADEVSTTVIPETTPSSSTIPYFRLKSRTTTEAPATTQTHQDFKEQNIQNSAEEVEITLTTASTETAIETTTEKEVISSRIIGTLMTAATSEAPESSEDDNDEPKIPQIILNDQNLTNQVKSILKAHSGTKNSTKPIVFNIFNFNIAGQNHEINYAIDGNKQKQPQKP
Type | Start | End | Length |
CDS |
9020 |
10827 |
1808 |
CDS |
11032 |
13149 |
2118 |
CDS |
13232 |
14742 |
1511 |
CDS |
15130 |
15692 |
563 |
intron |
10828 |
11031 |
204 |
intron |
13150 |
13231 |
82 |
intron |
14743 |
15129 |
387 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
EFR28988 |
hypothetical protein AND_02410 [Anopheles darlingi] |
6e-32 |
InterPro |
IPR001916 |
Glycoside hydrolase, family 22 |
|
InterPro |
IPR019799 |
Glycoside hydrolase, family 22, conserved site |
|
InterPro |
IPR023346 |
Lysozyme-like domain |
|
Pfam |
PF01464.15 |
Transglycosylase SLT domain |
7.1e-06 |
Pfam |
PF00062.15 |
C-type lysozyme/alpha-lactalbumin family |
8.3e-95 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.04077 |