MidgeBase gene description page [Pn.13097]
Outline
Gene ID | Pn.13097 |
Type | Protein coding gene |
Scaffold | PnScaf18409 |
Start | 86 |
End | 5107 |
Direction | - |
Sequence
Transcript: 2604 (bp)
ATGGAGCAGTCGACAGCGACGACGACCACCACCACCACCAGCACGAGTGCATGTCGTCACGCACACAAGCGCCTCGCTCGAAAATTCGAAGTCACCGAAGTCACAAGTCAACTGTCGACGCACGGAACGAACCATCCGCACCGCGACGCCAATGAGCAGAGCCGTCCAATGAGCAAGCATCACCACTATCACCATCCCTCCGCGGGTGGCTTCCACCATCCGGCGGCCAAGGGGAGCGGCGCGAGTTTCGAGCTCAGCGACGTGCGCGTCCAGCCGTCGCCGTCCAACGGGCTCGCGCGAGTCAACGGACAAACGAACGGCGCCGCCAGCGACGTCGACTCCAATGACACAAACAACAACAACAACTACAACAAGCACCACCACAACCATCACAACCACCACCACCACCATCACCACAACCACCATCATGGCCATCACAAGAGGAGCGCTTCCTCCAGCGGCGGTGGCGGCGGCGGCGGCGGCGGCAGCGGCGAGTCGAGCAGCTCGAAGCGCACGAAACTCAGCGGCAGCGGCGGTGGAAGCAACAGCCAAAGTGTGCCAAATTTTCATCATACGAAATATGGAAATTCGAACTTCGGCGGCGGCGGCGGCGGCGGCGGAAGCTCTCTCGACCACACGACGGCGACGCCGCCGAGTGCGTCGATAGCTAGCAGTAGTAATAGTAACGGTAATCATAGTAGCGTTAACAGTAGCAGTAATTTCGGTGCTGGATTGCAACATGGCGGTCACTACAATATCATGGAGAAGCTAAAGGAATTGTATCGCGAGCTCAAGGCAGACAAGTCGCTGAAGGATCCTCACTTATCGGCGTCGTCGTTCCTCCTCGACAAGCTGGTGACGCGCGAGCGCCTCAACACACTCATCATCAACCTCTACCCGGGCAATAAGGGCTACACGCTGGCCTACCGCTCGCCCTCCACGACCGCCTCCTCGACGACGGCCACCGCCAAGACGGCGGCTCAGTCGCCGGGGAGCGCCAGTAGCGGCGGCGCCACCACAGCCGCCGCCGCTCAAGCAGACCGCGAGACGCTGCACGAGACGCCGTCGTGGCTGTACGACAACGCCTACCTGCTGGACGCGCTGGACGCGGAGGAGCTGCCGCCGCTGCTGATCGACTTCTGCAACGAGCACTGCCCGCACCTGTTCTACAACGGCGCCGTGATTGCGCAAATCCGCGACTACCGCCAGAGCTACCCGGTCGTCGACATCTGTGATATTCACTACGTGCTGCTGAGGCCGACGGCGGCGGCGCTGTGGGACGAGGTGAACGGCGCGTGCGACTCGAGCTGGACGACCAAGGACCGCGTGGTGCTCGAGTCGCAGCTGGTGCTGGCGACGGCGCCGCCCATCTGCCTCGACCCCTCGCCGGCGATCGGCATCGCGGCGATAAACGCGACGACTGAACGCTCACCGATTGCCTCCGCCGACGGCGTCCTCAAGATGGCCAAAAAGTTCCTCCAAGTCACCAAGAACCGCAACAACAAGCTCGAGAAGAAGACGCACTACCCGCAGCTCTCGCTCGCCAACTTTGTCGCCGTCGAACGGCGTCAGAGGCGGGACGCGGGCGGCGTGCGGCATGTCGCCGCTCCGAAACGGACGGCGCCGCCAGTGGCCGAAGCAACGCACGAGGTCCTGCACAAGACGCTGCCGGCGGTGCCGCCGTCGCAGCGCTACGACTGGCAGGCGCGCAACACCGAGCGACTCACGTTCGAGACCGACCGCGACAACTGCCAGTACCGGGTGCGCGTGGAGCTCTTCGAGCGGCCGACGACGTGCGAGATGAGCGGCCAGCTGACGCTCGAGCGGCAGAAGAGCGGCGGCGGCGGCGCCGTCAACGGGCGCATGTGCCCGTTCAAGCTCGCGTCGCCGCTCGCCGCGCGCCGCTACGTGAAGGAGTTCATGGGCATCTTCACGGAGGAGGGCCGGAAGTTCGTGAAGATCACGCACGAGCGCGAGACGCGCGGCGGCCGCGAGATCCTCGAGATGCAGACCGGCGAGAAGCTGCCGCACCAGAAGGTCGCCACGCCCGTGCTCATCGCCGCCGCCGCCGCCCAGCCCTCGCCGCAGACGCAGCTGCAGTCGCTGCAGCAGCTCATCTCATCGCCTTCCGCGTCGTCGATAACGTCGCCGCCGCCGGTGACGCCGCCCACGCCGCAACCTCCCGCGCCGCCCGCCTTGGTGGCAGCCACACCGACCGCCGCCCTCAAGCTGACGGCGACGCCGCAGCGAATCATCGCGGCGCGCCAGCCCGGACAGCAGGTCGTGGTGCTCGCGCAAGTGAAGCAGGAGTCGGCCGGCAACCAGGCCATCAAGAATCTCCTCAACCAGCCGCGCGCGCCCATGACGCAGAGCATGGTCAACGGCCTCAACCTGCAGACCGCGCAGGAGGTGCAGACGGTGTACGTGCAGGTCGACAACAACAGCCTCATACACATGGGCGGCGCCGTTCCCGCCAACCTCCAGTTCCAGCAGGTGATCGTTCCGACGCAGCCAGCAGCGCAGACGGGGCCGCAGACGTACACGATGAACGTGCAGACGGTGACGCCGCAGCAGGTGACGGTGGTGAAGCGGCACCGCGACATG
Protein: 868 (aa)
MEQSTATTTTTTTSTSACRHAHKRLARKFEVTEVTSQLSTHGTNHPHRDANEQSRPMSKHHHYHHPSAGGFHHPAAKGSGASFELSDVRVQPSPSNGLARVNGQTNGAASDVDSNDTNNNNNYNKHHHNHHNHHHHHHHNHHHGHHKRSASSSGGGGGGGGGSGESSSSKRTKLSGSGGGSNSQSVPNFHHTKYGNSNFGGGGGGGGSSLDHTTATPPSASIASSSNSNGNHSSVNSSSNFGAGLQHGGHYNIMEKLKELYRELKADKSLKDPHLSASSFLLDKLVTRERLNTLIINLYPGNKGYTLAYRSPSTTASSTTATAKTAAQSPGSASSGGATTAAAAQADRETLHETPSWLYDNAYLLDALDAEELPPLLIDFCNEHCPHLFYNGAVIAQIRDYRQSYPVVDICDIHYVLLRPTAAALWDEVNGACDSSWTTKDRVVLESQLVLATAPPICLDPSPAIGIAAINATTERSPIASADGVLKMAKKFLQVTKNRNNKLEKKTHYPQLSLANFVAVERRQRRDAGGVRHVAAPKRTAPPVAEATHEVLHKTLPAVPPSQRYDWQARNTERLTFETDRDNCQYRVRVELFERPTTCEMSGQLTLERQKSGGGGAVNGRMCPFKLASPLAARRYVKEFMGIFTEEGRKFVKITHERETRGGREILEMQTGEKLPHQKVATPVLIAAAAAQPSPQTQLQSLQQLISSPSASSITSPPPVTPPTPQPPAPPALVAATPTAALKLTATPQRIIAARQPGQQVVVLAQVKQESAGNQAIKNLLNQPRAPMTQSMVNGLNLQTAQEVQTVYVQVDNNSLIHMGGAVPANLQFQQVIVPTQPAAQTGPQTYTMNVQTVTPQQVTVVKRHRDM
Type | Start | End | Length |
CDS |
89 |
208 |
120 |
CDS |
510 |
2177 |
1668 |
CDS |
3575 |
4254 |
680 |
CDS |
4886 |
5015 |
130 |
CDS |
5102 |
5107 |
6 |
intron |
209 |
509 |
301 |
intron |
2178 |
3574 |
1397 |
intron |
4255 |
4885 |
631 |
intron |
5016 |
5101 |
86 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
EFR20330 |
hypothetical protein AND_20270 [Anopheles darlingi] |
2e-77 |
InterPro |
IPR021950 |
Spt20 family |
|
Pfam |
PF12090.3 |
Spt20 family |
5.2e-26 |
Pfam |
PF01297.12 |
Periplasmic solute binding protein family |
0.0047 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
M. musculus |
ENSMUSG00000027751 |
A. gambiae |
AGAP012403 |
H. sapiens |
ENSP00000218894 |
H. sapiens |
ENSP00000417510 |
T. castaneum |
TC007192 |
H. melpomene |
HMEL016663-PA |
H. sapiens |
ENSP00000353388 |
S. invicta |
SI2.2.0_00501 |
H. sapiens |
ENSP00000439000 |
P. humanus |
PHUM006420-PA |
P. vanderplanki |
Pv.05778 |
B. mori |
BGIBMGA001187-TA |
N. vitripennis |
NV15037-PA |
D. melanogaster |
FBgn0036374 |
H. sapiens |
ENSP00000419754 |
H. sapiens |
ENSP00000348512 |
D. plexippus |
DPOGS213271PA |