MidgeBase gene description page [Pn.13097]

Outline

Link to gbrowse

Gene ID Pn.13097
Type Protein coding gene
Scaffold PnScaf18409
Start 86
End 5107
Direction -

Sequence

Transcript: 2604 (bp)

 ATGGAGCAGTCGACAGCGACGACGACCACCACCACCACCAGCACGAGTGCATGTCGTCACGCACACAAGCGCCTCGCTCGAAAATTCGAAGTCACCGAAGTCACAAGTCAACTGTCGACGCACGGAACGAACCATCCGCACCGCGACGCCAATGAGCAGAGCCGTCCAATGAGCAAGCATCACCACTATCACCATCCCTCCGCGGGTGGCTTCCACCATCCGGCGGCCAAGGGGAGCGGCGCGAGTTTCGAGCTCAGCGACGTGCGCGTCCAGCCGTCGCCGTCCAACGGGCTCGCGCGAGTCAACGGACAAACGAACGGCGCCGCCAGCGACGTCGACTCCAATGACACAAACAACAACAACAACTACAACAAGCACCACCACAACCATCACAACCACCACCACCACCATCACCACAACCACCATCATGGCCATCACAAGAGGAGCGCTTCCTCCAGCGGCGGTGGCGGCGGCGGCGGCGGCGGCAGCGGCGAGTCGAGCAGCTCGAAGCGCACGAAACTCAGCGGCAGCGGCGGTGGAAGCAACAGCCAAAGTGTGCCAAATTTTCATCATACGAAATATGGAAATTCGAACTTCGGCGGCGGCGGCGGCGGCGGCGGAAGCTCTCTCGACCACACGACGGCGACGCCGCCGAGTGCGTCGATAGCTAGCAGTAGTAATAGTAACGGTAATCATAGTAGCGTTAACAGTAGCAGTAATTTCGGTGCTGGATTGCAACATGGCGGTCACTACAATATCATGGAGAAGCTAAAGGAATTGTATCGCGAGCTCAAGGCAGACAAGTCGCTGAAGGATCCTCACTTATCGGCGTCGTCGTTCCTCCTCGACAAGCTGGTGACGCGCGAGCGCCTCAACACACTCATCATCAACCTCTACCCGGGCAATAAGGGCTACACGCTGGCCTACCGCTCGCCCTCCACGACCGCCTCCTCGACGACGGCCACCGCCAAGACGGCGGCTCAGTCGCCGGGGAGCGCCAGTAGCGGCGGCGCCACCACAGCCGCCGCCGCTCAAGCAGACCGCGAGACGCTGCACGAGACGCCGTCGTGGCTGTACGACAACGCCTACCTGCTGGACGCGCTGGACGCGGAGGAGCTGCCGCCGCTGCTGATCGACTTCTGCAACGAGCACTGCCCGCACCTGTTCTACAACGGCGCCGTGATTGCGCAAATCCGCGACTACCGCCAGAGCTACCCGGTCGTCGACATCTGTGATATTCACTACGTGCTGCTGAGGCCGACGGCGGCGGCGCTGTGGGACGAGGTGAACGGCGCGTGCGACTCGAGCTGGACGACCAAGGACCGCGTGGTGCTCGAGTCGCAGCTGGTGCTGGCGACGGCGCCGCCCATCTGCCTCGACCCCTCGCCGGCGATCGGCATCGCGGCGATAAACGCGACGACTGAACGCTCACCGATTGCCTCCGCCGACGGCGTCCTCAAGATGGCCAAAAAGTTCCTCCAAGTCACCAAGAACCGCAACAACAAGCTCGAGAAGAAGACGCACTACCCGCAGCTCTCGCTCGCCAACTTTGTCGCCGTCGAACGGCGTCAGAGGCGGGACGCGGGCGGCGTGCGGCATGTCGCCGCTCCGAAACGGACGGCGCCGCCAGTGGCCGAAGCAACGCACGAGGTCCTGCACAAGACGCTGCCGGCGGTGCCGCCGTCGCAGCGCTACGACTGGCAGGCGCGCAACACCGAGCGACTCACGTTCGAGACCGACCGCGACAACTGCCAGTACCGGGTGCGCGTGGAGCTCTTCGAGCGGCCGACGACGTGCGAGATGAGCGGCCAGCTGACGCTCGAGCGGCAGAAGAGCGGCGGCGGCGGCGCCGTCAACGGGCGCATGTGCCCGTTCAAGCTCGCGTCGCCGCTCGCCGCGCGCCGCTACGTGAAGGAGTTCATGGGCATCTTCACGGAGGAGGGCCGGAAGTTCGTGAAGATCACGCACGAGCGCGAGACGCGCGGCGGCCGCGAGATCCTCGAGATGCAGACCGGCGAGAAGCTGCCGCACCAGAAGGTCGCCACGCCCGTGCTCATCGCCGCCGCCGCCGCCCAGCCCTCGCCGCAGACGCAGCTGCAGTCGCTGCAGCAGCTCATCTCATCGCCTTCCGCGTCGTCGATAACGTCGCCGCCGCCGGTGACGCCGCCCACGCCGCAACCTCCCGCGCCGCCCGCCTTGGTGGCAGCCACACCGACCGCCGCCCTCAAGCTGACGGCGACGCCGCAGCGAATCATCGCGGCGCGCCAGCCCGGACAGCAGGTCGTGGTGCTCGCGCAAGTGAAGCAGGAGTCGGCCGGCAACCAGGCCATCAAGAATCTCCTCAACCAGCCGCGCGCGCCCATGACGCAGAGCATGGTCAACGGCCTCAACCTGCAGACCGCGCAGGAGGTGCAGACGGTGTACGTGCAGGTCGACAACAACAGCCTCATACACATGGGCGGCGCCGTTCCCGCCAACCTCCAGTTCCAGCAGGTGATCGTTCCGACGCAGCCAGCAGCGCAGACGGGGCCGCAGACGTACACGATGAACGTGCAGACGGTGACGCCGCAGCAGGTGACGGTGGTGAAGCGGCACCGCGACATG 

Protein: 868 (aa)

 MEQSTATTTTTTTSTSACRHAHKRLARKFEVTEVTSQLSTHGTNHPHRDANEQSRPMSKHHHYHHPSAGGFHHPAAKGSGASFELSDVRVQPSPSNGLARVNGQTNGAASDVDSNDTNNNNNYNKHHHNHHNHHHHHHHNHHHGHHKRSASSSGGGGGGGGGSGESSSSKRTKLSGSGGGSNSQSVPNFHHTKYGNSNFGGGGGGGGSSLDHTTATPPSASIASSSNSNGNHSSVNSSSNFGAGLQHGGHYNIMEKLKELYRELKADKSLKDPHLSASSFLLDKLVTRERLNTLIINLYPGNKGYTLAYRSPSTTASSTTATAKTAAQSPGSASSGGATTAAAAQADRETLHETPSWLYDNAYLLDALDAEELPPLLIDFCNEHCPHLFYNGAVIAQIRDYRQSYPVVDICDIHYVLLRPTAAALWDEVNGACDSSWTTKDRVVLESQLVLATAPPICLDPSPAIGIAAINATTERSPIASADGVLKMAKKFLQVTKNRNNKLEKKTHYPQLSLANFVAVERRQRRDAGGVRHVAAPKRTAPPVAEATHEVLHKTLPAVPPSQRYDWQARNTERLTFETDRDNCQYRVRVELFERPTTCEMSGQLTLERQKSGGGGAVNGRMCPFKLASPLAARRYVKEFMGIFTEEGRKFVKITHERETRGGREILEMQTGEKLPHQKVATPVLIAAAAAQPSPQTQLQSLQQLISSPSASSITSPPPVTPPTPQPPAPPALVAATPTAALKLTATPQRIIAARQPGQQVVVLAQVKQESAGNQAIKNLLNQPRAPMTQSMVNGLNLQTAQEVQTVYVQVDNNSLIHMGGAVPANLQFQQVIVPTQPAAQTGPQTYTMNVQTVTPQQVTVVKRHRDM 
Type Start End Length
CDS 89 208 120
CDS 510 2177 1668
CDS 3575 4254 680
CDS 4886 5015 130
CDS 5102 5107 6
intron 209 509 301
intron 2178 3574 1397
intron 4255 4885 631
intron 5016 5101 86

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr EFR20330 hypothetical protein AND_20270 [Anopheles darlingi] 2e-77
InterPro IPR021950 Spt20 family
Pfam PF12090.3 Spt20 family 5.2e-26
Pfam PF01297.12 Periplasmic solute binding protein family 0.0047

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
M. musculus ENSMUSG00000027751
A. gambiae AGAP012403
H. sapiens ENSP00000218894
H. sapiens ENSP00000417510
T. castaneum TC007192
H. melpomene HMEL016663-PA
H. sapiens ENSP00000353388
S. invicta SI2.2.0_00501
H. sapiens ENSP00000439000
P. humanus PHUM006420-PA
P. vanderplanki Pv.05778
B. mori BGIBMGA001187-TA
N. vitripennis NV15037-PA
D. melanogaster FBgn0036374
H. sapiens ENSP00000419754
H. sapiens ENSP00000348512
D. plexippus DPOGS213271PA