MidgeBase gene description page [Pn.11854]

Outline

Link to gbrowse

Gene ID Pn.11854
Type Protein coding gene
Scaffold PnScaf14689
Start 42048
End 47865
Direction -

Sequence

Transcript: 4530 (bp)

 ATGGAAAACCTGGGAAATTTAGAACAAGCAATTAATCTTTTCTATAAGAGTCAGTCTACAGACCAAGCACAACTGAATGAGTATCTAATACTTCAGCAAAGGAGTCCATTGGCATGGCAGTGGTCATGGGAATTTTTGGACTTTTCGAAAACCGTGGAAGTACAGTTTTTCGGCGCTGTAACTCTTCATCAGAAAATAGAAAAAAGCTTCTTAGAAGTTCCAGAACAGATGAGAAATGAGCTCAAAGAAAAGCTACTGCAAAAGATCATCGAGTTTGGAAGTGGAACAAAGCTAGTTCTCAATAGGCTATCTATGTGTTTGAGTGCTTTTGTGGTCCACATGCTGAAAGAATGGCCAACAGCAATAAACGATGTAATCGACATGTTTCTCAATCGACAGCTTCCTAACTTGAATCCGCAAATACAGCAATGGATACTCTTCGATGTCCTATCAGGCGTACCCGAAGAAGCTTGTAATATAATAACTGTGCAAAGAGCTCAACTAAAAATGGAGGTGTATAAGAACTCAGATCCAGTGCTTAGAACATTGGAGCAATTCATTAATGCCAAATGTGAGAGACCTCAGTTGGAGGATGAGGACATCACTGCACTGCAGAATGTAGCAAAATGCTGTCAAAATTGGTTTAAAAACGGTCTCATCGTACTTGACGGATGTCAGCAAATTACCGCTCAAATATTGAAACTGGTGTCAAAAGTTTATTGGTCAGTACTGGAAAATGATGGCTGTCTATCTCCGGATGAAAGTGAATTAACGGAAACCTGTCTCAAAGCGCTCTCGAATATGATGATTATGCCCGAGGCTCACAAGTATGTCAATTCTGCGCTCACAATAATGCGTATGTTTCTCGAGTCGCTCAGTCCTATCGTCAAAAACGAGTGGAAACTTGACAATCTGAACGAGGATATTGCTTTCTGCATTTATTCGCTTTTTATAGCATCGGTCGAGTGCCATTCACGTACTATTTTGAGTGGAATATGCGCTGATTCAGTTGAGCACCATGAAATTTACGTGAGCTTTGTCAATGAGATTCTTTTGTGCACAAACAAGCCAGGCAACTATCCCGTCGAAGAATCCTGCTCCACATTGGCCATGGGATTTTGGTTCATGCTTCAGGATGAAGTCCTGTCATATGACAATCCGACTGAGCGACAACGCTGCTTAGAAGCAATTCAACCGGTCTATGTACATCTCGTGAGGATTCTTGTTCGAAAATCTCAACTGCCGGACGAAAATAACATCGGAAAATGGAATTCTGACGATTTAGAGACTTTTCGATGCTATCGACAAGATATTGCCGACACATTGCTGTGTTGCTTTGATGTTTTGCATATTCAAATTTTGAAAATTCTTTCCGAGCTCTTAGACGAAGGAATTCTAGCTATTCAAGTTGACATAAAGAATTGGCCAATTTTGGAGGCAGCTATACATGGATTTTGTGCTATTTCGCAGCAAATTGAAAGCGTCGAGTACGAGGAAATCGTAAAGTTGATGCGAGTGCTTAATGAAATTCCTTATGAAACGATGAATGAAAAGCTATTGGGGACTGCTTTGGAAACGATGGGGTCGTTTAGTGAGTGGGTGAATGACAATCCAAAATATCTTATGAGCGCAATACAGTTGCTTGTTAAAGGTCTCGATTCGTCGATGGCGAGCCAAGCGACACTTGGACTAAAAGATTTAACCAGCGACTGTCAAAGTGAGCAAATGATTCCACTGGCAGAGCCTCTGTTGGAAGCATGTCAGAGGTCTCTTCAAAAAGGTCATTTGGCCAATTCAGAGTCGATTAGACTAATGTACAGTATCGGGAACATAATGAGTGTAGTGCCAAGCGAAAAAATACCAAATTATTTGGATAACGTTATATCACCCTGCTTTACTGAGCTTCAAATGCATGCTGAGCAACAGAATACAAGTGATAGTGCAAGGATAAGAGTCATGTTTTGTCTCAATATGATTTCGACGCTGTTTTCTTCTCTCAACACCAACAAGAGTAAGAACGAAAAGAAAAGAATCCATCAGGTGGCAGTCAGCAACAATGAACAACCGCAGCCGATTTTAATGATTCTTCAGAAAACGATGCCCATATTCAAGCAAATTTGCAACTTATACATCAACGACGTTCAGGTCGTTGAAATTTTATGCAAAGCAATTCAACAAGCGCTGGGAAATTTGATGGATGACATCAAGCCAATTCTAAACGACATTTGTACGTTGATTTTGTCCATTTTTCAAAGCAAATGTGTGCCGCCCGTCAACGATATCGGTGGCTTGTGCATCCTTATGTTTTATGGCGATGAAAACTATAAGGAGGCAATGAGGCAGCTGCTTTTGCAGATTGTTGTCTACAACTTCCAAACCTTTGAGCAAACACCACCAAATAAATTATCCGATGTCTCTGATCTTTTGGAATCGTTTTATGCTCTCAACACGAAAATCGTTAAGAAAATTCCAAGTGCGTACACTGTAGAGAATGTCGACTTTATGAAAATGATGGATTACGCCTTGAAAGGTGTAACATTGCCGGAGACGGGTTCAATCAAAAAGTCTGCATCATTTATCGCAGCTTTTGTTAAAGAGTCCCGAAATCACGTGAGCATGACCAATACTGTTCTTCTGAAGGGAGAAGACATCATAAAGACATCTTTATGCTGCATCGCCGGTGCAATTCCACGAGTTAACGTCGAAGTGTTTGGCGACGTTTTTATTGCTTTAAATACCAAGTATCCATCGGAATTTATCGTTTGGATGAAGATTCTCGAGCAACCGAGTTTTCCGACTGCTTATATAAATCACGATGAAAAGATCAACTTTATGAAGTCAATCATAAGAGAAAAGGTAAACAAACGTCTCATACAGGACCACATCAAGAGATTTGCAGCAAAGTGTCGCGGCGAGAGTGAAGCTACAAGCGAAGGCAGTGATAAGAAAAGAGCAAAAAAATATGGGTCAACGGCATTCTGTGACGACTCATCTTCAGAATTTACAGAGCAAGATGCATTGGATACCACAATATTTCCAACTGCCGTCATTTTTGTATTGCTCTCAAAAACATTTGAAGCTACTGCTGCCAATGGAATTCGAAGTGTGCTTAGCTTGTTCTTACGCGATAGCTTATTTTTCAACGAAAGAGTTTCCATCGTGATCCTTCATGCGTTCAACTTTTTCTCACAGTTCCTACCGATTTTTGGTGCAATTCTCGCCGACAGCTACATTGGAAACGTGAAGACAGTCAGTTTTTTCTTCATACCTTATGCAATCGGATACTTGGGAATATTTATGTCAACTCTGCCCGATCTGTTCACATTTCATTCGATCGTCTACACATCTCTACTGCTTATTGCTGTTGGAAACGGATTTCTCCGGGCATGCATCACTGCGCTAGGAGCTCATCAGTATGTTTTGCCGGAACAAAAAGCGGGACTCGACAAATACTTTTCGGCCTATTACTTCTTCTACTATGCTGGTATTCTAATGGGAAAAATTTTACCACCTTACGTCCGTTCAACGGTTCTTATCACGCCATTCTGTGTGGAAACTGGCGAATGCTATACATCAGTATTCGGTGTGATTGCTCTGACTTTTTTCATATCCTGGATTTTATTTCTGTGTGGATTGCAAATTTACAAGAAGGAAACGCCGTCAAGGGACAATACAATGTTGAAAACATTCAACTGCATTGCATACGCAGTTTTCAGAAGGCTAAGGTCAGGGAGGAAGGCCACTGGAATTATGGATTGCGCAGTAGGAAGAAAATACTCGCAAGAGTTCGTTAACGATGTGAAAGAATTTTTGAGAGTTGTCAAAGTTTTCTTGCCTCTGCCAATCTATTACAGTCTACTGAACCAACAAGACTCGACGTGGACGTTTCAGGCAAATCTTACGGATACCACAATTTTTGGCATTCAAATTGAAGCCGATCAGTTTAAGGCAATTGGGCCAATCCTGTTGATAATTTTGATACCCATGTGGGAGAAGATTATAAATCCTCTGCTCAAAAAATGTGGCATTTTCCTGTTGTCACTTGAGTGCGTTTTTATTGGTTGCATTTTTGCAACGCTAAGCTTCGTTTCGGCAGGTTTTCTTCAGTATTTCATTTTCTTGGACATGAGCGAGAAAAAGTATTCGGTGCTGCTTCAGTTTCCGCAATTTTTGTTGATAATGGTAGCAGAAATGCTGATATCGGTGCCAGGTCTTAAATATGCATATACTAATTCGCCGACCAGCATGAAGTCTGTGCTGACTGCAATTTTTTTTGTTAACAACGCTTTGGGAAACTTGGTTGTGATTGGAGTGACACAGCTCAGACTTTTCAACGGCGACCATTCGATGGAGTTCTTTTTCTACGCCTTCCTCATGCTTGCCGCAGCAGTAATCATTAAAATCTTGTCAAGCGCAGAGCAAGAGCCATCAAATTTGAACAGCATAGCAAACGACGAAGGTACATATGAAACTTTCATTTATGTCGACGAAGTCCAAACGACAAATATGGAGGAGAACTCGTGCGCAATT 

Protein: 1510 (aa)

 MENLGNLEQAINLFYKSQSTDQAQLNEYLILQQRSPLAWQWSWEFLDFSKTVEVQFFGAVTLHQKIEKSFLEVPEQMRNELKEKLLQKIIEFGSGTKLVLNRLSMCLSAFVVHMLKEWPTAINDVIDMFLNRQLPNLNPQIQQWILFDVLSGVPEEACNIITVQRAQLKMEVYKNSDPVLRTLEQFINAKCERPQLEDEDITALQNVAKCCQNWFKNGLIVLDGCQQITAQILKLVSKVYWSVLENDGCLSPDESELTETCLKALSNMMIMPEAHKYVNSALTIMRMFLESLSPIVKNEWKLDNLNEDIAFCIYSLFIASVECHSRTILSGICADSVEHHEIYVSFVNEILLCTNKPGNYPVEESCSTLAMGFWFMLQDEVLSYDNPTERQRCLEAIQPVYVHLVRILVRKSQLPDENNIGKWNSDDLETFRCYRQDIADTLLCCFDVLHIQILKILSELLDEGILAIQVDIKNWPILEAAIHGFCAISQQIESVEYEEIVKLMRVLNEIPYETMNEKLLGTALETMGSFSEWVNDNPKYLMSAIQLLVKGLDSSMASQATLGLKDLTSDCQSEQMIPLAEPLLEACQRSLQKGHLANSESIRLMYSIGNIMSVVPSEKIPNYLDNVISPCFTELQMHAEQQNTSDSARIRVMFCLNMISTLFSSLNTNKSKNEKKRIHQVAVSNNEQPQPILMILQKTMPIFKQICNLYINDVQVVEILCKAIQQALGNLMDDIKPILNDICTLILSIFQSKCVPPVNDIGGLCILMFYGDENYKEAMRQLLLQIVVYNFQTFEQTPPNKLSDVSDLLESFYALNTKIVKKIPSAYTVENVDFMKMMDYALKGVTLPETGSIKKSASFIAAFVKESRNHVSMTNTVLLKGEDIIKTSLCCIAGAIPRVNVEVFGDVFIALNTKYPSEFIVWMKILEQPSFPTAYINHDEKINFMKSIIREKVNKRLIQDHIKRFAAKCRGESEATSEGSDKKRAKKYGSTAFCDDSSSEFTEQDALDTTIFPTAVIFVLLSKTFEATAANGIRSVLSLFLRDSLFFNERVSIVILHAFNFFSQFLPIFGAILADSYIGNVKTVSFFFIPYAIGYLGIFMSTLPDLFTFHSIVYTSLLLIAVGNGFLRACITALGAHQYVLPEQKAGLDKYFSAYYFFYYAGILMGKILPPYVRSTVLITPFCVETGECYTSVFGVIALTFFISWILFLCGLQIYKKETPSRDNTMLKTFNCIAYAVFRRLRSGRKATGIMDCAVGRKYSQEFVNDVKEFLRVVKVFLPLPIYYSLLNQQDSTWTFQANLTDTTIFGIQIEADQFKAIGPILLIILIPMWEKIINPLLKKCGIFLLSLECVFIGCIFATLSFVSAGFLQYFIFLDMSEKKYSVLLQFPQFLLIMVAEMLISVPGLKYAYTNSPTSMKSVLTAIFFVNNALGNLVVIGVTQLRLFNGDHSMEFFFYAFLMLAAAVIIKILSSAEQEPSNLNSIANDEGTYETFIYVDEVQTTNMEENSCAI 
Type Start End Length
CDS 42051 43251 1201
CDS 43362 43669 308
CDS 43726 43833 108
CDS 44536 44599 64
CDS 44660 44987 328
CDS 45054 45189 136
CDS 45257 47323 2067
CDS 47394 47599 206
CDS 47754 47865 112
intron 43252 43361 110
intron 43670 43725 56
intron 43834 44535 702
intron 44600 44659 60
intron 44988 45053 66
intron 45190 45256 67
intron 47324 47393 70
intron 47600 47753 154

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001650815 importin [Aedes aegypti] gb|EAT43142.1| importin [Aedes aegypti] 0.0
InterPro IPR000109 Oligopeptide transporter
InterPro IPR016024 Armadillo-type fold
InterPro IPR016196 Major facilitator superfamily domain, general substrate transporter
Gene Ontology(BP) GO:0006857 oligopeptide transport
Gene Ontology(CC) GO:0016020 membrane
Gene Ontology(MF) GO:0005488 binding
Gene Ontology(MF) GO:0005215 transporter activity
Pfam PF03810.14 Importin-beta N-terminal domain 7.1e-05
Pfam PF00854.16 POT family 5.6e-40

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
T. castaneum TC011842
C. quinquefasciatus CPIJ001782
N. vitripennis NV10052-PA
M. musculus ENSMUSG00000033365
H. sapiens ENSP00000361418
A. mellifera GB11502-PA
A. gambiae AGAP009571
B. mori BGIBMGA004690-TA
H. melpomene HMEL017141-PA
D. plexippus DPOGS207582PA
P. humanus PHUM040490-PA
P. vanderplanki Pv.01264
A. aegypti AAEL005390
S. invicta SI2.2.0_08876
D. melanogaster FBgn0261532