MidgeBase gene description page [Pn.15772]

Outline

Link to gbrowse

Gene ID Pn.15772
Type Protein coding gene
Scaffold PnScaf33047
Start 8734
End 12948
Direction -

Sequence

Transcript: 3489 (bp)

 ATGGAAAGTGAGAAAAGCCAAGTCGAGCCAATATTCAAGTACATACGAATAGCGTCAGATTTGCGTGATATATTGAAGAGTGACTCAATTTCGGTTTGTTATCCATCGTCAAAAATCTTATTTATCGGAACAAACTGGGGACAACTCTATATTCTCGACCATGAAGGTAATGCAACGAGTGATCAAAAGTTCCCCAAGCATATGGTATCAATTAACATGATATCAGCTGACTCCAAAGGAGAATTCATTGCGACATGTAGTGATGACGGAAAAATACACATAAACTGCCTCTACTCAATGGACAACAGTATTTACTTGACCGTTGAGAAACAAGTCAAGTGTATTGAGCTAGATCCGAACTACGGACTTAGTGGGAAACGATTCATCGTCGGTGATCAACAGTTGAAGCTCTATGAGAAGACACTGCTTAGAGGCATAAAGCCAACAGTACTCTCAGAATCCGAGGGCATCGTGAACTCTCTAAAGTGGAATGGATCTTTTGTAGCGTGGTCAAATGCCATTGGTGTGAGGGTATATGATCTGACTGAAAGGTGTTCACTCGGTCTGATTAAATGGGAAGAGCCACAAGAAGGGAAGCTCACAGACTTTCGATGCAATCTCCTCTGGCACAACAATACGCTGTTCATCGGATGGGCTGAAACAATCAGAATATGTGTGATAAGAAAGAGAAGTGTCGTTGAGATTTCTACAAGAAATTTGCCAGGCTACATTGTGGACCCCATTTCGATGTTCAAGACCGAGTTTTACGTTTGCGGTCTTGCCCCATTGGAGAAGAACCAGCTAGTCGTACTTGGTCTTCCCAAGGAGAAAGAGGAGGACAATAAATCACAGCGGCCAGTTCTGTGCGTCATTCAATATAATTCAAATGAATATGAAGAGCTCTGCACTGACAGCCTAACACTAAAAGACTACGAGTTCTACACGTCCAACGACTACAGCTTGGCATGTTTAATAGACGAAAACAAATATTTTGTGGTTTCTCCAAAGGACATCGTTACCGCTTCATTGTACGAAACAGATGATCGAATCAAGTATCTCATAGAGCATGATCATTTGTTCAGCGAATACAAGTGCAAATCAAATGAATACGAAGAAATTGATGAGAATAATTACTTACTAGTTGCATTAGTGCTGACCGCACCAAAGAATTTCGATAGGCGGAACGCAATGCGTGACACTTGGATTAGTTTAAGGCCGTGGCAACTAAATGACAGTTTTTATCAGAACGAAGTTATTTACATTCCACCGGAACAGTCAAATGGATTTCTCGAACACGAAACTGTTGAGCAGCAAGAGAAAAGCCTGAAAAATTACCAAAAATGGTTGTCTTCTAGTAAGCACAGAGGATCAAACATTAAAGTTCCAAATCTTAAAATTAAAAATCTCTTCGTCATCGGAATGAAAGACCTCGACAGTGATACAACGAAAAGAATCAGAGCGGAAAATGATGTCTACAGCGACTTGCTTCTGCTCGAAGATCTTAAAGACTCATACAAAAATCTAACGCTGAAACTACTAAGTGCACTAGAAAGAATAAATTATGTTACGCCCAACTTTAAATACTTGCTCAAATGTGACGACGACTCGTACGTCAAGCTAGATTACTTAGCATTCGATCTCTTAGAATATGCCAAAAAGAAGACAACGAGAAAGAATCTCGAGCTCTATTGGGGCTTTTTTAATGGTCGTGCGAACATTAAGAAATCGGGGCAGTGGAAAGAAGTCAATTACGACTTATGTGACCGCTACTTGCCCTACGCACTGGGCGGTGGTTACGTCATATCAAAGAATCTCGTAACATACCTTGGGAATCACAGTAAATACCTTAATCGATACGAGAGTGAAGACATATCAATGGGTACATGGCTCTCGCCATTCAGAAACATTCATAGGAGGCACGATCCTCGATTCGATACTACTTACATGCCTCGCAAGTGCAAGACTTATCATATAGTAATGCACAAGAGGACAGTTGAAGATATGCGTGAAATACACAAGGGAAATCAGTGCTTTAGCGAGGTTACATATGAAGAAAGTAGAAAACCGGTTGACCAATACGAGAAAGCACTGGATGTGATACGAGAGAAGGGTGGAAAGTTTTCGATGACACAGATCGCGCGACTCTACGTCGATCATCTTCTCAAAAGACAGAAGTACGAGGAAGCTGCAAAGTTGTGTCTGAGCGTGTACGGCAATAACAAGGACTTGTGGGAGGAGGAGGTGTACAAGTTTGTGAAAGTCAAGCAACTGCGAGCCATTAGCTCCTATTTACCGCGAACAAACGAGTGCAAGCTTAACCCGCAGGTTTACGAGATGGTGATGTACGAGTACTTGCAGTATGATAAGCCCGGGTTTCTCAATTTGATCAAGGAATGGCAACCGAGTCTCTACAACACGACAGCCGTGATCAATGCCATCCAAATGAACTTTGACAAGAAGGAAAAGCGAATTCTTTTGGAGGCACTGGCGATCCTTTATTCGCACGAAAAAGAGTATGACAAGGCCGTGACGATGTACATTAAATTGGAGCATAAGGACGTCTTCACCCTCATCAAAAACCACAAACTCTATGCGGTGATCAAGCCCATGATCATTGAACTTCTGAAACTAGATAGCGAAAAAACCATCAGCGTCCTCCTCGGTCAAAAACAAATACCGCCAGCTGAAATTGTCGAGAAACTAGAAGAGTACGAAAATGAGTCGTTTGTCTACCAGTACTTGGATGCCTTCAAGAAAACCGATTCCACGGGGAAATTCGACTGGAAATTAATAAATCTCTACGCGAAGTACGACCGCGAAAAATTACTACCGTTATTGAGGAAGTCAAACAGTTATAAACTAGAGTTGGCCTACGAACTCTGCAAAGTCAACTCGTTCTACCCCGAAATGGTCTACGTGCTCGAGCAAATGGGCAACACCAACGAGGCGCTTGAGGTCATAATGAAGAAGATCGGCAACGTTCGGATGGCCGTCGATTTCTGCTGTGAACACAACGACATGCAGTTGTGGGAGTACTTGATCAACGAAAGTTTCGAAAAGCCGGAAATTATCAAGCTTTTGATGGACGGCATATCGGGATCTGGCTACTTGGTCGACCCACAGATACTCATCGATCGGCTGCGCGTGGGACAAGAAATTCCCGAACTGAAAAGCGCTCTCATCAGAATGCTCACCGGCTACAGTCTTCAAGTCTCCATACACGGCGGTTGCAATCAAATTCTCAAGACAGATTATTTTGACGTGTTCAACAAGCTCATCAAGCAGCAGAACCGAGCGATGTATTTCGGGGGTCAAACGGTCTGCAGCCTCTGTCAGAGGAATATCGTTGGCGTAAAGGACAGGAACACAAACAAGACGCCGACAGACATTATCATTTTCAATTGCAAACACATTTTCCACGAGTCATGCTGCGTTACTGATAGATTCGATCTTGATCACTGCTCGATCTGTATTCCGACGAGACGGACC 

Protein: 1163 (aa)

 MESEKSQVEPIFKYIRIASDLRDILKSDSISVCYPSSKILFIGTNWGQLYILDHEGNATSDQKFPKHMVSINMISADSKGEFIATCSDDGKIHINCLYSMDNSIYLTVEKQVKCIELDPNYGLSGKRFIVGDQQLKLYEKTLLRGIKPTVLSESEGIVNSLKWNGSFVAWSNAIGVRVYDLTERCSLGLIKWEEPQEGKLTDFRCNLLWHNNTLFIGWAETIRICVIRKRSVVEISTRNLPGYIVDPISMFKTEFYVCGLAPLEKNQLVVLGLPKEKEEDNKSQRPVLCVIQYNSNEYEELCTDSLTLKDYEFYTSNDYSLACLIDENKYFVVSPKDIVTASLYETDDRIKYLIEHDHLFSEYKCKSNEYEEIDENNYLLVALVLTAPKNFDRRNAMRDTWISLRPWQLNDSFYQNEVIYIPPEQSNGFLEHETVEQQEKSLKNYQKWLSSSKHRGSNIKVPNLKIKNLFVIGMKDLDSDTTKRIRAENDVYSDLLLLEDLKDSYKNLTLKLLSALERINYVTPNFKYLLKCDDDSYVKLDYLAFDLLEYAKKKTTRKNLELYWGFFNGRANIKKSGQWKEVNYDLCDRYLPYALGGGYVISKNLVTYLGNHSKYLNRYESEDISMGTWLSPFRNIHRRHDPRFDTTYMPRKCKTYHIVMHKRTVEDMREIHKGNQCFSEVTYEESRKPVDQYEKALDVIREKGGKFSMTQIARLYVDHLLKRQKYEEAAKLCLSVYGNNKDLWEEEVYKFVKVKQLRAISSYLPRTNECKLNPQVYEMVMYEYLQYDKPGFLNLIKEWQPSLYNTTAVINAIQMNFDKKEKRILLEALAILYSHEKEYDKAVTMYIKLEHKDVFTLIKNHKLYAVIKPMIIELLKLDSEKTISVLLGQKQIPPAEIVEKLEEYENESFVYQYLDAFKKTDSTGKFDWKLINLYAKYDREKLLPLLRKSNSYKLELAYELCKVNSFYPEMVYVLEQMGNTNEALEVIMKKIGNVRMAVDFCCEHNDMQLWEYLINESFEKPEIIKLLMDGISGSGYLVDPQILIDRLRVGQEIPELKSALIRMLTGYSLQVSIHGGCNQILKTDYFDVFNKLIKQQNRAMYFGGQTVCSLCQRNIVGVKDRNTNKTPTDIIIFNCKHIFHESCCVTDRFDLDHCSICIPTRRT 
Type Start End Length
CDS 8737 9015 279
CDS 9088 10225 1138
CDS 10367 11368 1002
CDS 11611 11751 141
CDS 11822 12477 656
CDS 12676 12948 273
intron 9016 9087 72
intron 10226 10366 141
intron 11369 11610 242
intron 11752 11821 70
intron 12478 12675 198

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001658175 light protein [Aedes aegypti] gb|EAT47723.1| light protein [Aedes aegypti] 1e-158
InterPro IPR015943 WD40/YVTN repeat-like-containing domain
InterPro IPR002659 Glycosyl transferase, family 31
InterPro IPR000547 Clathrin, heavy chain/VPS, 7-fold repeat
Gene Ontology(BP) GO:0006886 intracellular protein transport
Gene Ontology(BP) GO:0006486 protein glycosylation
Gene Ontology(BP) GO:0016192 vesicle-mediated transport
Gene Ontology(CC) GO:0016020 membrane
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0008378 galactosyltransferase activity
Pfam PF10366.4 Vacuolar sorting protein 39 domain 1 0.0078
Pfam PF00515.23 Tetratricopeptide repeat 1.8
Pfam PF13176.1 Tetratricopeptide repeat 0.02
Pfam PF00637.15 Region in Clathrin and VPS 3e-21
Pfam PF00400.27 WD domain, G-beta repeat 0.00056
Pfam PF13639.1 Ring finger domain 0.26
Pfam PF01762.16 Galactosyltransferase 2.1e-28

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
T. castaneum TC015204
P. humanus PHUM253950-PA
M. musculus ENSMUSG00000041236
A. mellifera GB16155-PA
D. melanogaster FBgn0002566
B. mori BGIBMGA005526-TA
P. vanderplanki Pv.11914
H. melpomene HMEL013113-PA
A. gambiae AGAP009174
D. plexippus DPOGS209384PA
S. invicta SI2.2.0_15800
A. aegypti AAEL001157
C. quinquefasciatus CPIJ010406
N. vitripennis NV50092-PA
H. sapiens ENSP00000379297
H. sapiens ENSP00000309457