MidgeBase gene description page [Pn.02319]

Outline

Link to gbrowse

Gene ID Pn.02319
Type Protein coding gene
Scaffold PnScaf1938
Start 1103
End 8947
Direction -

Sequence

Transcript: 3462 (bp)

 ATGACAAACGCAAACACATTAACTGATAAGGAGTATTATCAACTGACAATAAGCGTCGGCAAGGCGCTGAATCCGCAGGAACTGCCGATCAAGCAAAAGCACGTCAGAGCGTCAATAATCGCGACATGGATGTCAAATGGCGGTCACGCATTCTGGGCAATTGCCATCCGTCAGCAGCTCAATGACAACCGCATCACGGCGTGGAAGTTTCTGTACATGCTGCACAAGATTCTGCGCGAGGGCCACCCGCTGGTGATCCAGCACTCGATGCGGCACCGCACGATGCTGACGGAGCTGGGGAAGCTGTGGGGCCACCTCAACGACGGCTACGGCATTTGCATCCTGCAGTACACGAAGCTGCTCGTGACCAAGCTCAACTTCCACGACCGCAACGCGCGCTTTCCCGGCAACCTCGTCCTCAAGCCCGGCGAGCTCGAGAAGATCGCGGCCAACGACATCAACATGTACTTTCAGCTCGCCATCGAGATGTTCGACTACCTCGACGAGATCATCGCTCTCCAGGCCACAATTTTCAACTCGATCACCACATTTGCGGTGTCCTCGATGACGGCCGCGGGTCAGTGTCGCCTTACGCCGCTCATCCCGTGCATCCAGGACTCGAACCAGCTGTACGACTTCTGCGTGCGGCTCATGTTCCTGCTGCACGCGAACCTGCAGGAGGACCTGCTCGTGCACCACCGCGAACGCTTCCGGACGATCTTCAGGCAGCTGAACAGCTTCTACAAGCAGGCGGGCCAGCTGCAGTACTTCCACAACCTCATCTCGGTGCCGCATCTGCCCAACACGCCGCCCAACTTCCTCGTGCAGGCCGACCTCGGAAACTACACGGCGCCGCGCATCGTGCTCATGAACGAGGCCGACAGCCAGAGCGAGAGCGACACGCACTCCATCTCCGAGACGCTCGTCGATACGTCGATGGTCGACGCGGCACCGCCGCCGAGGGCTGAGACGCCGCCACCGCCGCCACCAAAAATAGACTACGAGAGGCTGCTCACCGAGAGAGACGAGCTCATAAACCAACTGAGGCACGAACAGCAGAACCAAATCCAAATGAGTCGCAGAGCGCTGAGCGAGAAGGCAGAGGCCGAGCATAAGCTGCACGAGCAGGTCATGAGGCTGACTGTCGAGTGCTCGGAGCTTCACAGCGAAATTTCGAATTTAAAAATGCAAAAGCAAGAGTTGGAGCTGAAGGCGGAGACTGCACCGGAACTTGAACAAAAAGTCCAAGTCGAGGAGGAAAAAGCGAAGCAAACTGAAGAAAAATTCCAAAAACTGAAGAACATGTACACGCAGATCCGTGACGAGCACATCAAGCTCCTTCGAAAGCACGACGAGATCAACAAGTCGCTGCAGGAAAGGTCGAAGGAGCTGGCGGAGGTGTCGCGCGAGAGGGAGGAGAGCCAGTCGAGGCTGCAGGAGATCGAGTCGCAGCGGTCGCTGATGTCCGAGAGCTTCCAGAGGAGCAGCATGGAGAGCGAGCAGCTGAGGCAGCAGTTCACGAGCATCGAGGAGGAGAAGCAGAGCTTGCTGGACCAAATCCAGGACGTCGAGTCGAAGAAGTCGGCGGAGATTGCCGAGCTGAGGATCAGCGTGGAGGCGGTCGAGGCGAGGTGCCGGCAGCTGGAGGAGCAGCTCGGGAAGGTCGAGGAGGAGAAAAAGCTCTTGAGCGACGAGAGCGAGGAGAAGTTGAGGGAGAGCGAGGGTAAGATTGAGGAGTTGGCGGCTGAAAAGGAGCGGCTGGAGAGCGAGATGAAGGAGAAGGAGCAGCGGTTTTTGGAGGAGCTCGAGTCGACAAAGTCCGAGCTGGCGACGCAGGGCGCGCAGCAGGTGACCGAGATGAAAACGCACAACGAAAGCGCACTTCGAGCCTTGGCGGAAGCGCACGAGCAGCAGCTCGCGAGCGTCGAGGACGAGAGGAAAAGCTTGGCGAGCCAGATGGAGGAGCTCGAGGCGACAAAGTCGGAGGAAATTGCGGAGCTGCGGGGCAGCGTCGAGGCAGTCGAGGACAAGTGCAGGCAGCTGGAGGAGCAGCTGAGCCAGGTGCAGGAGGAGAAGGCAGCTTTGAGGGAAGAGAGCGAGGGGAAGTTGAGGGAGAGCGAGGGTAAGGTTGAGGAGTTGGCGGCTGAAAAGGAGCGGCTGGAGGAGGAGATGAAGGCGAAGGAGCAGCGGTTCTTGGAGGAGCTCGAGTCGACGAGGTGCGAGCTGGTGGCGCAGGGCGAGCTGCGCGTCAGCGAGATCAACACTCACAACGAGAGCTCACTGCGTGCCTTAATGGAGGCCCTTCTGCGGGGCTGCGAGGAGATCAGCCTGCGCTCGGTGCAGGAGAACGAAACGCTCGGCACGCAGACCTCCGCCGCCTACTACATCATGATCATGCAGGAGCTGCAGGACCTGCTCGAGAAGCTGAAGGGCGCCTACGGCGGCTACAGCGAGAACTGCAGCGACAACGCGGAGCAGCTGGCCGTGTCGGTCGTCAACTGCGGGCACATGCTGTCGCTCGTCTTCGACCGGGGCATGACCATCGCCAATGCCTCCACCAACATGGTCTCCGGCGAGAAAATCGCATCGGAAATCAAAGAGTGCGGCAACACGAGCGCCGAGTTCTTCAAGCTCCTCGCCTCAAACGCCGACCCATCGACAATCAACGACTCTCTACAGCAGCTGAAGGACAAACTGTACGCGATCACGAACATGATCGGCGACCTCTCGAGCAACAAGGACGAGATCGAGAAGCTCGAGGAGATGGTGGAGGCGGAGCTGAACGGCATGGACAAGGCCATCGAGGAGGCGTCCAAGCAGATCATGGAGCTGCTCGCGCAGTCGCGCGCCTCCGACACCGGCATCAAGCTCGAGGTCAACGAGAAGATCCTCGACTCGTGCACGACCCTCATGCAGTGCATCAAGGTGCTCGTGCAGAAGTCGCGGAAGGTGCAGGCCGAAATCATCGCCACCGGCAAGGGCACGGCCACCGCCAAGGAGTTCTACAAGCGCAACCACCAGTGGACCGAGGGACTCATTTCCGCCGCCGGCTCCGTCGCCGCCGCCGCCAAACTACTCGTTGAGTCAGCGAATAAGGCAGTTAGCGAGCAATCGAAACACACACTCGACGTCGTCGTGGCAGCCCAGGAGATCGCAGCATGCGTAGCAACGCTGGTGGTAGCCTCACGAGTGAAGGCATCGCGTGACAGCAAGAGCCTTCGCGAGCTGACACTCGCTTCGAAGGACGTGACACAGTCGACGTCGATGGTCGTGGCGACGGCCAAGAGTTGCAGCCAGCAGCTGGAGGAGACGCAGGAGCTTGACTTTACGAAGCTCTCGATTCACCAGGCAAAGACCACCGAGATGGAGCTGCAAGTGAAAATTCTGGAGCTCGAGCAGGCGATTCAGACGGAGCGCATGAGGTTGGCCGCGTTGCGTAGGCAAAACTACCAAAATGGCGAT 

Protein: 1154 (aa)

 MTNANTLTDKEYYQLTISVGKALNPQELPIKQKHVRASIIATWMSNGGHAFWAIAIRQQLNDNRITAWKFLYMLHKILREGHPLVIQHSMRHRTMLTELGKLWGHLNDGYGICILQYTKLLVTKLNFHDRNARFPGNLVLKPGELEKIAANDINMYFQLAIEMFDYLDEIIALQATIFNSITTFAVSSMTAAGQCRLTPLIPCIQDSNQLYDFCVRLMFLLHANLQEDLLVHHRERFRTIFRQLNSFYKQAGQLQYFHNLISVPHLPNTPPNFLVQADLGNYTAPRIVLMNEADSQSESDTHSISETLVDTSMVDAAPPPRAETPPPPPPKIDYERLLTERDELINQLRHEQQNQIQMSRRALSEKAEAEHKLHEQVMRLTVECSELHSEISNLKMQKQELELKAETAPELEQKVQVEEEKAKQTEEKFQKLKNMYTQIRDEHIKLLRKHDEINKSLQERSKELAEVSREREESQSRLQEIESQRSLMSESFQRSSMESEQLRQQFTSIEEEKQSLLDQIQDVESKKSAEIAELRISVEAVEARCRQLEEQLGKVEEEKKLLSDESEEKLRESEGKIEELAAEKERLESEMKEKEQRFLEELESTKSELATQGAQQVTEMKTHNESALRALAEAHEQQLASVEDERKSLASQMEELEATKSEEIAELRGSVEAVEDKCRQLEEQLSQVQEEKAALREESEGKLRESEGKVEELAAEKERLEEEMKAKEQRFLEELESTRCELVAQGELRVSEINTHNESSLRALMEALLRGCEEISLRSVQENETLGTQTSAAYYIMIMQELQDLLEKLKGAYGGYSENCSDNAEQLAVSVVNCGHMLSLVFDRGMTIANASTNMVSGEKIASEIKECGNTSAEFFKLLASNADPSTINDSLQQLKDKLYAITNMIGDLSSNKDEIEKLEEMVEAELNGMDKAIEEASKQIMELLAQSRASDTGIKLEVNEKILDSCTTLMQCIKVLVQKSRKVQAEIIATGKGTATAKEFYKRNHQWTEGLISAAGSVAAAAKLLVESANKAVSEQSKHTLDVVVAAQEIAACVATLVVASRVKASRDSKSLRELTLASKDVTQSTSMVVATAKSCSQQLEETQELDFTKLSIHQAKTTEMELQVKILELEQAIQTERMRLAALRRQNYQNGD 
Type Start End Length
CDS 1106 1417 312
CDS 1499 1568 70
CDS 1753 2254 502
CDS 2616 2997 382
CDS 3455 3853 399
CDS 4549 4998 450
CDS 5317 5426 110
CDS 5487 5540 54
CDS 5980 6171 192
CDS 6285 6753 469
CDS 6955 7277 323
CDS 7616 7707 92
CDS 8841 8947 107
intron 1418 1498 81
intron 1569 1752 184
intron 2255 2615 361
intron 2998 3454 457
intron 3854 4548 695
intron 4999 5316 318
intron 5427 5486 60
intron 5541 5979 439
intron 6172 6284 113
intron 6754 6954 201
intron 7278 7615 338
intron 7708 8840 1133

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001660844 huntingtin interacting protein [Aedes aegypti] gb|EAT37579.1| huntingtin interacting protein [Aedes aegypti] 0.0
InterPro IPR002558 I/LWEQ
InterPro IPR013809 Epsin-like, N-terminal
InterPro IPR011417 ANTH
InterPro IPR008942 ENTH/VHS
Gene Ontology(MF) GO:0003779 actin binding
Gene Ontology(MF) GO:0005543 phospholipid binding
Pfam PF07651.11 ANTH domain 1.7e-62
Pfam PF01608.12 I/LWEQ domain 2.3e-65
Pfam PF07106.8 Tat binding protein 1(TBP-1)-interacting protein (TBPIP) 0.00034

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
A. aegypti AAEL010449
H. sapiens ENSP00000336747
M. musculus ENSMUSG00000000915
C. quinquefasciatus CPIJ004416
M. musculus ENSMUSG00000039959
P. vanderplanki Pv.04564
B. mori BGIBMGA005521-TA
P. vanderplanki Pv.04563
A. gambiae AGAP004801
H. sapiens ENSP00000253083
T. castaneum TC011686
S. invicta SI2.2.0_12725
N. vitripennis NV12217-PA
A. mellifera GB16663-PA
A. aegypti AAEL014598
D. plexippus DPOGS207840PA
P. humanus PHUM138710-PA
H. sapiens ENSP00000459517
H. melpomene HMEL015308-PA
H. sapiens ENSP00000460815
H. sapiens ENSP00000410300
D. melanogaster FBgn0036309