MidgeBase gene description page [Pn.02089]

Outline

Link to gbrowse

Gene ID Pn.02089
Type Protein coding gene
Scaffold PnScaf1822
Start 1191
End 6933
Direction -

Sequence

Transcript: 5361 (bp)

 ATGTTTATAGTACGAGCTTTAGAGAAAATTCTTGCTGACAAAGATATCAAACGACAAGGCCAATTGAAAAAGGCATGCGAATCGGCAATAGCAACACTCAAAGAGGAGCTGAAAGAAGAAAATGGACAAGCAACAGCCAACAGTGAACACCAGCAACAGCAGCAGCCGCCACAACACTCGTCGACTGCACTGCCGCTTCCGAAAAATGATTCCTCCAACAGCATCAATGCAGAAAAGTATTTTTTGCCATTCGAATTGGCATGCCAAAGCAAAACGCCCCGCATTGTGGTGACTGCACTCGATTGCCTGCAAAAGCTAATTGCCTACGGTCATTTGACGGGCAACATTGCTGACTCAGCCAACCCGGGCAAATTTCTTATCGATAGAATCGTTACGACCATTTGTAATTGCTTTATGGGACCGCAGACAGACGAAGGCGTTCAACTGCAAATAATCAAGGCTCTCCTGACGGTCGTCACCTCGCAAAATGTGGAAGTCCATGAAGGCACCGTTCTACAAGCTGTGCGAACATGCTATGACATTTATTTATCAAGTAAAAATTTGATAAACCAAACCACTGCGCGAGCGACACTAACGCAAATGCTCAACGTCATCTTCACTCGCATGGAGAATCAGGCCTTCGAAGTGACTACCATCCCGGTTCCGGCTCACAATAACAATAATCAACAGTCTCAGCAGCAACAGCCGCAGACGAGCGAGGAGAAGTCAGCCGCGAACAATGAAGGCGAATCGAAATCAGAAATTATTAATAGGAGTGAAGTCAAAGGTGAAGAATGTGAGAATAAAAACAACAACTCTGATGGAATTTCATCGAAAATGAATGAGCTTGCGATAAACGAGACCGACGAGAGCGAAAACATGGAAGAGTATGTGCGTAAAATTGTCGACGAAATATTGGATAACGCCGTAGAAATCGTAGAGAATAAGTTGGTAAACAACAATAGTGACAGTAGCATTAATAACATTAGTGGCGATGCAATTAGTGACAAAGTGACCGTTGGAGAGAAGCCCGAAGAGTTGAAGAAGGTGTCGAGCGTCAGCAGCATTTCGAACGACGGGTCTGTTGATCAAGCACCGTCGCACGAGAACGCCGAAATGAGCTCCGAAAACGAGAACATTGCGGCCACAAAATTCACACACGTACTGCAGAAAGACGCGTTTCTGGTGTTTCGTGCCCTTTGCAAGCTGTCCATGAAGCCGCTGCCCGAAGGCCAACCCGACCCGAAGTCCCACGAGTTACGATCAAAAATTTTATCATTGCACCTGCTGCTCTCCATTCTCCAGAATGCCGGGCCAGTTTTTCGATCAAACGAAATGTTCATCATGGCCATCAGGCAATATCTGTGCGTTGCGCTATCGAAAAATGGTGTCAGTTCCGTCCCCGAAATTTTTGAGCTCTCATTGTCCATTTTCGTGGCCTTGTTATCGAATTTTAAGGTTCATTTGAAGAAGCAAATAGAGGTGTTTTTTAAAGAGATTTTCCTCAACATTCTCGAGACGTCGAGCTCGACGTTCGAGCACAAGTTCATGGTGATTCAGGCGCTGGTGAGAATTTGTGCCGACGCCCAGAGCGTCGTCGACATTTACATCAACTACGATTGTGACTTTAGCGCGGCAAATCTTTTCGAGCGACTCGTCAACGACCTCTCGAAAATCGCCCAAGGCCGGCAAGCGTTGGAATTGGGAGCGACTACGCTGCAAGAGAAATCGATGCGAATCAAGGGACTCGAGTGTCTCGTTTCGATTCTAAAGTGCAAAGTCGAGTGGAGCAAAGACCTCTACATGAATCCAAATCTCCAGACGTCGCTGGGCGAAACGACACACAAGGCTGCGCCACACGACAGCGACACGAGCGATCAGCAGTCGATAAAGCACAGCGGATCGAGCTTGAGTCTGAACAGCGCGAGCAGCATCAACAACAACCACTCGGTCAATCGGGAAGTTCTTGACTTTCCCGAGGAGCTCGAGGAAAGAAAGCAGCGCAAGGAGTTGATGGAAACGGGCATCGAGATGTTCAACAATAAGCCGAAGAAGGGAATTCAGTTCTTGCAGGAGCGCGGTTTGTGCGGACTGAAGGTCGACGATATCGCGAAGTTCCTCATCGAGGACGACCGGCTGGATAAGACGCAAGTTGGCGACTTTCTCGGCGACAACGACGAGCTCAGCAAGTCGGTGATGTGCGCGTACATTGACGCAAAGGACTTCAGCGGAATGGAGATTGTGGCGGCGCTGAGGTTCTTCCTCGAAGGATTCCGGTTGCCGGGCGAGGCGCAGAAGATCGATCGACTCATGGAGAAGTTTGCGAGTCGATATTGCGAGTGCAATCCAAACAATCAGCTTTTCACGAGCGCCGACACGGTCTACGTTCTGGCCTTTTCGATCATCATGCTGACGACGGATCTGCACTCGCCGCAAGTCAAAAACAAGATGAGCAAAGAACAATACATTAGACTTAATCGGGGCATAAGCGACAGCAAGGATCTGCCCGAAGAGTACCTCTCACAGATTTACGATGAGATTTCCGGTCAGGAGATTAAGATGAAGAACACGGTGGTGACGAAGCCGAGCGGCAAGCCCGTCATAATTAACGAGAAAAAGCGCAAGCTCGTGTGGAACATGGAAATGGAGGCCATCTCGACGACGGCCAAAAACCTCATGGAGTCTGTGTCGCACGTCCGCGAGGCGTTCACTTCGGCGAAGCACTTGGATCACGTGCGGCCCATGTTCAAGCTCTCGTGGACGCCCTTCCTGGCGGCCTTCTCCGTTGGCCTGCAGGACTGCGACGATCCCGAAATTGCGCAGCTATGCTTGGACGGCATCCGATGCGCAATCAGAATTGCGTGCATTTTCCACATGACACTCGAGCGCGACGCGTATGTGCAGGCGCTCGCCCGGTTCACACTGCTCACGGCCAACTCGCCGATAACGGAAATGAAGGCGAAGAACATTGACACCATCAAGACGCTGATAATGGTCGCGCACACCGACGGCAATTATCTCGGCACGAGCTGGCTGGACATCGTCAAGTGCATATCGCAGCTGGAGCTTGCACAACTGATCGGAACGGGCGTGCGGCCACAATATTTGTCGGGCCCGACGCATCACCGCGATGCCCTGCTCGATCCGAGCGTGAAGGAACACATCGGCGAGACGAGTTCGCAATCGGTGGTCGTTGCAGTCGATCGCATCTTCACCGGCTCGATTCGCCTCGACGGCGATGCAATTGTCGATTTCGTCAAAGCCCTCTGCCACGTCTCGCTCGACGAGCTGAACAACGCTCAGCCGCGGATGTTTTCGCTGCAGAAAATTGTCGAAATCTCCTACTACAACATGGGCCGAATTCGTCTGCAGTGGTCGCGAATATGGCAAGTGCTTGGCGACTACTTCAATACGGTCGGCGCGTATGCAAACGAGGAGATTGCATTTTTCGCGCTCGACTCCCTTCGACAGCTCTCGATGAAGTTCATCGAGAAGGGCGAGTTCACGAACTTTCGCTTTCAGAAAGACTTCCTCAGGCCCTTCGAGCACATCATGAAGAAGAACAATTCGCCGGCGACGCGCGACATGGTCGTTCGGTGTGTCGCTCAAATGGTCAACTCGCAGTCGCAGAACATCAAGTCGGGCTGGAAGAACATTTTCTCGGTGTTTCATCTCGCCGCCAGCGACATGGATGAGTCGATTGTGGAGCTCGCCTTTCAAACAACCGGCAAGATCATCACGGTTCTGTACAAGAAACAATTCCACATCATGATTGACTCCTTTCCCGATGCCGTCAAGTGTCTGTCCGAGTTCGCCTCGAATACCCGCTTCCCCGACATCTCCATGGAGGCCATTCGCTTGCTGAGGACGTGCGCTGTCTCGGTCAATGAGTCGCCACAGCAATTTGCCGATCACACCGGCATGGAGAACGACATTCACGTGCTAGAAGAGGATCGCGTTTGGGTGCGCGGCTGGTTCCCGATGCTCTTCTCGCTCTCGTGCGTCGTCAATCGCTGCAAGCTCGACGTTCGAACGCGCGCCCTCACGGTGCTCTTCGAGATCGTGAAGACGTACGGCGAGAGCTACAAGCCCCACTGGTGGCGCGACTTGTTCAACATCTTATTCCGAATCTTTGACAATATGAAGCTCCCCGAGCACTACACCGAGAAGGCCGAGTGGATGACGACGACGTGCAACCACGCTTTGTATGCGATCATCGATGTGTTCACGCAGTACTTTGACACTCTCGGACCGCTGTTGCTGAAGGATCTGTACAGTCAGCTTCAGTGGTGCGTTCAGCAGAATAATGAGCAGTTGGCGCGCAGCGGCACAAATTGCCTCGAGAACCTCGTCATTTCGAATGGCACCAAGTTCAGCATCGAGACGTGGGAGGCGACATCGTGCTGCATCCTCGACATCTTCAGGCAAACACTTCCGAAGGAGCTGCTCACGTGGCGACCCGACCCGAACCAGCCGATCTCGCCGACCTCATTGACCGCACCCAACACAAACGCCCACCGGCACTCGATCACGGAAAATGGCGAAATCCGTCACGGCATTCTCAAGCGAAGCAACTCTCAACATTCCGTTTATAGTCTCAATTCCGAGGATGGCAAGACTGAGCTAATCACCCACACGAGCGGACTCTTTTCTAGTTTACTGATTAAGTGTGTCGTTCAATTGGAACTCATTCAGACGATCGACAACATCATCTTCTTCCCGGCCACGTCGAAGAAGGAGGACGCCGAGACGCTGGCGATGGCGACGGCTGATCTGAACCAAAGCGGCAATGTGTCGCTGCTGTCGACGGGCGAACACAGCGAGTGCCAACGCGAGGAGCAGGGAATGTATAGCTTCCTCAGCACGATGCATCTGATAAAGTTTGTCGACTGTCTCGTCGAGAGCCATCGCTTTGCGAAGAGCTTCAATCAGAACAATGACCAGCGGATGGTGCTGTGGAAGGCGGGCTTCAAGGGAAGCGTCAAGCCAAATTTGTTGAAGCAGGAAACGCAGTCGCTCGCCTGTGTGCTGAGAATTCTCTTCAAAATGTATGGCGATGAGAGTCGTCGCGACGATTGGATGGAGATTGAGCAGAGATTGATTAGCGTGTGCAGGGAGGCTCTCGAGTATTTCTTGGCACTACAAAGCGAACCACATCGCGAGGCGTGGACGTCCCTATTGCTTCTCATCATGACACGTCTGCTTAAAATGCCTGATCAAAGATTTGCAACACATTCGACAAATTACTATCCATTATTCTGCGACATGATGTGCTATGACATGAAGCCCGAGCTACGAAGTGTACTTAGAAGGTTGTTTGTACGAATTGGTCCCGTGTTTGGCATAAACAGTCCGAGTCAA 

Protein: 1787 (aa)

 MFIVRALEKILADKDIKRQGQLKKACESAIATLKEELKEENGQATANSEHQQQQQPPQHSSTALPLPKNDSSNSINAEKYFLPFELACQSKTPRIVVTALDCLQKLIAYGHLTGNIADSANPGKFLIDRIVTTICNCFMGPQTDEGVQLQIIKALLTVVTSQNVEVHEGTVLQAVRTCYDIYLSSKNLINQTTARATLTQMLNVIFTRMENQAFEVTTIPVPAHNNNNQQSQQQQPQTSEEKSAANNEGESKSEIINRSEVKGEECENKNNNSDGISSKMNELAINETDESENMEEYVRKIVDEILDNAVEIVENKLVNNNSDSSINNISGDAISDKVTVGEKPEELKKVSSVSSISNDGSVDQAPSHENAEMSSENENIAATKFTHVLQKDAFLVFRALCKLSMKPLPEGQPDPKSHELRSKILSLHLLLSILQNAGPVFRSNEMFIMAIRQYLCVALSKNGVSSVPEIFELSLSIFVALLSNFKVHLKKQIEVFFKEIFLNILETSSSTFEHKFMVIQALVRICADAQSVVDIYINYDCDFSAANLFERLVNDLSKIAQGRQALELGATTLQEKSMRIKGLECLVSILKCKVEWSKDLYMNPNLQTSLGETTHKAAPHDSDTSDQQSIKHSGSSLSLNSASSINNNHSVNREVLDFPEELEERKQRKELMETGIEMFNNKPKKGIQFLQERGLCGLKVDDIAKFLIEDDRLDKTQVGDFLGDNDELSKSVMCAYIDAKDFSGMEIVAALRFFLEGFRLPGEAQKIDRLMEKFASRYCECNPNNQLFTSADTVYVLAFSIIMLTTDLHSPQVKNKMSKEQYIRLNRGISDSKDLPEEYLSQIYDEISGQEIKMKNTVVTKPSGKPVIINEKKRKLVWNMEMEAISTTAKNLMESVSHVREAFTSAKHLDHVRPMFKLSWTPFLAAFSVGLQDCDDPEIAQLCLDGIRCAIRIACIFHMTLERDAYVQALARFTLLTANSPITEMKAKNIDTIKTLIMVAHTDGNYLGTSWLDIVKCISQLELAQLIGTGVRPQYLSGPTHHRDALLDPSVKEHIGETSSQSVVVAVDRIFTGSIRLDGDAIVDFVKALCHVSLDELNNAQPRMFSLQKIVEISYYNMGRIRLQWSRIWQVLGDYFNTVGAYANEEIAFFALDSLRQLSMKFIEKGEFTNFRFQKDFLRPFEHIMKKNNSPATRDMVVRCVAQMVNSQSQNIKSGWKNIFSVFHLAASDMDESIVELAFQTTGKIITVLYKKQFHIMIDSFPDAVKCLSEFASNTRFPDISMEAIRLLRTCAVSVNESPQQFADHTGMENDIHVLEEDRVWVRGWFPMLFSLSCVVNRCKLDVRTRALTVLFEIVKTYGESYKPHWWRDLFNILFRIFDNMKLPEHYTEKAEWMTTTCNHALYAIIDVFTQYFDTLGPLLLKDLYSQLQWCVQQNNEQLARSGTNCLENLVISNGTKFSIETWEATSCCILDIFRQTLPKELLTWRPDPNQPISPTSLTAPNTNAHRHSITENGEIRHGILKRSNSQHSVYSLNSEDGKTELITHTSGLFSSLLIKCVVQLELIQTIDNIIFFPATSKKEDAETLAMATADLNQSGNVSLLSTGEHSECQREEQGMYSFLSTMHLIKFVDCLVESHRFAKSFNQNNDQRMVLWKAGFKGSVKPNLLKQETQSLACVLRILFKMYGDESRRDDWMEIEQRLISVCREALEYFLALQSEPHREAWTSLLLLIMTRLLKMPDQRFATHSTNYYPLFCDMMCYDMKPELRSVLRRLFVRIGPVFGINSPSQ 
Type Start End Length
CDS 1194 1331 138
CDS 1402 6105 4704
CDS 6248 6663 416
CDS 6831 6933 103
intron 1332 1401 70
intron 6106 6247 142
intron 6664 6830 167

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_319652 AGAP008906-PA [Anopheles gambiae str. PEST] gb|EAA14874.4| AGAP008906-PA [Anopheles gambiae str. PEST] 0.0
InterPro IPR023394 SEC7-like, alpha orthogonal bundle
InterPro IPR011989 Armadillo-like helical
InterPro IPR000904 SEC7-like
InterPro IPR016024 Armadillo-type fold
InterPro IPR015403 Domain of unknown function DUF1981, SEC7 associated
Gene Ontology(BP) GO:0032012 regulation of ARF protein signal transduction
Gene Ontology(CC) GO:0005622 intracellular
Gene Ontology(MF) GO:0005086 ARF guanyl-nucleotide exchange factor activity
Gene Ontology(MF) GO:0005488 binding
Pfam PF01369.15 Sec7 domain 4.5e-68
Pfam PF12783.2 Guanine nucleotide exchange factor in Golgi transport N-terminal 1.2e-55
Pfam PF09324.5 Domain of unknown function (DUF1981) 2.4e-30

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
S. invicta SI2.2.0_05038
H. sapiens ENSP00000360985
A. gambiae AGAP008906
H. sapiens ENSP00000430891
H. sapiens ENSP00000428429
D. plexippus DPOGS212337PA
P. humanus PHUM191910-PA
M. musculus ENSMUSG00000067851
P. vanderplanki Pv.17593
T. castaneum TC002423
B. mori BGIBMGA004647-TA
C. quinquefasciatus CPIJ004831
P. vanderplanki Pv.17594
N. vitripennis NV13991-PA
A. aegypti AAEL013012
S. invicta SI2.2.0_10415
H. sapiens ENSP00000262215
A. mellifera GB12468-PA
H. melpomene HMEL003944-PA
D. melanogaster FBgn0028538
S. invicta SI2.2.0_02931