MidgeBase gene description page [Pn.02089]
Outline
Gene ID | Pn.02089 |
Type | Protein coding gene |
Scaffold | PnScaf1822 |
Start | 1191 |
End | 6933 |
Direction | - |
Sequence
Transcript: 5361 (bp)
ATGTTTATAGTACGAGCTTTAGAGAAAATTCTTGCTGACAAAGATATCAAACGACAAGGCCAATTGAAAAAGGCATGCGAATCGGCAATAGCAACACTCAAAGAGGAGCTGAAAGAAGAAAATGGACAAGCAACAGCCAACAGTGAACACCAGCAACAGCAGCAGCCGCCACAACACTCGTCGACTGCACTGCCGCTTCCGAAAAATGATTCCTCCAACAGCATCAATGCAGAAAAGTATTTTTTGCCATTCGAATTGGCATGCCAAAGCAAAACGCCCCGCATTGTGGTGACTGCACTCGATTGCCTGCAAAAGCTAATTGCCTACGGTCATTTGACGGGCAACATTGCTGACTCAGCCAACCCGGGCAAATTTCTTATCGATAGAATCGTTACGACCATTTGTAATTGCTTTATGGGACCGCAGACAGACGAAGGCGTTCAACTGCAAATAATCAAGGCTCTCCTGACGGTCGTCACCTCGCAAAATGTGGAAGTCCATGAAGGCACCGTTCTACAAGCTGTGCGAACATGCTATGACATTTATTTATCAAGTAAAAATTTGATAAACCAAACCACTGCGCGAGCGACACTAACGCAAATGCTCAACGTCATCTTCACTCGCATGGAGAATCAGGCCTTCGAAGTGACTACCATCCCGGTTCCGGCTCACAATAACAATAATCAACAGTCTCAGCAGCAACAGCCGCAGACGAGCGAGGAGAAGTCAGCCGCGAACAATGAAGGCGAATCGAAATCAGAAATTATTAATAGGAGTGAAGTCAAAGGTGAAGAATGTGAGAATAAAAACAACAACTCTGATGGAATTTCATCGAAAATGAATGAGCTTGCGATAAACGAGACCGACGAGAGCGAAAACATGGAAGAGTATGTGCGTAAAATTGTCGACGAAATATTGGATAACGCCGTAGAAATCGTAGAGAATAAGTTGGTAAACAACAATAGTGACAGTAGCATTAATAACATTAGTGGCGATGCAATTAGTGACAAAGTGACCGTTGGAGAGAAGCCCGAAGAGTTGAAGAAGGTGTCGAGCGTCAGCAGCATTTCGAACGACGGGTCTGTTGATCAAGCACCGTCGCACGAGAACGCCGAAATGAGCTCCGAAAACGAGAACATTGCGGCCACAAAATTCACACACGTACTGCAGAAAGACGCGTTTCTGGTGTTTCGTGCCCTTTGCAAGCTGTCCATGAAGCCGCTGCCCGAAGGCCAACCCGACCCGAAGTCCCACGAGTTACGATCAAAAATTTTATCATTGCACCTGCTGCTCTCCATTCTCCAGAATGCCGGGCCAGTTTTTCGATCAAACGAAATGTTCATCATGGCCATCAGGCAATATCTGTGCGTTGCGCTATCGAAAAATGGTGTCAGTTCCGTCCCCGAAATTTTTGAGCTCTCATTGTCCATTTTCGTGGCCTTGTTATCGAATTTTAAGGTTCATTTGAAGAAGCAAATAGAGGTGTTTTTTAAAGAGATTTTCCTCAACATTCTCGAGACGTCGAGCTCGACGTTCGAGCACAAGTTCATGGTGATTCAGGCGCTGGTGAGAATTTGTGCCGACGCCCAGAGCGTCGTCGACATTTACATCAACTACGATTGTGACTTTAGCGCGGCAAATCTTTTCGAGCGACTCGTCAACGACCTCTCGAAAATCGCCCAAGGCCGGCAAGCGTTGGAATTGGGAGCGACTACGCTGCAAGAGAAATCGATGCGAATCAAGGGACTCGAGTGTCTCGTTTCGATTCTAAAGTGCAAAGTCGAGTGGAGCAAAGACCTCTACATGAATCCAAATCTCCAGACGTCGCTGGGCGAAACGACACACAAGGCTGCGCCACACGACAGCGACACGAGCGATCAGCAGTCGATAAAGCACAGCGGATCGAGCTTGAGTCTGAACAGCGCGAGCAGCATCAACAACAACCACTCGGTCAATCGGGAAGTTCTTGACTTTCCCGAGGAGCTCGAGGAAAGAAAGCAGCGCAAGGAGTTGATGGAAACGGGCATCGAGATGTTCAACAATAAGCCGAAGAAGGGAATTCAGTTCTTGCAGGAGCGCGGTTTGTGCGGACTGAAGGTCGACGATATCGCGAAGTTCCTCATCGAGGACGACCGGCTGGATAAGACGCAAGTTGGCGACTTTCTCGGCGACAACGACGAGCTCAGCAAGTCGGTGATGTGCGCGTACATTGACGCAAAGGACTTCAGCGGAATGGAGATTGTGGCGGCGCTGAGGTTCTTCCTCGAAGGATTCCGGTTGCCGGGCGAGGCGCAGAAGATCGATCGACTCATGGAGAAGTTTGCGAGTCGATATTGCGAGTGCAATCCAAACAATCAGCTTTTCACGAGCGCCGACACGGTCTACGTTCTGGCCTTTTCGATCATCATGCTGACGACGGATCTGCACTCGCCGCAAGTCAAAAACAAGATGAGCAAAGAACAATACATTAGACTTAATCGGGGCATAAGCGACAGCAAGGATCTGCCCGAAGAGTACCTCTCACAGATTTACGATGAGATTTCCGGTCAGGAGATTAAGATGAAGAACACGGTGGTGACGAAGCCGAGCGGCAAGCCCGTCATAATTAACGAGAAAAAGCGCAAGCTCGTGTGGAACATGGAAATGGAGGCCATCTCGACGACGGCCAAAAACCTCATGGAGTCTGTGTCGCACGTCCGCGAGGCGTTCACTTCGGCGAAGCACTTGGATCACGTGCGGCCCATGTTCAAGCTCTCGTGGACGCCCTTCCTGGCGGCCTTCTCCGTTGGCCTGCAGGACTGCGACGATCCCGAAATTGCGCAGCTATGCTTGGACGGCATCCGATGCGCAATCAGAATTGCGTGCATTTTCCACATGACACTCGAGCGCGACGCGTATGTGCAGGCGCTCGCCCGGTTCACACTGCTCACGGCCAACTCGCCGATAACGGAAATGAAGGCGAAGAACATTGACACCATCAAGACGCTGATAATGGTCGCGCACACCGACGGCAATTATCTCGGCACGAGCTGGCTGGACATCGTCAAGTGCATATCGCAGCTGGAGCTTGCACAACTGATCGGAACGGGCGTGCGGCCACAATATTTGTCGGGCCCGACGCATCACCGCGATGCCCTGCTCGATCCGAGCGTGAAGGAACACATCGGCGAGACGAGTTCGCAATCGGTGGTCGTTGCAGTCGATCGCATCTTCACCGGCTCGATTCGCCTCGACGGCGATGCAATTGTCGATTTCGTCAAAGCCCTCTGCCACGTCTCGCTCGACGAGCTGAACAACGCTCAGCCGCGGATGTTTTCGCTGCAGAAAATTGTCGAAATCTCCTACTACAACATGGGCCGAATTCGTCTGCAGTGGTCGCGAATATGGCAAGTGCTTGGCGACTACTTCAATACGGTCGGCGCGTATGCAAACGAGGAGATTGCATTTTTCGCGCTCGACTCCCTTCGACAGCTCTCGATGAAGTTCATCGAGAAGGGCGAGTTCACGAACTTTCGCTTTCAGAAAGACTTCCTCAGGCCCTTCGAGCACATCATGAAGAAGAACAATTCGCCGGCGACGCGCGACATGGTCGTTCGGTGTGTCGCTCAAATGGTCAACTCGCAGTCGCAGAACATCAAGTCGGGCTGGAAGAACATTTTCTCGGTGTTTCATCTCGCCGCCAGCGACATGGATGAGTCGATTGTGGAGCTCGCCTTTCAAACAACCGGCAAGATCATCACGGTTCTGTACAAGAAACAATTCCACATCATGATTGACTCCTTTCCCGATGCCGTCAAGTGTCTGTCCGAGTTCGCCTCGAATACCCGCTTCCCCGACATCTCCATGGAGGCCATTCGCTTGCTGAGGACGTGCGCTGTCTCGGTCAATGAGTCGCCACAGCAATTTGCCGATCACACCGGCATGGAGAACGACATTCACGTGCTAGAAGAGGATCGCGTTTGGGTGCGCGGCTGGTTCCCGATGCTCTTCTCGCTCTCGTGCGTCGTCAATCGCTGCAAGCTCGACGTTCGAACGCGCGCCCTCACGGTGCTCTTCGAGATCGTGAAGACGTACGGCGAGAGCTACAAGCCCCACTGGTGGCGCGACTTGTTCAACATCTTATTCCGAATCTTTGACAATATGAAGCTCCCCGAGCACTACACCGAGAAGGCCGAGTGGATGACGACGACGTGCAACCACGCTTTGTATGCGATCATCGATGTGTTCACGCAGTACTTTGACACTCTCGGACCGCTGTTGCTGAAGGATCTGTACAGTCAGCTTCAGTGGTGCGTTCAGCAGAATAATGAGCAGTTGGCGCGCAGCGGCACAAATTGCCTCGAGAACCTCGTCATTTCGAATGGCACCAAGTTCAGCATCGAGACGTGGGAGGCGACATCGTGCTGCATCCTCGACATCTTCAGGCAAACACTTCCGAAGGAGCTGCTCACGTGGCGACCCGACCCGAACCAGCCGATCTCGCCGACCTCATTGACCGCACCCAACACAAACGCCCACCGGCACTCGATCACGGAAAATGGCGAAATCCGTCACGGCATTCTCAAGCGAAGCAACTCTCAACATTCCGTTTATAGTCTCAATTCCGAGGATGGCAAGACTGAGCTAATCACCCACACGAGCGGACTCTTTTCTAGTTTACTGATTAAGTGTGTCGTTCAATTGGAACTCATTCAGACGATCGACAACATCATCTTCTTCCCGGCCACGTCGAAGAAGGAGGACGCCGAGACGCTGGCGATGGCGACGGCTGATCTGAACCAAAGCGGCAATGTGTCGCTGCTGTCGACGGGCGAACACAGCGAGTGCCAACGCGAGGAGCAGGGAATGTATAGCTTCCTCAGCACGATGCATCTGATAAAGTTTGTCGACTGTCTCGTCGAGAGCCATCGCTTTGCGAAGAGCTTCAATCAGAACAATGACCAGCGGATGGTGCTGTGGAAGGCGGGCTTCAAGGGAAGCGTCAAGCCAAATTTGTTGAAGCAGGAAACGCAGTCGCTCGCCTGTGTGCTGAGAATTCTCTTCAAAATGTATGGCGATGAGAGTCGTCGCGACGATTGGATGGAGATTGAGCAGAGATTGATTAGCGTGTGCAGGGAGGCTCTCGAGTATTTCTTGGCACTACAAAGCGAACCACATCGCGAGGCGTGGACGTCCCTATTGCTTCTCATCATGACACGTCTGCTTAAAATGCCTGATCAAAGATTTGCAACACATTCGACAAATTACTATCCATTATTCTGCGACATGATGTGCTATGACATGAAGCCCGAGCTACGAAGTGTACTTAGAAGGTTGTTTGTACGAATTGGTCCCGTGTTTGGCATAAACAGTCCGAGTCAA
Protein: 1787 (aa)
MFIVRALEKILADKDIKRQGQLKKACESAIATLKEELKEENGQATANSEHQQQQQPPQHSSTALPLPKNDSSNSINAEKYFLPFELACQSKTPRIVVTALDCLQKLIAYGHLTGNIADSANPGKFLIDRIVTTICNCFMGPQTDEGVQLQIIKALLTVVTSQNVEVHEGTVLQAVRTCYDIYLSSKNLINQTTARATLTQMLNVIFTRMENQAFEVTTIPVPAHNNNNQQSQQQQPQTSEEKSAANNEGESKSEIINRSEVKGEECENKNNNSDGISSKMNELAINETDESENMEEYVRKIVDEILDNAVEIVENKLVNNNSDSSINNISGDAISDKVTVGEKPEELKKVSSVSSISNDGSVDQAPSHENAEMSSENENIAATKFTHVLQKDAFLVFRALCKLSMKPLPEGQPDPKSHELRSKILSLHLLLSILQNAGPVFRSNEMFIMAIRQYLCVALSKNGVSSVPEIFELSLSIFVALLSNFKVHLKKQIEVFFKEIFLNILETSSSTFEHKFMVIQALVRICADAQSVVDIYINYDCDFSAANLFERLVNDLSKIAQGRQALELGATTLQEKSMRIKGLECLVSILKCKVEWSKDLYMNPNLQTSLGETTHKAAPHDSDTSDQQSIKHSGSSLSLNSASSINNNHSVNREVLDFPEELEERKQRKELMETGIEMFNNKPKKGIQFLQERGLCGLKVDDIAKFLIEDDRLDKTQVGDFLGDNDELSKSVMCAYIDAKDFSGMEIVAALRFFLEGFRLPGEAQKIDRLMEKFASRYCECNPNNQLFTSADTVYVLAFSIIMLTTDLHSPQVKNKMSKEQYIRLNRGISDSKDLPEEYLSQIYDEISGQEIKMKNTVVTKPSGKPVIINEKKRKLVWNMEMEAISTTAKNLMESVSHVREAFTSAKHLDHVRPMFKLSWTPFLAAFSVGLQDCDDPEIAQLCLDGIRCAIRIACIFHMTLERDAYVQALARFTLLTANSPITEMKAKNIDTIKTLIMVAHTDGNYLGTSWLDIVKCISQLELAQLIGTGVRPQYLSGPTHHRDALLDPSVKEHIGETSSQSVVVAVDRIFTGSIRLDGDAIVDFVKALCHVSLDELNNAQPRMFSLQKIVEISYYNMGRIRLQWSRIWQVLGDYFNTVGAYANEEIAFFALDSLRQLSMKFIEKGEFTNFRFQKDFLRPFEHIMKKNNSPATRDMVVRCVAQMVNSQSQNIKSGWKNIFSVFHLAASDMDESIVELAFQTTGKIITVLYKKQFHIMIDSFPDAVKCLSEFASNTRFPDISMEAIRLLRTCAVSVNESPQQFADHTGMENDIHVLEEDRVWVRGWFPMLFSLSCVVNRCKLDVRTRALTVLFEIVKTYGESYKPHWWRDLFNILFRIFDNMKLPEHYTEKAEWMTTTCNHALYAIIDVFTQYFDTLGPLLLKDLYSQLQWCVQQNNEQLARSGTNCLENLVISNGTKFSIETWEATSCCILDIFRQTLPKELLTWRPDPNQPISPTSLTAPNTNAHRHSITENGEIRHGILKRSNSQHSVYSLNSEDGKTELITHTSGLFSSLLIKCVVQLELIQTIDNIIFFPATSKKEDAETLAMATADLNQSGNVSLLSTGEHSECQREEQGMYSFLSTMHLIKFVDCLVESHRFAKSFNQNNDQRMVLWKAGFKGSVKPNLLKQETQSLACVLRILFKMYGDESRRDDWMEIEQRLISVCREALEYFLALQSEPHREAWTSLLLLIMTRLLKMPDQRFATHSTNYYPLFCDMMCYDMKPELRSVLRRLFVRIGPVFGINSPSQ
Type | Start | End | Length |
CDS |
1194 |
1331 |
138 |
CDS |
1402 |
6105 |
4704 |
CDS |
6248 |
6663 |
416 |
CDS |
6831 |
6933 |
103 |
intron |
1332 |
1401 |
70 |
intron |
6106 |
6247 |
142 |
intron |
6664 |
6830 |
167 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_319652 |
AGAP008906-PA [Anopheles gambiae str. PEST] gb|EAA14874.4| AGAP008906-PA [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR023394 |
SEC7-like, alpha orthogonal bundle |
|
InterPro |
IPR011989 |
Armadillo-like helical |
|
InterPro |
IPR000904 |
SEC7-like |
|
InterPro |
IPR016024 |
Armadillo-type fold |
|
InterPro |
IPR015403 |
Domain of unknown function DUF1981, SEC7 associated |
|
Gene Ontology(BP) |
GO:0032012 |
regulation of ARF protein signal transduction |
|
Gene Ontology(CC) |
GO:0005622 |
intracellular |
|
Gene Ontology(MF) |
GO:0005086 |
ARF guanyl-nucleotide exchange factor activity |
|
Gene Ontology(MF) |
GO:0005488 |
binding |
|
Pfam |
PF01369.15 |
Sec7 domain |
4.5e-68 |
Pfam |
PF12783.2 |
Guanine nucleotide exchange factor in Golgi transport N-terminal |
1.2e-55 |
Pfam |
PF09324.5 |
Domain of unknown function (DUF1981) |
2.4e-30 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
S. invicta |
SI2.2.0_05038 |
H. sapiens |
ENSP00000360985 |
A. gambiae |
AGAP008906 |
H. sapiens |
ENSP00000430891 |
H. sapiens |
ENSP00000428429 |
D. plexippus |
DPOGS212337PA |
P. humanus |
PHUM191910-PA |
M. musculus |
ENSMUSG00000067851 |
P. vanderplanki |
Pv.17593 |
T. castaneum |
TC002423 |
B. mori |
BGIBMGA004647-TA |
C. quinquefasciatus |
CPIJ004831 |
P. vanderplanki |
Pv.17594 |
N. vitripennis |
NV13991-PA |
A. aegypti |
AAEL013012 |
S. invicta |
SI2.2.0_10415 |
H. sapiens |
ENSP00000262215 |
A. mellifera |
GB12468-PA |
H. melpomene |
HMEL003944-PA |
D. melanogaster |
FBgn0028538 |
S. invicta |
SI2.2.0_02931 |