MidgeBase gene description page [Pn.03748]
Outline
Gene ID | Pn.03748 |
Type | Protein coding gene |
Scaffold | PnScaf3094 |
Start | 3989 |
End | 12445 |
Direction | - |
Sequence
Transcript: 3588 (bp)
ATGAATACTAAACCTGCATTGACACCTCAGAACTCATCGAAACAAGTCGTTGAGTTGGAGGGATATGTCATAATATTGGTGGAGGGAAGAGATGGAAAGATAAAGTTGTATGGCAGTCCAGCCGATCGCGATGGTTTGGAAGTCGCTGACGAGATTCTCGACGTCAACGAACGAAAGCTCGACGATGTCCCTCGTGCTCTCGTCATAAAGCATATTCATGAGTGTATACAATCGTGCATGATAAAACTGAGAGTGAAGCGTCGAAGCGACTCGCGCCTTGCTGGCGAATTGTGTAACACGGTTCAAGATGCATTTCTCATTGCTGTTGAACAACAGGCTCGTGAACGCTTGCAACGTTTGTCAGCTCTCAAGAGAATCACGCCAGTCGACATAAGTCAGCTGTCAATAAAGTTAAATCAACAGCATCCGAAGGGAGGTGTAACGACGACACAGGACTATAGTTTCTTGAAGGACTCATCGCCAATCTACGTTACATCGCTCTCGAATAACAGTAATAGTACCGTGACAAGTGCAGCAATATCGACCGCGAAAATATCGATCGCCCAGCCGCCCGTCAAGAGTCCGACGGTGAAGAGCATTAGTAATCCACTGTGCGACACAACCGGTAATAGTCAGCACCATCATCCAGAACTTCATCAGACTAATAGCACGAGCAATCAATTACTAACCACTTCAATAATTCGCAAACCAATCACCTCCACCACTACGTTTGACAATATTAATAATAAAACGGTTCAGAATAACGTTAACTATCTTGCTTCCTCGGCTCTAACATCCAATCCCTCCTCCTCCGCGTCTATCTCAGCGACGGCACAGCAAATCAAACAGGCGGGCATCGTGTCCCACGGAACAACAGCAATACCATTAACTGACCCTAATAGTAGTGATTTAATCAAACATAGCGCTAATCTAACCAATAACATTAATGTGCTCAACAACAATGTTGTCAGTAGTTGTCAGCTGCCAAATAACATAGATTTCGATTCCAACGACGGTTGCACTAGTTACTTTTCAAATCGCACTGAGTTCTCTGTAGATTCTGCTGCTGCACAGGAAGATCACTTTGAAGGCAATCAAGGCATATCAAAGGGCGGAACTGAGGTTCTTCTCGGCACCGATCCGCTGGTGAAAAACGAAAATAGAACACGAAGACCATCACGTTCCAGCATATTGCTGGGTGATGACTCACTGAAACAAGTGTCGAGTGGAAATTACGCAAGTCACAACACAGTCGAGATGTACTCAGTCGAGAATGGTCAGCATCGTGAGATGGCCGTCGATGTGCCAGAGTCGTTTATCGCTCACAACAAGACACAGCCTCGCTATCCGCCGCCGCGAAACCCCAACAGCTCCACGCTCCCCATGTCGACGCCCTCGTCGAAGGCAATGGCGAACGACATGCGGCCGGTGCCGCCGCCGCGCGATCACATCCGCAGCGACAGCGCCCAGCAGTCGCAACTCCAGACGAAAGCCGGCAAAATCGGACAAATTTTGGAGCCAACCTCCGATCAAATGGACAGTATCAAGAAGTACCAAGACCAGCTGAAACAGCGGCGCGAGAAGGAGGAGCGCGTCGCGGCCACTAACGAGTTTCTGCGCAATAGCGTGCGTGGCTCGGAGAAGCTGAGAGCGTTGAAGGCAGAGCACATTGCACCGCCGCCCGTGGGCTTCGACAACGAGGCCTACATTGTCGAGGACGATGACGATGTTAACAAAATGAATTTAATTGATTACGATGAGATGGTAGCAACAATGCAAAGACTTCAAACAAACTTCAAAAAGCATGGCATGCATGCACTAGCCAGTCGTTTAAACATTGCTCAAAATTTATTACTTAAATCGGTGGTTGCCAAGGCATTAGATACACGAGTTCAGCTTCTTCATAGACGTTATCCGCGTGTTCAAAATCCAATTTCATACAACGTTCAAAAGTTGACAAAAGATTGTGTGGAGGCATTATCAGAATCAAATTCACATTTAGCGAACGAGCTCTGCGACTTGCTCACGTCTTATGAGATGGAGGGTTTGTTGCAGGCGCACGATAGCATAGCCTCGTTAACCGACAGAGCTTACTGCTCGCCATCGAACCTTAACATCATGCCCGTCAAAATTCCACTCCACCATGCGCCTTCGAATGTGTCCAGCAATTTGAAGCATGAAACGAAAGGCTACAGCAGTTCCAATGAGGACATCAAGAAGAAGCCAGAACCCGTTCCAATACCGTTTGGCGTGCTGCGTGATGGAAGTCAAGATCACATTAGAATAATTCAAATAGAAAAGTCCTCGGAGCCGCTGGGCGCGACGATTCGCAACGAAGGCGAGGCGGTCGTGATAGGACGCGTGGTGAGAGGCGGCGCCGCGGAGAAGTCGGGACTCTTGCACGAGGGCGACGAGATCCTCGAGGTGAATGGCATCGAAATGCGTGGTAAATCGGTCAACGACGTATGCTCGATTCTCGCCGGCATGACCGGGACGCTTACCTTTCTCGTGGTGCCAGCAAATCGCGTTCCTGTGCTCGTCGGCGGCCGCGATCCGCCGGTGCTGCACGTGCGCGCGCATTTCGATTACGACCCCGAAGATGACCTTTACATTCCGTGCCGTGAGCTGGGCATAAGCTTTCAAAAGGGCGACGTCCTGCACGTCATTTCACGCGAGGACCCCAACTGGTGGCAGGCGTATCGCGAGGGTGAAGAAGACCAGACGCTGGCTGGCCTCATACCGAGTCAGTCGTTCCAGCATCAGCGCGAGTCAATGAAACTGGCGATTGCCGGCGAGCTCGGAAGTCGGGCGAAGAATAATCGCGAAAGTAAAAGTGCCACTTCAACACTCCTGTGTGCTCGCAAGGGCCGAAGAAAACGAAAGAAGGCCAACAACGAGCATGGCTATCCGCTGTATGCGACGACGACACACGAGGAGCCCGAACCGGAAATCCTGACGTACGAGGAGGTGACGTTATACTATCCTCGGGCGTCACACAAGCGACCTATCGTACTCATCGGTCCACCCAACATCGGTCGTCACGAGCTTAGACAGCGACTGATGGCAGACTCAGACCGATTCGCCGCAGCCGTTCCACATACGTCGAGACCGCGACGAGAAGGCGAAGTTCCCGGTGTCGACTATCACTTCATTTCGCGACAGCAATTCGAGGCCGACATCCTCGGTCGTAAATTCGTGGAACATGGAGAATACGAAAAGGCATACTATGGTACGTCGCTGGAAGCAATACGAACAGTGGTGGAGAGCGGAAAAATTTGCGTTCTAAATTTGCATCCTCAAAGCTTGAAAATTCTACGGCAATCGGACTTAAAACCTTACACCGTGCTAGTCGCTCCGCCGAGTCTGGAAAAGCTACGCCAGAAGAAGATTCGAGCGGGAGAGCCTTATAAGGAGGAAGAGTTGAAAGAAATTATTGCGACAGCTCGCGATATGGAAGCACGATGGGGACATCTCTTCGATATGATAATTATAAATAATGATACGGAGCGAGCGTATCATCAGCTATTGGCCGAAATTAATTCACTTGAAAGAGAGCCGCAATGGGTGGAGAAACTGAACAACAAGAGT
Protein: 1196 (aa)
MNTKPALTPQNSSKQVVELEGYVIILVEGRDGKIKLYGSPADRDGLEVADEILDVNERKLDDVPRALVIKHIHECIQSCMIKLRVKRRSDSRLAGELCNTVQDAFLIAVEQQARERLQRLSALKRITPVDISQLSIKLNQQHPKGGVTTTQDYSFLKDSSPIYVTSLSNNSNSTVTSAAISTAKISIAQPPVKSPTVKSISNPLCDTTGNSQHHHPELHQTNSTSNQLLTTSIIRKPITSTTTFDNINNKTVQNNVNYLASSALTSNPSSSASISATAQQIKQAGIVSHGTTAIPLTDPNSSDLIKHSANLTNNINVLNNNVVSSCQLPNNIDFDSNDGCTSYFSNRTEFSVDSAAAQEDHFEGNQGISKGGTEVLLGTDPLVKNENRTRRPSRSSILLGDDSLKQVSSGNYASHNTVEMYSVENGQHREMAVDVPESFIAHNKTQPRYPPPRNPNSSTLPMSTPSSKAMANDMRPVPPPRDHIRSDSAQQSQLQTKAGKIGQILEPTSDQMDSIKKYQDQLKQRREKEERVAATNEFLRNSVRGSEKLRALKAEHIAPPPVGFDNEAYIVEDDDDVNKMNLIDYDEMVATMQRLQTNFKKHGMHALASRLNIAQNLLLKSVVAKALDTRVQLLHRRYPRVQNPISYNVQKLTKDCVEALSESNSHLANELCDLLTSYEMEGLLQAHDSIASLTDRAYCSPSNLNIMPVKIPLHHAPSNVSSNLKHETKGYSSSNEDIKKKPEPVPIPFGVLRDGSQDHIRIIQIEKSSEPLGATIRNEGEAVVIGRVVRGGAAEKSGLLHEGDEILEVNGIEMRGKSVNDVCSILAGMTGTLTFLVVPANRVPVLVGGRDPPVLHVRAHFDYDPEDDLYIPCRELGISFQKGDVLHVISREDPNWWQAYREGEEDQTLAGLIPSQSFQHQRESMKLAIAGELGSRAKNNRESKSATSTLLCARKGRRKRKKANNEHGYPLYATTTHEEPEPEILTYEEVTLYYPRASHKRPIVLIGPPNIGRHELRQRLMADSDRFAAAVPHTSRPRREGEVPGVDYHFISRQQFEADILGRKFVEHGEYEKAYYGTSLEAIRTVVESGKICVLNLHPQSLKILRQSDLKPYTVLVAPPSLEKLRQKKIRAGEPYKEEELKEIIATARDMEARWGHLFDMIIINNDTERAYHQLLAEINSLEREPQWVEKLNNKS
Type | Start | End | Length |
CDS |
3992 |
4168 |
177 |
CDS |
4247 |
4428 |
182 |
CDS |
4515 |
4646 |
132 |
CDS |
5024 |
5900 |
877 |
CDS |
5978 |
6232 |
255 |
CDS |
6612 |
6826 |
215 |
CDS |
6917 |
7503 |
587 |
CDS |
7713 |
7828 |
116 |
CDS |
8297 |
8932 |
636 |
CDS |
9342 |
9472 |
131 |
CDS |
11607 |
11664 |
58 |
CDS |
11742 |
11921 |
180 |
CDS |
12404 |
12445 |
42 |
intron |
4169 |
4246 |
78 |
intron |
4429 |
4514 |
86 |
intron |
4647 |
5023 |
377 |
intron |
5901 |
5977 |
77 |
intron |
6233 |
6611 |
379 |
intron |
6827 |
6916 |
90 |
intron |
7504 |
7712 |
209 |
intron |
7829 |
8296 |
468 |
intron |
8933 |
9341 |
409 |
intron |
9473 |
11606 |
2134 |
intron |
11665 |
11741 |
77 |
intron |
11922 |
12403 |
482 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_003436217 |
AGAP002711-PF [Anopheles gambiae str. PEST] gb|EGK96796.1| AGAP002711-PF [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR011511 |
Variant SH3 |
|
InterPro |
IPR001452 |
Src homology-3 domain |
|
InterPro |
IPR008144 |
Guanylate kinase |
|
InterPro |
IPR008145 |
Guanylate kinase/L-type calcium channel |
|
InterPro |
IPR001478 |
PDZ domain |
|
InterPro |
IPR004172 |
L27 |
|
InterPro |
IPR020590 |
Guanylate kinase, conserved site |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF02828.11 |
L27 domain |
0.001 |
Pfam |
PF07653.12 |
Variant SH3 domain |
4e-13 |
Pfam |
PF13180.1 |
PDZ domain |
1.4e-09 |
Pfam |
PF00625.16 |
Guanylate kinase |
2.4e-45 |
Pfam |
PF00595.19 |
PDZ domain (Also known as DHR or GLGF) |
2.6e-17 |
Pfam |
PF00018.23 |
SH3 domain |
4.6e-06 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
B. mori |
BGIBMGA008411-TA |
N. vitripennis |
NV12091-PA |
A. gambiae |
AGAP002711 |
P. vanderplanki |
Pv.04331 |
A. mellifera |
GB18930-PA |
P. humanus |
PHUM287330-PA |
D. melanogaster |
FBgn0261873 |
M. musculus |
ENSMUSG00000021112 |
H. sapiens |
ENSP00000261681 |
D. plexippus |
DPOGS210401PA |
A. aegypti |
AAEL014012 |
C. quinquefasciatus |
CPIJ002874 |
H. sapiens |
ENSP00000451488 |
H. melpomene |
HMEL006754-PA |
T. castaneum |
TC007820 |