MidgeBase gene description page [Pn.09740]
Outline
Gene ID | Pn.09740 |
Type | Protein coding gene |
Scaffold | PnScaf10089 |
Start | 1344 |
End | 9909 |
Direction | + |
Sequence
Transcript: 4170 (bp)
ATGATTTTTAAACGACGATTCAGCGCAGCTCTATTAGTTTTATTGTGCAGCACATTTGCGGCGACGGAGCGTGGCGGCGATGTGAAGCTCGATAGCGACGATTTACAGGTCGTGTACTCGGATCCTCCGGAAGCTGTCGCGTCGACGCCGTCACGTCACTTGTACACAAAAGAAATCGCACAATGTTCGCCCGGGGAGAGCAAAATGATGTTGACACTTTTGCTTCAATCACTTAAATGGAGCGACGTTGGCGAGCGCAAGCAGGAGAAGGCCATTAAAAAGATGTCACGTTTCTTCCTCGTGCCAGTGAAATACATCAGACACGATGACATCAGCACCACGGAAATGCTCGAAATGCTGAAGCACTCGATTAACAAGAGCAATAACTTGCTCAACACCAAAGACGCCGTCGGACGCATCGAGTTCATGATCGGATGCGGCGATAAGTTGTTTGCGAGCAGCAAATCAGTTGCGCACAAGGTCGACGATTACGTCAAGTTGGGCATGATGGAGGAGCTGTCGGACCTGCCCATCCAGTGGTGGAACATCTGGACGAAGCACGTCAAGAGTCGCTTGCCGAGGACGAGACGGCAAGTCGAGGGATCGGGAAATGCCGACGAGGAGGACGACTACGAGGAAGAGTACACGGACGAGCAAATCGAGGACGACGAGGATGAGGATGTCAACAATAATGACGATCAATCTCCGACATCTGCCGTGAATGCGAATCAGCGAAAGAAGCAGCATTCGAAGGGCAATAAGGAGCTTTCGACAGCGCCGAAAAGTCAAGATGATCTGCCGCCTCCTCCGCAGCAAAGACCCACTGACGGTGGCGCCGAAACGGAGGCTAATGAGAAAATTGAAGATAATGTCGAGAGCGAGAATGGCGTGGCTCGTGGCGGCGAGAAAATTGAATCGGAATCCGACTACGCGAATCCTCTCGAAGAGGAGTCATTAGCACCTATTGATGGGGAAAGCGATAGCGTCGATGATTCAATTAAGGGCGAGCAAATTGAAGACATTACGCAGCGCAACGCGGAGGCGAATAGTCCGACGCCGATCGGCACCAACGAGATAATCAAGTTGGAGAAGGAGGTGGACAAGACCATCGAGAACGCCAACAATGTCGACGATGTTCCCTTGCCGCCCAAGCAACAGGACGACAAAATTCTCGAGGAGTTCGTCAACAGCATGATGCCGAATGTGCCGGTCCAGGAGCTCACGACGGACAGCGAGGTTTACACGATTCTCACCTCGCCGACGACGACCGTCGAGACGCCGCACACGTCACGTCATCAGCATCATCATCGCATCCATCATCGACCCGATGCACCCGAGACAGCGACGAGTGGAGGTGAGGAGCCTCGACCGGACACGTCGTTAGCGACAGCCGGTGCTGAAGACGTGACGAGTGAATTTTTCACTACAATCGCCACCGCCACGAACACGGCGGTGGTGCAGCAGCAACATGTACCCGAGGTGGAAACAGTTTTCACTCCCATAAATAACAGCCGCGCACACAATGAGCCCATTAATGTGTTTATCGATGAAAAATTGCCGATTGTTACGGCGACTGTGGACGTTACGACTGCGACAATATCGACCAGTGAGAAAGAATCGATTACGATGATTGAGGAAAGTGTTACGGATCGAATGCCATCATCTGCATCACCATCACAACCGCCATTAATTATTTCATCAACCTCACATGAACATGAAACGACTTACAATTATCCCATTACGACAAATCCAACTATAATAACAACAACAGCTGATATAACAACAATCATAACCGAGCAAAGCATAACAGAACCTACAACAACAACAACAACCCTTAGAAATGTAGACGCGACAGTTGCTCATGTTGTCGCTGAGGATAAGGTCTCGACGGCGGCAAACGACTACGAGGAGCGAAAATCAATCGAGGAATCGAATTACGATGACGAGTACGATGATGATGACGAGAATTTCATTGACCAGCCAACGCAGAGACCTTCGAGCATCGCAACAACGGCTCGCGCCCCGACCACCATCAAGGAGGACCGACGAACGCCGCTCCCACTCATCCCGACCCTGCCAAACGAGGACGAGGACGACGAGGATGACGAGGACAACGAGCTGGACGAGGACCTCCTTGCACAAATTTCGACGTCCACGCAAGCACCGACAACCCAAACACCACCTGCGACCACTACCATAGCTGCCACTACGACCATCACCACGACCACTGAGCAGACGACCACGGTGGTGTCGACAACTCAGTCGGCAGCCACAGAAGTCACGAGCGAGTCTTCGGAGGTGACGAGCGACGGTTCGGACTACGACGACAACGAGCCGCCGCGAGTCGTAAAGAGAATCAAGAAGATTCAGGCGACCGCCGGCAAGACGTTCATCTACAACCTTGAGGGTCCGATATTCGAGGACAAGGAGGGCAAGACGAACCTCCGGCTAGAGATGCTGGACAAGAACGATGAGCCGCTGCCGTCGAGCTCGTGGATTCGCTTCGATGCGGCTAAACAGGAAATCTACGGACTGCCGCTCGAGAAAGACGTCAGCCGACACGAGTTTAAGCTGCGAGCGACCGACAAGGACGGCGCTTACGTGGACGAGGACGTCGACGTCACAGTGCAGCAGCACAAGAGCTTCCGCAGCGTGAACCACGAGATTTTCATTAAGGTGTCGCTGGAGAAGCAATTCGAGTCGGACGTCGACTGGAAGATTCGCCTCATGCGCGGCATCAACGCCGTTCTCGGCGACAACTCGCTCGGCAACATCTACGTGCGCGACGTGACGCCGCACAAGTACGAGGATACGCGCTACACCTTCTCCTACACGAACGACTCGCTGCCCAAGGAACACTGCCCGAAGGCCGAGCTCGACGACCTGATGCTGAAGCTCACCAAGCAGGCCTTGAACAGCGAAATGCGCCGGGAAATTGCCGTGCACAACATCGAGAAGGAGCTGATTAACTCGTGCGCCAAAGTCAAGACTCCGCGGCCGCCGTCGCTGCCGCCGAGCCGCACAAACTTCCCGCCAACCGTCCGCAATCACGTGGACAAGATCGAGGCACACGTCGGCCAGCTGTTGGTGTTTGCCGTGCCTGAAGACACTTTCTACGATCCGGAGGATCAGAACGACCTGAAACTGTCGCTTCTCTACGAGAACCGCTCGCTGCTCGAGTCCTCCAACTGGTTGCAGTTCGACGCCAAGAACCAGGAGTTCTACGGCGTACCGACTATCTACGACAAGACGCAAACTTACGTGCTCGTGGCAGAGGACAAGAACGGCCTCACCACCAACGACGCCCTCGTCGTCGAGATCCAAAATCCTCACTCGAAACGCGACTTTAGCGTGACCTTCGAGTACCAGTTGGACATCGGCTATGAGCAGTTCAAGAGCGCTGCCACAAAGCGCAAGTTCATTGAGCGCATTCAGCAGCTGTTTGGCGACGCGGACGCCAGCGCGATTCTCATAAAGTCCGTCAAGGAACTCAAGCACTACGGACGGACGTCTGTTGTCGTGCAGAACACGACACTCACTCATCGCATCTGTCCGATCAACCTTATCGATAGCCTTCGAACGCGTCTCGTGCGTACAGACGGCAATCTGAGAGACGAAGTCAAGCAGGCCATCGGAAGCGAATTCAACGTCCTGAAAATTAGTATAGTTCCTACATCAAAATGTTCCGGCGGAGACACATATCACCATCCCGACGAAACCAGTCCCGTTGATCGTCCCGAAGAGCACGAGTCGCCCTTGTTGAATCAAGAAGTACTTATCACATATGTTCTTCCCACAGCCATTATATTGTTGATGCTGTTAATTGCACTGTTGATCGCATGCCTTCTTTACAAACGCCGCAATACCGGCAAAATGGAGCTGGGCGACGAGGAGGAAAGAAAGTCCTTCCGCTCGAAGGGAATTCCTGTGATTTTCCAAGATGAATTGGATGAGAAGCCTGAAATTATTACAAAGTCACCTGTCATCTTGAAAGACGAGAAGCCGCCACTTTTACCACAATACAACGGATTGAACCAGGACGGTGATGAGGACGTCGACCAGTATATTCCACCACAACCGCTTCTCATGGGAAGCCGTGACTCGCGTGGAAAGTCTCCAGTTTCACAGAACACACCTAGTTATCGCAAGCCTCCACCATACGTTAGTCCA
Protein: 1390 (aa)
MIFKRRFSAALLVLLCSTFAATERGGDVKLDSDDLQVVYSDPPEAVASTPSRHLYTKEIAQCSPGESKMMLTLLLQSLKWSDVGERKQEKAIKKMSRFFLVPVKYIRHDDISTTEMLEMLKHSINKSNNLLNTKDAVGRIEFMIGCGDKLFASSKSVAHKVDDYVKLGMMEELSDLPIQWWNIWTKHVKSRLPRTRRQVEGSGNADEEDDYEEEYTDEQIEDDEDEDVNNNDDQSPTSAVNANQRKKQHSKGNKELSTAPKSQDDLPPPPQQRPTDGGAETEANEKIEDNVESENGVARGGEKIESESDYANPLEEESLAPIDGESDSVDDSIKGEQIEDITQRNAEANSPTPIGTNEIIKLEKEVDKTIENANNVDDVPLPPKQQDDKILEEFVNSMMPNVPVQELTTDSEVYTILTSPTTTVETPHTSRHQHHHRIHHRPDAPETATSGGEEPRPDTSLATAGAEDVTSEFFTTIATATNTAVVQQQHVPEVETVFTPINNSRAHNEPINVFIDEKLPIVTATVDVTTATISTSEKESITMIEESVTDRMPSSASPSQPPLIISSTSHEHETTYNYPITTNPTIITTTADITTIITEQSITEPTTTTTTLRNVDATVAHVVAEDKVSTAANDYEERKSIEESNYDDEYDDDDENFIDQPTQRPSSIATTARAPTTIKEDRRTPLPLIPTLPNEDEDDEDDEDNELDEDLLAQISTSTQAPTTQTPPATTTIAATTTITTTTEQTTTVVSTTQSAATEVTSESSEVTSDGSDYDDNEPPRVVKRIKKIQATAGKTFIYNLEGPIFEDKEGKTNLRLEMLDKNDEPLPSSSWIRFDAAKQEIYGLPLEKDVSRHEFKLRATDKDGAYVDEDVDVTVQQHKSFRSVNHEIFIKVSLEKQFESDVDWKIRLMRGINAVLGDNSLGNIYVRDVTPHKYEDTRYTFSYTNDSLPKEHCPKAELDDLMLKLTKQALNSEMRREIAVHNIEKELINSCAKVKTPRPPSLPPSRTNFPPTVRNHVDKIEAHVGQLLVFAVPEDTFYDPEDQNDLKLSLLYENRSLLESSNWLQFDAKNQEFYGVPTIYDKTQTYVLVAEDKNGLTTNDALVVEIQNPHSKRDFSVTFEYQLDIGYEQFKSAATKRKFIERIQQLFGDADASAILIKSVKELKHYGRTSVVVQNTTLTHRICPINLIDSLRTRLVRTDGNLRDEVKQAIGSEFNVLKISIVPTSKCSGGDTYHHPDETSPVDRPEEHESPLLNQEVLITYVLPTAIILLMLLIALLIACLLYKRRNTGKMELGDEEERKSFRSKGIPVIFQDELDEKPEIITKSPVILKDEKPPLLPQYNGLNQDGDEDVDQYIPPQPLLMGSRDSRGKSPVSQNTPSYRKPPPYVSP
Type | Start | End | Length |
CDS |
1344 |
1451 |
108 |
CDS |
1520 |
1720 |
201 |
CDS |
1806 |
2182 |
377 |
CDS |
2342 |
2387 |
46 |
CDS |
3923 |
4819 |
897 |
CDS |
6003 |
6254 |
252 |
CDS |
7006 |
7101 |
96 |
CDS |
7159 |
8857 |
1699 |
CDS |
9337 |
9702 |
366 |
CDS |
9779 |
9906 |
128 |
intron |
1452 |
1519 |
68 |
intron |
1721 |
1805 |
85 |
intron |
2183 |
2341 |
159 |
intron |
2388 |
3922 |
1535 |
intron |
4820 |
6002 |
1183 |
intron |
6255 |
7005 |
751 |
intron |
7102 |
7158 |
57 |
intron |
8858 |
9336 |
479 |
intron |
9703 |
9778 |
76 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_314057 |
AGAP005162-PB [Anopheles gambiae str. PEST] gb|EAA09416.5| AGAP005162-PB [Anopheles gambiae str. PEST] |
0.0 |
InterPro |
IPR015919 |
Cadherin-like |
|
InterPro |
IPR006644 |
Dystroglycan-type cadherin-like |
|
InterPro |
IPR013783 |
Immunoglobulin-like fold |
|
InterPro |
IPR008465 |
Dystroglycan |
|
Gene Ontology(CC) |
GO:0016020 |
membrane |
|
Gene Ontology(MF) |
GO:0005509 |
calcium ion binding |
|
Pfam |
PF05454.6 |
Dystroglycan (Dystrophin-associated glycoprotein 1) |
1.4e-59 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
H. sapiens |
ENSP00000439334 |
T. castaneum |
TC009157 |
P. humanus |
PHUM379620-PA |
M. musculus |
ENSMUSG00000039952 |
A. mellifera |
GB14967-PA |
A. gambiae |
AGAP005162 |
D. plexippus |
DPOGS207511PA |
B. mori |
BGIBMGA001935-TA |
D. melanogaster |
FBgn0034072 |
P. vanderplanki |
Pv.02661 |
H. sapiens |
ENSP00000440705 |
S. invicta |
SI2.2.0_05379 |
C. quinquefasciatus |
CPIJ018999 |
A. aegypti |
AAEL013147 |
H. melpomene |
HMEL015566-PA |
H. sapiens |
ENSP00000440590 |
H. sapiens |
ENSP00000438421 |
H. sapiens |
ENSP00000312435 |
H. sapiens |
ENSP00000442600 |
N. vitripennis |
NV11301-PA |