MidgeBase gene description page [Pn.09740]

Outline

Link to gbrowse

Gene ID Pn.09740
Type Protein coding gene
Scaffold PnScaf10089
Start 1344
End 9909
Direction +

Sequence

Transcript: 4170 (bp)

 ATGATTTTTAAACGACGATTCAGCGCAGCTCTATTAGTTTTATTGTGCAGCACATTTGCGGCGACGGAGCGTGGCGGCGATGTGAAGCTCGATAGCGACGATTTACAGGTCGTGTACTCGGATCCTCCGGAAGCTGTCGCGTCGACGCCGTCACGTCACTTGTACACAAAAGAAATCGCACAATGTTCGCCCGGGGAGAGCAAAATGATGTTGACACTTTTGCTTCAATCACTTAAATGGAGCGACGTTGGCGAGCGCAAGCAGGAGAAGGCCATTAAAAAGATGTCACGTTTCTTCCTCGTGCCAGTGAAATACATCAGACACGATGACATCAGCACCACGGAAATGCTCGAAATGCTGAAGCACTCGATTAACAAGAGCAATAACTTGCTCAACACCAAAGACGCCGTCGGACGCATCGAGTTCATGATCGGATGCGGCGATAAGTTGTTTGCGAGCAGCAAATCAGTTGCGCACAAGGTCGACGATTACGTCAAGTTGGGCATGATGGAGGAGCTGTCGGACCTGCCCATCCAGTGGTGGAACATCTGGACGAAGCACGTCAAGAGTCGCTTGCCGAGGACGAGACGGCAAGTCGAGGGATCGGGAAATGCCGACGAGGAGGACGACTACGAGGAAGAGTACACGGACGAGCAAATCGAGGACGACGAGGATGAGGATGTCAACAATAATGACGATCAATCTCCGACATCTGCCGTGAATGCGAATCAGCGAAAGAAGCAGCATTCGAAGGGCAATAAGGAGCTTTCGACAGCGCCGAAAAGTCAAGATGATCTGCCGCCTCCTCCGCAGCAAAGACCCACTGACGGTGGCGCCGAAACGGAGGCTAATGAGAAAATTGAAGATAATGTCGAGAGCGAGAATGGCGTGGCTCGTGGCGGCGAGAAAATTGAATCGGAATCCGACTACGCGAATCCTCTCGAAGAGGAGTCATTAGCACCTATTGATGGGGAAAGCGATAGCGTCGATGATTCAATTAAGGGCGAGCAAATTGAAGACATTACGCAGCGCAACGCGGAGGCGAATAGTCCGACGCCGATCGGCACCAACGAGATAATCAAGTTGGAGAAGGAGGTGGACAAGACCATCGAGAACGCCAACAATGTCGACGATGTTCCCTTGCCGCCCAAGCAACAGGACGACAAAATTCTCGAGGAGTTCGTCAACAGCATGATGCCGAATGTGCCGGTCCAGGAGCTCACGACGGACAGCGAGGTTTACACGATTCTCACCTCGCCGACGACGACCGTCGAGACGCCGCACACGTCACGTCATCAGCATCATCATCGCATCCATCATCGACCCGATGCACCCGAGACAGCGACGAGTGGAGGTGAGGAGCCTCGACCGGACACGTCGTTAGCGACAGCCGGTGCTGAAGACGTGACGAGTGAATTTTTCACTACAATCGCCACCGCCACGAACACGGCGGTGGTGCAGCAGCAACATGTACCCGAGGTGGAAACAGTTTTCACTCCCATAAATAACAGCCGCGCACACAATGAGCCCATTAATGTGTTTATCGATGAAAAATTGCCGATTGTTACGGCGACTGTGGACGTTACGACTGCGACAATATCGACCAGTGAGAAAGAATCGATTACGATGATTGAGGAAAGTGTTACGGATCGAATGCCATCATCTGCATCACCATCACAACCGCCATTAATTATTTCATCAACCTCACATGAACATGAAACGACTTACAATTATCCCATTACGACAAATCCAACTATAATAACAACAACAGCTGATATAACAACAATCATAACCGAGCAAAGCATAACAGAACCTACAACAACAACAACAACCCTTAGAAATGTAGACGCGACAGTTGCTCATGTTGTCGCTGAGGATAAGGTCTCGACGGCGGCAAACGACTACGAGGAGCGAAAATCAATCGAGGAATCGAATTACGATGACGAGTACGATGATGATGACGAGAATTTCATTGACCAGCCAACGCAGAGACCTTCGAGCATCGCAACAACGGCTCGCGCCCCGACCACCATCAAGGAGGACCGACGAACGCCGCTCCCACTCATCCCGACCCTGCCAAACGAGGACGAGGACGACGAGGATGACGAGGACAACGAGCTGGACGAGGACCTCCTTGCACAAATTTCGACGTCCACGCAAGCACCGACAACCCAAACACCACCTGCGACCACTACCATAGCTGCCACTACGACCATCACCACGACCACTGAGCAGACGACCACGGTGGTGTCGACAACTCAGTCGGCAGCCACAGAAGTCACGAGCGAGTCTTCGGAGGTGACGAGCGACGGTTCGGACTACGACGACAACGAGCCGCCGCGAGTCGTAAAGAGAATCAAGAAGATTCAGGCGACCGCCGGCAAGACGTTCATCTACAACCTTGAGGGTCCGATATTCGAGGACAAGGAGGGCAAGACGAACCTCCGGCTAGAGATGCTGGACAAGAACGATGAGCCGCTGCCGTCGAGCTCGTGGATTCGCTTCGATGCGGCTAAACAGGAAATCTACGGACTGCCGCTCGAGAAAGACGTCAGCCGACACGAGTTTAAGCTGCGAGCGACCGACAAGGACGGCGCTTACGTGGACGAGGACGTCGACGTCACAGTGCAGCAGCACAAGAGCTTCCGCAGCGTGAACCACGAGATTTTCATTAAGGTGTCGCTGGAGAAGCAATTCGAGTCGGACGTCGACTGGAAGATTCGCCTCATGCGCGGCATCAACGCCGTTCTCGGCGACAACTCGCTCGGCAACATCTACGTGCGCGACGTGACGCCGCACAAGTACGAGGATACGCGCTACACCTTCTCCTACACGAACGACTCGCTGCCCAAGGAACACTGCCCGAAGGCCGAGCTCGACGACCTGATGCTGAAGCTCACCAAGCAGGCCTTGAACAGCGAAATGCGCCGGGAAATTGCCGTGCACAACATCGAGAAGGAGCTGATTAACTCGTGCGCCAAAGTCAAGACTCCGCGGCCGCCGTCGCTGCCGCCGAGCCGCACAAACTTCCCGCCAACCGTCCGCAATCACGTGGACAAGATCGAGGCACACGTCGGCCAGCTGTTGGTGTTTGCCGTGCCTGAAGACACTTTCTACGATCCGGAGGATCAGAACGACCTGAAACTGTCGCTTCTCTACGAGAACCGCTCGCTGCTCGAGTCCTCCAACTGGTTGCAGTTCGACGCCAAGAACCAGGAGTTCTACGGCGTACCGACTATCTACGACAAGACGCAAACTTACGTGCTCGTGGCAGAGGACAAGAACGGCCTCACCACCAACGACGCCCTCGTCGTCGAGATCCAAAATCCTCACTCGAAACGCGACTTTAGCGTGACCTTCGAGTACCAGTTGGACATCGGCTATGAGCAGTTCAAGAGCGCTGCCACAAAGCGCAAGTTCATTGAGCGCATTCAGCAGCTGTTTGGCGACGCGGACGCCAGCGCGATTCTCATAAAGTCCGTCAAGGAACTCAAGCACTACGGACGGACGTCTGTTGTCGTGCAGAACACGACACTCACTCATCGCATCTGTCCGATCAACCTTATCGATAGCCTTCGAACGCGTCTCGTGCGTACAGACGGCAATCTGAGAGACGAAGTCAAGCAGGCCATCGGAAGCGAATTCAACGTCCTGAAAATTAGTATAGTTCCTACATCAAAATGTTCCGGCGGAGACACATATCACCATCCCGACGAAACCAGTCCCGTTGATCGTCCCGAAGAGCACGAGTCGCCCTTGTTGAATCAAGAAGTACTTATCACATATGTTCTTCCCACAGCCATTATATTGTTGATGCTGTTAATTGCACTGTTGATCGCATGCCTTCTTTACAAACGCCGCAATACCGGCAAAATGGAGCTGGGCGACGAGGAGGAAAGAAAGTCCTTCCGCTCGAAGGGAATTCCTGTGATTTTCCAAGATGAATTGGATGAGAAGCCTGAAATTATTACAAAGTCACCTGTCATCTTGAAAGACGAGAAGCCGCCACTTTTACCACAATACAACGGATTGAACCAGGACGGTGATGAGGACGTCGACCAGTATATTCCACCACAACCGCTTCTCATGGGAAGCCGTGACTCGCGTGGAAAGTCTCCAGTTTCACAGAACACACCTAGTTATCGCAAGCCTCCACCATACGTTAGTCCA 

Protein: 1390 (aa)

 MIFKRRFSAALLVLLCSTFAATERGGDVKLDSDDLQVVYSDPPEAVASTPSRHLYTKEIAQCSPGESKMMLTLLLQSLKWSDVGERKQEKAIKKMSRFFLVPVKYIRHDDISTTEMLEMLKHSINKSNNLLNTKDAVGRIEFMIGCGDKLFASSKSVAHKVDDYVKLGMMEELSDLPIQWWNIWTKHVKSRLPRTRRQVEGSGNADEEDDYEEEYTDEQIEDDEDEDVNNNDDQSPTSAVNANQRKKQHSKGNKELSTAPKSQDDLPPPPQQRPTDGGAETEANEKIEDNVESENGVARGGEKIESESDYANPLEEESLAPIDGESDSVDDSIKGEQIEDITQRNAEANSPTPIGTNEIIKLEKEVDKTIENANNVDDVPLPPKQQDDKILEEFVNSMMPNVPVQELTTDSEVYTILTSPTTTVETPHTSRHQHHHRIHHRPDAPETATSGGEEPRPDTSLATAGAEDVTSEFFTTIATATNTAVVQQQHVPEVETVFTPINNSRAHNEPINVFIDEKLPIVTATVDVTTATISTSEKESITMIEESVTDRMPSSASPSQPPLIISSTSHEHETTYNYPITTNPTIITTTADITTIITEQSITEPTTTTTTLRNVDATVAHVVAEDKVSTAANDYEERKSIEESNYDDEYDDDDENFIDQPTQRPSSIATTARAPTTIKEDRRTPLPLIPTLPNEDEDDEDDEDNELDEDLLAQISTSTQAPTTQTPPATTTIAATTTITTTTEQTTTVVSTTQSAATEVTSESSEVTSDGSDYDDNEPPRVVKRIKKIQATAGKTFIYNLEGPIFEDKEGKTNLRLEMLDKNDEPLPSSSWIRFDAAKQEIYGLPLEKDVSRHEFKLRATDKDGAYVDEDVDVTVQQHKSFRSVNHEIFIKVSLEKQFESDVDWKIRLMRGINAVLGDNSLGNIYVRDVTPHKYEDTRYTFSYTNDSLPKEHCPKAELDDLMLKLTKQALNSEMRREIAVHNIEKELINSCAKVKTPRPPSLPPSRTNFPPTVRNHVDKIEAHVGQLLVFAVPEDTFYDPEDQNDLKLSLLYENRSLLESSNWLQFDAKNQEFYGVPTIYDKTQTYVLVAEDKNGLTTNDALVVEIQNPHSKRDFSVTFEYQLDIGYEQFKSAATKRKFIERIQQLFGDADASAILIKSVKELKHYGRTSVVVQNTTLTHRICPINLIDSLRTRLVRTDGNLRDEVKQAIGSEFNVLKISIVPTSKCSGGDTYHHPDETSPVDRPEEHESPLLNQEVLITYVLPTAIILLMLLIALLIACLLYKRRNTGKMELGDEEERKSFRSKGIPVIFQDELDEKPEIITKSPVILKDEKPPLLPQYNGLNQDGDEDVDQYIPPQPLLMGSRDSRGKSPVSQNTPSYRKPPPYVSP 
Type Start End Length
CDS 1344 1451 108
CDS 1520 1720 201
CDS 1806 2182 377
CDS 2342 2387 46
CDS 3923 4819 897
CDS 6003 6254 252
CDS 7006 7101 96
CDS 7159 8857 1699
CDS 9337 9702 366
CDS 9779 9906 128
intron 1452 1519 68
intron 1721 1805 85
intron 2183 2341 159
intron 2388 3922 1535
intron 4820 6002 1183
intron 6255 7005 751
intron 7102 7158 57
intron 8858 9336 479
intron 9703 9778 76

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_314057 AGAP005162-PB [Anopheles gambiae str. PEST] gb|EAA09416.5| AGAP005162-PB [Anopheles gambiae str. PEST] 0.0
InterPro IPR015919 Cadherin-like
InterPro IPR006644 Dystroglycan-type cadherin-like
InterPro IPR013783 Immunoglobulin-like fold
InterPro IPR008465 Dystroglycan
Gene Ontology(CC) GO:0016020 membrane
Gene Ontology(MF) GO:0005509 calcium ion binding
Pfam PF05454.6 Dystroglycan (Dystrophin-associated glycoprotein 1) 1.4e-59

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
H. sapiens ENSP00000439334
T. castaneum TC009157
P. humanus PHUM379620-PA
M. musculus ENSMUSG00000039952
A. mellifera GB14967-PA
A. gambiae AGAP005162
D. plexippus DPOGS207511PA
B. mori BGIBMGA001935-TA
D. melanogaster FBgn0034072
P. vanderplanki Pv.02661
H. sapiens ENSP00000440705
S. invicta SI2.2.0_05379
C. quinquefasciatus CPIJ018999
A. aegypti AAEL013147
H. melpomene HMEL015566-PA
H. sapiens ENSP00000440590
H. sapiens ENSP00000438421
H. sapiens ENSP00000312435
H. sapiens ENSP00000442600
N. vitripennis NV11301-PA