MidgeBase gene description page [Pn.08910]

Outline

Link to gbrowse

Gene ID Pn.08910
Type Protein coding gene
Scaffold PnScaf8764
Start 6997
End 11902
Direction -

Sequence

Transcript: 4764 (bp)

 ATGTGGAACATCACCGTGCGCCGCGGACGCAAAATTCAGTTCGAGATCCTCGACATCAACATCCGCAACGCCTCCGACGGCTGCAGCAACTTCATCTCCATCAAGAACGGCCACGACGACTCGGCGCCTTATCTCGGCAGCGGCCAGTTCTGCACGAAAATGGAAATCCCGCAGACGACCGGCAATCGAGCCTTTGTCTCGTACAAGGTCAACACGCCCTTCTTCAACTCCTTCCGGCTGCGCTACGCGGAGGTGCAGCACGAGTGCGGCGGACAAATCACGCTCTCGAGCGCCTACACCGCTTCCACCATCAGCTCACCCAACTATCCCAACATCCCGCCGCCGCACGTCGAGTGCACGTGGATCATCATTGCGCCGGTGGGCCGCGAGATCCGCGTCGACTTCCTCGAGCGCTTCGACCTCACGCGCGGCGAGTTTTGCGACAAGGAGTTTGTGGAGCTGCGCGAGGGATCGACGTCGGCAGCTAGCTTGATCGGCACCTACTGCGGCAGCACAAAGCCACAGACGCAATACACAAAGTCGAACGTGCTGCGCGTGCGCTTCGTCACGGACGTCCTCGAGCCGAAGAACGGCTTCAAAGCGAACGTCTCGATCGGCGTGTGTGGCGGCACGGTGCGAACCACGGGCGTCGGTTATGTGTCGTCACCGAAGTACCCGGGCATCGGCGCATATCCGAGCAACGTGACGTGCGACTATCGCATCGTCGGACCGCCAAACACGATCTTCACCGTCGAAATTATCGACATCGACTTGCCGGGTCGGGAGGATGAGAGCCGCGACTACGAGAGCAGTGAGATGTCGAACTCGCCGTCGCATGGCTGCGACACGCAGAAGGACCATTTCGTGCTCTACTCGGTGATGCCAGACGTCGGCACGAACGAAACGCTCCTCGACACCGGAACATTCTGCGGCTCGCACGCCCCCAAAAGAGCCATCACCAGCGACTCGAACGAAATTCTCGTGCGGTTCAGGACGTTCGCGCGCACCAACAAGCTCTTCAAGGGCTTCCGCCTCTTCTACAATGCCTCGCGGCAGAGCTGCGGCGGCGAACTCAACGCCGACTCCGGCATCATCACGAGCATCGCTTATCCGTCGCGCACGCTCAACCGCGCCTTCTGCGAGTGGCGCATCACGGTCAGGAAGGGCCGTCGCGTCAAACTCGAGTTCCTCGACCTCGACTTTGTGCCGCCCGACGTGCGCAACATGCAGCGCCTCATCATCTACAGCGACTTTTCCTACGCGAGCCGCATGATGTTCATCACGAGCAACAACAACCCGGGCGTCGTTTACTCGTCCGACAACAAGCTCATGATCACGGCGTGGATCCGTGCGCCGTCGCAGAACCGCGGCTTCAAGATCAGGTACTCGAGCGAGGAGCCGTCGCTGTGCGAGGGCGACTTGAACGCCGACGAGGGCTACATTTATCCGCCGAACATCGAGAACGCGACGAGCTACTCGTGCACGTACGAGAGGCAGAACAGGCCGATTGTTGGCGACGCACTCGGCAAGGGCACCGTCGCCTTTTACCTGAGCAACCTCGACGTCGGTCGGCGAGTGGTGAACTGCCGGTTTGCCGCCACAAACTTGGCCTTCTCGCGCGCCTCCGCGATGGTCGAGAGCGAGAAATATCTGGCTCGGATTTGCGGGAATGCCTCGGAAACTGCGACCGTCCTCTCGCCCTTCCCCGACGTCCGCTTCGAGGTCAAGCAGAGTCCGTTCTTCGGCAAGGTGAACTTCTCGCTACACTACAAAGTGCACAACTGCGGAGGCCTCATCAAGGGCGGCCCATCGGCCATTAACATCACGAACCCGCCGCCGAACACGGCGGACTACAAGAAACTCGACTGCGCCTACCACATCAAGTACGAGGAGGGCTTCGCAGTGGCTCTGAAAATCCAGCGCCTCCGACTGAAGCTCTCGTGCGAGCAGGAGCACATCACCATCTACAACGGCCCGAACGCCATGAGCCCCGTGCTGGCCAGAATATGCGGCGACGACTTCGACCGCACGCCCCTCATCTCGCAGAAGGAGACCGTCTACATCGAGTACCACACGGAGAATTTCAACGGCGAGAGCCGCAACTCGGAGTTCAGCATCAACGTCGACTCGTCGTCGTTCGGCTGCGGTGGCATCCTTAACCACCTGATCAGGAGGTTCAGCACGCCGCTCTACGACAAACCCTACCCGCCCAACACCGAGTGCATGTGGGAAATTCGCGCCGACTCGGGCTACCAGGTCGGCCTCTACTTCACCGAGCGATTCTTCATCGAGGCCTCCGACAACTGCACCAAGGACTTCCTCGAGTTCTACGACTTTGTCGACAACGAGTGGATCTTCATGAAGCGCGTGTGCGGCCGAGACACGCCGCAGCCAGTCAACTCGACGTCTACGCGACTGCGCGTTCTCTTCCACTCGGACGGCGCCCTCAACGGCGACGGCTTTGTTGCGCAGTGGGACCAAAACTGCGGCGGCGTCATCGCCGTCGACACGAAGAAGCGAGTGCTCACGAGCCCGCGCTATCCGCTGGCGTACGAGGCCAATCTCAGGTGCAACTACACGTTCGTGACGTCGAACGCGGAGGACTTTGTGAACATCAAGTTCCTGGACTTCAACCTCGAGGCGGTCAACGGAAAGTGCATTTACGACAACGTGACCGTCTACAAGTGGCTCGAGTACATGTCGCCGCAGGAGATGAGCAAGGTGGGCGTCTTCTGTGGCACGAAAAACCCCGGCAATTTCCGGTACAAGAACAAGATCACGATCATCTTTGTCAGCGACCAGTGGGTGGAGCGCAAGGGCTTCCAGCTCGAGTACAGCACAGACAGCTGCGGAGGCGTCGTCACGTCGCCGACGACCATCGCCTCGGTGCCGATCGTGCGGTCGACCAACAGCGAAACCTACGAGTACCTCGGAGCGATGAGCTGCGTGTGGAACATCACGGCGCCGCCGAACCACAAGATCGTGCTGAAGTTCGAGAAAATCGCGCTCGACTTCAGCGAGTACTGCAGCTACGACTTCGTGGAGATTTTCAGCGGCGCGCTCGATGACGAGAAGAACCGGCTGGTGAAGCTGTGCAACAACCTCACGATCCCGCCGATCTCCGTCTTGGACAACCAGGCGGTCGTGAAGATGAGGACCGAGCAGTCCAAGGACTACCTCGGCTTCAGCGCAGTCGTTTACTTCCAGGAGAAGTGCGACCAGGCATTCACGCTCACGCTCAGCAATCCCTCGGTGGTGATCGACATTTCGCACCAGCAGACCGCCGCCAACCTCGAGTGCACCTACAAAATCAGCGGCGATCCTCTCAGCACGTTGCGTGTCCGCTTTGACGAGATGCACCTCTCGATCTGCGATCCCGACCAGCGCAAGGACAACTGCAGCTGCGACTATCTCGAGTTCTTCGACGGCAACGGGCCGTTCAGCCGTCCCATCGGTCGGTACTGCGGCCACGACGCGCCCATCGACATCATCTCGACCACCTCGAGCCTGTACATTCGATTCGCGACCGACTCGATTCGCAGCAGCAATGGCTTCAAGCTGACGATCACGATGCAGGAGTCGCCGTGCGGACCCGTGCCCTACTTCAACTTCACCAATCCCGAGCAGGCCGTGGTTCTCACGTCGCCGCGCGTATCGGCCAGCACTAGGCGCTACCCGCCCAACATCCGCTGCCAATGGACCATCGAAGTTCCCGAACGCCAGAACATCGAGATCATCTTCGCTCTGTTCGAGCTCGAGGACTCGGAGGGCTGCCGGAACGACAGCCTCAGGATCGAGGACGACATGGTGAAGGACTACGTGCTCGAGGGTCTCGGACAGGAGACGGTCTACCGCGGCCACAGCACGCACACTTTCCAGCCGAGCTTCTACATGGGCATCACGGGGCCGACGGCGCCGCACGTCTACTGCGGCTCCGTCCTGCCGCACGACTACTTCTCGACGACCAACAAAATCCGCATCTACTTCGAGACAAACTCGGAGCTCGAGTTCGGGGGCTTCGACCTCACCGTGCGGCAGGTGAGCTCGTGCAACCGCAACTACACGGCGCTCCAGGGACGACTGATGGCGGACGTGAGCGTGAACGGTTGCCACACGACCATCACCGTGCCGGAGAACTACACGATTTCGCTGTACTTTGCGCGATTCTACTTCTACGCCAACGACTGCGAGAAGTCCTTCATGAAGATCTACGACGGCACGTTCGAGACGGGAGCGCTGCTGCGAACGCTCTGCGGGTACTCGACGCCCGATCCGGTCTTCTCGGCGGGCAATCGGCTGTCGATTCGAACCAAGTACGAGGACGACTCGACGTTTTTTACGCGCGGCAACTACGACATCATGTATGTGGCGTCCGAGAGGAGCAAAGGTCCGGGCTGTGGCGGCGAAATTTTCAATTACGGAGGGCTTTTCACGAGTCCTCAGTATCCGAGCGTCAATCGCACAAATTACGACTGCACGTGGACGGTGCGGGTGCCCTCGAACCTCGTCGTGGCTCTGAAGTTCCAAATCTTCGACATGGGCTCAAAGCTCTCCTGCGGCAAGGACTACGTGGAGTTTTTGGAGGAAAACGAAGCAAACGAATTTAGGTCGATCAAGAGCTACTGCGGCGACGACGAGCCGGCGATGTATGTGAGCAGTAGAAGCCAAATAAAGATCCACTACGTGCAGACCGTGCACTTTGCTGGAACTGGCTGGATGCTCCACTTCATGGGCATTCACGAGGGTTCAACGCCTCTGGACTGG 

Protein: 1588 (aa)

 MWNITVRRGRKIQFEILDINIRNASDGCSNFISIKNGHDDSAPYLGSGQFCTKMEIPQTTGNRAFVSYKVNTPFFNSFRLRYAEVQHECGGQITLSSAYTASTISSPNYPNIPPPHVECTWIIIAPVGREIRVDFLERFDLTRGEFCDKEFVELREGSTSAASLIGTYCGSTKPQTQYTKSNVLRVRFVTDVLEPKNGFKANVSIGVCGGTVRTTGVGYVSSPKYPGIGAYPSNVTCDYRIVGPPNTIFTVEIIDIDLPGREDESRDYESSEMSNSPSHGCDTQKDHFVLYSVMPDVGTNETLLDTGTFCGSHAPKRAITSDSNEILVRFRTFARTNKLFKGFRLFYNASRQSCGGELNADSGIITSIAYPSRTLNRAFCEWRITVRKGRRVKLEFLDLDFVPPDVRNMQRLIIYSDFSYASRMMFITSNNNPGVVYSSDNKLMITAWIRAPSQNRGFKIRYSSEEPSLCEGDLNADEGYIYPPNIENATSYSCTYERQNRPIVGDALGKGTVAFYLSNLDVGRRVVNCRFAATNLAFSRASAMVESEKYLARICGNASETATVLSPFPDVRFEVKQSPFFGKVNFSLHYKVHNCGGLIKGGPSAINITNPPPNTADYKKLDCAYHIKYEEGFAVALKIQRLRLKLSCEQEHITIYNGPNAMSPVLARICGDDFDRTPLISQKETVYIEYHTENFNGESRNSEFSINVDSSSFGCGGILNHLIRRFSTPLYDKPYPPNTECMWEIRADSGYQVGLYFTERFFIEASDNCTKDFLEFYDFVDNEWIFMKRVCGRDTPQPVNSTSTRLRVLFHSDGALNGDGFVAQWDQNCGGVIAVDTKKRVLTSPRYPLAYEANLRCNYTFVTSNAEDFVNIKFLDFNLEAVNGKCIYDNVTVYKWLEYMSPQEMSKVGVFCGTKNPGNFRYKNKITIIFVSDQWVERKGFQLEYSTDSCGGVVTSPTTIASVPIVRSTNSETYEYLGAMSCVWNITAPPNHKIVLKFEKIALDFSEYCSYDFVEIFSGALDDEKNRLVKLCNNLTIPPISVLDNQAVVKMRTEQSKDYLGFSAVVYFQEKCDQAFTLTLSNPSVVIDISHQQTAANLECTYKISGDPLSTLRVRFDEMHLSICDPDQRKDNCSCDYLEFFDGNGPFSRPIGRYCGHDAPIDIISTTSSLYIRFATDSIRSSNGFKLTITMQESPCGPVPYFNFTNPEQAVVLTSPRVSASTRRYPPNIRCQWTIEVPERQNIEIIFALFELEDSEGCRNDSLRIEDDMVKDYVLEGLGQETVYRGHSTHTFQPSFYMGITGPTAPHVYCGSVLPHDYFSTTNKIRIYFETNSELEFGGFDLTVRQVSSCNRNYTALQGRLMADVSVNGCHTTITVPENYTISLYFARFYFYANDCEKSFMKIYDGTFETGALLRTLCGYSTPDPVFSAGNRLSIRTKYEDDSTFFTRGNYDIMYVASERSKGPGCGGEIFNYGGLFTSPQYPSVNRTNYDCTWTVRVPSNLVVALKFQIFDMGSKLSCGKDYVEFLEENEANEFRSIKSYCGDDEPAMYVSSRSQIKIHYVQTVHFAGTGWMLHFMGIHEGSTPLDW 
Type Start End Length
CDS 7000 7019 20
CDS 7082 7297 216
CDS 7375 11902 4528
intron 7020 7081 62
intron 7298 7374 77

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001843996 cubilin [Culex quinquefasciatus] gb|EDS35495.1| cubilin [Culex quinquefasciatus] 0.0
InterPro IPR000859 CUB
Pfam PF02408.15 CUB-like domain 3e-13
Pfam PF00431.15 CUB domain 7.5e-196

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.04271

Orthologous genes

Species Gene ID
S. invicta SI2.2.0_03406
P. vanderplanki Pv.14800
N. vitripennis NV13464-PA
T. castaneum TC007013
A. mellifera GB17517-PA
D. plexippus DPOGS207300PA
H. sapiens ENSP00000367064
B. mori BGIBMGA014545-TA
A. aegypti AAEL010965
D. melanogaster FBgn0052702
P. humanus PHUM104920-PA
P. vanderplanki Pv.12269
C. quinquefasciatus CPIJ002327
A. gambiae AGAP005526
D. melanogaster FBgn0259140
A. aegypti AAEL014312
M. musculus ENSMUSG00000026726