MidgeBase gene description page [Pn.00342]

Outline

Link to gbrowse

Gene ID Pn.00342
Type Protein coding gene
Scaffold PnScaf394
Start 28631
End 34009
Direction -

Sequence

Transcript: 3966 (bp)

 ATGGGCAAGGACTGCAACCAGTGCATGCCGGAGACCTACGGACTGTCCGAGAGCCGAGACGGATGCACGCCTTGCGACTGCGACCCGGGTGGTTCGCTCGACAACAACTGCGACGTCATCACTGGACAGTGCAAGTGCCGACCATACATGCAGGGCAGAAACTGCAGCGAGCCGAAACAGAACCACTTTATTCCTAATCTCCACATCGTGGTCGAGGCCGAGGGACCGAGTGCCATCTGCGAGACTCAGTCGTCATACAATAACTGCTCGGTGGTGATCCGCGAAAACCACCCAGATAGACCGACCTTCTCGGGCCCGGGTTTCTTGAAGGTCTACGAGGGCGGCGACATTAGCTTTACTGTCGACAACGTGCCGAAGACAATGAACTACGACGTTCAGATTCGTTACCAGGCGAATCAGCACGGCGACTGGGAGGACATTCGAGTGAGTCTCATTCGTCCGGACACCGTCGAGTACGGCAGTGCTTGCTACAACGTCAATCCAGTGGAAGAGCAAGAAAAGTTTTTGCGTTTCAACGAGTATGAATCGAGCGTGATTGCTCTCTCGGACTTGTGCCTCGAGGAGGGAAAGACCTACAAGTTCATCGTGTCGCTGCACCGCCAAAACAACTACGATCCCAATCCAAAGGCGCAAATTCTCATCGACTCTCTGGCCCTGATTCCGCGCATCGACGTCACGACAATCTTCAGCGGCTCGACCATCGCCGAGATCCGCAAGCAAGACTTTGACTACTACGGCTGCAACGACACCTACTACAGCGTGGCACTCACTCAGAGCGCCGACCCAAGATGCAAGGAACTCATCGAAATGTCATCGGCTTTGGTCTTCAACGGTGCAACGCAAATTCCCCACAAATTAATCATGAGCCGTGCAGTGTTTACTCTCAGCATAATTTTCCTCATTCAAATATTGTCACTCAATCTCGACGCCGCCGAGCTCGGAGATTCGTGCGACTGCAACCCGACTGGATCGTTAAGTAAGAAGTGCGACGAGCACGGAGGATTCTGCAGGTGCAAGACCAATGTCGTCGGACGTCAATGTGACCGTTGCGCACCACGCACGTACGGATTCTCGCAGGACGGCTGCCAGTCGTGCGACTGCGACAGCATCGGCTCGAAAGACGGTGAATGTCACCTGACCACCGGCCAGTGTAACTGCCAGCCCAACACCTACGGACGCGAGTGCAACCAGTGCCAGCCTGGCTACTGGAACTTCCCCGATTGCCAGCAATGCAACTGCAATGGACACGCAACGACCTGCGACTCAAAGACCGGCGAGTGCATCGGCTGCCAGGACTTTACGACCGGCAACCACTGCGACGAGTGCATCACGGGCTATTACGGTGATCCGTTTTTGGGCAGCGAAATCGGCTGCCGTCCCTGCCGCTGTCCTGACACCATCGCCTCGGGCCACTCACACGCCTCGCAGTGCGCGCTCATTTCGAACAACAACGACGTCATCTGCTACTGCGAACCGGGCTATGCCGGCACAAAGTGCGACGTCTGCGACAACAATTACTTTGGCAATCCGGACAAGCCCGGCGGCGATTGCAAGCCGTGCAACTGCTCGAGCAATGTCGACTTGAGTCGCTCGGGTAATTGCGATGCGCGCAGTGGTAAATGCCTGCAGTGCTTGTACGACACCGACGGCGACCACTGCGAGTACTGCCGCGATGGATTCTATGGAGATGCTTCGAGACAGGACTGCAGAGAATGCGACTGCGATGTCCTTGGAAGCGTTGGCGGAAAGTTCTGCGACCGCTACACTGGCCAGTGTCCATGCTTGCCGAACGTGGTTGGAATGAGATGCGATCAGTGCGAGAACAATCATTGGAAAATTGCCAGCGGTGAGGGCTGCGAGCCTTGCAATTGCGACGAAATCGGTGCCTACAGCGAGCAGTGCAATCCCTACGACGGCCAGTGTAGCTGTCGACCGGGCTTTGGCGGACGAGCTTGCGATCAGTGCGAGGCGAACTTCTGGGGAGATCCGAATGTTGAGTGCAAGCCATGCGAGTGCAACACCTACGGCTCATTCACTTATCAGTGCGACCGCGAGACAGGACAGTGCAAGTGCATCAAGGGAATCGGCGGCTACAAGTGCGACCAATGCGATCGCGGCTATCTCGGCGCAGCGCCTTACTGCTCACCCTGTGGCGAGTGCTTCGACAACTGGGACGACATTCTCAACGGACTGAAGTCGGAGACTGACGAGATCATCGAGAAGGCGAAGCTGATCAAGCAGCAGGGCGCAACCGGTGCCTACACGAAGGAGTTCGAGGACGTCGAGAAGAAAATTGCCACGATCAAGACCATCCTGAACAACACGACGGTGAGTGCGCGCGACATCAAGAAGATCGACGACAAAATCGCGAAGCTGCGCGATCATCTGGAAAAATCCGAGGAGAGTCTGAAGCAAACGGACGCCGAGCTCGATCGACTCAACGAGGAGATGAACTTGGCGGGCGTCGAAATCTCCAACTTGGAGGAGAAGAGCGACAAGATCAAAGCGCTGGCGAACGCTCTCAAAGAAAATGCTACGACCCTGCAGGAGGCGAACATCGAGGGAGCTCTTAATTTGACACGCGACGCGTGGATTCGCGTGAAAGGTCTGTCGGAGGTGTACACCGAAATCGGCGACCTCAACTCGGAAGCCGACCGCCAGTGCAAGCGAATCGAGAATCTCATCAGTCGACAACTGGAGAACGAAAATATCTTGTCGGCGAACGACAAACAGATCGAGGAGCTGCAGACGAACTTGAACGAGCTCATTGCCGAGATTCCGAGTCTAAACGAGCAGATTTGCGACAAGCGCGGCGACCCTTGCGACGCACTTTGTGGCGGTGCCGGATGCGGCAAGTGCGGTGGAATCTCGTGCGAAAAGGGCGCCTTCACAAAGGCCTCGAACGCTCTCAACTACGCAAAGGACACCGAGAAGATCATCAAGGAGAAGGACCAGCATGCGGAGGAGTACATTCGATCGGTGTCGCACGCCAAGACTGAGGCCGTCGATGCCTTCAAGAAGGCGAAAGAGACGTTCGACAAGGTCGAGCAGACGTACAATACGACCGAGGCGCAGCTGAAGGAGGGACGTGAGCTGATTGCAGATTTGACGAACATCATTGCGAACACGACTGCGTCGCCTGGCGAAATCAAAGAAATCGCCGAGAGCATCTTGAAGCTCGACCTTCACTTGGATCCGTCCGAAATAAAGCATTTGGCGAACAATATCGATCAAACGGTCGCCGCACTCGAGAACGTCGATGACATCATTGAAAATACTCGAGGAGACCTGGAAATGGTCGAGCGACTCAAGACCAGCGCCACTGAAGCACAAAAGAGAGCAAACGACGTTTTAAAGAGAGCGACCGAAGTGAGCAAGGCACTGGAGGAGGCCGAATCATCGCAGAATGCTGCAAGAGACGCGATTAAGAAAGCCAACAGCGACATCTCGTCCGCCAAATCTGACTTGGAAGAGATCAACGTTGACGCAACGAAGGCGCGCACCAGTGTTGATGAAACAGCGGCCGCCGTCAACGCTCTCCAGACGAAACTCAACAAGCTGCAACGAAAGTTCCTCAAAAACTCGCACGACGCCAAGGAGGTGAAGACACAGGCCGATCTCGCCAAGGACATTTCGAATGTGACACACGAGAAGGCGAAGAAGCTGAAGAGTCAGTACAAGTATGCGAACGACACACTGAATTCAAAGGCCCAATCGACTGAATATGCGCGAAACAAAGCCCAGCAATTACTGCAGCGAGCATCGAAAATAACAGTCGACACGAACAATAAATTGAAGGAGCTGAAAGATATGGTGGCACAGACAACTAGCAACGACGACGAGCTGGGCGAAATGGAGCGAAAGATTTTGAGACTTAACGGCGAAATTGACGTGTACATACAGCGCATCAGAGACCACGCCGATCGTTACCGAACCTGCACTTCT 

Protein: 1322 (aa)

 MGKDCNQCMPETYGLSESRDGCTPCDCDPGGSLDNNCDVITGQCKCRPYMQGRNCSEPKQNHFIPNLHIVVEAEGPSAICETQSSYNNCSVVIRENHPDRPTFSGPGFLKVYEGGDISFTVDNVPKTMNYDVQIRYQANQHGDWEDIRVSLIRPDTVEYGSACYNVNPVEEQEKFLRFNEYESSVIALSDLCLEEGKTYKFIVSLHRQNNYDPNPKAQILIDSLALIPRIDVTTIFSGSTIAEIRKQDFDYYGCNDTYYSVALTQSADPRCKELIEMSSALVFNGATQIPHKLIMSRAVFTLSIIFLIQILSLNLDAAELGDSCDCNPTGSLSKKCDEHGGFCRCKTNVVGRQCDRCAPRTYGFSQDGCQSCDCDSIGSKDGECHLTTGQCNCQPNTYGRECNQCQPGYWNFPDCQQCNCNGHATTCDSKTGECIGCQDFTTGNHCDECITGYYGDPFLGSEIGCRPCRCPDTIASGHSHASQCALISNNNDVICYCEPGYAGTKCDVCDNNYFGNPDKPGGDCKPCNCSSNVDLSRSGNCDARSGKCLQCLYDTDGDHCEYCRDGFYGDASRQDCRECDCDVLGSVGGKFCDRYTGQCPCLPNVVGMRCDQCENNHWKIASGEGCEPCNCDEIGAYSEQCNPYDGQCSCRPGFGGRACDQCEANFWGDPNVECKPCECNTYGSFTYQCDRETGQCKCIKGIGGYKCDQCDRGYLGAAPYCSPCGECFDNWDDILNGLKSETDEIIEKAKLIKQQGATGAYTKEFEDVEKKIATIKTILNNTTVSARDIKKIDDKIAKLRDHLEKSEESLKQTDAELDRLNEEMNLAGVEISNLEEKSDKIKALANALKENATTLQEANIEGALNLTRDAWIRVKGLSEVYTEIGDLNSEADRQCKRIENLISRQLENENILSANDKQIEELQTNLNELIAEIPSLNEQICDKRGDPCDALCGGAGCGKCGGISCEKGAFTKASNALNYAKDTEKIIKEKDQHAEEYIRSVSHAKTEAVDAFKKAKETFDKVEQTYNTTEAQLKEGRELIADLTNIIANTTASPGEIKEIAESILKLDLHLDPSEIKHLANNIDQTVAALENVDDIIENTRGDLEMVERLKTSATEAQKRANDVLKRATEVSKALEEAESSQNAARDAIKKANSDISSAKSDLEEINVDATKARTSVDETAAAVNALQTKLNKLQRKFLKNSHDAKEVKTQADLAKDISNVTHEKAKKLKSQYKYANDTLNSKAQSTEYARNKAQQLLQRASKITVDTNNKLKELKDMVAQTTSNDDELGEMERKILRLNGEIDVYIQRIRDHADRYRTCTS 
Type Start End Length
CDS 28634 28770 137
CDS 28891 29035 145
CDS 29139 29327 189
CDS 29398 29466 69
CDS 29555 29627 73
CDS 29995 31321 1327
CDS 31384 31480 97
CDS 31545 31741 197
CDS 31801 32565 765
CDS 32632 32736 105
CDS 32852 33044 193
CDS 33106 33513 408
CDS 33749 34009 261
intron 28771 28890 120
intron 29036 29138 103
intron 29328 29397 70
intron 29467 29554 88
intron 29628 29994 367
intron 31322 31383 62
intron 31481 31544 64
intron 31742 31800 59
intron 32566 32631 66
intron 32737 32851 115
intron 33045 33105 61
intron 33514 33748 235

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001862577 laminin subunit beta-1 [Culex quinquefasciatus] gb|EDS37215.1| laminin subunit beta-1 [Culex quinquefasciatus] 0.0
InterPro IPR009053 Prefoldin
InterPro IPR013015 Laminin IV
InterPro IPR006210 Epidermal growth factor-like
InterPro IPR002049 EGF-like, laminin
Gene Ontology(MF) GO:0005515 protein binding
Pfam PF04582.7 Reovirus sigma C capsid protein 5.1
Pfam PF09824.4 ArsR transcriptional regulator 1.2
Pfam PF00053.19 Laminin EGF-like (Domains III and V) 4.7e-80
Pfam PF13166.1 AAA domain 4
Pfam PF10174.4 RIM-binding protein of the cytomatrix active zone 5.3

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
N. vitripennis NV14179-PA
D. plexippus DPOGS211962PA
P. vanderplanki Pv.06237
H. sapiens ENSP00000402265
B. mori BGIBMGA000913-TA
D. melanogaster FBgn0261800
M. musculus ENSMUSG00000052911
H. sapiens ENSP00000222399
H. sapiens ENSP00000373432
H. sapiens ENSP00000307156
H. sapiens ENSP00000205386
C. quinquefasciatus CPIJ011714
H. melpomene HMEL004807-PA
A. gambiae AGAP001381
H. sapiens ENSP00000416562
S. invicta SI2.2.0_06367
M. musculus ENSMUSG00000002900
H. sapiens ENSP00000377190
A. aegypti AAEL003658
B. mori BGIBMGA000910-TA
S. invicta SI2.2.0_16537
P. humanus PHUM032490-PA
H. sapiens ENSP00000377191
H. sapiens ENSP00000388325
H. sapiens ENSP00000402353
H. sapiens ENSP00000373433
A. mellifera GB13917-PA
T. castaneum TC005184
B. mori BGIBMGA000911-TA