MidgeBase gene description page [Pn.00342]
Outline
Gene ID | Pn.00342 |
Type | Protein coding gene |
Scaffold | PnScaf394 |
Start | 28631 |
End | 34009 |
Direction | - |
Sequence
Transcript: 3966 (bp)
ATGGGCAAGGACTGCAACCAGTGCATGCCGGAGACCTACGGACTGTCCGAGAGCCGAGACGGATGCACGCCTTGCGACTGCGACCCGGGTGGTTCGCTCGACAACAACTGCGACGTCATCACTGGACAGTGCAAGTGCCGACCATACATGCAGGGCAGAAACTGCAGCGAGCCGAAACAGAACCACTTTATTCCTAATCTCCACATCGTGGTCGAGGCCGAGGGACCGAGTGCCATCTGCGAGACTCAGTCGTCATACAATAACTGCTCGGTGGTGATCCGCGAAAACCACCCAGATAGACCGACCTTCTCGGGCCCGGGTTTCTTGAAGGTCTACGAGGGCGGCGACATTAGCTTTACTGTCGACAACGTGCCGAAGACAATGAACTACGACGTTCAGATTCGTTACCAGGCGAATCAGCACGGCGACTGGGAGGACATTCGAGTGAGTCTCATTCGTCCGGACACCGTCGAGTACGGCAGTGCTTGCTACAACGTCAATCCAGTGGAAGAGCAAGAAAAGTTTTTGCGTTTCAACGAGTATGAATCGAGCGTGATTGCTCTCTCGGACTTGTGCCTCGAGGAGGGAAAGACCTACAAGTTCATCGTGTCGCTGCACCGCCAAAACAACTACGATCCCAATCCAAAGGCGCAAATTCTCATCGACTCTCTGGCCCTGATTCCGCGCATCGACGTCACGACAATCTTCAGCGGCTCGACCATCGCCGAGATCCGCAAGCAAGACTTTGACTACTACGGCTGCAACGACACCTACTACAGCGTGGCACTCACTCAGAGCGCCGACCCAAGATGCAAGGAACTCATCGAAATGTCATCGGCTTTGGTCTTCAACGGTGCAACGCAAATTCCCCACAAATTAATCATGAGCCGTGCAGTGTTTACTCTCAGCATAATTTTCCTCATTCAAATATTGTCACTCAATCTCGACGCCGCCGAGCTCGGAGATTCGTGCGACTGCAACCCGACTGGATCGTTAAGTAAGAAGTGCGACGAGCACGGAGGATTCTGCAGGTGCAAGACCAATGTCGTCGGACGTCAATGTGACCGTTGCGCACCACGCACGTACGGATTCTCGCAGGACGGCTGCCAGTCGTGCGACTGCGACAGCATCGGCTCGAAAGACGGTGAATGTCACCTGACCACCGGCCAGTGTAACTGCCAGCCCAACACCTACGGACGCGAGTGCAACCAGTGCCAGCCTGGCTACTGGAACTTCCCCGATTGCCAGCAATGCAACTGCAATGGACACGCAACGACCTGCGACTCAAAGACCGGCGAGTGCATCGGCTGCCAGGACTTTACGACCGGCAACCACTGCGACGAGTGCATCACGGGCTATTACGGTGATCCGTTTTTGGGCAGCGAAATCGGCTGCCGTCCCTGCCGCTGTCCTGACACCATCGCCTCGGGCCACTCACACGCCTCGCAGTGCGCGCTCATTTCGAACAACAACGACGTCATCTGCTACTGCGAACCGGGCTATGCCGGCACAAAGTGCGACGTCTGCGACAACAATTACTTTGGCAATCCGGACAAGCCCGGCGGCGATTGCAAGCCGTGCAACTGCTCGAGCAATGTCGACTTGAGTCGCTCGGGTAATTGCGATGCGCGCAGTGGTAAATGCCTGCAGTGCTTGTACGACACCGACGGCGACCACTGCGAGTACTGCCGCGATGGATTCTATGGAGATGCTTCGAGACAGGACTGCAGAGAATGCGACTGCGATGTCCTTGGAAGCGTTGGCGGAAAGTTCTGCGACCGCTACACTGGCCAGTGTCCATGCTTGCCGAACGTGGTTGGAATGAGATGCGATCAGTGCGAGAACAATCATTGGAAAATTGCCAGCGGTGAGGGCTGCGAGCCTTGCAATTGCGACGAAATCGGTGCCTACAGCGAGCAGTGCAATCCCTACGACGGCCAGTGTAGCTGTCGACCGGGCTTTGGCGGACGAGCTTGCGATCAGTGCGAGGCGAACTTCTGGGGAGATCCGAATGTTGAGTGCAAGCCATGCGAGTGCAACACCTACGGCTCATTCACTTATCAGTGCGACCGCGAGACAGGACAGTGCAAGTGCATCAAGGGAATCGGCGGCTACAAGTGCGACCAATGCGATCGCGGCTATCTCGGCGCAGCGCCTTACTGCTCACCCTGTGGCGAGTGCTTCGACAACTGGGACGACATTCTCAACGGACTGAAGTCGGAGACTGACGAGATCATCGAGAAGGCGAAGCTGATCAAGCAGCAGGGCGCAACCGGTGCCTACACGAAGGAGTTCGAGGACGTCGAGAAGAAAATTGCCACGATCAAGACCATCCTGAACAACACGACGGTGAGTGCGCGCGACATCAAGAAGATCGACGACAAAATCGCGAAGCTGCGCGATCATCTGGAAAAATCCGAGGAGAGTCTGAAGCAAACGGACGCCGAGCTCGATCGACTCAACGAGGAGATGAACTTGGCGGGCGTCGAAATCTCCAACTTGGAGGAGAAGAGCGACAAGATCAAAGCGCTGGCGAACGCTCTCAAAGAAAATGCTACGACCCTGCAGGAGGCGAACATCGAGGGAGCTCTTAATTTGACACGCGACGCGTGGATTCGCGTGAAAGGTCTGTCGGAGGTGTACACCGAAATCGGCGACCTCAACTCGGAAGCCGACCGCCAGTGCAAGCGAATCGAGAATCTCATCAGTCGACAACTGGAGAACGAAAATATCTTGTCGGCGAACGACAAACAGATCGAGGAGCTGCAGACGAACTTGAACGAGCTCATTGCCGAGATTCCGAGTCTAAACGAGCAGATTTGCGACAAGCGCGGCGACCCTTGCGACGCACTTTGTGGCGGTGCCGGATGCGGCAAGTGCGGTGGAATCTCGTGCGAAAAGGGCGCCTTCACAAAGGCCTCGAACGCTCTCAACTACGCAAAGGACACCGAGAAGATCATCAAGGAGAAGGACCAGCATGCGGAGGAGTACATTCGATCGGTGTCGCACGCCAAGACTGAGGCCGTCGATGCCTTCAAGAAGGCGAAAGAGACGTTCGACAAGGTCGAGCAGACGTACAATACGACCGAGGCGCAGCTGAAGGAGGGACGTGAGCTGATTGCAGATTTGACGAACATCATTGCGAACACGACTGCGTCGCCTGGCGAAATCAAAGAAATCGCCGAGAGCATCTTGAAGCTCGACCTTCACTTGGATCCGTCCGAAATAAAGCATTTGGCGAACAATATCGATCAAACGGTCGCCGCACTCGAGAACGTCGATGACATCATTGAAAATACTCGAGGAGACCTGGAAATGGTCGAGCGACTCAAGACCAGCGCCACTGAAGCACAAAAGAGAGCAAACGACGTTTTAAAGAGAGCGACCGAAGTGAGCAAGGCACTGGAGGAGGCCGAATCATCGCAGAATGCTGCAAGAGACGCGATTAAGAAAGCCAACAGCGACATCTCGTCCGCCAAATCTGACTTGGAAGAGATCAACGTTGACGCAACGAAGGCGCGCACCAGTGTTGATGAAACAGCGGCCGCCGTCAACGCTCTCCAGACGAAACTCAACAAGCTGCAACGAAAGTTCCTCAAAAACTCGCACGACGCCAAGGAGGTGAAGACACAGGCCGATCTCGCCAAGGACATTTCGAATGTGACACACGAGAAGGCGAAGAAGCTGAAGAGTCAGTACAAGTATGCGAACGACACACTGAATTCAAAGGCCCAATCGACTGAATATGCGCGAAACAAAGCCCAGCAATTACTGCAGCGAGCATCGAAAATAACAGTCGACACGAACAATAAATTGAAGGAGCTGAAAGATATGGTGGCACAGACAACTAGCAACGACGACGAGCTGGGCGAAATGGAGCGAAAGATTTTGAGACTTAACGGCGAAATTGACGTGTACATACAGCGCATCAGAGACCACGCCGATCGTTACCGAACCTGCACTTCT
Protein: 1322 (aa)
MGKDCNQCMPETYGLSESRDGCTPCDCDPGGSLDNNCDVITGQCKCRPYMQGRNCSEPKQNHFIPNLHIVVEAEGPSAICETQSSYNNCSVVIRENHPDRPTFSGPGFLKVYEGGDISFTVDNVPKTMNYDVQIRYQANQHGDWEDIRVSLIRPDTVEYGSACYNVNPVEEQEKFLRFNEYESSVIALSDLCLEEGKTYKFIVSLHRQNNYDPNPKAQILIDSLALIPRIDVTTIFSGSTIAEIRKQDFDYYGCNDTYYSVALTQSADPRCKELIEMSSALVFNGATQIPHKLIMSRAVFTLSIIFLIQILSLNLDAAELGDSCDCNPTGSLSKKCDEHGGFCRCKTNVVGRQCDRCAPRTYGFSQDGCQSCDCDSIGSKDGECHLTTGQCNCQPNTYGRECNQCQPGYWNFPDCQQCNCNGHATTCDSKTGECIGCQDFTTGNHCDECITGYYGDPFLGSEIGCRPCRCPDTIASGHSHASQCALISNNNDVICYCEPGYAGTKCDVCDNNYFGNPDKPGGDCKPCNCSSNVDLSRSGNCDARSGKCLQCLYDTDGDHCEYCRDGFYGDASRQDCRECDCDVLGSVGGKFCDRYTGQCPCLPNVVGMRCDQCENNHWKIASGEGCEPCNCDEIGAYSEQCNPYDGQCSCRPGFGGRACDQCEANFWGDPNVECKPCECNTYGSFTYQCDRETGQCKCIKGIGGYKCDQCDRGYLGAAPYCSPCGECFDNWDDILNGLKSETDEIIEKAKLIKQQGATGAYTKEFEDVEKKIATIKTILNNTTVSARDIKKIDDKIAKLRDHLEKSEESLKQTDAELDRLNEEMNLAGVEISNLEEKSDKIKALANALKENATTLQEANIEGALNLTRDAWIRVKGLSEVYTEIGDLNSEADRQCKRIENLISRQLENENILSANDKQIEELQTNLNELIAEIPSLNEQICDKRGDPCDALCGGAGCGKCGGISCEKGAFTKASNALNYAKDTEKIIKEKDQHAEEYIRSVSHAKTEAVDAFKKAKETFDKVEQTYNTTEAQLKEGRELIADLTNIIANTTASPGEIKEIAESILKLDLHLDPSEIKHLANNIDQTVAALENVDDIIENTRGDLEMVERLKTSATEAQKRANDVLKRATEVSKALEEAESSQNAARDAIKKANSDISSAKSDLEEINVDATKARTSVDETAAAVNALQTKLNKLQRKFLKNSHDAKEVKTQADLAKDISNVTHEKAKKLKSQYKYANDTLNSKAQSTEYARNKAQQLLQRASKITVDTNNKLKELKDMVAQTTSNDDELGEMERKILRLNGEIDVYIQRIRDHADRYRTCTS
Type | Start | End | Length |
CDS |
28634 |
28770 |
137 |
CDS |
28891 |
29035 |
145 |
CDS |
29139 |
29327 |
189 |
CDS |
29398 |
29466 |
69 |
CDS |
29555 |
29627 |
73 |
CDS |
29995 |
31321 |
1327 |
CDS |
31384 |
31480 |
97 |
CDS |
31545 |
31741 |
197 |
CDS |
31801 |
32565 |
765 |
CDS |
32632 |
32736 |
105 |
CDS |
32852 |
33044 |
193 |
CDS |
33106 |
33513 |
408 |
CDS |
33749 |
34009 |
261 |
intron |
28771 |
28890 |
120 |
intron |
29036 |
29138 |
103 |
intron |
29328 |
29397 |
70 |
intron |
29467 |
29554 |
88 |
intron |
29628 |
29994 |
367 |
intron |
31322 |
31383 |
62 |
intron |
31481 |
31544 |
64 |
intron |
31742 |
31800 |
59 |
intron |
32566 |
32631 |
66 |
intron |
32737 |
32851 |
115 |
intron |
33045 |
33105 |
61 |
intron |
33514 |
33748 |
235 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001862577 |
laminin subunit beta-1 [Culex quinquefasciatus] gb|EDS37215.1| laminin subunit beta-1 [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR009053 |
Prefoldin |
|
InterPro |
IPR013015 |
Laminin IV |
|
InterPro |
IPR006210 |
Epidermal growth factor-like |
|
InterPro |
IPR002049 |
EGF-like, laminin |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF04582.7 |
Reovirus sigma C capsid protein |
5.1 |
Pfam |
PF09824.4 |
ArsR transcriptional regulator |
1.2 |
Pfam |
PF00053.19 |
Laminin EGF-like (Domains III and V) |
4.7e-80 |
Pfam |
PF13166.1 |
AAA domain |
4 |
Pfam |
PF10174.4 |
RIM-binding protein of the cytomatrix active zone |
5.3 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
N. vitripennis |
NV14179-PA |
D. plexippus |
DPOGS211962PA |
P. vanderplanki |
Pv.06237 |
H. sapiens |
ENSP00000402265 |
B. mori |
BGIBMGA000913-TA |
D. melanogaster |
FBgn0261800 |
M. musculus |
ENSMUSG00000052911 |
H. sapiens |
ENSP00000222399 |
H. sapiens |
ENSP00000373432 |
H. sapiens |
ENSP00000307156 |
H. sapiens |
ENSP00000205386 |
C. quinquefasciatus |
CPIJ011714 |
H. melpomene |
HMEL004807-PA |
A. gambiae |
AGAP001381 |
H. sapiens |
ENSP00000416562 |
S. invicta |
SI2.2.0_06367 |
M. musculus |
ENSMUSG00000002900 |
H. sapiens |
ENSP00000377190 |
A. aegypti |
AAEL003658 |
B. mori |
BGIBMGA000910-TA |
S. invicta |
SI2.2.0_16537 |
P. humanus |
PHUM032490-PA |
H. sapiens |
ENSP00000377191 |
H. sapiens |
ENSP00000388325 |
H. sapiens |
ENSP00000402353 |
H. sapiens |
ENSP00000373433 |
A. mellifera |
GB13917-PA |
T. castaneum |
TC005184 |
B. mori |
BGIBMGA000911-TA |