MidgeBase gene description page [Pn.10235]
Outline
Gene ID | Pn.10235 |
Type | Protein coding gene |
Scaffold | PnScaf11037 |
Start | 235 |
End | 5358 |
Direction | + |
Sequence
Transcript: 2190 (bp)
ATGGAGTTCACTCCGATCTATTTAATTTTTGTAATTATTTTCGCTCCACTAGCTCATGGATTTTCCATTCCAAAAGTCAATGTGAAACTTTCGAAACCGAGAGGAATTCAATTTTCATTACAAGAGACAGACGATGTTGTCTTCGACGACGTTCGACTATTTGTGTTCGCATCGAAGAGTCTGATAAACAATTCAAAGTTTGTCGAGAGCGAGCTGACGAGAGATGTCAATGGATTGTGGATTTATGAAATAGCAAATGTTGATTTAGCACTCGATGATGACATCGAGTATTGGCTGTATGTGGAGCACAACAAGCTTGGACATTACGCGACCGATGCTGTGAAGGTTGCAGACATCGAAGAGCAGCGAGAAGCACCGAGCGAGTTGACGGCAGCAGCGATGCCAACATCATCAACCAATGGCACTTATCCACATGTCACTGTCAAGCTGCTAAAACCGAATGGCATTCAATTTCTCATTGACGACGCAGAGAGCGGTCCATTAACCAATGTCAAATTAATTATTTTTGCATCAAAACCCATCGCTAATAACTCAAAATTAATTGATGTGCCACTCGCAAAAAATTATGATAATGGAGTGTGGAGCGGTGAGCTCGCAAATACACCGTTGACGGCCGATGATGAGATTGAATTTTGGTCGTATGGCGAGTTGCGCGGTTTGGGCTACTTCGCTAATGATGTCATTAAATTACGGGATTTGGACGCAGCTGAAGAGGACGATGTGCGGACGAGACAGCTGCCTGTGGGCCTGTGCAACGAGACCTTTCCGATTGTCAACGTGAAGATGATTGGCAGCAATAGCTTGCGAATCTCGCTGCAAGCACCAGAAGAGTTCGACGAATACAGCGCTGTGAAAATTTTAATATTCTCGCCCCGGAGCCTCGTAAATAATGATTTAAAATTTATTGAGAATCAGTTAAGACGCGACGATAGTGGCTTGTGGATTTACGAGTTGCGAAACGTCGCTCTGCAGCCGTCTGATGTCTTCGAGTACTGGATTTATGTGGAAAAGTCGAGCGTCGGTTATTACGTGAGCCAAAAGTTCGGAGTTCAAGATATTGCGCCGCCACCACCTGAACCGCCGACAACATCGACGACGAGCACAACGACATCGCCATGCGAGGCGTCTGTCAGCGTCGCTAATGGAAAGCCAGTGGCATGCAAGAATTCCATTATTTTCAATGAAGATTTTAATTTGGATAATTTGAAATACTGGAGCTTCGACACTCGCTTTCCATTAGACGATGCGACAGCCGACGCCGAGTTCTGTGTGTACGAGAAACGCGCCGAGACCTCCTTCATCCGCAACGGCACGATGACGCTGCGAGCCGAGTCGCTGAAGCGCGTTGCGGGCTTCGATGACGTGCGCATCAGGCTCGGAAAATTTAATCTGGATGAGCGATGCACGCCCATCGCGGGCGACGAGCGCGAGTGCGAAAGACATGCACAATTTGGTTACATCTTGCCGCCGGTTACGTCCGCCTATCTCACGACGCGCGGCAAATTCTCGTTCATGTATGGGCGAGTTGAGGCGCGTATTCGGGCGCCCATCGGCGACTACCTCTACGCACAAATGACTCTTAAGCCACAAGTGGAAGCGGCCGATGAGGAGGGAAGAAACAGCAGCCGTTCGCAGCACTTGAAAGTGTTCTTCACGCGAGGCAATGAGCATCTCCGCGACGGCGACGAGGAAGTGGGCGGAAGTCGTGTTTATGGCGGTGCGATTCTCTCGAAAAATCCGAAAAACAATCTCCGATGGCTTAAGAGTCGGCACTTTCCGGACTCGCATCTCGGCCATGAGTTTCACATTTACGAGCTGCTGTGGACGCCGACGGAGATCTCGCTCTCGATCGATGGCATAAAGTACGGTTCATTAGGCAGCGATTTGAGGGAGTCAGCGATGGTGGCAAAAATTAAATCGGCCGTGAACTGGTCGCAAAACGGTCCGTTTAATAGAGAGCACTTTTTATCGCTAAACTTGGCAGCCGGCAGCGTCAAGAATTTCTACTCGACAAACGGCACCGTGCTGAACGGCATTGCGCTCGAACCGAAGCCATGGAGCGACACAGATCCCCGAGCTGAGCGCAGCTTCTACATGGCTCATGACAAATGGTACTCAACATGGAAGCAACCGACCCTCGACATCGACTACGTACGAGTTTATGCTGTT
Protein: 730 (aa)
MEFTPIYLIFVIIFAPLAHGFSIPKVNVKLSKPRGIQFSLQETDDVVFDDVRLFVFASKSLINNSKFVESELTRDVNGLWIYEIANVDLALDDDIEYWLYVEHNKLGHYATDAVKVADIEEQREAPSELTAAAMPTSSTNGTYPHVTVKLLKPNGIQFLIDDAESGPLTNVKLIIFASKPIANNSKLIDVPLAKNYDNGVWSGELANTPLTADDEIEFWSYGELRGLGYFANDVIKLRDLDAAEEDDVRTRQLPVGLCNETFPIVNVKMIGSNSLRISLQAPEEFDEYSAVKILIFSPRSLVNNDLKFIENQLRRDDSGLWIYELRNVALQPSDVFEYWIYVEKSSVGYYVSQKFGVQDIAPPPPEPPTTSTTSTTTSPCEASVSVANGKPVACKNSIIFNEDFNLDNLKYWSFDTRFPLDDATADAEFCVYEKRAETSFIRNGTMTLRAESLKRVAGFDDVRIRLGKFNLDERCTPIAGDERECERHAQFGYILPPVTSAYLTTRGKFSFMYGRVEARIRAPIGDYLYAQMTLKPQVEAADEEGRNSSRSQHLKVFFTRGNEHLRDGDEEVGGSRVYGGAILSKNPKNNLRWLKSRHFPDSHLGHEFHIYELLWTPTEISLSIDGIKYGSLGSDLRESAMVAKIKSAVNWSQNGPFNREHFLSLNLAAGSVKNFYSTNGTVLNGIALEPKPWSDTDPRAERSFYMAHDKWYSTWKQPTLDIDYVRVYAV
Type | Start | End | Length |
CDS |
235 |
358 |
124 |
CDS |
443 |
670 |
228 |
CDS |
1039 |
1170 |
132 |
CDS |
1328 |
1558 |
231 |
CDS |
1652 |
1777 |
126 |
CDS |
2619 |
2852 |
234 |
CDS |
3272 |
3617 |
346 |
CDS |
3687 |
3856 |
170 |
CDS |
4355 |
4548 |
194 |
CDS |
4642 |
4836 |
195 |
CDS |
5040 |
5148 |
109 |
CDS |
5255 |
5355 |
101 |
intron |
359 |
442 |
84 |
intron |
671 |
1038 |
368 |
intron |
1171 |
1327 |
157 |
intron |
1559 |
1651 |
93 |
intron |
1778 |
2618 |
841 |
intron |
2853 |
3271 |
419 |
intron |
3618 |
3686 |
69 |
intron |
3857 |
4354 |
498 |
intron |
4549 |
4641 |
93 |
intron |
4837 |
5039 |
203 |
intron |
5149 |
5254 |
106 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001850624 |
conserved hypothetical protein [Culex quinquefasciatus] gb|EDS32378.1| conserved hypothetical protein [Culex quinquefasciatus] |
3e-59 |
InterPro |
IPR001202 |
WW/Rsp5/WWP |
|
InterPro |
IPR013320 |
Concanavalin A-like lectin/glucanase, subgroup |
|
InterPro |
IPR008985 |
Concanavalin A-like lectin/glucanase |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF00722.16 |
Glycosyl hydrolases family 16 |
2.6e-07 |
Pfam |
PF09866.4 |
Uncharacterized protein conserved in bacteria (DUF2093) |
0.051 |
Pfam |
PF12843.2 |
Protein of unknown function (DUF3820) |
0.016 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
A. aegypti |
AAEL007626 |
N. vitripennis |
NV10893-PA |
A. mellifera |
GB19452-PA |
D. plexippus |
DPOGS212941PA |
T. castaneum |
TC011529 |
C. quinquefasciatus |
CPIJ008997 |
D. plexippus |
DPOGS212964PA |
A. mellifera |
GB19961-PA |
C. quinquefasciatus |
CPIJ005217 |
H. melpomene |
HMEL003270-PA |
D. plexippus |
DPOGS212963PA |
A. aegypti |
AAEL000652 |
C. quinquefasciatus |
CPIJ013557 |
D. melanogaster |
FBgn0040323 |
N. vitripennis |
NV21393-PA |
P. vanderplanki |
Pv.11776 |
H. melpomene |
HMEL005461-PA |
C. quinquefasciatus |
CPIJ013556 |
B. mori |
BGIBMGA011609-TA |
S. invicta |
SI2.2.0_14729 |
B. mori |
BGIBMGA011607-TA |
A. gambiae |
AGAP012409 |
H. melpomene |
HMEL005460-PA |
A. gambiae |
AGAP006761 |
D. melanogaster |
FBgn0040321 |
T. castaneum |
TC003991 |
B. mori |
BGIBMGA011608-TA |