MidgeBase gene description page [Pn.03611]
Outline
Gene ID | Pn.03611 |
Type | Protein coding gene |
Scaffold | PnScaf2973 |
Start | 31 |
End | 8575 |
Direction | + |
Sequence
Transcript: 3858 (bp)
ATGATATGCCGATTAAACAACAATAAAATTAAAACGCTTCCCGATAATTTGTTGGCGGGCATGTCCAATTTAGTGCGAGTTGACTTATCAAGTAATGAGATCGTTGCAATATCGAAGAGAATGTTTAAAGGAGTTACCGCGTTGAGGAGTCTTCAATTGGACCATAATCAAATTGTATGCATCGATGAGCAAGCGCTCAAGAATTTGCATGAACTTGAGATTCTTACGCTTAACAACAATAACATTACCACACTGCCGCGCGAAATGTTTAGCAGCATGCCACGTTTAAGAGCACTCCGTCTCTCTGACAATCCATTTGCATGCGATTGTCATTTGGCATGGCTCTCGAAATTTCTCAGAAGTGCACCTCGATTAGCACCGTATACACGCTGTCATTCTCCGAGCCAGCTCAAGGGACAAAACGTGGCTGATTTGCACGACCAAGAGTTCAAATGTTCAGGTCTCACCGAACACGCTCCGATTGAATGCGGTGGCAGGAGTTTGTGTCCACATCCATGCCGCTGTGCCGAGGGTATTGTTGACTGTCGTGAAAAGAGCTTATCAAACGTTCCAATGACACTGCCAGAAGACACTACTGAAATTCGATTGGAGCAGAATTACATTACAGAAATCCCACCCAAAGCGTTTTCAAATCATCGTCGCTTGAGAAGAATAGATTTATCTAATAACAACATCTCAAAAATTGCTTACGACGCTTTCTCTGGATTAAAGTCTCTTAATTCGTTAGTGCTATACGGAAACAAGATCAAGGAGCTGCCAGCTGGTGTTTTCAAAGGGTTAACATCATTACAATTGTTGTTGCTAAACGCTAATGAAATTTCGTGCATTCGGAAGGACTCGTTCAAGGATTTGACGGGCTTGAGTCTACTATCACTTTACGACAATAATATTCAGTCGCTTTCAAATGGAACTTTTGATGCGCTAAAGCAAATTCAGACGTTACATTTGGCACGTAATCCATTTATATGTGATTGCAATCTCAAGTGGTTAGCTGACTATTTACACAAGAACCCAATTGAGACGAGTGGTGCCAAGTGCGAAGCACCGAAAAGAATGCATCGCAGGAAAATTGAATCATTAAGAGACGAAAAGCTAAAATGTACTGAAGATTTGCGTAGCAAATATGCAGCTGATTGCCGTGCTGAGCCAATTGAATGTCCCGCCGTGTGTCACTGCGAGCGTACAACTGTCGATTGTTCTGGACGCGCATTAAAAGAGATTCCACGCGACATTCCATTATTCACAATGGAACTCTTGCTGAATGACAACCAGTTAGGGCGCATAAAATCTGATGGACTTTTTGGTCGATTGCCAAATCTTGTCAAATTGGATTTACGACGTAATCAAATCACGTCTATCGAGCCACGCGCTTTTGAAGGTGCATCGAAGATTCAGGAAATGCTAATTAGCGAGAATAAAATGAAAGAAGTACATAATAAAATGTTTATGGGACTTCACAATTTAAAAATACTCTCACTATACGACAACCTAATTTCGTGCGTTATGCCTGGAAGTTTCGAACACTTGACGTCACTTAATCAACTGAATTTGGCGTCAAATCCATTTTCATGCAGTTGCCATTTGGCATGGTTTTCGGATTGGTTAAGGAAGAAGCAATTGGGCGGCACACCTGCTCGATGTGTAACACCGGCAAGAGTAAAAGACGTGGCTATTAAGGATTTACCACATCACGAGTTCCGCTGCAGTGGAAAGAGTGACCAGGGATGCCTCGGAGAGGACTATTGTCCTACTTTGTGCACTTGCACAGGCACTGTTGTACGCTGCTCGAGAAATTCTCTCAACGAGATACCTCGCGGCATTCCTAGTGAAACGACCGAGCTGTACTTAGAATCAAACAACATCTCAACAATTCACGCAGACCGCATTCGACATTTGAAATCACTAACGCGCTTGGATTTGAGCAATAACCGCATTAATATTCTCTCCAACTATACATTCTCAAACCTCACGAAGCTTTCGACTCTCATAATAAGCTACAATAATCTTCAATGTGTTCAACGACATGCATTAACGGGTTTGAAGAATTTACGTGTTTTATCGCTGCATGGCAATCAAATATCTATGATACCAGAGGGTTCATTCAACGATCTACAATCGATAACTCACATTGCACTAGGAAGTAATCCATTGTATTGCGATTGCTCATTGAAATGGCTTTCAGAATGGGTGAAATTGGATTATGTCGAGCCGGGTATCGCACGATGTACCGAACCGGAGCGTATGAAGGATAAACTAATTTTATCGACACCAGCGAATCAATTCGTGTGCAATGAGAAAGTCAGCAACGAAATTTTATCCAAGTGTGACGCATGCTATACTTTCCCTTGCAAGAATAACGCAGAATGTGTTATCAAGCCAGAGAAGCAGTACGAGTGTCGATGCGTGGCCGGTTACCATGGAGAACACTGCGAGCACATGATTGATGCATGCTATGGAAATCCGTGCCGAAACAACGGCACATGCAGTGTGCTAGAGGAAGGACGATTCAACTGCGATTGTTTGCCCGGCTACAAAGGGGCGCGATGCGAAATCAACATCGACGACTGTGAGGAGCACAAGTGCCAGAACAATGGCACATGTGTCGACGGTGTTGAGTCATACACTTGCAACTGTCTGCCCGGTTATACTGGCGAATTCTGCGAACAAAAAATACAGTTTTGTGGAATCAATTTTAACCCATGTGAAAATGGCGCGAAATGCGTGGATTATAGCACGCATTATACATGCGAGTGTTTGGCAGGCTTCCGCGGTGTCAACTGCTCCGAAAACATTGATGACTGCGAAAATAACATGTGTCAAAACGGCGGCACATGTGTCGATGGCATAAACACATACTATTGCGAGTGTCCAAGCGATTTCACTGGAAAGTTCTGTGAGGGAACTCCGATGGTCGCTATGATGTATCCACAAACTTCGCCGTGCCAGAATCATGAATGCAAATTTGGCGTTTGTTTTCAACCGAATCCCTCATCATCGGAGTATATGTGCAAATGCAACCAGGGCTATACAGGAAAGCGATGCGAATATCTAACGTCTCTGACTTTCCTTCACAACAATAGCTTTGTTGAAATGGAGCCGCTGCGAATGAAGCCGGAAGCCAATGTCACTGTAATTTTCAGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACGAATACTCTCAGATACCACATGGTTGAGTTGTATGCAGTCAAAAAAAACTTTACGCTACGTGTCAATAAAGGACAGGCACGTTCGATTGTGAACGATGGCACGAAGGAATATTTGAAATTCTCTTCACCTATGTTCCTTGGTGGCATTCCACCCGAAGCAGCACATCAAGCATATAATCAATATCACTTGAGAAACATGACGAGCTTCCAAGGGTGCATGAAAGAAGTGTGGATAAATCACAAGCAAGTCGACTTTGTGAATGCGGCGCGTCAACAAAAAGTCAGCCCTGGATGTGCACTCTACGATTCAGACACGGAAAACGATAGTGAAAGCTATCAAGACCAGTTCATTCAGGAGCCACCGGAGGATGCGTCGAAGGAAGAGGATCCGTGTGATGTTCATCAATGCAAAAACAATGGCAAGTGTGTGGCAAATAATAAAGGCTCATACAGTTGTAAATGATTTTCTTCTGTTGTGATTCAAGTAGTTATGCTGAGAAAAATTATC
Protein: 1286 (aa)
MICRLNNNKIKTLPDNLLAGMSNLVRVDLSSNEIVAISKRMFKGVTALRSLQLDHNQIVCIDEQALKNLHELEILTLNNNNITTLPREMFSSMPRLRALRLSDNPFACDCHLAWLSKFLRSAPRLAPYTRCHSPSQLKGQNVADLHDQEFKCSGLTEHAPIECGGRSLCPHPCRCAEGIVDCREKSLSNVPMTLPEDTTEIRLEQNYITEIPPKAFSNHRRLRRIDLSNNNISKIAYDAFSGLKSLNSLVLYGNKIKELPAGVFKGLTSLQLLLLNANEISCIRKDSFKDLTGLSLLSLYDNNIQSLSNGTFDALKQIQTLHLARNPFICDCNLKWLADYLHKNPIETSGAKCEAPKRMHRRKIESLRDEKLKCTEDLRSKYAADCRAEPIECPAVCHCERTTVDCSGRALKEIPRDIPLFTMELLLNDNQLGRIKSDGLFGRLPNLVKLDLRRNQITSIEPRAFEGASKIQEMLISENKMKEVHNKMFMGLHNLKILSLYDNLISCVMPGSFEHLTSLNQLNLASNPFSCSCHLAWFSDWLRKKQLGGTPARCVTPARVKDVAIKDLPHHEFRCSGKSDQGCLGEDYCPTLCTCTGTVVRCSRNSLNEIPRGIPSETTELYLESNNISTIHADRIRHLKSLTRLDLSNNRINILSNYTFSNLTKLSTLIISYNNLQCVQRHALTGLKNLRVLSLHGNQISMIPEGSFNDLQSITHIALGSNPLYCDCSLKWLSEWVKLDYVEPGIARCTEPERMKDKLILSTPANQFVCNEKVSNEILSKCDACYTFPCKNNAECVIKPEKQYECRCVAGYHGEHCEHMIDACYGNPCRNNGTCSVLEEGRFNCDCLPGYKGARCEINIDDCEEHKCQNNGTCVDGVESYTCNCLPGYTGEFCEQKIQFCGINFNPCENGAKCVDYSTHYTCECLAGFRGVNCSENIDDCENNMCQNGGTCVDGINTYYCECPSDFTGKFCEGTPMVAMMYPQTSPCQNHECKFGVCFQPNPSSSEYMCKCNQGYTGKRCEYLTSLTFLHNNSFVEMEPLRMKPEANVTVIFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTNTLRYHMVELYAVKKNFTLRVNKGQARSIVNDGTKEYLKFSSPMFLGGIPPEAAHQAYNQYHLRNMTSFQGCMKEVWINHKQVDFVNAARQQKVSPGCALYDSDTENDSESYQDQFIQEPPEDASKEEDPCDVHQCKNNGKCVANNKGSYSCKXFSSVVIQVVMLRKII
Type | Start | End | Length |
CDS |
31 |
33 |
3 |
CDS |
92 |
96 |
5 |
CDS |
360 |
431 |
72 |
CDS |
498 |
877 |
380 |
CDS |
952 |
1093 |
142 |
CDS |
2365 |
2724 |
360 |
CDS |
2807 |
2964 |
158 |
CDS |
3834 |
3987 |
154 |
CDS |
4055 |
4273 |
219 |
CDS |
5316 |
5828 |
513 |
CDS |
5931 |
6074 |
144 |
CDS |
6499 |
8083 |
1585 |
CDS |
8148 |
8223 |
76 |
CDS |
8526 |
8572 |
47 |
intron |
34 |
91 |
58 |
intron |
97 |
359 |
263 |
intron |
432 |
497 |
66 |
intron |
878 |
951 |
74 |
intron |
1094 |
2364 |
1271 |
intron |
2725 |
2806 |
82 |
intron |
2965 |
3833 |
869 |
intron |
3988 |
4054 |
67 |
intron |
4274 |
5315 |
1042 |
intron |
5829 |
5930 |
102 |
intron |
6075 |
6498 |
424 |
intron |
8084 |
8147 |
64 |
intron |
8224 |
8525 |
302 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001845970 |
slit protein [Culex quinquefasciatus] gb|EDS42151.1| slit protein [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR000483 |
Cysteine-rich flanking region, C-terminal |
|
InterPro |
IPR000372 |
Leucine-rich repeat-containing N-terminal |
|
InterPro |
IPR001881 |
EGF-like calcium-binding |
|
InterPro |
IPR001611 |
Leucine-rich repeat |
|
InterPro |
IPR000742 |
Epidermal growth factor-like domain |
|
InterPro |
IPR013320 |
Concanavalin A-like lectin/glucanase, subgroup |
|
InterPro |
IPR001791 |
Laminin G domain |
|
InterPro |
IPR003590 |
Leucine-rich repeat, ribonuclease inhibitor subtype |
|
InterPro |
IPR013032 |
EGF-like, conserved site |
|
InterPro |
IPR006209 |
EGF-like domain |
|
InterPro |
IPR000152 |
EGF-type aspartate/asparagine hydroxylation site |
|
InterPro |
IPR018097 |
EGF-like calcium-binding, conserved site |
|
InterPro |
IPR003591 |
Leucine-rich repeat, typical subtype |
|
InterPro |
IPR006210 |
Epidermal growth factor-like |
|
InterPro |
IPR008985 |
Concanavalin A-like lectin/glucanase |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0005509 |
calcium ion binding |
|
Pfam |
PF12799.2 |
Leucine Rich repeats (2 copies) |
3.5e-54 |
Pfam |
PF01463.19 |
Leucine rich repeat C-terminal domain |
9.8e-11 |
Pfam |
PF07974.8 |
EGF-like domain |
0.0062 |
Pfam |
PF12661.2 |
Human growth factor-like EGF |
2.7e-11 |
Pfam |
PF00560.28 |
Leucine Rich Repeat |
4.7e-26 |
Pfam |
PF13855.1 |
Leucine rich repeat |
1.1e-85 |
Pfam |
PF07725.7 |
Leucine Rich Repeat |
0.44 |
Pfam |
PF12946.2 |
MSP1 EGF domain 1 |
0.0061 |
Pfam |
PF02210.19 |
Laminin G domain |
1.8e-10 |
Pfam |
PF00054.18 |
Laminin G domain |
4.4e-14 |
Pfam |
PF00008.22 |
EGF-like domain |
1.4e-30 |
Pfam |
PF13504.1 |
Leucine rich repeat |
1.1 |
Pfam |
PF01462.13 |
Leucine rich repeat N-terminal domain |
4.3e-16 |
Pfam |
PF13516.1 |
Leucine Rich repeat |
0.011 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
M. musculus |
ENSMUSG00000031558 |
H. sapiens |
ENSP00000315005 |
A. gambiae |
AGAP002792 |
A. aegypti |
AAEL009175 |
D. melanogaster |
FBgn0264089 |
H. sapiens |
ENSP00000422591 |
H. sapiens |
ENSP00000384890 |
H. sapiens |
ENSP00000332164 |
H. sapiens |
ENSP00000427548 |
H. sapiens |
ENSP00000266058 |
A. gambiae |
AGAP002793 |
P. vanderplanki |
Pv.09707 |
H. melpomene |
HMEL007794-PA |
N. vitripennis |
NV10549-PA |
M. musculus |
ENSMUSG00000056427 |
H. sapiens |
ENSP00000360080 |
A. mellifera |
GB19929-PA |
S. invicta |
SI2.2.0_14174 |
C. quinquefasciatus |
CPIJ004328 |
S. invicta |
SI2.2.0_07851 |
H. sapiens |
ENSP00000422261 |
H. sapiens |
ENSP00000273739 |
H. sapiens |
ENSP00000360109 |
M. musculus |
ENSMUSG00000025020 |
P. humanus |
PHUM419730-PA |
H. sapiens |
ENSP00000405183 |
T. castaneum |
TC000214 |
D. plexippus |
DPOGS211056PA |
B. mori |
BGIBMGA009611-TA |
H. sapiens |
ENSP00000430333 |