MidgeBase gene description page [Pn.03611]

Outline

Link to gbrowse

Gene ID Pn.03611
Type Protein coding gene
Scaffold PnScaf2973
Start 31
End 8575
Direction +

Sequence

Transcript: 3858 (bp)

 ATGATATGCCGATTAAACAACAATAAAATTAAAACGCTTCCCGATAATTTGTTGGCGGGCATGTCCAATTTAGTGCGAGTTGACTTATCAAGTAATGAGATCGTTGCAATATCGAAGAGAATGTTTAAAGGAGTTACCGCGTTGAGGAGTCTTCAATTGGACCATAATCAAATTGTATGCATCGATGAGCAAGCGCTCAAGAATTTGCATGAACTTGAGATTCTTACGCTTAACAACAATAACATTACCACACTGCCGCGCGAAATGTTTAGCAGCATGCCACGTTTAAGAGCACTCCGTCTCTCTGACAATCCATTTGCATGCGATTGTCATTTGGCATGGCTCTCGAAATTTCTCAGAAGTGCACCTCGATTAGCACCGTATACACGCTGTCATTCTCCGAGCCAGCTCAAGGGACAAAACGTGGCTGATTTGCACGACCAAGAGTTCAAATGTTCAGGTCTCACCGAACACGCTCCGATTGAATGCGGTGGCAGGAGTTTGTGTCCACATCCATGCCGCTGTGCCGAGGGTATTGTTGACTGTCGTGAAAAGAGCTTATCAAACGTTCCAATGACACTGCCAGAAGACACTACTGAAATTCGATTGGAGCAGAATTACATTACAGAAATCCCACCCAAAGCGTTTTCAAATCATCGTCGCTTGAGAAGAATAGATTTATCTAATAACAACATCTCAAAAATTGCTTACGACGCTTTCTCTGGATTAAAGTCTCTTAATTCGTTAGTGCTATACGGAAACAAGATCAAGGAGCTGCCAGCTGGTGTTTTCAAAGGGTTAACATCATTACAATTGTTGTTGCTAAACGCTAATGAAATTTCGTGCATTCGGAAGGACTCGTTCAAGGATTTGACGGGCTTGAGTCTACTATCACTTTACGACAATAATATTCAGTCGCTTTCAAATGGAACTTTTGATGCGCTAAAGCAAATTCAGACGTTACATTTGGCACGTAATCCATTTATATGTGATTGCAATCTCAAGTGGTTAGCTGACTATTTACACAAGAACCCAATTGAGACGAGTGGTGCCAAGTGCGAAGCACCGAAAAGAATGCATCGCAGGAAAATTGAATCATTAAGAGACGAAAAGCTAAAATGTACTGAAGATTTGCGTAGCAAATATGCAGCTGATTGCCGTGCTGAGCCAATTGAATGTCCCGCCGTGTGTCACTGCGAGCGTACAACTGTCGATTGTTCTGGACGCGCATTAAAAGAGATTCCACGCGACATTCCATTATTCACAATGGAACTCTTGCTGAATGACAACCAGTTAGGGCGCATAAAATCTGATGGACTTTTTGGTCGATTGCCAAATCTTGTCAAATTGGATTTACGACGTAATCAAATCACGTCTATCGAGCCACGCGCTTTTGAAGGTGCATCGAAGATTCAGGAAATGCTAATTAGCGAGAATAAAATGAAAGAAGTACATAATAAAATGTTTATGGGACTTCACAATTTAAAAATACTCTCACTATACGACAACCTAATTTCGTGCGTTATGCCTGGAAGTTTCGAACACTTGACGTCACTTAATCAACTGAATTTGGCGTCAAATCCATTTTCATGCAGTTGCCATTTGGCATGGTTTTCGGATTGGTTAAGGAAGAAGCAATTGGGCGGCACACCTGCTCGATGTGTAACACCGGCAAGAGTAAAAGACGTGGCTATTAAGGATTTACCACATCACGAGTTCCGCTGCAGTGGAAAGAGTGACCAGGGATGCCTCGGAGAGGACTATTGTCCTACTTTGTGCACTTGCACAGGCACTGTTGTACGCTGCTCGAGAAATTCTCTCAACGAGATACCTCGCGGCATTCCTAGTGAAACGACCGAGCTGTACTTAGAATCAAACAACATCTCAACAATTCACGCAGACCGCATTCGACATTTGAAATCACTAACGCGCTTGGATTTGAGCAATAACCGCATTAATATTCTCTCCAACTATACATTCTCAAACCTCACGAAGCTTTCGACTCTCATAATAAGCTACAATAATCTTCAATGTGTTCAACGACATGCATTAACGGGTTTGAAGAATTTACGTGTTTTATCGCTGCATGGCAATCAAATATCTATGATACCAGAGGGTTCATTCAACGATCTACAATCGATAACTCACATTGCACTAGGAAGTAATCCATTGTATTGCGATTGCTCATTGAAATGGCTTTCAGAATGGGTGAAATTGGATTATGTCGAGCCGGGTATCGCACGATGTACCGAACCGGAGCGTATGAAGGATAAACTAATTTTATCGACACCAGCGAATCAATTCGTGTGCAATGAGAAAGTCAGCAACGAAATTTTATCCAAGTGTGACGCATGCTATACTTTCCCTTGCAAGAATAACGCAGAATGTGTTATCAAGCCAGAGAAGCAGTACGAGTGTCGATGCGTGGCCGGTTACCATGGAGAACACTGCGAGCACATGATTGATGCATGCTATGGAAATCCGTGCCGAAACAACGGCACATGCAGTGTGCTAGAGGAAGGACGATTCAACTGCGATTGTTTGCCCGGCTACAAAGGGGCGCGATGCGAAATCAACATCGACGACTGTGAGGAGCACAAGTGCCAGAACAATGGCACATGTGTCGACGGTGTTGAGTCATACACTTGCAACTGTCTGCCCGGTTATACTGGCGAATTCTGCGAACAAAAAATACAGTTTTGTGGAATCAATTTTAACCCATGTGAAAATGGCGCGAAATGCGTGGATTATAGCACGCATTATACATGCGAGTGTTTGGCAGGCTTCCGCGGTGTCAACTGCTCCGAAAACATTGATGACTGCGAAAATAACATGTGTCAAAACGGCGGCACATGTGTCGATGGCATAAACACATACTATTGCGAGTGTCCAAGCGATTTCACTGGAAAGTTCTGTGAGGGAACTCCGATGGTCGCTATGATGTATCCACAAACTTCGCCGTGCCAGAATCATGAATGCAAATTTGGCGTTTGTTTTCAACCGAATCCCTCATCATCGGAGTATATGTGCAAATGCAACCAGGGCTATACAGGAAAGCGATGCGAATATCTAACGTCTCTGACTTTCCTTCACAACAATAGCTTTGTTGAAATGGAGCCGCTGCGAATGAAGCCGGAAGCCAATGTCACTGTAATTTTCAGCNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACGAATACTCTCAGATACCACATGGTTGAGTTGTATGCAGTCAAAAAAAACTTTACGCTACGTGTCAATAAAGGACAGGCACGTTCGATTGTGAACGATGGCACGAAGGAATATTTGAAATTCTCTTCACCTATGTTCCTTGGTGGCATTCCACCCGAAGCAGCACATCAAGCATATAATCAATATCACTTGAGAAACATGACGAGCTTCCAAGGGTGCATGAAAGAAGTGTGGATAAATCACAAGCAAGTCGACTTTGTGAATGCGGCGCGTCAACAAAAAGTCAGCCCTGGATGTGCACTCTACGATTCAGACACGGAAAACGATAGTGAAAGCTATCAAGACCAGTTCATTCAGGAGCCACCGGAGGATGCGTCGAAGGAAGAGGATCCGTGTGATGTTCATCAATGCAAAAACAATGGCAAGTGTGTGGCAAATAATAAAGGCTCATACAGTTGTAAATGATTTTCTTCTGTTGTGATTCAAGTAGTTATGCTGAGAAAAATTATC 

Protein: 1286 (aa)

 MICRLNNNKIKTLPDNLLAGMSNLVRVDLSSNEIVAISKRMFKGVTALRSLQLDHNQIVCIDEQALKNLHELEILTLNNNNITTLPREMFSSMPRLRALRLSDNPFACDCHLAWLSKFLRSAPRLAPYTRCHSPSQLKGQNVADLHDQEFKCSGLTEHAPIECGGRSLCPHPCRCAEGIVDCREKSLSNVPMTLPEDTTEIRLEQNYITEIPPKAFSNHRRLRRIDLSNNNISKIAYDAFSGLKSLNSLVLYGNKIKELPAGVFKGLTSLQLLLLNANEISCIRKDSFKDLTGLSLLSLYDNNIQSLSNGTFDALKQIQTLHLARNPFICDCNLKWLADYLHKNPIETSGAKCEAPKRMHRRKIESLRDEKLKCTEDLRSKYAADCRAEPIECPAVCHCERTTVDCSGRALKEIPRDIPLFTMELLLNDNQLGRIKSDGLFGRLPNLVKLDLRRNQITSIEPRAFEGASKIQEMLISENKMKEVHNKMFMGLHNLKILSLYDNLISCVMPGSFEHLTSLNQLNLASNPFSCSCHLAWFSDWLRKKQLGGTPARCVTPARVKDVAIKDLPHHEFRCSGKSDQGCLGEDYCPTLCTCTGTVVRCSRNSLNEIPRGIPSETTELYLESNNISTIHADRIRHLKSLTRLDLSNNRINILSNYTFSNLTKLSTLIISYNNLQCVQRHALTGLKNLRVLSLHGNQISMIPEGSFNDLQSITHIALGSNPLYCDCSLKWLSEWVKLDYVEPGIARCTEPERMKDKLILSTPANQFVCNEKVSNEILSKCDACYTFPCKNNAECVIKPEKQYECRCVAGYHGEHCEHMIDACYGNPCRNNGTCSVLEEGRFNCDCLPGYKGARCEINIDDCEEHKCQNNGTCVDGVESYTCNCLPGYTGEFCEQKIQFCGINFNPCENGAKCVDYSTHYTCECLAGFRGVNCSENIDDCENNMCQNGGTCVDGINTYYCECPSDFTGKFCEGTPMVAMMYPQTSPCQNHECKFGVCFQPNPSSSEYMCKCNQGYTGKRCEYLTSLTFLHNNSFVEMEPLRMKPEANVTVIFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTNTLRYHMVELYAVKKNFTLRVNKGQARSIVNDGTKEYLKFSSPMFLGGIPPEAAHQAYNQYHLRNMTSFQGCMKEVWINHKQVDFVNAARQQKVSPGCALYDSDTENDSESYQDQFIQEPPEDASKEEDPCDVHQCKNNGKCVANNKGSYSCKXFSSVVIQVVMLRKII 
Type Start End Length
CDS 31 33 3
CDS 92 96 5
CDS 360 431 72
CDS 498 877 380
CDS 952 1093 142
CDS 2365 2724 360
CDS 2807 2964 158
CDS 3834 3987 154
CDS 4055 4273 219
CDS 5316 5828 513
CDS 5931 6074 144
CDS 6499 8083 1585
CDS 8148 8223 76
CDS 8526 8572 47
intron 34 91 58
intron 97 359 263
intron 432 497 66
intron 878 951 74
intron 1094 2364 1271
intron 2725 2806 82
intron 2965 3833 869
intron 3988 4054 67
intron 4274 5315 1042
intron 5829 5930 102
intron 6075 6498 424
intron 8084 8147 64
intron 8224 8525 302

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001845970 slit protein [Culex quinquefasciatus] gb|EDS42151.1| slit protein [Culex quinquefasciatus] 0.0
InterPro IPR000483 Cysteine-rich flanking region, C-terminal
InterPro IPR000372 Leucine-rich repeat-containing N-terminal
InterPro IPR001881 EGF-like calcium-binding
InterPro IPR001611 Leucine-rich repeat
InterPro IPR000742 Epidermal growth factor-like domain
InterPro IPR013320 Concanavalin A-like lectin/glucanase, subgroup
InterPro IPR001791 Laminin G domain
InterPro IPR003590 Leucine-rich repeat, ribonuclease inhibitor subtype
InterPro IPR013032 EGF-like, conserved site
InterPro IPR006209 EGF-like domain
InterPro IPR000152 EGF-type aspartate/asparagine hydroxylation site
InterPro IPR018097 EGF-like calcium-binding, conserved site
InterPro IPR003591 Leucine-rich repeat, typical subtype
InterPro IPR006210 Epidermal growth factor-like
InterPro IPR008985 Concanavalin A-like lectin/glucanase
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0005509 calcium ion binding
Pfam PF12799.2 Leucine Rich repeats (2 copies) 3.5e-54
Pfam PF01463.19 Leucine rich repeat C-terminal domain 9.8e-11
Pfam PF07974.8 EGF-like domain 0.0062
Pfam PF12661.2 Human growth factor-like EGF 2.7e-11
Pfam PF00560.28 Leucine Rich Repeat 4.7e-26
Pfam PF13855.1 Leucine rich repeat 1.1e-85
Pfam PF07725.7 Leucine Rich Repeat 0.44
Pfam PF12946.2 MSP1 EGF domain 1 0.0061
Pfam PF02210.19 Laminin G domain 1.8e-10
Pfam PF00054.18 Laminin G domain 4.4e-14
Pfam PF00008.22 EGF-like domain 1.4e-30
Pfam PF13504.1 Leucine rich repeat 1.1
Pfam PF01462.13 Leucine rich repeat N-terminal domain 4.3e-16
Pfam PF13516.1 Leucine Rich repeat 0.011

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
M. musculus ENSMUSG00000031558
H. sapiens ENSP00000315005
A. gambiae AGAP002792
A. aegypti AAEL009175
D. melanogaster FBgn0264089
H. sapiens ENSP00000422591
H. sapiens ENSP00000384890
H. sapiens ENSP00000332164
H. sapiens ENSP00000427548
H. sapiens ENSP00000266058
A. gambiae AGAP002793
P. vanderplanki Pv.09707
H. melpomene HMEL007794-PA
N. vitripennis NV10549-PA
M. musculus ENSMUSG00000056427
H. sapiens ENSP00000360080
A. mellifera GB19929-PA
S. invicta SI2.2.0_14174
C. quinquefasciatus CPIJ004328
S. invicta SI2.2.0_07851
H. sapiens ENSP00000422261
H. sapiens ENSP00000273739
H. sapiens ENSP00000360109
M. musculus ENSMUSG00000025020
P. humanus PHUM419730-PA
H. sapiens ENSP00000405183
T. castaneum TC000214
D. plexippus DPOGS211056PA
B. mori BGIBMGA009611-TA
H. sapiens ENSP00000430333