MidgeBase gene description page [Pn.07096]
Outline
Gene ID | Pn.07096 |
Type | Protein coding gene |
Scaffold | PnScaf6348 |
Start | 1419 |
End | 6453 |
Direction | - |
Sequence
Transcript: 2613 (bp)
ATGCTGAGCCTGCGAACCCTCAGGAAGGACCTCAAGACACTCAGACGAACCTTCGACCGGAATGTGGAGCAAATCACGAGCTTCAGCGAGAACTTCAGGAATGCGACGTTCACGCTGACCGTCAACAGGTTCGCCATTCTGGACGACAAGGACCGACGCAGGCTGACGAGCTTCAGGTCCTCGATTCTCGGGAAGGTCGCGCACAAGGTCGAGACCGTCTCGATGAGGTCCGACACGCCTTGCCTGCCGAGGGAGCTGGACTGGCGGGAGTACGGCGCGGTGACGCCCGTTCAGGACCAAGGATTCTATTGCGGCTCTTGCTGGGCTTTCTCGGCGACGGGGCACCTGGAAGGCATCATTTCGATATCGACGGGCGAGCCAGCTGTGAAGCTCTCCGACCAGCAATTTATTGACTGCAACTACAACAAGCTTATCGGGAACTTTGGGTGTGATGGAGGCCAAATGTCGCTCGCCCTCTCCTACGCCCTCAAAAAGGACATAACAACGGCCGACACATACCCGTATGCCGACTCGAGGGGCGAATGCGCCTACAGCAAGCCACAGACGAGGTTCAATTTTTCGCAGCCGGTCATGCTGCCGGCGGGCGACGAGGAGGCTCTGAAGGTCGCTGTTGCCACCTCTGGACCGATCGCAGTAGCCATCGACGCATCGCGCGAGTCCTTCTTCTACTACTCCGAAGGTGTCTACTTCGACCCCAACTGCACGCAGTGGATCAACCATGCGGTACTGGTAGTCGGGTACGGAACTGATCCTGTTGGAGGTGACTACTGGCTCATCAAGAACTCCTGGAGCGAGTCCTGGGGCATGCAGGGCTACATGAAGCTCGCTCGCAACCGCGACAACCACTGCGGCATCTCCAAGCTCCATAAATCAGAGGATAAGAGCGTCATTTTTGCGAGCTATAAGGCGCGCAAGAGCGAGGCCGTCGCCGTAGTGCTCCGCAAAATGTTGGCGCATTTGATTGTGGCTTCGGCAGCCTTCGCGCTTGTGTCTTCCCAAGAGCAAATCGAGCTCGAGTGCGTGTTTGAGAGTCAGACCAACTTTGAGCGCGAGACGTACACTTGCGACGTCCAGTCGGACCTTCCGTCCCTCGAGCCGGCCTTCATTGCCGATGTCGGCGGGGCGCACACGGCGGGCCAGGATGTGAGCCAAGTCTTCGGCATCGCGATGGTAGACAGAGTCATTTACCGCATACCAAGCAACCTCCATCTGCAGTTTCCGGCTGCCACTGAGATTAAAATTGACGGGGCGCCCATGCGACATTTGCGGGCCAGCGACTTGGCTGGTTTCAAGGCAGTCCTGATTTTTTTCGCCCTGACCTGCGCGAGACTCGAGGTCATCGAGGCCGGCACTTTCGAGGGATTCGAGCGGCTGGAGAGAGTTGACTTGAGCAACAACAACATCACCCTCATCGAGGCAGGGACATTCTCGAACGTCCCGCAACTTGCTTTCCTCGACATCACCGCGAGCCTCTGCAACGACCCAGATCTTCTCACCGCCTCGTCGACCGCTGCCTCCATCGCTGACTTCCTCGCCCAACTCGAATCCTCTGCGTGCATCGCGTCGGCCGACGTCGTAGCTCGCGTGCTGCCCTCGATTCTCGAGAAGGTGGAGCTGGACGTGCAGATTCGGTCGCAAAACAGCGACCTTGAGGCGCAAATCGCCGAAATCGCGTCGCAAACCTCGCAAATTTCCGAAGCAACCGCGCAACTCGAGACGTGCACGAGTCGGGAGGCGGAGCTGACGGCGGCCATTGACAGCCTCAGCAACCGGCTCGCGACGCTCGAGGAGCTCGAGACCGCGAACGAGACGTTGACGGAGAATCTCGGCATCTGCCAGACGGCGCTCGACCAGCGCACGGCCGAGAACGAGCGCCTCGAAGACGAGCTGAGGGAGCTCGAGGGCGAGGCCGAGAAGTGCCGCGAGGCCAACGGCACGTGCCGCTTCGTCGACGACCCGACCTACGGCTACACGTGCCTCGGCCACGACGTCCGTGTGAGCGCCGAGAGCGACCGCGTCGAGTGGGGCGGCACGCACTTGCGTGGTCGTGCGGACGACGCCGTGCGCGGCTTGATTTTGCGCGGACTCGAGGTCGTCTTCGTGCCGAGGAAGATTTCCGAGGTCTTTGGACGACTCGAGGCGCTCGTCGTCACCGGCTGCGGGCTGCGCAGCATCGAGAAGCGCGACCTCGACGGGCTCGACAGACTCACGACGCTGCAGGTCTCCGACAACCAAATTTCGAGCATCCAAGCGGGCAGCTTCGACGAGGTTTTGCTTCTGCAGACGCTCGACCTGTCCTTCAACGAAATTTCGTCGCTGCCGACGAGAGCTTTTGCGAACCTCGCGAGACTCGCGCACATCGACCTGAGCAACAACCGATTGACGAATATCAGATTCGACGCGATTCCGGCGACCAACGGCATCACCAGCTTCCTGGCCACGAGCAATCAGCTGAGAAGTGTCGACGTCTCGCTCGTGTGGCGACTCAATCGCGCATCGCTGATCGACTTCCGCGGAAATGCGTGCAACTTCAACTACGACAGCGGAAGTGGCAGCTTTTTGGCGTTTTACAACAGCATTCTTGCGAGCTGC
Protein: 871 (aa)
MLSLRTLRKDLKTLRRTFDRNVEQITSFSENFRNATFTLTVNRFAILDDKDRRRLTSFRSSILGKVAHKVETVSMRSDTPCLPRELDWREYGAVTPVQDQGFYCGSCWAFSATGHLEGIISISTGEPAVKLSDQQFIDCNYNKLIGNFGCDGGQMSLALSYALKKDITTADTYPYADSRGECAYSKPQTRFNFSQPVMLPAGDEEALKVAVATSGPIAVAIDASRESFFYYSEGVYFDPNCTQWINHAVLVVGYGTDPVGGDYWLIKNSWSESWGMQGYMKLARNRDNHCGISKLHKSEDKSVIFASYKARKSEAVAVVLRKMLAHLIVASAAFALVSSQEQIELECVFESQTNFERETYTCDVQSDLPSLEPAFIADVGGAHTAGQDVSQVFGIAMVDRVIYRIPSNLHLQFPAATEIKIDGAPMRHLRASDLAGFKAVLIFFALTCARLEVIEAGTFEGFERLERVDLSNNNITLIEAGTFSNVPQLAFLDITASLCNDPDLLTASSTAASIADFLAQLESSACIASADVVARVLPSILEKVELDVQIRSQNSDLEAQIAEIASQTSQISEATAQLETCTSREAELTAAIDSLSNRLATLEELETANETLTENLGICQTALDQRTAENERLEDELRELEGEAEKCREANGTCRFVDDPTYGYTCLGHDVRVSAESDRVEWGGTHLRGRADDAVRGLILRGLEVVFVPRKISEVFGRLEALVVTGCGLRSIEKRDLDGLDRLTTLQVSDNQISSIQAGSFDEVLLLQTLDLSFNEISSLPTRAFANLARLAHIDLSNNRLTNIRFDAIPATNGITSFLATSNQLRSVDVSLVWRLNRASLIDFRGNACNFNYDSGSGSFLAFYNSILASC
Type | Start | End | Length |
CDS |
1422 |
1793 |
372 |
CDS |
2192 |
3019 |
828 |
CDS |
3537 |
4068 |
532 |
CDS |
4271 |
4360 |
90 |
CDS |
4865 |
4910 |
46 |
CDS |
5185 |
5476 |
292 |
CDS |
5870 |
5998 |
129 |
CDS |
6130 |
6453 |
324 |
intron |
1794 |
2191 |
398 |
intron |
3020 |
3536 |
517 |
intron |
4069 |
4270 |
202 |
intron |
4361 |
4864 |
504 |
intron |
4911 |
5184 |
274 |
intron |
5477 |
5869 |
393 |
intron |
5999 |
6129 |
131 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
ACH56225 |
cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus] |
9e-60 |
InterPro |
IPR013128 |
Peptidase C1A, papain |
|
InterPro |
IPR025661 |
Cysteine peptidase, asparagine active site |
|
InterPro |
IPR000668 |
Peptidase C1A, papain C-terminal |
|
InterPro |
IPR025660 |
Cysteine peptidase, histidine active site |
|
InterPro |
IPR003591 |
Leucine-rich repeat, typical subtype |
|
InterPro |
IPR001611 |
Leucine-rich repeat |
|
Gene Ontology(BP) |
GO:0006508 |
proteolysis |
|
Gene Ontology(MF) |
GO:0008234 |
cysteine-type peptidase activity |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF13514.1 |
AAA domain |
0.022 |
Pfam |
PF09177.6 |
Syntaxin 6, N-terminal |
2.2 |
Pfam |
PF00560.28 |
Leucine Rich Repeat |
9.6e-14 |
Pfam |
PF09486.5 |
Bacterial type III secretion protein (HrpB7) |
4.4 |
Pfam |
PF13504.1 |
Leucine rich repeat |
3.3e-05 |
Pfam |
PF12799.2 |
Leucine Rich repeats (2 copies) |
3.1e-17 |
Pfam |
PF05531.7 |
Nucleopolyhedrovirus P10 protein |
2.4 |
Pfam |
PF08246.7 |
Cathepsin propeptide inhibitor domain (I29) |
0.014 |
Pfam |
PF13851.1 |
Growth-arrest specific micro-tubule binding |
0.59 |
Pfam |
PF00112.18 |
Papain family cysteine protease |
4.5e-68 |
Pfam |
PF13870.1 |
Domain of unknown function (DUF4201) |
0.55 |
Pfam |
PF13306.1 |
Leucine rich repeats (6 copies) |
1.9e-09 |
Pfam |
PF13855.1 |
Leucine rich repeat |
3.1e-27 |
Pfam |
PF08317.6 |
Spc7 kinetochore protein |
3.7 |
Pfam |
PF04636.8 |
PA26 p53-induced protein (sestrin) |
0.08 |
Pfam |
PF06818.10 |
Fez1 |
2.2 |
Pfam |
PF12128.3 |
Protein of unknown function (DUF3584) |
0.14 |
Pfam |
PF04977.10 |
Septum formation initiator |
0.046 |
Pfam |
PF08286.6 |
Spc24 subunit of Ndc80 |
7.7 |
Pfam |
PF13516.1 |
Leucine Rich repeat |
1.4e-06 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes