MidgeBase gene description page [Pn.07096]

Outline

Link to gbrowse

Gene ID Pn.07096
Type Protein coding gene
Scaffold PnScaf6348
Start 1419
End 6453
Direction -

Sequence

Transcript: 2613 (bp)

 ATGCTGAGCCTGCGAACCCTCAGGAAGGACCTCAAGACACTCAGACGAACCTTCGACCGGAATGTGGAGCAAATCACGAGCTTCAGCGAGAACTTCAGGAATGCGACGTTCACGCTGACCGTCAACAGGTTCGCCATTCTGGACGACAAGGACCGACGCAGGCTGACGAGCTTCAGGTCCTCGATTCTCGGGAAGGTCGCGCACAAGGTCGAGACCGTCTCGATGAGGTCCGACACGCCTTGCCTGCCGAGGGAGCTGGACTGGCGGGAGTACGGCGCGGTGACGCCCGTTCAGGACCAAGGATTCTATTGCGGCTCTTGCTGGGCTTTCTCGGCGACGGGGCACCTGGAAGGCATCATTTCGATATCGACGGGCGAGCCAGCTGTGAAGCTCTCCGACCAGCAATTTATTGACTGCAACTACAACAAGCTTATCGGGAACTTTGGGTGTGATGGAGGCCAAATGTCGCTCGCCCTCTCCTACGCCCTCAAAAAGGACATAACAACGGCCGACACATACCCGTATGCCGACTCGAGGGGCGAATGCGCCTACAGCAAGCCACAGACGAGGTTCAATTTTTCGCAGCCGGTCATGCTGCCGGCGGGCGACGAGGAGGCTCTGAAGGTCGCTGTTGCCACCTCTGGACCGATCGCAGTAGCCATCGACGCATCGCGCGAGTCCTTCTTCTACTACTCCGAAGGTGTCTACTTCGACCCCAACTGCACGCAGTGGATCAACCATGCGGTACTGGTAGTCGGGTACGGAACTGATCCTGTTGGAGGTGACTACTGGCTCATCAAGAACTCCTGGAGCGAGTCCTGGGGCATGCAGGGCTACATGAAGCTCGCTCGCAACCGCGACAACCACTGCGGCATCTCCAAGCTCCATAAATCAGAGGATAAGAGCGTCATTTTTGCGAGCTATAAGGCGCGCAAGAGCGAGGCCGTCGCCGTAGTGCTCCGCAAAATGTTGGCGCATTTGATTGTGGCTTCGGCAGCCTTCGCGCTTGTGTCTTCCCAAGAGCAAATCGAGCTCGAGTGCGTGTTTGAGAGTCAGACCAACTTTGAGCGCGAGACGTACACTTGCGACGTCCAGTCGGACCTTCCGTCCCTCGAGCCGGCCTTCATTGCCGATGTCGGCGGGGCGCACACGGCGGGCCAGGATGTGAGCCAAGTCTTCGGCATCGCGATGGTAGACAGAGTCATTTACCGCATACCAAGCAACCTCCATCTGCAGTTTCCGGCTGCCACTGAGATTAAAATTGACGGGGCGCCCATGCGACATTTGCGGGCCAGCGACTTGGCTGGTTTCAAGGCAGTCCTGATTTTTTTCGCCCTGACCTGCGCGAGACTCGAGGTCATCGAGGCCGGCACTTTCGAGGGATTCGAGCGGCTGGAGAGAGTTGACTTGAGCAACAACAACATCACCCTCATCGAGGCAGGGACATTCTCGAACGTCCCGCAACTTGCTTTCCTCGACATCACCGCGAGCCTCTGCAACGACCCAGATCTTCTCACCGCCTCGTCGACCGCTGCCTCCATCGCTGACTTCCTCGCCCAACTCGAATCCTCTGCGTGCATCGCGTCGGCCGACGTCGTAGCTCGCGTGCTGCCCTCGATTCTCGAGAAGGTGGAGCTGGACGTGCAGATTCGGTCGCAAAACAGCGACCTTGAGGCGCAAATCGCCGAAATCGCGTCGCAAACCTCGCAAATTTCCGAAGCAACCGCGCAACTCGAGACGTGCACGAGTCGGGAGGCGGAGCTGACGGCGGCCATTGACAGCCTCAGCAACCGGCTCGCGACGCTCGAGGAGCTCGAGACCGCGAACGAGACGTTGACGGAGAATCTCGGCATCTGCCAGACGGCGCTCGACCAGCGCACGGCCGAGAACGAGCGCCTCGAAGACGAGCTGAGGGAGCTCGAGGGCGAGGCCGAGAAGTGCCGCGAGGCCAACGGCACGTGCCGCTTCGTCGACGACCCGACCTACGGCTACACGTGCCTCGGCCACGACGTCCGTGTGAGCGCCGAGAGCGACCGCGTCGAGTGGGGCGGCACGCACTTGCGTGGTCGTGCGGACGACGCCGTGCGCGGCTTGATTTTGCGCGGACTCGAGGTCGTCTTCGTGCCGAGGAAGATTTCCGAGGTCTTTGGACGACTCGAGGCGCTCGTCGTCACCGGCTGCGGGCTGCGCAGCATCGAGAAGCGCGACCTCGACGGGCTCGACAGACTCACGACGCTGCAGGTCTCCGACAACCAAATTTCGAGCATCCAAGCGGGCAGCTTCGACGAGGTTTTGCTTCTGCAGACGCTCGACCTGTCCTTCAACGAAATTTCGTCGCTGCCGACGAGAGCTTTTGCGAACCTCGCGAGACTCGCGCACATCGACCTGAGCAACAACCGATTGACGAATATCAGATTCGACGCGATTCCGGCGACCAACGGCATCACCAGCTTCCTGGCCACGAGCAATCAGCTGAGAAGTGTCGACGTCTCGCTCGTGTGGCGACTCAATCGCGCATCGCTGATCGACTTCCGCGGAAATGCGTGCAACTTCAACTACGACAGCGGAAGTGGCAGCTTTTTGGCGTTTTACAACAGCATTCTTGCGAGCTGC 

Protein: 871 (aa)

 MLSLRTLRKDLKTLRRTFDRNVEQITSFSENFRNATFTLTVNRFAILDDKDRRRLTSFRSSILGKVAHKVETVSMRSDTPCLPRELDWREYGAVTPVQDQGFYCGSCWAFSATGHLEGIISISTGEPAVKLSDQQFIDCNYNKLIGNFGCDGGQMSLALSYALKKDITTADTYPYADSRGECAYSKPQTRFNFSQPVMLPAGDEEALKVAVATSGPIAVAIDASRESFFYYSEGVYFDPNCTQWINHAVLVVGYGTDPVGGDYWLIKNSWSESWGMQGYMKLARNRDNHCGISKLHKSEDKSVIFASYKARKSEAVAVVLRKMLAHLIVASAAFALVSSQEQIELECVFESQTNFERETYTCDVQSDLPSLEPAFIADVGGAHTAGQDVSQVFGIAMVDRVIYRIPSNLHLQFPAATEIKIDGAPMRHLRASDLAGFKAVLIFFALTCARLEVIEAGTFEGFERLERVDLSNNNITLIEAGTFSNVPQLAFLDITASLCNDPDLLTASSTAASIADFLAQLESSACIASADVVARVLPSILEKVELDVQIRSQNSDLEAQIAEIASQTSQISEATAQLETCTSREAELTAAIDSLSNRLATLEELETANETLTENLGICQTALDQRTAENERLEDELRELEGEAEKCREANGTCRFVDDPTYGYTCLGHDVRVSAESDRVEWGGTHLRGRADDAVRGLILRGLEVVFVPRKISEVFGRLEALVVTGCGLRSIEKRDLDGLDRLTTLQVSDNQISSIQAGSFDEVLLLQTLDLSFNEISSLPTRAFANLARLAHIDLSNNRLTNIRFDAIPATNGITSFLATSNQLRSVDVSLVWRLNRASLIDFRGNACNFNYDSGSGSFLAFYNSILASC 
Type Start End Length
CDS 1422 1793 372
CDS 2192 3019 828
CDS 3537 4068 532
CDS 4271 4360 90
CDS 4865 4910 46
CDS 5185 5476 292
CDS 5870 5998 129
CDS 6130 6453 324
intron 1794 2191 398
intron 3020 3536 517
intron 4069 4270 202
intron 4361 4864 504
intron 4911 5184 274
intron 5477 5869 393
intron 5999 6129 131

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr ACH56225 cathepsin L-like cysteine proteinase [Bursaphelenchus xylophilus] 9e-60
InterPro IPR013128 Peptidase C1A, papain
InterPro IPR025661 Cysteine peptidase, asparagine active site
InterPro IPR000668 Peptidase C1A, papain C-terminal
InterPro IPR025660 Cysteine peptidase, histidine active site
InterPro IPR003591 Leucine-rich repeat, typical subtype
InterPro IPR001611 Leucine-rich repeat
Gene Ontology(BP) GO:0006508 proteolysis
Gene Ontology(MF) GO:0008234 cysteine-type peptidase activity
Gene Ontology(MF) GO:0005515 protein binding
Pfam PF13514.1 AAA domain 0.022
Pfam PF09177.6 Syntaxin 6, N-terminal 2.2
Pfam PF00560.28 Leucine Rich Repeat 9.6e-14
Pfam PF09486.5 Bacterial type III secretion protein (HrpB7) 4.4
Pfam PF13504.1 Leucine rich repeat 3.3e-05
Pfam PF12799.2 Leucine Rich repeats (2 copies) 3.1e-17
Pfam PF05531.7 Nucleopolyhedrovirus P10 protein 2.4
Pfam PF08246.7 Cathepsin propeptide inhibitor domain (I29) 0.014
Pfam PF13851.1 Growth-arrest specific micro-tubule binding 0.59
Pfam PF00112.18 Papain family cysteine protease 4.5e-68
Pfam PF13870.1 Domain of unknown function (DUF4201) 0.55
Pfam PF13306.1 Leucine rich repeats (6 copies) 1.9e-09
Pfam PF13855.1 Leucine rich repeat 3.1e-27
Pfam PF08317.6 Spc7 kinetochore protein 3.7
Pfam PF04636.8 PA26 p53-induced protein (sestrin) 0.08
Pfam PF06818.10 Fez1 2.2
Pfam PF12128.3 Protein of unknown function (DUF3584) 0.14
Pfam PF04977.10 Septum formation initiator 0.046
Pfam PF08286.6 Spc24 subunit of Ndc80 7.7
Pfam PF13516.1 Leucine Rich repeat 1.4e-06

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID