MidgeBase gene description page [Pn.02693]

Outline

Link to gbrowse

Gene ID Pn.02693
Type Protein coding gene
Scaffold PnScaf2316
Start 144
End 2352
Direction -

Sequence

Transcript: 1371 (bp)

 ATGCCGACCATGCGAGTCCTCCTCCTGCTGCTCGCTATCGTCACCATCTACGTGGCCGTGCCCACTTCAGCGCAGCGAAGAACAACCACCAAAGCACCCAGAACGGTGAAGGCGAGCGCCAAATCCCGAGCAACCAACGGCACCGCTGCGACCAAGAAGAAGAGCGGCGCGACGACCGTCAAAGTCCAGGCAAAAAATAGCAAAAAGGCTACAACGGCGGCCATCAGCATTGGTGGCCAGAAGCTGAACGTCCGCGACTCCAACCGGAATATAACGACGGCCGAGGGCCGAACAACGCGCACCTTTACGACCCGCACGTGGACACCTCGCAGCACGCGCCGCAGAACCACGCGCCGCAGAACCACGCGCCAGCCGCGCAACTGCCGCGGCACCGGCTGCCAGCGCGACGAGGAGGCCGCCTTCGACGAGTTCAAGGCGAACTTCACCAAGTCGTACGCGGACAACGAGGCCGAGTGCAGGGCGCTGCACAACTTCTGCAAGACCTACCGCGCCGTCGAGCGCAACAACCGCAAGCCCAAGCGCAGCTTCCTCATGTCGATCCTCGCCAACGCCGACGAGTCGGACGAGGAGAAGGCGGCGGCGCGCGCGAGCAAGGGCGGGCTCGTCCTCGACCCGGAGGTCATGAAGAAGGTTAAGAAGGTGACCGTCGAGGAGGCGCTGAAGTCGCCCTACGGCCTCAAGCAGGCCAACGCCAAGAAGGCCAAGAAGGTGAGGCGCCAGTCGGGCAGCGAGACGAGCTATGCCGACTTGAGCGGGCCGGTGAAAGACCAAGGCGGTTGCGGCTGCTGCTGGAGCTTCTGCGTCGTCGCTTTGCTGCAGCTCCTCAACTACAATGAAAATGGCGTGAATGTGAGTCTCTCCGAGCAGAATGTCGTCGACTGCAACGATGCTGGGTCGGAGTGCAATGGAGGCAATCCAGCAACGGCCTTCGACTATGCGCAGAACAATGGCATCGCGATGCAGGCGCACTACCCTTACCGCAAGTCGCAGGCCTCCTGCAAGCGCGACAGAGTCGAGAGCGTCTTCCAGCCCTCGTCCGGGATTTGCTACGGCGTCGTCGACACCGAGGACGAGCTCGAGAGCCTGGTCGACAACTTTGGGGCGGTCCCGATCGCCATCGGCCTGTGCGACAGCCTCATGAGCTACAGCTCGGGAGTGTTCGATGACCCGGAGTGCGACGTGCAGGTGACGCACTGCGTGACGCTGGTTGGCTACGGCACGGACGAGTTCGGCGACAGCTACTGGATTATCAAGAACTCCTGGGGTCGCTACTGGGGCTACGGAGGTTATGGACGATTGAAGAGAGGCATCAACTCCTGCAGACTGACCAGCATGTTTTTCACCCCCTGC 

Protein: 457 (aa)

 MPTMRVLLLLLAIVTIYVAVPTSAQRRTTTKAPRTVKASAKSRATNGTAATKKKSGATTVKVQAKNSKKATTAAISIGGQKLNVRDSNRNITTAEGRTTRTFTTRTWTPRSTRRRTTRRRTTRQPRNCRGTGCQRDEEAAFDEFKANFTKSYADNEAECRALHNFCKTYRAVERNNRKPKRSFLMSILANADESDEEKAAARASKGGLVLDPEVMKKVKKVTVEEALKSPYGLKQANAKKAKKVRRQSGSETSYADLSGPVKDQGGCGCCWSFCVVALLQLLNYNENGVNVSLSEQNVVDCNDAGSECNGGNPATAFDYAQNNGIAMQAHYPYRKSQASCKRDRVESVFQPSSGICYGVVDTEDELESLVDNFGAVPIAIGLCDSLMSYSSGVFDDPECDVQVTHCVTLVGYGTDEFGDSYWIIKNSWGRYWGYGGYGRLKRGINSCRLTSMFFTPC 
Type Start End Length
CDS 147 233 87
CDS 360 422 63
CDS 490 791 302
CDS 1434 2352 919
intron 234 359 126
intron 423 489 67
intron 792 1433 642

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr NP_566920 putative cysteine proteinase [Arabidopsis thaliana] gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana] emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana] gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana] 1e-37
InterPro IPR013128 Peptidase C1A, papain
InterPro IPR025661 Cysteine peptidase, asparagine active site
InterPro IPR025660 Cysteine peptidase, histidine active site
InterPro IPR000668 Peptidase C1A, papain C-terminal
InterPro IPR000169 Cysteine peptidase, cysteine active site
Gene Ontology(BP) GO:0006508 proteolysis
Gene Ontology(MF) GO:0008234 cysteine-type peptidase activity
Pfam PF12685.2 SpoIIIAH-like protein 0.029
Pfam PF03051.10 Peptidase C1-like family 0.046
Pfam PF14009.1 Domain of unknown function (DUF4228) 5.1
Pfam PF08246.7 Cathepsin propeptide inhibitor domain (I29) 0.00013
Pfam PF02163.17 Peptidase family M50 0.0072
Pfam PF00112.18 Papain family cysteine protease 2.4e-51
Pfam PF10312.4 Conserved mid region of cactin 0.02

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID