MidgeBase gene description page [Pn.09008]
Outline
Gene ID | Pn.09008 |
Type | Protein coding gene |
Scaffold | PnScaf8961 |
Start | 5434 |
End | 9118 |
Direction | - |
Sequence
Transcript: 1887 (bp)
ATGAAGCAGCTGATTTTGGGACTCGCTTTGATCGCGAGCGTTTCGTGTCAAATCTACACGAAAATTCAAGCCGATGGCTATCACTATGACAAGCCTCAGCCGCAGCAGCGACTCGACATCCCCGAGAAGCCAGCCTGCCCGAACGGTGGCAGCGGAGAATTTTGCTGCATAAACGGTGCCGACAACGCCGACTGCTGCACCAATGGCGGATCTGGACCCCACTGCTGCACCAACGGAGCAGACAACGCCGACTGCTGCGAGAACGGCGGAAGCGGCAAATTTTGCTGCGCTAATGGCGCAGATAATGCAGATTGCTGCACAAATGGAGGCAGCGGGCCTTTTTGCTGCACCAACGGTGCCGATAATGCCGACTGCTGCGAGAATGGCGGAAGTGGAAAGTTCTGCTGTGCGAATGGCGCGGACAACGTTGACTGCTGCGAAAACGGTGGCAGCGGCCCGCATTGCTGCACAAACGGAGCTACAAATCAGTTCTGCTGCACCAACGGCGCCGACAACGAAAACTGCGAAATTCCGCGACCTTTCGAGCCGGCACCGACGCCAGCCGAGCCGGAGCCCACACTGCCGCCTCTGCCTTCGTTCCTCCCTCAAATCAACCTCCCGAAAATCATTCCCGTGAAGCCGGCAGAGGAGCCGCAGCCCGAAATCGACATCCGAAGCAAGCCAGTGGTGGTGGAGCCCGCGCCCTTCTCACCGCAGCCAGCCCCACGAGTTTTCGTCTCCGAGAGCACCACAAAGTCGCCCAACGAGTACCTGCCGCCACTGGAGGACGAAGCTTCCGAGTGGGACGCGTGGAAGGCAAAATTCCAGAAGCGCTACGACACCGCCGAGGAGGACGAGCGCCGTCGCTTGATTTTCGAGGAAAACGTTAAGAAAATTAACCTCCACAACTCGGACTTCGCTGCCGGCTTGGTGGCTTTCGATCTCGATGTCAACAAATTTGCTGATTTAGAGCAAGACGAGTTCCTGACCATCCACACCGGCCTGCGTCGAAGAGGTCGCTCTGCAGCGCAGTCACCGCATCGCTTCTACGGCGCTCCCTACTCCTCTTCCTCTCGCGCTGGCTTCAACGTTGAGCTCGGCTTCGAACAAAATTTGCTCTCCGGCTCCTCCTCCTCCTCCTCTTCTTCTTCTTCCTCTGCGTCAGTCGCAGCGGCAGCGGCGGCCCAAGGACAAAACATGATGAACGTCTTCATGCCATCAGCTTCGCTCGCATCCGAAGTCAAAGATGAAGTCGATTGGAGAAAGGAAGGTGCAATCACACCGGTGAAGAATCAAGGTAATTGCGCCAGTTGCTGGGCGTTTTCAGCTAATGGCGCCCTCGAAGCTCACAACTTCCTTAGAAAGCGCACAGGACCGATTCCACTGAGCGAACAGAACCTCATCGATTGCGTCAAAGAAAATGACGGATGCGATGGCGGCTACATGACAAATGCCTACGAGTATGCAGCGAGAAATCCGGGTGTCGATACGGAGGCTTCATATCCGTATGAGGCGCGAAACAGCACGTGTCGATTCAAGCGCGAAAACGTCGGCGGCGAGTGCATGACGCACATGGAGATCCAGATCGGAAACGAGGAAGCGCTGCAGCAGGCCGTCGCCACCGTCGGTCCGGTCGCGGCTGGCATTGACGGTGCCCAGCGCTCGTTCCAGTTCTACAAGTCGGGCTATTACTACGAGCCGAAGTGCGAGCAAGACGTGAATCATGCTGTTCTGATTGTGGGCTATGGAAAGACGGAGAGCGGCGAGGAGTACTGGATCTGCAAGAACTCGTGGGACACCGACTGGGGCGAAGAAGGCTACATCCGAATGGCCAAGAACCGAAAGAACCACTGCGGCATCACCAACCTGGCCAGCTACCCGATTGTC
Protein: 629 (aa)
MKQLILGLALIASVSCQIYTKIQADGYHYDKPQPQQRLDIPEKPACPNGGSGEFCCINGADNADCCTNGGSGPHCCTNGADNADCCENGGSGKFCCANGADNADCCTNGGSGPFCCTNGADNADCCENGGSGKFCCANGADNVDCCENGGSGPHCCTNGATNQFCCTNGADNENCEIPRPFEPAPTPAEPEPTLPPLPSFLPQINLPKIIPVKPAEEPQPEIDIRSKPVVVEPAPFSPQPAPRVFVSESTTKSPNEYLPPLEDEASEWDAWKAKFQKRYDTAEEDERRRLIFEENVKKINLHNSDFAAGLVAFDLDVNKFADLEQDEFLTIHTGLRRRGRSAAQSPHRFYGAPYSSSSRAGFNVELGFEQNLLSGSSSSSSSSSSSASVAAAAAAQGQNMMNVFMPSASLASEVKDEVDWRKEGAITPVKNQGNCASCWAFSANGALEAHNFLRKRTGPIPLSEQNLIDCVKENDGCDGGYMTNAYEYAARNPGVDTEASYPYEARNSTCRFKRENVGGECMTHMEIQIGNEEALQQAVATVGPVAAGIDGAQRSFQFYKSGYYYEPKCEQDVNHAVLIVGYGKTESGEEYWICKNSWDTDWGEEGYIRMAKNRKNHCGITNLASYPIV
Type | Start | End | Length |
CDS |
5437 |
5548 |
112 |
CDS |
5696 |
5846 |
151 |
CDS |
5903 |
6109 |
207 |
CDS |
6175 |
6260 |
86 |
CDS |
6417 |
6450 |
34 |
CDS |
6575 |
7657 |
1083 |
CDS |
7720 |
7798 |
79 |
CDS |
7854 |
7985 |
132 |
CDS |
9116 |
9118 |
3 |
intron |
5549 |
5695 |
147 |
intron |
5847 |
5902 |
56 |
intron |
6110 |
6174 |
65 |
intron |
6261 |
6416 |
156 |
intron |
6451 |
6574 |
124 |
intron |
7658 |
7719 |
62 |
intron |
7799 |
7853 |
55 |
intron |
7986 |
9115 |
1130 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_002111005 |
expressed hypothetical protein [Trichoplax adhaerens] gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens] |
8e-77 |
InterPro |
IPR013128 |
Peptidase C1A, papain |
|
InterPro |
IPR013201 |
Proteinase inhibitor I29, cathepsin propeptide |
|
InterPro |
IPR025661 |
Cysteine peptidase, asparagine active site |
|
InterPro |
IPR025660 |
Cysteine peptidase, histidine active site |
|
InterPro |
IPR000668 |
Peptidase C1A, papain C-terminal |
|
Gene Ontology(BP) |
GO:0006508 |
proteolysis |
|
Gene Ontology(MF) |
GO:0008234 |
cysteine-type peptidase activity |
|
Pfam |
PF03051.10 |
Peptidase C1-like family |
0.002 |
Pfam |
PF08246.7 |
Cathepsin propeptide inhibitor domain (I29) |
3.2e-14 |
Pfam |
PF10500.4 |
Nuclear RNA-splicing-associated protein |
4.9 |
Pfam |
PF00112.18 |
Papain family cysteine protease |
1.6e-76 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.08169 |