MidgeBase gene description page [Pn.01180]

Outline

Link to gbrowse

Gene ID Pn.01180
Type Protein coding gene
Scaffold PnScaf1158
Start 2789
End 5056
Direction +

Sequence

Transcript: 1368 (bp)

 ATGCGAATTTGGGTCGCGTTTTGCTTGGTTTTGTTGGCTTTGGTGCAGATAAAAGTGGGGAGCGGCTATGAAATTGTGCAAGTCGCTTCGGTGAAGGGCTTTAGGCGGGGCGAGTGCCGGGAGTGGCAGAGGTTTAAGAAAAAGTACAACAAAGTCTACGAGTCGAAGCGCGAAAACATTCGCCGGCGCCAAATCTTCATCGCCAACCTGCGCACCATTCGAGCGCACAACCAGAGATTCCTCGCCGGTTCCGAGGCCTACAACTGCTCCATCAACAGACACAGCGACCTCACGTCCGCCGAGTTTTCCGCTCGGTTTTTGGGCCTCAACGGACGTCTCATCGGCGGCCTCAACGCCGAGCGCAGCTTCGTCGACATGAAGGGCAGGAAGGTGTTTCGGGCGGTCAAGTACGACCGGGTCGCGAACTTTGTCGACTGGAGGCGGACGGCAGTTAGTCCCGTGAAAAGGCAGGGCTTTTGCGGCAGCTGCTGGGCGTTCTCGGCTGCTGGCGCGCTCGAGGGTCAAATTTACATCAAAACTCAAAAGCTGATCAACGTTTCGGCGCAGTACTTTGTTGACTGCGTGAGGACTGAGATGTCTGAGGGGTGCATCGGCGGCTCGATGGACGACGCATTTCAGTACTCGGCCGCCAACGGGATCGTCCTCGACAGCTTGTACCCGTACGAGGAGTCGGAATCGTGGCTGGGCTGCCAGAGCAGCAAGGAGAAGCTCAAAGGGGTGAAAATCAAGGGATTCGTTCAGCTGGCGAAGCAAACCGAGGAGGAGCTGAAGGCGGCCGTGTCGAATGTCGGCCCCATAAGCGTCGGCATCGACGCCAGCCTCAGGTCCATGCAGTTCTACGACTACGGCGTGTACTTCGACAAGAGCTGCAACGCCAGCAACATCAACCATGGCGTGCTGGTTGTTGGGTACGGAAGCGACACGTCCTTCGATCCACCGCAAGACTACTGGATCGTCAAAAATTCTTGGGGGTCGACACACGGCGAGCACGGATATATCAGAATAGCGCGAAACCGCGACAACCACTGCGGAATTGCGACGATGGCCAGCTATCCTTTGGTTTTTTGCCTGAAAATTAATAAATTTTGTTGTTACTACGTTTTTATTAACTCAATTTCTTGTTTGTCTGTTTGTCCGTTTGTCTGTCTGTCTGTTTGTCTGTCTGTCTATCTATTTGTCTGTCTGTCTGTCTGTCTGTCTGTCTATTTGTCTATCTGTCTGTCTGTCTGTCTGTCTGTCAGTCTGATTGTCTGTCTGTTTTTCTGTCTGTCTGTCTGTCTGTCTGTCTATTTGTCTATCTGTCTAGCTGTCTATAAGTCTGTCTGTCTGTCTGTCTGTCTGTCAGTC 

Protein: 456 (aa)

 MRIWVAFCLVLLALVQIKVGSGYEIVQVASVKGFRRGECREWQRFKKKYNKVYESKRENIRRRQIFIANLRTIRAHNQRFLAGSEAYNCSINRHSDLTSAEFSARFLGLNGRLIGGLNAERSFVDMKGRKVFRAVKYDRVANFVDWRRTAVSPVKRQGFCGSCWAFSAAGALEGQIYIKTQKLINVSAQYFVDCVRTEMSEGCIGGSMDDAFQYSAANGIVLDSLYPYEESESWLGCQSSKEKLKGVKIKGFVQLAKQTEEELKAAVSNVGPISVGIDASLRSMQFYDYGVYFDKSCNASNINHGVLVVGYGSDTSFDPPQDYWIVKNSWGSTHGEHGYIRIARNRDNHCGIATMASYPLVFCLKINKFCCYYVFINSISCLSVCPFVCLSVCLSVYLFVCLSVCLSVYLSICLSVCLSVSLIVCLFFCLSVCLSVYLSICLAVYKSVCLSVCLSV 
Type Start End Length
CDS 2789 2926 138
CDS 3298 3764 467
CDS 4020 4336 317
CDS 4398 4446 49
CDS 4538 4542 5
CDS 4609 4714 106
CDS 4768 5053 286
intron 2927 3297 371
intron 3765 4019 255
intron 4337 4397 61
intron 4447 4537 91
intron 4543 4608 66
intron 4715 4767 53

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001655999 cathepsin l [Aedes aegypti] gb|EAT45919.1| cathepsin l [Aedes aegypti] 2e-73
InterPro IPR013128 Peptidase C1A, papain
InterPro IPR013201 Proteinase inhibitor I29, cathepsin propeptide
InterPro IPR025660 Cysteine peptidase, histidine active site
InterPro IPR000668 Peptidase C1A, papain C-terminal
InterPro IPR000169 Cysteine peptidase, cysteine active site
Gene Ontology(BP) GO:0006508 proteolysis
Gene Ontology(MF) GO:0008234 cysteine-type peptidase activity
Pfam PF03051.10 Peptidase C1-like family 0.00044
Pfam PF08246.7 Cathepsin propeptide inhibitor domain (I29) 4e-12
Pfam PF00112.18 Papain family cysteine protease 6.6e-72

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID