MidgeBase gene description page [Pn.04073]

Outline

Link to gbrowse

Gene ID Pn.04073
Type Protein coding gene
Scaffold PnScaf3334
Start 39842
End 41643
Direction +

Sequence

Transcript: 1200 (bp)

 ATGAGTGCCTACGAGAACAATGAGGAGATACAGCTGTTCCGGGAATATCTACAGATTCCGAGCGTGCATCCAAACATCAACTACGAGCCGTGCATCGAATTTTTGAAGCGCCAAGCGAGCGACTTGAACCTCGACTACAAAGTCGAGTACCCGAAAAGTGCAAAGAAGCCTGTCTTCATTCTCACATGGCCTGGAACGCAGCCCGAATTGCCATCCATCATTCTGAACTCGCACATGGATGTCGTTCCGGTTTTCGAAGAGTTTTGGACGCACAAGCCCTTCTCGGCCGACATCGATGCCGAGGGAAAGATTTTCGCTCGAGGATCGCAGGACATGAAGTGCGTCGGCATGCAGTATCTCGCCGCCCTGAGACACTTCAGGAGAAACAACGTGCAGTTCAAGCGAACGATTCACGCGTGCTTTCTACCCGAGGAGGAAGTTGGTGGCGTGGAGGGCATGCGAGACTTCATCAACACCGAAGAATTCAAAAATCTAAACGCCGGATTTTCTCTCGACGAAGGCATCGCCAGCCCAGACGAAGTCTTCAACGTTTTCTACGCAGAGCGCTCGATTTGGCACGTTGAATTTGCAGTGCCCGGCAATCCGGGCCATGGCTCGCTTTTGCTAAAGAACACTGCTGGCGAGAAGCTCGAGCGCCTTCTCAATCGCTTCATCGAGTACCGCGACTCGCAGGTGAAGCGACTCGAAGACGATCCCAGTTTGCTGATCGGAGATGTGACGACTGTGAATGTCACGATGATTCACGGAGGCGTGCAATCGAACGTCATTCCACCGGAGTATAAGATGATGGTGGATATCCGACTGGCGCTCGATGTCGACCACGTTGAGTTCGAAAACATGTTCAAGAAGTGGTGCGAAGAGGCTGGCGAAGGAATCGCTTATTCCTTCGAACAAAAACAGCCGAAAGTCGCGGCCACAAAAACTGATGCCAGTAACCGTTACTTCACTGCGTTCAAGTCTGCTGTTGATGAGCTCGGACTCGACATCAAGCTCCAAGTTTTTCCGGGTGGCACTGACTCAAGGTACATCCGTGGAGTTGGAATTCCAGCAATTGGATTCAGTCCGATGAACCACACGCCCGTTTTGCTGCATGATCACGATGAGTTCTTGCGTGCTGACATTTACTTGAAGGGGATCGAAATTTACAAGAAGATTCTTGAAAAAGTTTGCAACCTGGAT 

Protein: 400 (aa)

 MSAYENNEEIQLFREYLQIPSVHPNINYEPCIEFLKRQASDLNLDYKVEYPKSAKKPVFILTWPGTQPELPSIILNSHMDVVPVFEEFWTHKPFSADIDAEGKIFARGSQDMKCVGMQYLAALRHFRRNNVQFKRTIHACFLPEEEVGGVEGMRDFINTEEFKNLNAGFSLDEGIASPDEVFNVFYAERSIWHVEFAVPGNPGHGSLLLKNTAGEKLERLLNRFIEYRDSQVKRLEDDPSLLIGDVTTVNVTMIHGGVQSNVIPPEYKMMVDIRLALDVDHVEFENMFKKWCEEAGEGIAYSFEQKQPKVAATKTDASNRYFTAFKSAVDELGLDIKLQVFPGGTDSRYIRGVGIPAIGFSPMNHTPVLLHDHDEFLRADIYLKGIEIYKKILEKVCNLD 
Type Start End Length
CDS 39842 39926 85
CDS 40394 41303 910
CDS 41372 41458 87
CDS 41523 41640 118
intron 39927 40393 467
intron 41304 41371 68
intron 41459 41522 64

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_002012859 GL23825 [Drosophila persimilis] gb|EDW23845.1| GL23825 [Drosophila persimilis] 1e-153
InterPro IPR002933 Peptidase M20
InterPro IPR010159 N-acyl-L-amino-acid amidohydrolase
InterPro IPR011650 Peptidase M20, dimerisation
Gene Ontology(BP) GO:0008152 metabolic process
Gene Ontology(BP) GO:0006520 cellular amino acid metabolic process
Gene Ontology(CC) GO:0005737 cytoplasm
Gene Ontology(MF) GO:0016787 hydrolase activity
Gene Ontology(MF) GO:0004046 aminoacylase activity
Pfam PF01546.23 Peptidase family M20/M25/M40 2.4e-49
Pfam PF07687.9 Peptidase dimerisation domain 1.1e-12
Pfam PF04389.12 Peptidase family M28 0.0095

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.04072

Orthologous genes

Species Gene ID
H. sapiens ENSP00000419262
D. plexippus DPOGS213685PA
P. vanderplanki Pv.10101
H. sapiens ENSP00000390557
H. melpomene HMEL010757-PA
P. humanus PHUM047660-PA
T. castaneum TC012529
C. quinquefasciatus CPIJ013671
D. melanogaster FBgn0037818
H. sapiens ENSP00000384296
P. vanderplanki Pv.10102
H. sapiens ENSP00000420487
N. vitripennis NV16582-PA
C. quinquefasciatus CPIJ013670
M. musculus ENSMUSG00000023262
T. castaneum TC012531
A. mellifera GB12125-PA
A. gambiae AGAP000679
T. castaneum TC012530
D. melanogaster FBgn0039050
D. plexippus DPOGS209240PA
D. melanogaster FBgn0039052
S. invicta SI2.2.0_00964
H. sapiens ENSP00000417056
A. aegypti AAEL011206
B. mori BGIBMGA010352-TA
D. melanogaster FBgn0039053
C. quinquefasciatus CPIJ009804
D. melanogaster FBgn0039049
C. quinquefasciatus CPIJ013669
D. melanogaster FBgn0039051