MidgeBase gene description page [Pn.04102]

Outline

Link to gbrowse

Gene ID Pn.04102
Type Protein coding gene
Scaffold PnScaf3364
Start 13567
End 17530
Direction +

Sequence

Transcript: 1386 (bp)

 ATGGCTTTGGAAGATTCAAGATACTGCTATAGTCCAACGGCTGAGTCCAACAGCCCGGCACTGCCCAACTCGCCGAATAGTTTTTCACGAAACTCATCAACAGCTTCATCGCCGACTACAGTTTGTGATATTAAAATGAGCTCAAAGCGCGGCTTAGTCGTCGTCGACGATGAAGACATGGACATAAATGTCGACGACGACGACGACGATGAAATCTACGAAAAGAAGTTCAAAGCCAGCAAGTCCGACAGCCTCAACATCGGCAAGTTCAGCTTCTCGATAACGAACATCCTGAGCGATGCGTTCGGACCCAAGACATCGCCCACAGCTGCCAACAACCACAGTCAGCACGTCATCAAGACGGAGAGCTGCGACCCGAGCGACCGAATCTTTCGGCCCTTCGAGATCAAAAACTTCATCTGCAACAGTGCAAACAACTCAGCGAGCAACGGCCACAGCAACGCGCGAGCGTTCATGCAAAATCTCAGCAACCCATCTTCCGTGTTCTTGAACAGCTTTCGGCTCTCCGACATTTTCGACTACAGTACAAAGAGTTCAGCGTCAGAGAACAACCACAACAACAACAACAACAACAACAACAACAACAACAACAATAATAATAATAGTTTTAATAGTATTAGTAACAACAACAACAACAAAAGTGATAATAGCCTAAGAAACAGCCTCTACAGCAACTTCTCGTCCTACCCGAAGATCCAGGAGGAGATCTTCCAGAGCAGTCACCGAAAGTTCGCGCCATCGCCGGCGAGTTCGGCGCTCAAAATTCCGACAGCGATCGGCGGCCTCTGCAAGACGATCTCGCAGATCGGACAGGAAACCTCGCCGCCGCCACTATCGACATCGTCGGCATCGACGACTTCGTCAACGAAGAGCGGCTCAGTTGACACGCTCAAGCTGCAGCAGTCGTCGACCGACAGCCTCGACTCGGACGACTGTCAATCGGAGGCTAAGAAAGATGAAAGCAAGATGTGGCCGGCATGGATTTTCTGCACTCGCTATTCGGACCGGCCGAGTTCTGGTCCTCGCTACCGTCGGCCAAAGGAGAAGAAGGAAAAGGGAGCGGAGGACGAGAAGCGACCGAGAACGGCGTTTTCGAATGAGCAGCTAGCAAGACTAAAGAGAGAGTTTAACGAAAATCGATATTTGACTGAGAAGAGACGACAACAGTTGAGCTCGGAGCTGGGCTTAAATGAGGCGCAAATTAAAATATGGTTCCAGAACAAGCGGGCCAAGATCAAGAAGACATCTGGAACGAAGAATGCGCTCGCTCTTCAGCTGATGGCGCAAGGTCTATACAATCACACGACAGTGCCGCTGACAAAGGAAGAGGAAGAATTAGAGCTGAGGATGAATGGAAAGCTGCCG 

Protein: 462 (aa)

 MALEDSRYCYSPTAESNSPALPNSPNSFSRNSSTASSPTTVCDIKMSSKRGLVVVDDEDMDINVDDDDDDEIYEKKFKASKSDSLNIGKFSFSITNILSDAFGPKTSPTAANNHSQHVIKTESCDPSDRIFRPFEIKNFICNSANNSASNGHSNARAFMQNLSNPSSVFLNSFRLSDIFDYSTKSSASENNHNNNNNNNNNNNNNNNNSFNSISNNNNNKSDNSLRNSLYSNFSSYPKIQEEIFQSSHRKFAPSPASSALKIPTAIGGLCKTISQIGQETSPPPLSTSSASTTSSTKSGSVDTLKLQQSSTDSLDSDDCQSEAKKDESKMWPAWIFCTRYSDRPSSGPRYRRPKEKKEKGAEDEKRPRTAFSNEQLARLKREFNENRYLTEKRRQQLSSELGLNEAQIKIWFQNKRAKIKKTSGTKNALALQLMAQGLYNHTTVPLTKEEEELELRMNGKLP 
Type Start End Length
CDS 13567 13690 124
CDS 13985 14862 878
CDS 14941 14977 37
CDS 16904 17004 101
CDS 17184 17341 158
CDS 17440 17527 88
intron 13691 13984 294
intron 14863 14940 78
intron 14978 16903 1926
intron 17005 17183 179
intron 17342 17439 98

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001649162 engrailed [Aedes aegypti] gb|EAT33107.1| engrailed [Aedes aegypti] 1e-73
InterPro IPR019737 Homeobox engrailed-type, conserved site
InterPro IPR000047 Helix-turn-helix motif
InterPro IPR019549 Homeobox engrailed, C-terminal
InterPro IPR001356 Homeodomain
InterPro IPR017970 Homeobox, conserved site
InterPro IPR000747 Homeodomain engrailed
InterPro IPR020479 Homeodomain, metazoa
InterPro IPR009057 Homeodomain-like
Gene Ontology(BP) GO:0007275 multicellular organismal development
Gene Ontology(BP) GO:0006355 regulation of transcription, DNA-dependent
Gene Ontology(CC) GO:0005634 nucleus
Gene Ontology(MF) GO:0003677 DNA binding
Gene Ontology(MF) GO:0043565 sequence-specific DNA binding
Gene Ontology(MF) GO:0000976 transcription regulatory region sequence-specific DNA binding
Gene Ontology(MF) GO:0003700 sequence-specific DNA binding transcription factor activity
Pfam PF07150.6 Protein of unknown function (DUF1390) 0.066
Pfam PF10525.4 Engrailed homeobox C-terminal signature domain 2.7e-16
Pfam PF00046.24 Homeobox domain 5e-22

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
C. quinquefasciatus CPIJ019925
A. gambiae AGAP008023
D. melanogaster FBgn0000577
A. aegypti AAEL014635
B. mori BGIBMGA009644-TA
C. quinquefasciatus CPIJ010220
P. humanus PHUM030340-PA
H. sapiens ENSP00000297375
B. mori BGIBMGA009798-TA
M. musculus ENSMUSG00000039095
P. vanderplanki Pv.16904
S. invicta SI2.2.0_03235
P. humanus PHUM030230-PA