MidgeBase gene description page [Pn.04102]
Outline
Gene ID | Pn.04102 |
Type | Protein coding gene |
Scaffold | PnScaf3364 |
Start | 13567 |
End | 17530 |
Direction | + |
Sequence
Transcript: 1386 (bp)
ATGGCTTTGGAAGATTCAAGATACTGCTATAGTCCAACGGCTGAGTCCAACAGCCCGGCACTGCCCAACTCGCCGAATAGTTTTTCACGAAACTCATCAACAGCTTCATCGCCGACTACAGTTTGTGATATTAAAATGAGCTCAAAGCGCGGCTTAGTCGTCGTCGACGATGAAGACATGGACATAAATGTCGACGACGACGACGACGATGAAATCTACGAAAAGAAGTTCAAAGCCAGCAAGTCCGACAGCCTCAACATCGGCAAGTTCAGCTTCTCGATAACGAACATCCTGAGCGATGCGTTCGGACCCAAGACATCGCCCACAGCTGCCAACAACCACAGTCAGCACGTCATCAAGACGGAGAGCTGCGACCCGAGCGACCGAATCTTTCGGCCCTTCGAGATCAAAAACTTCATCTGCAACAGTGCAAACAACTCAGCGAGCAACGGCCACAGCAACGCGCGAGCGTTCATGCAAAATCTCAGCAACCCATCTTCCGTGTTCTTGAACAGCTTTCGGCTCTCCGACATTTTCGACTACAGTACAAAGAGTTCAGCGTCAGAGAACAACCACAACAACAACAACAACAACAACAACAACAACAACAACAATAATAATAATAGTTTTAATAGTATTAGTAACAACAACAACAACAAAAGTGATAATAGCCTAAGAAACAGCCTCTACAGCAACTTCTCGTCCTACCCGAAGATCCAGGAGGAGATCTTCCAGAGCAGTCACCGAAAGTTCGCGCCATCGCCGGCGAGTTCGGCGCTCAAAATTCCGACAGCGATCGGCGGCCTCTGCAAGACGATCTCGCAGATCGGACAGGAAACCTCGCCGCCGCCACTATCGACATCGTCGGCATCGACGACTTCGTCAACGAAGAGCGGCTCAGTTGACACGCTCAAGCTGCAGCAGTCGTCGACCGACAGCCTCGACTCGGACGACTGTCAATCGGAGGCTAAGAAAGATGAAAGCAAGATGTGGCCGGCATGGATTTTCTGCACTCGCTATTCGGACCGGCCGAGTTCTGGTCCTCGCTACCGTCGGCCAAAGGAGAAGAAGGAAAAGGGAGCGGAGGACGAGAAGCGACCGAGAACGGCGTTTTCGAATGAGCAGCTAGCAAGACTAAAGAGAGAGTTTAACGAAAATCGATATTTGACTGAGAAGAGACGACAACAGTTGAGCTCGGAGCTGGGCTTAAATGAGGCGCAAATTAAAATATGGTTCCAGAACAAGCGGGCCAAGATCAAGAAGACATCTGGAACGAAGAATGCGCTCGCTCTTCAGCTGATGGCGCAAGGTCTATACAATCACACGACAGTGCCGCTGACAAAGGAAGAGGAAGAATTAGAGCTGAGGATGAATGGAAAGCTGCCG
Protein: 462 (aa)
MALEDSRYCYSPTAESNSPALPNSPNSFSRNSSTASSPTTVCDIKMSSKRGLVVVDDEDMDINVDDDDDDEIYEKKFKASKSDSLNIGKFSFSITNILSDAFGPKTSPTAANNHSQHVIKTESCDPSDRIFRPFEIKNFICNSANNSASNGHSNARAFMQNLSNPSSVFLNSFRLSDIFDYSTKSSASENNHNNNNNNNNNNNNNNNNSFNSISNNNNNKSDNSLRNSLYSNFSSYPKIQEEIFQSSHRKFAPSPASSALKIPTAIGGLCKTISQIGQETSPPPLSTSSASTTSSTKSGSVDTLKLQQSSTDSLDSDDCQSEAKKDESKMWPAWIFCTRYSDRPSSGPRYRRPKEKKEKGAEDEKRPRTAFSNEQLARLKREFNENRYLTEKRRQQLSSELGLNEAQIKIWFQNKRAKIKKTSGTKNALALQLMAQGLYNHTTVPLTKEEEELELRMNGKLP
Type | Start | End | Length |
CDS |
13567 |
13690 |
124 |
CDS |
13985 |
14862 |
878 |
CDS |
14941 |
14977 |
37 |
CDS |
16904 |
17004 |
101 |
CDS |
17184 |
17341 |
158 |
CDS |
17440 |
17527 |
88 |
intron |
13691 |
13984 |
294 |
intron |
14863 |
14940 |
78 |
intron |
14978 |
16903 |
1926 |
intron |
17005 |
17183 |
179 |
intron |
17342 |
17439 |
98 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001649162 |
engrailed [Aedes aegypti] gb|EAT33107.1| engrailed [Aedes aegypti] |
1e-73 |
InterPro |
IPR019737 |
Homeobox engrailed-type, conserved site |
|
InterPro |
IPR000047 |
Helix-turn-helix motif |
|
InterPro |
IPR019549 |
Homeobox engrailed, C-terminal |
|
InterPro |
IPR001356 |
Homeodomain |
|
InterPro |
IPR017970 |
Homeobox, conserved site |
|
InterPro |
IPR000747 |
Homeodomain engrailed |
|
InterPro |
IPR020479 |
Homeodomain, metazoa |
|
InterPro |
IPR009057 |
Homeodomain-like |
|
Gene Ontology(BP) |
GO:0007275 |
multicellular organismal development |
|
Gene Ontology(BP) |
GO:0006355 |
regulation of transcription, DNA-dependent |
|
Gene Ontology(CC) |
GO:0005634 |
nucleus |
|
Gene Ontology(MF) |
GO:0003677 |
DNA binding |
|
Gene Ontology(MF) |
GO:0043565 |
sequence-specific DNA binding |
|
Gene Ontology(MF) |
GO:0000976 |
transcription regulatory region sequence-specific DNA binding |
|
Gene Ontology(MF) |
GO:0003700 |
sequence-specific DNA binding transcription factor activity |
|
Pfam |
PF07150.6 |
Protein of unknown function (DUF1390) |
0.066 |
Pfam |
PF10525.4 |
Engrailed homeobox C-terminal signature domain |
2.7e-16 |
Pfam |
PF00046.24 |
Homeobox domain |
5e-22 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
C. quinquefasciatus |
CPIJ019925 |
A. gambiae |
AGAP008023 |
D. melanogaster |
FBgn0000577 |
A. aegypti |
AAEL014635 |
B. mori |
BGIBMGA009644-TA |
C. quinquefasciatus |
CPIJ010220 |
P. humanus |
PHUM030340-PA |
H. sapiens |
ENSP00000297375 |
B. mori |
BGIBMGA009798-TA |
M. musculus |
ENSMUSG00000039095 |
P. vanderplanki |
Pv.16904 |
S. invicta |
SI2.2.0_03235 |
P. humanus |
PHUM030230-PA |