MidgeBase gene description page [Pn.12073]
Outline
Gene ID | Pn.12073 |
Type | Protein coding gene |
Scaffold | PnScaf15449 |
Start | 44 |
End | 3347 |
Direction | - |
Sequence
Transcript: 2787 (bp)
ATGTTCAATAAAACGAAATTTCCGGGCAAGCCATCCAAACTAGTGAATAAGAAGCGTGTTAGTGTTTTATCATTTGATAACAGTTTTGACAAAAGAGCGAACGATCGTTGTGAAATTGACAGTGAAAAAAATTTGGCTTCACCAAAACATGAACATGGCTACCAGACGGAAAGTGCAACAACAACTACGGCAAAACAGTCAGCGGAGGAGAGCGAAGAAAAGTCAAAACCAGTTATGACAACTGAGGAGACAGAAAATAAAGTTTTAACATCCACAACAACAACAACAAAAACGGCGGACGACGAGAAGAATGATATGGAAAACAAGGAGACAGCAAAAAGTTTAAATAAAAATGATTCAAATTGTGATAATTTAGATAAAAAGACGAAGAATTCGAAGAAGCATGTGACCTTTCGGCAAACGATAGAAACAAGTGATCAGTGCAAAGTGAAGCGTGTCTATAATCCGAATTTTAGCGGACCCATTGTTTCCATAATTAAGAAGGAGTCTCTCAAGTATCCTATATTGGTGTACAAAACTAAATGCATTGTCCGTGAGTCGAGACTAACGGAAATCGTACGAAACAGTGCTAATAACATTGATAAACTAAACTCATTGAAATTTGGGCGCGAATATGCTAATCATCATTCACATCAGACGACTGAAGGCGCTTCGAGCAAGTCTTCGTCGTCGTCGTCGTCGACGGCCTCAAATGTAATTGTTGCCAGTAAATTCAATCTCCCCAAGCTGAGTGCAAATTCGTCGCGAGTCATCAAGCCCAACAAGAGATTTCTCTTCGACACGGGCGAGGATCCATCGGCGGCCAGCAAGAAGAAGGTCATCAAGCCGAGTCCGTGGGGAGAGGGCGGGAGCGACCAGAGCGCCAGCTCATCCTCTTCACTACTTAAAGAGAAGAAAAATAATTTTGGCTTTAACTTTGTCGATGAACTAGATCTGAAATCAAAAAAGTCGAAGGCTCAATCACCATCGCTGACTTCATCGACATCCTCGTCAGCTTTGACGTCTGCATCCACCTCTTCACTGTCACTTTCGGCGGCGGCAGCTGCGACATCATCGACGTTGACCGGCAGCAAGAGTGACGACACTTCAAGCAAGCTCCTATCTCAACCCATTCTTCGAAAGCCTGTACTTCAGCTCTCTACAAACTTTAGTCTCTTCGGATCGCAGGATCGCGATGCGAATGCACTTAAAAGCCCGTTCTCTCTTAAGCTTAGCTCACCCAACCACTCCGGTTCCGCCTCTTCGACAGCGATTTTTACGAAATCTCTGGTGTGCAACGTCTGCAACTCCATAACGACGCGCAAGCAGCCGAGGAAGTATGGAGCCATTTGCTGTGAAATTTGCAAGAAATTCATGTCAAAAATTATTGAGCGTGTTAATAAGCATCCTGTACAAAATTGGCAATGTGATAAAGGAGATGGTTCATGTACGATAGAGTCGGTTGCTCTGAAGAACCTGAAGCCGTCGAAGATTGATATCAACATGAAGATCATCAGCAAGGGCCGTTGCTATGCATGCTGGCTGAAGAAATGTCTACTTACTTTTCAATTGCCATCTCCACTCAAAGCGCGTCTCACCAACGTTCTGCCGAAGAGTTTGATGGAGCCTTCGAAACCAGTGGTCTTATCAAACACAAACAATGATAACTCAAATATATTTTTAAGTTCATTGAAGTCATTTTCGTCGTTCAAGTTGTCGTCGAATCCGTTGGCCATCAACAACTTTACGTTCGGCTCCAAGCCCGACATCAAGTCGAGCTTCGACAATGTCCTGAAGAATGCATTTACGAGTGATAAACCCTTGCTAGTTGACATACCAAAAGAACCGGCAACGACGACGACGACAGCGGCGGCAACTACTCCACTTTTGTCCACAAACGAGACGAAGAAATCTCCGAAAGGATCTCCAAAATTATCGCCGAAAATCGTCCCTGCCGCCACGGTAACAAGCGAAGCAACTGCGCCAGTATCATCACCATCTACATCACAACAAGCATCATCACAACAACAGAGTCAAATCATTCACCCGAGCAACGTTGCTACGTCTGACAGCCTGAGGCAGAGGAATCTGATTAAAGGGCCGCGCGTCAAGCACGTCTGTCGCTCGGCGTCGCTTGTGCTCGGGCTGCCGATTGCCGTCTTCCCAGGCGACTCCGAGAATCAGAATGCCGAAGCGGAGGCGGCGGTGGCGGCGACGGCGGCCGAGGATCCAACCTTGGACGAGAATTCCGAGAAGGAGAATCTCGCATTTGCGCCCACCGAAGATTCTGAGAAGGTCGACGAGAGCGCAAAGAAGGACGAGATTGAGTGCTCGAAGACGTCGCGAAAAGGCGAGCAAGAGTTTTTGGACGCCGAAACGGCTCCCAGTGAGGAGAAGGAGAATGCAAAGTCTAAGCCGGAGACGGCGGCCGTGGCGGACTTGCTCGCGCTGAGGAAGATCGAGAGCGTCGACATCTGCAAGCCGATCACGAGGAAGGTGACGCGGCCGACGCTGACGCTGTCGCAAACGCAGAACGCCACCATGAACCGAATATCGATGAAGCAGCTGCCGTCGTCGAAGCGGCTCCTCTTCAACCACCACATGCGCAACAACAGCAACATGCCGCCGATGGTGTCGATCGACTTCTGGGAGAACTACGACCCGGCCGAGGTGAGCCGCACAGGCTTCGGCCTCATCCTGAGCGAGCGGACGCCGATAAAGTCGGTGTGCTTCCTGTGCGGAAGCTACGGAAGCGACCCCTTAATCTTCTGCGTACAAACAGTTTCC
Protein: 929 (aa)
MFNKTKFPGKPSKLVNKKRVSVLSFDNSFDKRANDRCEIDSEKNLASPKHEHGYQTESATTTTAKQSAEESEEKSKPVMTTEETENKVLTSTTTTTKTADDEKNDMENKETAKSLNKNDSNCDNLDKKTKNSKKHVTFRQTIETSDQCKVKRVYNPNFSGPIVSIIKKESLKYPILVYKTKCIVRESRLTEIVRNSANNIDKLNSLKFGREYANHHSHQTTEGASSKSSSSSSSTASNVIVASKFNLPKLSANSSRVIKPNKRFLFDTGEDPSAASKKKVIKPSPWGEGGSDQSASSSSSLLKEKKNNFGFNFVDELDLKSKKSKAQSPSLTSSTSSSALTSASTSSLSLSAAAAATSSTLTGSKSDDTSSKLLSQPILRKPVLQLSTNFSLFGSQDRDANALKSPFSLKLSSPNHSGSASSTAIFTKSLVCNVCNSITTRKQPRKYGAICCEICKKFMSKIIERVNKHPVQNWQCDKGDGSCTIESVALKNLKPSKIDINMKIISKGRCYACWLKKCLLTFQLPSPLKARLTNVLPKSLMEPSKPVVLSNTNNDNSNIFLSSLKSFSSFKLSSNPLAINNFTFGSKPDIKSSFDNVLKNAFTSDKPLLVDIPKEPATTTTTAAATTPLLSTNETKKSPKGSPKLSPKIVPAATVTSEATAPVSSPSTSQQASSQQQSQIIHPSNVATSDSLRQRNLIKGPRVKHVCRSASLVLGLPIAVFPGDSENQNAEAEAAVAATAAEDPTLDENSEKENLAFAPTEDSEKVDESAKKDEIECSKTSRKGEQEFLDAETAPSEEKENAKSKPETAAVADLLALRKIESVDICKPITRKVTRPTLTLSQTQNATMNRISMKQLPSSKRLLFNHHMRNNSNMPPMVSIDFWENYDPAEVSRTGFGLILSERTPIKSVCFLCGSYGSDPLIFCVQTVS
Type | Start | End | Length |
CDS |
47 |
59 |
13 |
CDS |
121 |
1453 |
1333 |
CDS |
1520 |
2032 |
513 |
CDS |
2094 |
2856 |
763 |
CDS |
3183 |
3347 |
165 |
intron |
60 |
120 |
61 |
intron |
1454 |
1519 |
66 |
intron |
2033 |
2093 |
61 |
intron |
2857 |
3182 |
326 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
NP_476769 |
trithorax, isoform D [Drosophila melanogaster] ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster] sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName: Full=Lysine N-methyltransferase 2A gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster] gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster] |
3e-54 |
InterPro |
IPR001628 |
Zinc finger, nuclear hormone receptor-type |
|
Gene Ontology(BP) |
GO:0006355 |
regulation of transcription, DNA-dependent |
|
Gene Ontology(CC) |
GO:0005634 |
nucleus |
|
Gene Ontology(MF) |
GO:0043565 |
sequence-specific DNA binding |
|
Gene Ontology(MF) |
GO:0008270 |
zinc ion binding |
|
Gene Ontology(MF) |
GO:0003700 |
sequence-specific DNA binding transcription factor activity |
|
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
T. castaneum |
TC004768 |
P. vanderplanki |
Pv.07627 |
C. quinquefasciatus |
CPIJ013972 |
B. mori |
BGIBMGA010221-TA |
S. invicta |
SI2.2.0_07789 |
H. melpomene |
HMEL013536-PA |
A. aegypti |
AAEL000054 |
A. gambiae |
AGAP002741 |
A. mellifera |
GB16330-PA |
D. plexippus |
DPOGS212487PA |
P. humanus |
PHUM079870-PA |
P. vanderplanki |
Pv.07626 |
D. melanogaster |
FBgn0003862 |