MidgeBase gene description page [Pn.12073]

Outline

Link to gbrowse

Gene ID Pn.12073
Type Protein coding gene
Scaffold PnScaf15449
Start 44
End 3347
Direction -

Sequence

Transcript: 2787 (bp)

 ATGTTCAATAAAACGAAATTTCCGGGCAAGCCATCCAAACTAGTGAATAAGAAGCGTGTTAGTGTTTTATCATTTGATAACAGTTTTGACAAAAGAGCGAACGATCGTTGTGAAATTGACAGTGAAAAAAATTTGGCTTCACCAAAACATGAACATGGCTACCAGACGGAAAGTGCAACAACAACTACGGCAAAACAGTCAGCGGAGGAGAGCGAAGAAAAGTCAAAACCAGTTATGACAACTGAGGAGACAGAAAATAAAGTTTTAACATCCACAACAACAACAACAAAAACGGCGGACGACGAGAAGAATGATATGGAAAACAAGGAGACAGCAAAAAGTTTAAATAAAAATGATTCAAATTGTGATAATTTAGATAAAAAGACGAAGAATTCGAAGAAGCATGTGACCTTTCGGCAAACGATAGAAACAAGTGATCAGTGCAAAGTGAAGCGTGTCTATAATCCGAATTTTAGCGGACCCATTGTTTCCATAATTAAGAAGGAGTCTCTCAAGTATCCTATATTGGTGTACAAAACTAAATGCATTGTCCGTGAGTCGAGACTAACGGAAATCGTACGAAACAGTGCTAATAACATTGATAAACTAAACTCATTGAAATTTGGGCGCGAATATGCTAATCATCATTCACATCAGACGACTGAAGGCGCTTCGAGCAAGTCTTCGTCGTCGTCGTCGTCGACGGCCTCAAATGTAATTGTTGCCAGTAAATTCAATCTCCCCAAGCTGAGTGCAAATTCGTCGCGAGTCATCAAGCCCAACAAGAGATTTCTCTTCGACACGGGCGAGGATCCATCGGCGGCCAGCAAGAAGAAGGTCATCAAGCCGAGTCCGTGGGGAGAGGGCGGGAGCGACCAGAGCGCCAGCTCATCCTCTTCACTACTTAAAGAGAAGAAAAATAATTTTGGCTTTAACTTTGTCGATGAACTAGATCTGAAATCAAAAAAGTCGAAGGCTCAATCACCATCGCTGACTTCATCGACATCCTCGTCAGCTTTGACGTCTGCATCCACCTCTTCACTGTCACTTTCGGCGGCGGCAGCTGCGACATCATCGACGTTGACCGGCAGCAAGAGTGACGACACTTCAAGCAAGCTCCTATCTCAACCCATTCTTCGAAAGCCTGTACTTCAGCTCTCTACAAACTTTAGTCTCTTCGGATCGCAGGATCGCGATGCGAATGCACTTAAAAGCCCGTTCTCTCTTAAGCTTAGCTCACCCAACCACTCCGGTTCCGCCTCTTCGACAGCGATTTTTACGAAATCTCTGGTGTGCAACGTCTGCAACTCCATAACGACGCGCAAGCAGCCGAGGAAGTATGGAGCCATTTGCTGTGAAATTTGCAAGAAATTCATGTCAAAAATTATTGAGCGTGTTAATAAGCATCCTGTACAAAATTGGCAATGTGATAAAGGAGATGGTTCATGTACGATAGAGTCGGTTGCTCTGAAGAACCTGAAGCCGTCGAAGATTGATATCAACATGAAGATCATCAGCAAGGGCCGTTGCTATGCATGCTGGCTGAAGAAATGTCTACTTACTTTTCAATTGCCATCTCCACTCAAAGCGCGTCTCACCAACGTTCTGCCGAAGAGTTTGATGGAGCCTTCGAAACCAGTGGTCTTATCAAACACAAACAATGATAACTCAAATATATTTTTAAGTTCATTGAAGTCATTTTCGTCGTTCAAGTTGTCGTCGAATCCGTTGGCCATCAACAACTTTACGTTCGGCTCCAAGCCCGACATCAAGTCGAGCTTCGACAATGTCCTGAAGAATGCATTTACGAGTGATAAACCCTTGCTAGTTGACATACCAAAAGAACCGGCAACGACGACGACGACAGCGGCGGCAACTACTCCACTTTTGTCCACAAACGAGACGAAGAAATCTCCGAAAGGATCTCCAAAATTATCGCCGAAAATCGTCCCTGCCGCCACGGTAACAAGCGAAGCAACTGCGCCAGTATCATCACCATCTACATCACAACAAGCATCATCACAACAACAGAGTCAAATCATTCACCCGAGCAACGTTGCTACGTCTGACAGCCTGAGGCAGAGGAATCTGATTAAAGGGCCGCGCGTCAAGCACGTCTGTCGCTCGGCGTCGCTTGTGCTCGGGCTGCCGATTGCCGTCTTCCCAGGCGACTCCGAGAATCAGAATGCCGAAGCGGAGGCGGCGGTGGCGGCGACGGCGGCCGAGGATCCAACCTTGGACGAGAATTCCGAGAAGGAGAATCTCGCATTTGCGCCCACCGAAGATTCTGAGAAGGTCGACGAGAGCGCAAAGAAGGACGAGATTGAGTGCTCGAAGACGTCGCGAAAAGGCGAGCAAGAGTTTTTGGACGCCGAAACGGCTCCCAGTGAGGAGAAGGAGAATGCAAAGTCTAAGCCGGAGACGGCGGCCGTGGCGGACTTGCTCGCGCTGAGGAAGATCGAGAGCGTCGACATCTGCAAGCCGATCACGAGGAAGGTGACGCGGCCGACGCTGACGCTGTCGCAAACGCAGAACGCCACCATGAACCGAATATCGATGAAGCAGCTGCCGTCGTCGAAGCGGCTCCTCTTCAACCACCACATGCGCAACAACAGCAACATGCCGCCGATGGTGTCGATCGACTTCTGGGAGAACTACGACCCGGCCGAGGTGAGCCGCACAGGCTTCGGCCTCATCCTGAGCGAGCGGACGCCGATAAAGTCGGTGTGCTTCCTGTGCGGAAGCTACGGAAGCGACCCCTTAATCTTCTGCGTACAAACAGTTTCC 

Protein: 929 (aa)

 MFNKTKFPGKPSKLVNKKRVSVLSFDNSFDKRANDRCEIDSEKNLASPKHEHGYQTESATTTTAKQSAEESEEKSKPVMTTEETENKVLTSTTTTTKTADDEKNDMENKETAKSLNKNDSNCDNLDKKTKNSKKHVTFRQTIETSDQCKVKRVYNPNFSGPIVSIIKKESLKYPILVYKTKCIVRESRLTEIVRNSANNIDKLNSLKFGREYANHHSHQTTEGASSKSSSSSSSTASNVIVASKFNLPKLSANSSRVIKPNKRFLFDTGEDPSAASKKKVIKPSPWGEGGSDQSASSSSSLLKEKKNNFGFNFVDELDLKSKKSKAQSPSLTSSTSSSALTSASTSSLSLSAAAAATSSTLTGSKSDDTSSKLLSQPILRKPVLQLSTNFSLFGSQDRDANALKSPFSLKLSSPNHSGSASSTAIFTKSLVCNVCNSITTRKQPRKYGAICCEICKKFMSKIIERVNKHPVQNWQCDKGDGSCTIESVALKNLKPSKIDINMKIISKGRCYACWLKKCLLTFQLPSPLKARLTNVLPKSLMEPSKPVVLSNTNNDNSNIFLSSLKSFSSFKLSSNPLAINNFTFGSKPDIKSSFDNVLKNAFTSDKPLLVDIPKEPATTTTTAAATTPLLSTNETKKSPKGSPKLSPKIVPAATVTSEATAPVSSPSTSQQASSQQQSQIIHPSNVATSDSLRQRNLIKGPRVKHVCRSASLVLGLPIAVFPGDSENQNAEAEAAVAATAAEDPTLDENSEKENLAFAPTEDSEKVDESAKKDEIECSKTSRKGEQEFLDAETAPSEEKENAKSKPETAAVADLLALRKIESVDICKPITRKVTRPTLTLSQTQNATMNRISMKQLPSSKRLLFNHHMRNNSNMPPMVSIDFWENYDPAEVSRTGFGLILSERTPIKSVCFLCGSYGSDPLIFCVQTVS 
Type Start End Length
CDS 47 59 13
CDS 121 1453 1333
CDS 1520 2032 513
CDS 2094 2856 763
CDS 3183 3347 165
intron 60 120 61
intron 1454 1519 66
intron 2033 2093 61
intron 2857 3182 326

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr NP_476769 trithorax, isoform D [Drosophila melanogaster] ref|NP_599109.1| trithorax, isoform A [Drosophila melanogaster] sp|P20659.4|TRX_DROME RecName: Full=Histone-lysine N-methyltransferase trithorax; AltName: Full=Lysine N-methyltransferase 2A gb|AAF55041.2| trithorax, isoform A [Drosophila melanogaster] gb|AAN13599.1| trithorax, isoform D [Drosophila melanogaster] 3e-54
InterPro IPR001628 Zinc finger, nuclear hormone receptor-type
Gene Ontology(BP) GO:0006355 regulation of transcription, DNA-dependent
Gene Ontology(CC) GO:0005634 nucleus
Gene Ontology(MF) GO:0043565 sequence-specific DNA binding
Gene Ontology(MF) GO:0008270 zinc ion binding
Gene Ontology(MF) GO:0003700 sequence-specific DNA binding transcription factor activity

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.15875

Orthologous genes

Species Gene ID
T. castaneum TC004768
P. vanderplanki Pv.07627
C. quinquefasciatus CPIJ013972
B. mori BGIBMGA010221-TA
S. invicta SI2.2.0_07789
H. melpomene HMEL013536-PA
A. aegypti AAEL000054
A. gambiae AGAP002741
A. mellifera GB16330-PA
D. plexippus DPOGS212487PA
P. humanus PHUM079870-PA
P. vanderplanki Pv.07626
D. melanogaster FBgn0003862