MidgeBase gene description page [Pn.08434]

Outline

Link to gbrowse

Gene ID Pn.08434
Type Protein coding gene
Scaffold PnScaf8095
Start 26962
End 32099
Direction +

Sequence

Transcript: 3933 (bp)

 ATGGAAGATATCGAGTCACAAATAAAGGACATTCAGGAGAAGAAACGACAGGCTCAGGCTCAGGACGAGCGTCAGAAAGGAATCGGACTGCTAGAGAGTGGATACTTTGACTCTGAAATTTATGAGGGAGGTCAGAAAGGAAAGTATGAGGGCTACGTTACCTCAATTGCGACAAATGAAGAAGATGAGGATGACGACGATGAACCAATTAGACCAGACAAAAAGTCATCCGCATTTAATGCTCCTATGCAGTTTATCAAAGAAATCACAAGAAATGAGCCGGAATATGACCCGTTTGAAGATCGACGTCAAAAGACCGTGGGCGAGAAGGAGGATGAATACCGACAGCGTCGTAGAAAATTAGTTATTTCGCCTGAGCGAGTCGATCCATTTGCCGATGGCGGCAAAACTCCCGATGTCAAGTCACGAACATTCACGGAAATTATGCGAGAGCAGCAGCTGCGAGGAGAAGAAGCAGAATTGCGCAAGAAGATTCAGGACAAGGCGAAGGACGGGACGCTTAAAGTTTCCAATGGCGACTCTAACAGAGCGGAAGCTAAGAAGCGAGGGAGATGGGATCAAACAGTGGACGAGCAGTTTGTCCCAGCTAAAAAATCTGCATCCGGTGCACTCACGCCCACTTGGGAAGTCGATAAAACACCGGGAGACCATCGATGGGACGAAACACCAGGAAGAGCGATTGGAAGTGAAACACCAGGCGCCACACCGGCTGCACGACACATTTGGGACGCGACTCCAGCCGTCTCTTCGCATTCGACTACACCAGGTCGCGACACGCCAGCACAGGACAAGTCAGTTCGTAAAAATCGCTGGGATGAGACGCCAAAAACTGAGCGAGAAACGCCCGGTCACAATTCCGGCTGGATGGAGACGCCACGAGCAGATCGCGGAGCTCCGGACCTCATCGATTCCACGCCGGGGGCATCGAAGCGTCGTTCGCGTTGGGACGAAACGCCGTCGGGTGCGACGCCATCGGCCATGACTCCTTCGAGCGCGATGACTCCAAGCATGACGCCACACGCAACTCCGGGACATGCCACGCCGATGCTCACTCCAAGTGGGACAACACCGATCGGCTCAAAGGCCATGGCGATGGCTACACCGACACCAGGACATCTGGCTTCGATGACTCCCGAACAACTGCAAGCCTACCGCTGGGAAAAGGAAATTGACGAGCGCAATCGACCGTTTACTGATGAGGAGTTGGACGCCTTGTTCCCGCCGGGCTACAAAATTTTGCCTGCACCTGCTGGTTATATTCCCATTCGCACGCCAGCCAGAAAACTGACAGCCACACCGACTCCGATCGCGGGGACACCAACGGGATTTTTCATTCAACAAGAGGACAAAACAGCTGCGAAATTCAATGACAACCAGCCGAAAGGAAATTTGCCTTTCATGAAGCCCGAGGACGCACAATATTTCGACAAACTTCTCGTCGACGTCGACGAGGAGGCGCTGACTCCCGAGGAGCAGCGCGAAAGAAAAATCATGAAGCTGCTGCTGAAAATCAAGAACGGAACGCCGCCGATGAGAAAAGCCGCACTTCGTCAAATCACCGACAAGGCACGAGAGTTCGGTGCCGGTCCGTTGTTTAATCAGATTCTGCCGCTTCTGATGTCGCCGACTCTCGAGGACCAGGAACGTCACTTGCTCGTCAAAGTCATCGATCGAATTCTCTACAAACTCGACGACCTCGTTCGTCCCTACGTTCACAAGATTCTGGTCGTTATCGAGCCGCTGCTGATTGACGAAGACTACTATGCTCGCGTCGAGGGACGAGAGATCATCTCGAACTTGGCAAAGGCAGCCGGTTTGGCGACAATGATCAGTACCATGCGTCCGGATATTGACAACATCGATGAATATGTGCGAAACACGACGGCAAGAGCGTTCGCCGTTGTGGCGTCGTCGCTCGGAATCCCGTCGCTTTTGCCGTTCTTAAAGGCCGTTTGCAAGAGCAAGAAATCGTGGCAGGCTCGTCACACGGGCATCAAGATCGTTCAGCAAATCGCCATCCTCATGGGCTGTGCTATTCTTCCGCATCTGAAGTCACTCGTCGAGATCATCGAGCACGGCTTAGTGGACGAGCAACAGAAGGTCCGAACCATCACGGCTCTCGCTTTGGCCGCGTTGGCAGAAGCGTCAACGCCGTACGGCATCGAGAGCTTCGACTCTGTCCTCAAGCCGCTGTGGAAAGGCATCCGAACGCACCGTGGAAAAGGCTTAGCTGCATTTCTAAAAGCTATCGGCTATTTGATTCCGCTCATGGACGCCGAATATGCCAACTACTATACGCGCGAAGTAATGCTGATTTTAATCCGCGAGTTTCAATCGCCCGACGAAGAAATGAAGAAGATTGTACTGAAAGTCGTGAAGCAATGTTGTGCAACCGACGGCGTTGAAGCTCAATACATAAAAGAGGAAATTTTGCCGCATTTCTTCAAGCACTTTTGGAATCATCGCATGGCATTGGATCGCCGTAATTATCGTCAATTGGTCGATACAACGGTAGAAATTGCTAACAAAGTCGGCTCATCCGAGATCATTAATCGTGTCGTTGACGACTTGAAAGATGAAAACGAACAATACCGAAAGATGGTTATGGAATCGGTGGAGAAGATTATGGGCAATCTCGGCGCGGCGGACATCGACTCGCGCTTGGAGGAGCAGCTTATCGACGGTATTCTGTATGCGTTCCAGGAGCAGACCACCGAGGACGTGGTAATGCTCAATGGCTTCGGAACGATTGTGAATCAGCTGAGCAAGCGAGTGAAGCCGTATTTGCCGCAGATTTGCGGAACAATTCTGTGGCGCTTGAACAACAAGTCGGCTAAAGTAAGACAGCAGGCAGCGGATTTGATTTCGCGCATTGCCGTCGTGATGAAGACGTGTCAGGAGGAGAAGCTGATGGGGCATTTGGGCGTCGTTTTGTACGAGTATCTCGGCGAGGAGTATCCCGAAGTGCTCGGTTCGATTTTGGGCGCACTGAAGGCGATTGTGAATGTCATTGGCATGACGAAAATGACGCCGCCCATCAAGGACTTGTTGCCGCGTTTGACGCCGATCCTGAAGAATCGACACGAGAAAGTACAAGAGAATTGTATTGATCTGGTCGGTCGAATTGCCGATCGCGGTCCCGAGTACGTATCGGCCCGCGAATGGATGCGCATTTGCTTTGAGCTGCTCGAGCTGCTGAAGGCCCACAAGAAGGCGATTCGTCGTGCGACGGTAAACACTTTCGGCTACATTGCGAAAGCCATTGGACCGCACGATGTGCTGGCGACTCTTCTCAACAACCTCAAGGTGCAGGAGCGACAAAACCGTGTGTGCACAACAGTTGCGATCGCCATCGTGGCAGAAACTTGCCGACCTTTCACCGTCCTCCCGGCACTAATGAACGAGTATCGAGTTCCCGAGCTGAACGTCCAGAACGGCGTGCTCAAGTCGCTTTCATTCCTCTTCGAGTACATCGGAGAGATGGGCAAAGATTACATCTATGCGGTCGTTCCTCTGCTCGAAGACGCCCTCATGGACCGCGACTTGGTGCACAGACAAACTGCTTGCGCCGCAATCAAGCACATGTCGCTCGGCGTCTACGGCTTTGGATGCGAGGATGCGCTCGTTCATCTGCTGAACTACGTGTGGCCAAACATTTTCGAAACATCACCTCATCTCGTGCAAGCATTCATGGACGCCGTCGAGGGACTTCGAGTGGCGCTCGGACCAATCAAGATTCTGCAATACACGCTGCAAGGACTGTTTCATCCGGCGCGAAAGGTTCGCGACGTTTACTGGAAGATTTACAATTCGCTCTACATAGGAGCCCAAGATGCACTAATTGCCGGCTACCCGCGAATTGAAAATGACCCTAAAAATGAGTACATTCGTTACGAGCTCGACTATAACTTG 

Protein: 1311 (aa)

 MEDIESQIKDIQEKKRQAQAQDERQKGIGLLESGYFDSEIYEGGQKGKYEGYVTSIATNEEDEDDDDEPIRPDKKSSAFNAPMQFIKEITRNEPEYDPFEDRRQKTVGEKEDEYRQRRRKLVISPERVDPFADGGKTPDVKSRTFTEIMREQQLRGEEAELRKKIQDKAKDGTLKVSNGDSNRAEAKKRGRWDQTVDEQFVPAKKSASGALTPTWEVDKTPGDHRWDETPGRAIGSETPGATPAARHIWDATPAVSSHSTTPGRDTPAQDKSVRKNRWDETPKTERETPGHNSGWMETPRADRGAPDLIDSTPGASKRRSRWDETPSGATPSAMTPSSAMTPSMTPHATPGHATPMLTPSGTTPIGSKAMAMATPTPGHLASMTPEQLQAYRWEKEIDERNRPFTDEELDALFPPGYKILPAPAGYIPIRTPARKLTATPTPIAGTPTGFFIQQEDKTAAKFNDNQPKGNLPFMKPEDAQYFDKLLVDVDEEALTPEEQRERKIMKLLLKIKNGTPPMRKAALRQITDKAREFGAGPLFNQILPLLMSPTLEDQERHLLVKVIDRILYKLDDLVRPYVHKILVVIEPLLIDEDYYARVEGREIISNLAKAAGLATMISTMRPDIDNIDEYVRNTTARAFAVVASSLGIPSLLPFLKAVCKSKKSWQARHTGIKIVQQIAILMGCAILPHLKSLVEIIEHGLVDEQQKVRTITALALAALAEASTPYGIESFDSVLKPLWKGIRTHRGKGLAAFLKAIGYLIPLMDAEYANYYTREVMLILIREFQSPDEEMKKIVLKVVKQCCATDGVEAQYIKEEILPHFFKHFWNHRMALDRRNYRQLVDTTVEIANKVGSSEIINRVVDDLKDENEQYRKMVMESVEKIMGNLGAADIDSRLEEQLIDGILYAFQEQTTEDVVMLNGFGTIVNQLSKRVKPYLPQICGTILWRLNNKSAKVRQQAADLISRIAVVMKTCQEEKLMGHLGVVLYEYLGEEYPEVLGSILGALKAIVNVIGMTKMTPPIKDLLPRLTPILKNRHEKVQENCIDLVGRIADRGPEYVSAREWMRICFELLELLKAHKKAIRRATVNTFGYIAKAIGPHDVLATLLNNLKVQERQNRVCTTVAIAIVAETCRPFTVLPALMNEYRVPELNVQNGVLKSLSFLFEYIGEMGKDYIYAVVPLLEDALMDRDLVHRQTACAAIKHMSLGVYGFGCEDALVHLLNYVWPNIFETSPHLVQAFMDAVEGLRVALGPIKILQYTLQGLFHPARKVRDVYWKIYNSLYIGAQDALIAGYPRIENDPKNEYIRYELDYNL 
Type Start End Length
CDS 26962 26968 7
CDS 27204 27469 266
CDS 27530 27656 127
CDS 28495 28748 254
CDS 28818 32096 3279
intron 26969 27203 235
intron 27470 27529 60
intron 27657 28494 838
intron 28749 28817 69

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001657046 U2 small nuclear ribonucleoprotein, putative [Aedes aegypti] gb|EAT45112.1| U2 small nuclear ribonucleoprotein, putative [Aedes aegypti] 0.0
InterPro IPR011989 Armadillo-like helical
InterPro IPR015016 Splicing factor 3B subunit 1
InterPro IPR016024 Armadillo-type fold
Gene Ontology(MF) GO:0005488 binding
Pfam PF12755.2 Vacuolar 14 Fab1-binding region 1.4e-06
Pfam PF00514.18 Armadillo/beta-catenin-like repeat 0.00042
Pfam PF02985.17 HEAT repeat 5.9e-13
Pfam PF08623.5 TATA-binding protein interacting (TIP20) 0.0061
Pfam PF08064.8 UME (NUC010) domain 0.053
Pfam PF12348.3 CLASP N terminal 0.0028
Pfam PF13513.1 HEAT-like repeat 5.7e-11
Pfam PF13646.1 HEAT repeats 1.4e-15
Pfam PF08920.5 Splicing factor 3B subunit 1 3.7e-44

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
N. vitripennis NV11085-PA
P. humanus PHUM257230-PA
D. melanogaster FBgn0031266
S. invicta SI2.2.0_80122
H. sapiens ENSP00000335321
D. plexippus DPOGS206070PA
N. vitripennis NV15361-PA
A. aegypti AAEL003605
A. mellifera GB16777-PA
P. vanderplanki Pv.08425
A. gambiae AGAP000178
H. melpomene HMEL005039-PA
B. mori BGIBMGA006851-TA
M. musculus ENSMUSG00000025982
C. quinquefasciatus CPIJ801626
P. vanderplanki Pv.11699
T. castaneum TC012382