MidgeBase gene description page [Pn.05164]

Outline

Link to gbrowse

Gene ID Pn.05164
Type Protein coding gene
Scaffold PnScaf4435
Start 49040
End 53272
Direction -

Sequence

Transcript: 2469 (bp)

 ATGAATGAGAATTCGACGTTCTTTTATTCAACGCAAATCAACGCTAATACCACTCAAAATCCGCTAATCAACTTCACGACAAATGATTCGAATGACTTTAAGACCGTGCCATGGTGGTGTTCGTGCTCGAGCGAACTCGAAGATTTGGAGGGAGAGGTCGAATGCCACTGCGAAGGAACGCCACTGAAAATTATTCCGCAGAATCTACTGAACTTTACAAGACTGTCCGTAGCAAATACGAAATTGAAAGTATTAAGGGAAGCTGAATTGAGAAAATATGCAGCACTCATAAAAGATATAGTTTTAATAAACCTCAGTGAGCTTGAAAGAATTGAGACTGAAGCTTTCAAAAATACCCGTCAATTAAGGACTCTATACATTTCACGAGCCCCGAAGCTTCGTTATGTTGCCAGAGAAACGTTTCAATACGTTTCACACTCATTTAAGATACTGCGTATCATCCATTCGGGGTTGCTCGAAATACCAGACCTGAGTTATTTGCAAACGAATGCTAGGATATTATTGCAACTAGATTTCGAGGGAAACCTAATCAGCGAGATTTTGGCGAACTCTGTTCGCATCCGAACAGAACAATTAATAATCGATAACAATGTTCTTGTAAGCGTTCATCGTGCAGCGTTTAATGGATCCGAAATCGGAAAATTAAGCCTAAAGGGTAACACAAAGTTACGCATTCTGCATGAGCGTGCATTCGAAGGAATAAAAAATATCCAGGAGCTGGATTTGTCTAGTACATTTATAGAGAGTCTGCCAACTCTCGGACTAGATAAGCTCGAAGTTCTCCGCATTCAAAATACTCGTTCACTGAAGCACATTCCGTCAGTGTACAACTTCCAGAATCTCGAGAAAGCTTGGCTCACTCATCCATTTCACTGTTGTGCATTCAAATTTCCCCAGCGTCACGATCCTTCAAGGCATCTTCGACACTTGCAGTTAATTGAGAGCTTCAAGAATGAGTGCAAGCTTAAGGGGTATCAATCAACTTTAGCAGCGGCAGCAGTAGTATCGGCGGTGTCTTCGTCTGAAGCCTTGAGCTCATTGAACAATAACAACTTGAATGGAGATTCTGAGCTTATGCCGCGGATGGAGAGACGACGAAGGAGGTCGACGTATTTGACAAAGACCAGCGATGTGATGAATAACAAGACGAGAGAGGTGTCGATTTACAATAATGAAGAGGATATAGTGCTCGACGAAGACTACTACTTGGATCCCGAGGGTGAATTTCATGAACCGACAATTCTAAAAAATGAAACGCTCGAGGCCATGTGCGGAAACATATCGCTAAAAGTGAGAGACGTTCAGTGTTTTCCGATGCCAGATGCGCTGAATCCTTGCGAGGACGTTATGGGAAGCAGCTGGCTGCGGGCGTCTGTATGGATAGTTGTTTCTCTGGCAGTTATCGGAAACGTGGCAGTATTAGTTGTTTTGTTGTCTAATCAATCGGACTTTACAGTGCCGAAGTTCTTGATGTGCAATTTGGCTTTAGCTGATTTCTGCATGGGAATCTATCTCCTGCTAATTGCGTCCATTGATCTGCACTCGATGGGTGAATACTTTAATTTCGCATTCGATTGGCAGTATGGAATGGGTTGCAAGACTGCTGGATGGCTCACCGTATTTGCTGGTCATCTTTCAATATTTACTCTTACAATCATTACGGTGGAGCGGTGGTTTGCCATAACGCATGCAATATATCTGAATAAACGATTGAAGCACCGCAACGCAATCTACATCATGGTTTGCGGGTGGTTTTACTCCATAATTATGGCGATAATGCCACTTTACGGCATTTCCAACTATTCATCCACAAGCATTTGTCTACCGATGGAGGCAAAAGACAACGCCGATGTGATATATTTGATAACAGTATTGGCAGTAAATGGCTTAGGATTTTTCATCGTAGTCATTTGCTATGCTCAAATTTACTTTTCTCTGGGGAAAGAAACCAGGCAACGTGGAATCGGAAGCGGGCGGTCGGAGATGACTGTCGCTAAGAAAATGGCACTTTTGGTCTTTACAAATTTCGCGTGCTGGGCACCTATAGCATTTTTCGGACTAACTGCTCTTGCGGGTTTTCCTCTGATAAACGTTACAAAGTCAAAAATCCTCCTCGTATTTTTCTATCCAATCAATTCCTGTGCAAATCCCTATCTTTATGCAATTCTTACTTCACAATATCGACGAGATTTGTTCCAATTACTATCGAAATTCGGAATATGTACGCAGCGTGCTCAAAAGTACCGAATGAATTACAGTAATCCCACTCACACGATTCCTCTGAATATGCTATCGAGTACTCGAAATTCAAACTCAATGAACTTCCAGTATCGACGTGCAAACAGCAACCCAAAGGCCTCAAACACAATAGAACAGCCGCTTGTTGCCGATCAACATGTAGCGACCTATACAAACGGAAAAATTGATGTTAATTTATGTGAAGATACT 

Protein: 823 (aa)

 MNENSTFFYSTQINANTTQNPLINFTTNDSNDFKTVPWWCSCSSELEDLEGEVECHCEGTPLKIIPQNLLNFTRLSVANTKLKVLREAELRKYAALIKDIVLINLSELERIETEAFKNTRQLRTLYISRAPKLRYVARETFQYVSHSFKILRIIHSGLLEIPDLSYLQTNARILLQLDFEGNLISEILANSVRIRTEQLIIDNNVLVSVHRAAFNGSEIGKLSLKGNTKLRILHERAFEGIKNIQELDLSSTFIESLPTLGLDKLEVLRIQNTRSLKHIPSVYNFQNLEKAWLTHPFHCCAFKFPQRHDPSRHLRHLQLIESFKNECKLKGYQSTLAAAAVVSAVSSSEALSSLNNNNLNGDSELMPRMERRRRRSTYLTKTSDVMNNKTREVSIYNNEEDIVLDEDYYLDPEGEFHEPTILKNETLEAMCGNISLKVRDVQCFPMPDALNPCEDVMGSSWLRASVWIVVSLAVIGNVAVLVVLLSNQSDFTVPKFLMCNLALADFCMGIYLLLIASIDLHSMGEYFNFAFDWQYGMGCKTAGWLTVFAGHLSIFTLTIITVERWFAITHAIYLNKRLKHRNAIYIMVCGWFYSIIMAIMPLYGISNYSSTSICLPMEAKDNADVIYLITVLAVNGLGFFIVVICYAQIYFSLGKETRQRGIGSGRSEMTVAKKMALLVFTNFACWAPIAFFGLTALAGFPLINVTKSKILLVFFYPINSCANPYLYAILTSQYRRDLFQLLSKFGICTQRAQKYRMNYSNPTHTIPLNMLSSTRNSNSMNFQYRRANSNPKASNTIEQPLVADQHVATYTNGKIDVNLCEDT 
Type Start End Length
CDS 49043 49477 435
CDS 49548 49690 143
CDS 49772 49827 56
CDS 50167 50395 229
CDS 50468 50668 201
CDS 50734 50828 95
CDS 50903 51354 452
CDS 51413 51530 118
CDS 51599 51673 75
CDS 51777 51845 69
CDS 51918 51983 66
CDS 52148 52225 78
CDS 52554 52631 78
CDS 52711 52785 75
CDS 52907 53055 149
CDS 53123 53272 150
intron 49478 49547 70
intron 49691 49771 81
intron 49828 50166 339
intron 50396 50467 72
intron 50669 50733 65
intron 50829 50902 74
intron 51355 51412 58
intron 51531 51598 68
intron 51674 51776 103
intron 51846 51917 72
intron 51984 52147 164
intron 52226 52553 328
intron 52632 52710 79
intron 52786 52906 121
intron 53056 53122 67

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_309589 AGAP004035-PA [Anopheles gambiae str. PEST] gb|EAA05376.3| AGAP004035-PA [Anopheles gambiae str. PEST] 0.0
InterPro IPR017452 GPCR, rhodopsin-like superfamily
InterPro IPR002131 Glycoprotein hormone receptor
InterPro IPR000276 GPCR, rhodopsin-like, 7TM
Gene Ontology(BP) GO:0007186 G-protein coupled receptor signaling pathway
Gene Ontology(CC) GO:0016021 integral to membrane
Gene Ontology(MF) GO:0016500 protein-hormone receptor activity
Pfam PF13855.1 Leucine rich repeat 4.4e-14
Pfam PF12799.2 Leucine Rich repeats (2 copies) 0.072
Pfam PF00560.28 Leucine Rich Repeat 1.2
Pfam PF00001.16 7 transmembrane receptor (rhodopsin family) 4.9e-38
Pfam PF13306.1 Leucine rich repeats (6 copies) 8.1e-06

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.10447

Orthologous genes

Species Gene ID
H. sapiens ENSP00000294954
A. aegypti AAEL004399
P. vanderplanki Pv.07556
H. sapiens ENSP00000384708
P. vanderplanki Pv.06926
H. sapiens ENSP00000385847
H. sapiens ENSP00000298171
D. plexippus DPOGS200833PA
C. quinquefasciatus CPIJ007712
H. sapiens ENSP00000306780
D. melanogaster FBgn0016650
M. musculus ENSMUSG00000032937
H. sapiens ENSP00000441235
T. castaneum TC009575
H. sapiens ENSP00000344301
H. sapiens ENSP00000385406
P. humanus PHUM452330-PA
H. sapiens ENSP00000444172
M. musculus ENSMUSG00000024107
A. gambiae AGAP004035
T. castaneum TC009127
H. sapiens ENSP00000333908
H. melpomene HMEL012637-PA
H. sapiens ENSP00000415504
B. mori BGIBMGA009887-TA
H. sapiens ENSP00000386033