MidgeBase gene description page [Pn.00759]

Outline

Link to gbrowse

Gene ID Pn.00759
Type Protein coding gene
Scaffold PnScaf791
Start 53676
End 58336
Direction +

Sequence

Transcript: 4116 (bp)

 ATGACAAACGACGTGTGTCTCGTTCTCGAGGATGGAACAGTTTTGCCGGGCCAGAAGTTCGGTGCCCACAACGACGTCGACGGAGAGGTTGTCTTCCAGACCGGCATGGTCGGGTACGTCGAATCCATGACTGATCCTTCATACCACGGCCAGATCTTGGTGCTCACATATCCGCTCATCGGAAACTATGGCGTGCCCACCGAGACGGAATTTGACGAGCACCAACTGATTAAGCACTTCGAGTCGAATGACAAGATTTGGATTTCGGGCCTGATTGTGGGTGAGCTTTGCGAGTCACCATCGCACTGGCGCTTGAAGTACAAACTGGCCGAGTGGATGCAGAAGCACAACGTGGTCGGCATCAGTGGCATCGACACACGAGCTCTAACGAAGAAGATTCGAGAGAACGGCACCGTTTTGGGGAAAATCATCCAGCAGCCTTCGGGACCCTTTCCCGGCGTCGAGTTCAAGGACCAGAACGAGAGAAACTTGGTCGCCGAAGTGTCGACAAAGTCAATCAAAACGTACAACCCGAAGGGTTCGCCCCGCATTTGTGCCATTGACTGCGGTCTGAAGCTCAATCAAATTCGTTGCTTCGTCAAGCGAGGAGCTCGTGTGGACGTCGTTCCGTGGGACCACGAACTCAAGCCGAAGGACTTTGACGGTTTGTTCTTGAGCAATGGGCCAGGTGATCCCGTGATGTGCCACAAAACGGTAAAAAACGTTCAACAGGTACTTGCCTCATCGGAGGCGAAGCCAATTTTCGGAATTTGTCTCGGCCATCAGCTGCTGTCGACTGCGGTCGGCTGCAAGACCTACAAGATGAAGTACGGAAACCGCGGCCACAACTTGCCGGCGCTCCATCACGGCACCAATCGCTGCTTCATGACCTCGCAGAATCACGGCTTTGCCGTCGACGCCGCGACCTTGGACAGCAAGAATTGGGAGCCACTTTTCACGAATTTGAACGACAACTCCAACGAGGGCATCGTGCACAAGGAGAAGCCTTACTTCAGTGTGCAGTTCCATCCGGAGCACACTGCCGGTCCCGAAGACCTCGAGTGTCTCTTCGACGTCTTCTTGGAGTCGGTCAACGACCACATGAAGGGAGTTTCGGGGCTTTCGATCAAAAATCGACTCATAAAGAAGCTCGTCTATGTGCCAAAAATTGTTTTCGACACCGCGCGACCGAAGAAGGTTCTCATTCTCGGCTCGGGTGGCTTATCCATTGGCCAAGCCGGCGAATTCGACTACTCGGGCTCGCAAGCCATCAAGGCGATGCAGGAAGAGAAAATTCAAACGGTTCTCATCAACCCAAATATCGCGACCGTCCAAACGTCGAAGGGTCTCGCCGACAAAGTCTACTTCTTGCCGCTCACACCCGAGTACGTGGAGCAAGTCATTAAGGCCGAGAGACCCTCGGGAATTCTACTGAGTTTCGGCGGCCAAACGGCTCTCAACTGCGGCGTCGAGCTGGACAAGAAGGGAATTCTGAAGAAGTACAAGATCAGTGTCCTTGGCACGCAAATCAGCTCGATTGTCGAGACCGAGGACAGGAAAATTTTCGCCGAACGCGTCAACGAGATAGGCGAGAAAGTGGCGCCCTCAGTTTGTGCCTGCTCGGTGGCGGAGGCTCTGAAGGCGGCCGAGAAAATCGGCTATCCAGTGATGGCGCGCGCAGCATTTTCGCTCGGCGGCTTGGGCTCGGGATTCGCAAGCAATCCGGAGGAGTTGAAGGCCCTCGCACACCAAGCGCTCGCCTACTCGACGCAGCTCATCATCGACAAGTCGCTGAAGGGCTGGAAGGAGGTCGAGTACGAGGTGGTGCGCGACGCTTACGACAACTGCATAACCGTGTGCAACATGGAGAACGTCGACCCGCTCGGCATCCACACCGGCGAGAGCATTGTCGTCGCGCCATCGCAGACGCTCTCAAACCGCGAGTACAACATGCTGCGCACCACTGCGCTTAAAGTCATCCGACACTTCAACATCGTCGGCGAGTGCAACATTCAATACGCGCTCAACCCGAATTCCGAGGAGTTTTACATCATCGAGGTGAATGCGCGTCTTAGTCGCAGCTCCGCACTCGCTTCGAAAGCGACCGGCTATCCGCTCGCCTACGTCGCCGCGAAGCTGTCGCTCGGCGTTCCGCTACCGGAGATCAGAAACTCGGTGACGGGTGTCACGACTGCGTGCTTCGAGCCCTCGCTCGACTACTGCGTAGTGAAGATTCCGCGCTGGGATCTCGCGAAATTCATCCGCGTCAGCAAGAACATCGGCAGCTCGATGAAGAGCGTCGGCGAGGTGATGGCCATCGGCAGAAAGTTCGAGGAGGCCTTCCAGAAGGCCCTCCGCATGGTCGACGAAAGTGTCAATGGTTTCGATCCGACTCTCAAACCGGTAAATGATGAGGAACTCAAGACGCCGACCGACAAGCGAATGTTCGTCCTCGCGGCCGCTCTCAAGGCCGGCTATTCGGTCGAGAAGCTCTACGACATGACGAAAATCGATCGCTGGTTCCTCTCGAAGATGAAGAACATCATCGACATCACGATCGAGCTCGAGAAGCTCAACTGTGCCATTTCCGAGGATCTCTTGCGACAGGCGAAGAACCACGGATTTTCCGATAAGCAAATCGCGAAGTTCATCAAGGGATCAGAGCTTGCCGTGCGAAAACAACGACGTGAATGTGATATCATTCCTTTCGTGAAACAAATTGACACCGTCGCTGGCGAGTGGCCCGCGTCAACAAACTATTTGTACCTGACATACAACGCCTCGGCGCATGACGTTGAGTTCACTGAACAGATGACGATGGTAATTGGATCGGGTGTTTATCGTATCGGGAGCTCTGTTGAGTTCGACTGGTGTGCGGTCGGTTGTTTACGCGAGCTCAGAAACCTCGGAAAGAAAACAATTATGGTAAACTACAATCCGGAGACGGTTTCGACCGACTACGACATGAGCGACCGGCTTTACTTTGAGGAAATTTCCTTCGAAACGGTGATGGACATTTACACGAACGAAGACCCGGAGGGAATTATTTTGAGCATGGGCGGACAGCTGCCGAATAACATCGCGATGGATTTGCATCGACAGCACGCGCGAATTCTCGGCACAAGTCCCGAGTCCGTCGACTCAGCCGAGAATCGCTTCAAGTTTTCGCGTCTGTTGGATCGCAAGGGAATTTTGCAGCCGCGCTGGAAGGAGCTCACCAACTTGCAGTCGGCAACGGACTTTTGCGAAGAGGTCGGCTATCCGTGCCTCGTTCGTCCCTCGTACGTTCTCTCGGGAGCCGCCATGAATGTGGCCTACTCGCACCAGGACTTGGAAACTTACTTGCATGCGGCTTCAGTCGTCAGCAAGGACCATCCGGTGGTCATCTCGAAGTTCTTGACCGAAGCGAAGGAAATCGACGTCGACGCCGTGGCCGATGACGGAGAGATTCTGTGCTTGGCCGTGTCAGAGCACGTGGAGAATGCGGGTGTACACAGCGGAGACGCGACACTTGTCACGCCACCGCAAGACATCAACAAGGAAACGCTCGAGAAAATCAAGGGAATCGCAAAGGACATCGCCGCCCTCCTCGACATTTCTGGGCCCTTCAACATGCAACTCATCGCGAAAAACAACGAGCTGAAAGTCATCGAGTGCAACGTCAGAGTTTCGCGCTCTTTTCCGTTCGTGTCGAAGACGCTCAATCACGACTTTGTGGCGATGGCAACGCGCGTCATAATTGGCGAGAAAGTTGCGCCCGTTGATGTGCTCTTCGGCGATAACTCGAAAGTCGGCGTGAAGGTTCCGCAATTCAGCTTCTCGCGCCTTGCCGGCGCAGAGGTCACACTCGGCGTCGAGATGTCTTCGACGGGTGAGGTGGCGTGTTTCGGCGATAATCGATACGAAGCCTACCTCAAGGCGATGATGTCGACGGGCTTTCAAATGCCGAAAAAATCCATTCTCATCAGCGTCGGTAGCATTCGGCACAAGAACGAACTGCTGACATCGATTCGTGACCTGTCACGAATGGGCTACAAGCTCTATGCCTCGATGGGAACGGCTGACTTTTACACTGAACACGGCATTCGGGTGAGTTTTCTTAAAATTAATTACTTA 

Protein: 1372 (aa)

 MTNDVCLVLEDGTVLPGQKFGAHNDVDGEVVFQTGMVGYVESMTDPSYHGQILVLTYPLIGNYGVPTETEFDEHQLIKHFESNDKIWISGLIVGELCESPSHWRLKYKLAEWMQKHNVVGISGIDTRALTKKIRENGTVLGKIIQQPSGPFPGVEFKDQNERNLVAEVSTKSIKTYNPKGSPRICAIDCGLKLNQIRCFVKRGARVDVVPWDHELKPKDFDGLFLSNGPGDPVMCHKTVKNVQQVLASSEAKPIFGICLGHQLLSTAVGCKTYKMKYGNRGHNLPALHHGTNRCFMTSQNHGFAVDAATLDSKNWEPLFTNLNDNSNEGIVHKEKPYFSVQFHPEHTAGPEDLECLFDVFLESVNDHMKGVSGLSIKNRLIKKLVYVPKIVFDTARPKKVLILGSGGLSIGQAGEFDYSGSQAIKAMQEEKIQTVLINPNIATVQTSKGLADKVYFLPLTPEYVEQVIKAERPSGILLSFGGQTALNCGVELDKKGILKKYKISVLGTQISSIVETEDRKIFAERVNEIGEKVAPSVCACSVAEALKAAEKIGYPVMARAAFSLGGLGSGFASNPEELKALAHQALAYSTQLIIDKSLKGWKEVEYEVVRDAYDNCITVCNMENVDPLGIHTGESIVVAPSQTLSNREYNMLRTTALKVIRHFNIVGECNIQYALNPNSEEFYIIEVNARLSRSSALASKATGYPLAYVAAKLSLGVPLPEIRNSVTGVTTACFEPSLDYCVVKIPRWDLAKFIRVSKNIGSSMKSVGEVMAIGRKFEEAFQKALRMVDESVNGFDPTLKPVNDEELKTPTDKRMFVLAAALKAGYSVEKLYDMTKIDRWFLSKMKNIIDITIELEKLNCAISEDLLRQAKNHGFSDKQIAKFIKGSELAVRKQRRECDIIPFVKQIDTVAGEWPASTNYLYLTYNASAHDVEFTEQMTMVIGSGVYRIGSSVEFDWCAVGCLRELRNLGKKTIMVNYNPETVSTDYDMSDRLYFEEISFETVMDIYTNEDPEGIILSMGGQLPNNIAMDLHRQHARILGTSPESVDSAENRFKFSRLLDRKGILQPRWKELTNLQSATDFCEEVGYPCLVRPSYVLSGAAMNVAYSHQDLETYLHAASVVSKDHPVVISKFLTEAKEIDVDAVADDGEILCLAVSEHVENAGVHSGDATLVTPPQDINKETLEKIKGIAKDIAALLDISGPFNMQLIAKNNELKVIECNVRVSRSFPFVSKTLNHDFVAMATRVIIGEKVAPVDVLFGDNSKVGVKVPQFSFSRLAGAEVTLGVEMSSTGEVACFGDNRYEAYLKAMMSTGFQMPKKSILISVGSIRHKNELLTSIRDLSRMGYKLYASMGTADFYTEHGIRVSFLKINYL 
Type Start End Length
CDS 53676 54363 688
CDS 54429 55328 900
CDS 55386 56536 1151
CDS 56862 58106 1245
CDS 58202 58333 132
intron 54364 54428 65
intron 55329 55385 57
intron 56537 56861 325
intron 58107 58201 95

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001848709 carbamoyl-phosphate synthase large chain [Culex quinquefasciatus] gb|EDS28904.1| carbamoyl-phosphate synthase large chain [Culex quinquefasciatus] 0.0
InterPro IPR005481 Carbamoyl-phosphate synthase, large subunit, N-terminal
InterPro IPR005479 Carbamoyl-phosphate synthetase large subunit-like, ATP-binding domain
InterPro IPR013815 ATP-grasp fold, subdomain 1
InterPro IPR011607 Methylglyoxal synthase-like domain
InterPro IPR016185 Pre-ATP-grasp fold
InterPro IPR005480 Carbamoyl-phosphate synthetase, large subunit oligomerisation domain
InterPro IPR002474 Carbamoyl-phosphate synthase, small subunit N-terminal domain
InterPro IPR006275 Carbamoyl-phosphate synthase, large subunit
InterPro IPR011761 ATP-grasp fold
InterPro IPR005483 Carbamoyl-phosphate synthase large subunit, CPSase domain
InterPro IPR013816 ATP-grasp fold, subdomain 2
InterPro IPR017926 Glutamine amidotransferase type 1
InterPro IPR006274 Carbamoyl-phosphate synthase, small subunit
Gene Ontology(BP) GO:0006807 nitrogen compound metabolic process
Gene Ontology(BP) GO:0008152 metabolic process
Gene Ontology(BP) GO:0006543 glutamine catabolic process
Gene Ontology(BP) GO:0070409 carbamoyl phosphate biosynthetic process
Gene Ontology(MF) GO:0046872 metal ion binding
Gene Ontology(MF) GO:0005524 ATP binding
Gene Ontology(MF) GO:0003824 catalytic activity
Pfam PF02787.14 Carbamoyl-phosphate synthetase large chain, oligomerisation domain 4.4e-43
Pfam PF08443.6 RimK-like ATP-grasp domain 2.4e-06
Pfam PF07722.8 Peptidase C26 1.1e-06
Pfam PF01965.19 DJ-1/PfpI family 0.028
Pfam PF02222.17 ATP-grasp domain 5.1e-18
Pfam PF00988.17 Carbamoyl-phosphate synthase small chain, CPSase domain 5.5e-51
Pfam PF00289.17 Carbamoyl-phosphate synthase L chain, N-terminal domain 2e-37
Pfam PF13535.1 ATP-grasp domain 2e-33
Pfam PF00117.23 Glutamine amidotransferase class-I 4e-44
Pfam PF07478.8 D-ala D-ala ligase C-terminus 5e-16
Pfam PF02655.9 ATP-grasp domain 2.8e-08
Pfam PF02786.12 Carbamoyl-phosphate synthase L chain, ATP binding domain 4.8e-107
Pfam PF02142.17 MGS-like domain 1.4e-06
Pfam PF01071.14 Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain 0.00018

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.00761

Orthologous genes

Species Gene ID
A. gambiae AGAP000300
A. mellifera GB16437-PA
P. vanderplanki Pv.15659
H. melpomene HMEL012118-PA
P. humanus PHUM000420-PA
M. musculus ENSMUSG00000025991
D. melanogaster FBgn0003189
T. castaneum TC012392
M. musculus ENSMUSG00000013629
H. sapiens ENSP00000264705
H. sapiens ENSP00000384510
A. aegypti AAEL009475
P. vanderplanki Pv.15661
C. quinquefasciatus CPIJ007153
H. sapiens ENSP00000406136
S. invicta SI2.2.0_08333
H. sapiens ENSP00000233072
D. plexippus DPOGS205071PA
B. mori BGIBMGA006816-TA
A. aegypti AAEL009490
H. sapiens ENSP00000402608
N. vitripennis NV12337-PA
S. invicta SI2.2.0_03623