MidgeBase gene description page [Pn.00759]
Outline
Gene ID | Pn.00759 |
Type | Protein coding gene |
Scaffold | PnScaf791 |
Start | 53676 |
End | 58336 |
Direction | + |
Sequence
Transcript: 4116 (bp)
ATGACAAACGACGTGTGTCTCGTTCTCGAGGATGGAACAGTTTTGCCGGGCCAGAAGTTCGGTGCCCACAACGACGTCGACGGAGAGGTTGTCTTCCAGACCGGCATGGTCGGGTACGTCGAATCCATGACTGATCCTTCATACCACGGCCAGATCTTGGTGCTCACATATCCGCTCATCGGAAACTATGGCGTGCCCACCGAGACGGAATTTGACGAGCACCAACTGATTAAGCACTTCGAGTCGAATGACAAGATTTGGATTTCGGGCCTGATTGTGGGTGAGCTTTGCGAGTCACCATCGCACTGGCGCTTGAAGTACAAACTGGCCGAGTGGATGCAGAAGCACAACGTGGTCGGCATCAGTGGCATCGACACACGAGCTCTAACGAAGAAGATTCGAGAGAACGGCACCGTTTTGGGGAAAATCATCCAGCAGCCTTCGGGACCCTTTCCCGGCGTCGAGTTCAAGGACCAGAACGAGAGAAACTTGGTCGCCGAAGTGTCGACAAAGTCAATCAAAACGTACAACCCGAAGGGTTCGCCCCGCATTTGTGCCATTGACTGCGGTCTGAAGCTCAATCAAATTCGTTGCTTCGTCAAGCGAGGAGCTCGTGTGGACGTCGTTCCGTGGGACCACGAACTCAAGCCGAAGGACTTTGACGGTTTGTTCTTGAGCAATGGGCCAGGTGATCCCGTGATGTGCCACAAAACGGTAAAAAACGTTCAACAGGTACTTGCCTCATCGGAGGCGAAGCCAATTTTCGGAATTTGTCTCGGCCATCAGCTGCTGTCGACTGCGGTCGGCTGCAAGACCTACAAGATGAAGTACGGAAACCGCGGCCACAACTTGCCGGCGCTCCATCACGGCACCAATCGCTGCTTCATGACCTCGCAGAATCACGGCTTTGCCGTCGACGCCGCGACCTTGGACAGCAAGAATTGGGAGCCACTTTTCACGAATTTGAACGACAACTCCAACGAGGGCATCGTGCACAAGGAGAAGCCTTACTTCAGTGTGCAGTTCCATCCGGAGCACACTGCCGGTCCCGAAGACCTCGAGTGTCTCTTCGACGTCTTCTTGGAGTCGGTCAACGACCACATGAAGGGAGTTTCGGGGCTTTCGATCAAAAATCGACTCATAAAGAAGCTCGTCTATGTGCCAAAAATTGTTTTCGACACCGCGCGACCGAAGAAGGTTCTCATTCTCGGCTCGGGTGGCTTATCCATTGGCCAAGCCGGCGAATTCGACTACTCGGGCTCGCAAGCCATCAAGGCGATGCAGGAAGAGAAAATTCAAACGGTTCTCATCAACCCAAATATCGCGACCGTCCAAACGTCGAAGGGTCTCGCCGACAAAGTCTACTTCTTGCCGCTCACACCCGAGTACGTGGAGCAAGTCATTAAGGCCGAGAGACCCTCGGGAATTCTACTGAGTTTCGGCGGCCAAACGGCTCTCAACTGCGGCGTCGAGCTGGACAAGAAGGGAATTCTGAAGAAGTACAAGATCAGTGTCCTTGGCACGCAAATCAGCTCGATTGTCGAGACCGAGGACAGGAAAATTTTCGCCGAACGCGTCAACGAGATAGGCGAGAAAGTGGCGCCCTCAGTTTGTGCCTGCTCGGTGGCGGAGGCTCTGAAGGCGGCCGAGAAAATCGGCTATCCAGTGATGGCGCGCGCAGCATTTTCGCTCGGCGGCTTGGGCTCGGGATTCGCAAGCAATCCGGAGGAGTTGAAGGCCCTCGCACACCAAGCGCTCGCCTACTCGACGCAGCTCATCATCGACAAGTCGCTGAAGGGCTGGAAGGAGGTCGAGTACGAGGTGGTGCGCGACGCTTACGACAACTGCATAACCGTGTGCAACATGGAGAACGTCGACCCGCTCGGCATCCACACCGGCGAGAGCATTGTCGTCGCGCCATCGCAGACGCTCTCAAACCGCGAGTACAACATGCTGCGCACCACTGCGCTTAAAGTCATCCGACACTTCAACATCGTCGGCGAGTGCAACATTCAATACGCGCTCAACCCGAATTCCGAGGAGTTTTACATCATCGAGGTGAATGCGCGTCTTAGTCGCAGCTCCGCACTCGCTTCGAAAGCGACCGGCTATCCGCTCGCCTACGTCGCCGCGAAGCTGTCGCTCGGCGTTCCGCTACCGGAGATCAGAAACTCGGTGACGGGTGTCACGACTGCGTGCTTCGAGCCCTCGCTCGACTACTGCGTAGTGAAGATTCCGCGCTGGGATCTCGCGAAATTCATCCGCGTCAGCAAGAACATCGGCAGCTCGATGAAGAGCGTCGGCGAGGTGATGGCCATCGGCAGAAAGTTCGAGGAGGCCTTCCAGAAGGCCCTCCGCATGGTCGACGAAAGTGTCAATGGTTTCGATCCGACTCTCAAACCGGTAAATGATGAGGAACTCAAGACGCCGACCGACAAGCGAATGTTCGTCCTCGCGGCCGCTCTCAAGGCCGGCTATTCGGTCGAGAAGCTCTACGACATGACGAAAATCGATCGCTGGTTCCTCTCGAAGATGAAGAACATCATCGACATCACGATCGAGCTCGAGAAGCTCAACTGTGCCATTTCCGAGGATCTCTTGCGACAGGCGAAGAACCACGGATTTTCCGATAAGCAAATCGCGAAGTTCATCAAGGGATCAGAGCTTGCCGTGCGAAAACAACGACGTGAATGTGATATCATTCCTTTCGTGAAACAAATTGACACCGTCGCTGGCGAGTGGCCCGCGTCAACAAACTATTTGTACCTGACATACAACGCCTCGGCGCATGACGTTGAGTTCACTGAACAGATGACGATGGTAATTGGATCGGGTGTTTATCGTATCGGGAGCTCTGTTGAGTTCGACTGGTGTGCGGTCGGTTGTTTACGCGAGCTCAGAAACCTCGGAAAGAAAACAATTATGGTAAACTACAATCCGGAGACGGTTTCGACCGACTACGACATGAGCGACCGGCTTTACTTTGAGGAAATTTCCTTCGAAACGGTGATGGACATTTACACGAACGAAGACCCGGAGGGAATTATTTTGAGCATGGGCGGACAGCTGCCGAATAACATCGCGATGGATTTGCATCGACAGCACGCGCGAATTCTCGGCACAAGTCCCGAGTCCGTCGACTCAGCCGAGAATCGCTTCAAGTTTTCGCGTCTGTTGGATCGCAAGGGAATTTTGCAGCCGCGCTGGAAGGAGCTCACCAACTTGCAGTCGGCAACGGACTTTTGCGAAGAGGTCGGCTATCCGTGCCTCGTTCGTCCCTCGTACGTTCTCTCGGGAGCCGCCATGAATGTGGCCTACTCGCACCAGGACTTGGAAACTTACTTGCATGCGGCTTCAGTCGTCAGCAAGGACCATCCGGTGGTCATCTCGAAGTTCTTGACCGAAGCGAAGGAAATCGACGTCGACGCCGTGGCCGATGACGGAGAGATTCTGTGCTTGGCCGTGTCAGAGCACGTGGAGAATGCGGGTGTACACAGCGGAGACGCGACACTTGTCACGCCACCGCAAGACATCAACAAGGAAACGCTCGAGAAAATCAAGGGAATCGCAAAGGACATCGCCGCCCTCCTCGACATTTCTGGGCCCTTCAACATGCAACTCATCGCGAAAAACAACGAGCTGAAAGTCATCGAGTGCAACGTCAGAGTTTCGCGCTCTTTTCCGTTCGTGTCGAAGACGCTCAATCACGACTTTGTGGCGATGGCAACGCGCGTCATAATTGGCGAGAAAGTTGCGCCCGTTGATGTGCTCTTCGGCGATAACTCGAAAGTCGGCGTGAAGGTTCCGCAATTCAGCTTCTCGCGCCTTGCCGGCGCAGAGGTCACACTCGGCGTCGAGATGTCTTCGACGGGTGAGGTGGCGTGTTTCGGCGATAATCGATACGAAGCCTACCTCAAGGCGATGATGTCGACGGGCTTTCAAATGCCGAAAAAATCCATTCTCATCAGCGTCGGTAGCATTCGGCACAAGAACGAACTGCTGACATCGATTCGTGACCTGTCACGAATGGGCTACAAGCTCTATGCCTCGATGGGAACGGCTGACTTTTACACTGAACACGGCATTCGGGTGAGTTTTCTTAAAATTAATTACTTA
Protein: 1372 (aa)
MTNDVCLVLEDGTVLPGQKFGAHNDVDGEVVFQTGMVGYVESMTDPSYHGQILVLTYPLIGNYGVPTETEFDEHQLIKHFESNDKIWISGLIVGELCESPSHWRLKYKLAEWMQKHNVVGISGIDTRALTKKIRENGTVLGKIIQQPSGPFPGVEFKDQNERNLVAEVSTKSIKTYNPKGSPRICAIDCGLKLNQIRCFVKRGARVDVVPWDHELKPKDFDGLFLSNGPGDPVMCHKTVKNVQQVLASSEAKPIFGICLGHQLLSTAVGCKTYKMKYGNRGHNLPALHHGTNRCFMTSQNHGFAVDAATLDSKNWEPLFTNLNDNSNEGIVHKEKPYFSVQFHPEHTAGPEDLECLFDVFLESVNDHMKGVSGLSIKNRLIKKLVYVPKIVFDTARPKKVLILGSGGLSIGQAGEFDYSGSQAIKAMQEEKIQTVLINPNIATVQTSKGLADKVYFLPLTPEYVEQVIKAERPSGILLSFGGQTALNCGVELDKKGILKKYKISVLGTQISSIVETEDRKIFAERVNEIGEKVAPSVCACSVAEALKAAEKIGYPVMARAAFSLGGLGSGFASNPEELKALAHQALAYSTQLIIDKSLKGWKEVEYEVVRDAYDNCITVCNMENVDPLGIHTGESIVVAPSQTLSNREYNMLRTTALKVIRHFNIVGECNIQYALNPNSEEFYIIEVNARLSRSSALASKATGYPLAYVAAKLSLGVPLPEIRNSVTGVTTACFEPSLDYCVVKIPRWDLAKFIRVSKNIGSSMKSVGEVMAIGRKFEEAFQKALRMVDESVNGFDPTLKPVNDEELKTPTDKRMFVLAAALKAGYSVEKLYDMTKIDRWFLSKMKNIIDITIELEKLNCAISEDLLRQAKNHGFSDKQIAKFIKGSELAVRKQRRECDIIPFVKQIDTVAGEWPASTNYLYLTYNASAHDVEFTEQMTMVIGSGVYRIGSSVEFDWCAVGCLRELRNLGKKTIMVNYNPETVSTDYDMSDRLYFEEISFETVMDIYTNEDPEGIILSMGGQLPNNIAMDLHRQHARILGTSPESVDSAENRFKFSRLLDRKGILQPRWKELTNLQSATDFCEEVGYPCLVRPSYVLSGAAMNVAYSHQDLETYLHAASVVSKDHPVVISKFLTEAKEIDVDAVADDGEILCLAVSEHVENAGVHSGDATLVTPPQDINKETLEKIKGIAKDIAALLDISGPFNMQLIAKNNELKVIECNVRVSRSFPFVSKTLNHDFVAMATRVIIGEKVAPVDVLFGDNSKVGVKVPQFSFSRLAGAEVTLGVEMSSTGEVACFGDNRYEAYLKAMMSTGFQMPKKSILISVGSIRHKNELLTSIRDLSRMGYKLYASMGTADFYTEHGIRVSFLKINYL
Type | Start | End | Length |
CDS |
53676 |
54363 |
688 |
CDS |
54429 |
55328 |
900 |
CDS |
55386 |
56536 |
1151 |
CDS |
56862 |
58106 |
1245 |
CDS |
58202 |
58333 |
132 |
intron |
54364 |
54428 |
65 |
intron |
55329 |
55385 |
57 |
intron |
56537 |
56861 |
325 |
intron |
58107 |
58201 |
95 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001848709 |
carbamoyl-phosphate synthase large chain [Culex quinquefasciatus] gb|EDS28904.1| carbamoyl-phosphate synthase large chain [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR005481 |
Carbamoyl-phosphate synthase, large subunit, N-terminal |
|
InterPro |
IPR005479 |
Carbamoyl-phosphate synthetase large subunit-like, ATP-binding domain |
|
InterPro |
IPR013815 |
ATP-grasp fold, subdomain 1 |
|
InterPro |
IPR011607 |
Methylglyoxal synthase-like domain |
|
InterPro |
IPR016185 |
Pre-ATP-grasp fold |
|
InterPro |
IPR005480 |
Carbamoyl-phosphate synthetase, large subunit oligomerisation domain |
|
InterPro |
IPR002474 |
Carbamoyl-phosphate synthase, small subunit N-terminal domain |
|
InterPro |
IPR006275 |
Carbamoyl-phosphate synthase, large subunit |
|
InterPro |
IPR011761 |
ATP-grasp fold |
|
InterPro |
IPR005483 |
Carbamoyl-phosphate synthase large subunit, CPSase domain |
|
InterPro |
IPR013816 |
ATP-grasp fold, subdomain 2 |
|
InterPro |
IPR017926 |
Glutamine amidotransferase type 1 |
|
InterPro |
IPR006274 |
Carbamoyl-phosphate synthase, small subunit |
|
Gene Ontology(BP) |
GO:0006807 |
nitrogen compound metabolic process |
|
Gene Ontology(BP) |
GO:0008152 |
metabolic process |
|
Gene Ontology(BP) |
GO:0006543 |
glutamine catabolic process |
|
Gene Ontology(BP) |
GO:0070409 |
carbamoyl phosphate biosynthetic process |
|
Gene Ontology(MF) |
GO:0046872 |
metal ion binding |
|
Gene Ontology(MF) |
GO:0005524 |
ATP binding |
|
Gene Ontology(MF) |
GO:0003824 |
catalytic activity |
|
Pfam |
PF02787.14 |
Carbamoyl-phosphate synthetase large chain, oligomerisation domain |
4.4e-43 |
Pfam |
PF08443.6 |
RimK-like ATP-grasp domain |
2.4e-06 |
Pfam |
PF07722.8 |
Peptidase C26 |
1.1e-06 |
Pfam |
PF01965.19 |
DJ-1/PfpI family |
0.028 |
Pfam |
PF02222.17 |
ATP-grasp domain |
5.1e-18 |
Pfam |
PF00988.17 |
Carbamoyl-phosphate synthase small chain, CPSase domain |
5.5e-51 |
Pfam |
PF00289.17 |
Carbamoyl-phosphate synthase L chain, N-terminal domain |
2e-37 |
Pfam |
PF13535.1 |
ATP-grasp domain |
2e-33 |
Pfam |
PF00117.23 |
Glutamine amidotransferase class-I |
4e-44 |
Pfam |
PF07478.8 |
D-ala D-ala ligase C-terminus |
5e-16 |
Pfam |
PF02655.9 |
ATP-grasp domain |
2.8e-08 |
Pfam |
PF02786.12 |
Carbamoyl-phosphate synthase L chain, ATP binding domain |
4.8e-107 |
Pfam |
PF02142.17 |
MGS-like domain |
1.4e-06 |
Pfam |
PF01071.14 |
Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain |
0.00018 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
A. gambiae |
AGAP000300 |
A. mellifera |
GB16437-PA |
P. vanderplanki |
Pv.15659 |
H. melpomene |
HMEL012118-PA |
P. humanus |
PHUM000420-PA |
M. musculus |
ENSMUSG00000025991 |
D. melanogaster |
FBgn0003189 |
T. castaneum |
TC012392 |
M. musculus |
ENSMUSG00000013629 |
H. sapiens |
ENSP00000264705 |
H. sapiens |
ENSP00000384510 |
A. aegypti |
AAEL009475 |
P. vanderplanki |
Pv.15661 |
C. quinquefasciatus |
CPIJ007153 |
H. sapiens |
ENSP00000406136 |
S. invicta |
SI2.2.0_08333 |
H. sapiens |
ENSP00000233072 |
D. plexippus |
DPOGS205071PA |
B. mori |
BGIBMGA006816-TA |
A. aegypti |
AAEL009490 |
H. sapiens |
ENSP00000402608 |
N. vitripennis |
NV12337-PA |
S. invicta |
SI2.2.0_03623 |