MidgeBase gene description page [Pn.02651]
Outline
Gene ID | Pn.02651 |
Type | Protein coding gene |
Scaffold | PnScaf2280 |
Start | 204 |
End | 5812 |
Direction | - |
Sequence
Transcript: 4944 (bp)
ATGTCGAAAATAAAGGAGATCGGTGATGATGTGGTGATTTCGGGTGTGGCGGGTCGCTTCCCAAACTCGCGCAATCTGCTTGAGTTCGCCAGCAATCTCTACAACAAAGTCGACATGGTGGACGAGGAGGAGAAGCGGTGGCGGCACGTAAATCCGGAATGTCCCAAGCGGATGGGAAAAATCGGCGACATCGAGAAGTTCGACGCGTCCTTCTTCGGCGTGAGCTACCGGCAGGCTCGCACGATGGACCCGCAATGTCGGATGCTGCTGGAGCACGCATACGAGGCCGTCCTCGACAGCGGCATGTGCCCGAAGCAGCTGGCTGGAACGAAGACCGGCGTGTTCGTCGGCATGAGCTACAACGAGTCGGAGAAGAAGTGGATCTACGAGAACATCTCGAAGGAGGGCTTCGGCATCACCGGGTCAGCGAAGGCTCTGGTCGCGAATCGAATTTCTTTTGCCCTCGGCGTCACCGGTCCGAGTTTCGTTTGCGACACCGCGTGTAGCTCGTCGATGTACGCACTCGACATCGCGTACAAGATGATTTCTTCGGGCGAGTGCGACGCTGCCATCGTTGGCGGCACAAACCTCTGCCTGCATCCGCACCTGAGCTACCAGTTTGTGAAGCTCGGCGTCGTGTCCAAGGACGGATTTTGTCGGCCGTTTGACAAAGACGCGAGCGGTTTCACTCGGGCCGAGACCAGTTGTGCGCTGTTCCTGCAGCGGCGTATCGACGCGAAGCGAGTCTACGCGACCATTTTACACTCGAAGACGAACTGCGATGGCTTCAAGGAGGAGGGCATCAACTTTCCGAGCAGTAAGAGGCAGGCTGAGCTGCTGACCGAGTTCTACGAAGAGGTCGACATAAGGCCGAGTGATCCGAGCATCGGCTACTTCGAGGCTCACTGCACTGGGACGAAGGTCGGCGACGCAGAGGAGTGTCGAGCGATAGACGAAGTGATGTGCAAGAACCGCAAAGAGCCGCTGCTTGTCGGCTCGGTGAAGAGCAACATGGGACACTCTGAACCCGGTTCGGGAGTTTGTTCGATCAGCAAAGTCATTTTGGCTTTCGAAACAGGCAAGATTGCTCCCAATATCAACCTGAAAGCAGTCAAAGAGGAGATCACTGGTGTAGTTGAAGGTCGGATGAAGGTCGTGACCGAAACTCAAGATCTTCTGCAGCCCTTCGTGCCCGTCAACTCCTCAGGCTTCGGAGGAGCCAACGCACACGCTCTGCTGCAAATAAACTCCAAAGCGAAGGTCAATAAAGGAATTCCGAGCGACAAGATTCCGCGGCTTGTTCTCTGGTCGGGAAGGACCGAAGAGGCGGTTGATGCAATCTTCGACAGTGTCACACGGCAGCCACTTGATGCCGAGTTCGTTTCGCTGATTCAGAGCTCACAAGCAACCTCCAACAAGGCGAATCTCTACAGGGGCTTTGGTGTGTTCACGCAGCCTGAAAATCACGAAAATAACGCGGTTTGTAGTTTGCGCGAAGTGCAGCACTTCAACGGTCAAAGGAGGCCGCTGGTGTGGATCTACAGCGGCATGGGATCGCAGTGGAAAGCGATGGGAGCTGCTTTGATGAAGATCCCAACTTTTGCTTGTGCGATCGACAAATGTCACGGCATTCTGCTGGCAAAGAATGTAAATTTGAAGGAGATTTTAACCTCGACCGATTCGAGCACGTTCGACAACATTTTGCACTCATTTGTCGGCATCGCGGCCGTTCAAGTCGGCCTCACTGACGTTCTCAAGGCGGTTGGCTTGGAGCCCGACTTCGTCGTTGGCCATTCGCTCGGCGAACTCGGCTGCGCCTACGCTGATGGCTGCTGCAGTGCCGAGGAGATGATTCTGTCCGCTTACTCGCGTGGCATGGTGAGTTTCGAGACAAAAGTGGTTTTCGGCTCGATGGCGGCCGTCGGTTTGGGCTACAAGCAGCTCAAAGACATCGTGCCTGATGGCGTTGAGGTCGCCTGCCACAACAGCTCTGAATCGAGCACGATTTCGGGACCTGCCGACAAAGTTGCAGCATTTGTCACTGAGCTGAAAAAACAGGGAATTTTCGCGAAAGAAGTGGCCTGCTCGAACATTCCCTATCATAGCTCTTACATCGCCGACATGGGCGCAAATCTGCTTGCTCGTCTTCAAAAAATCCACAAGACCCCGGCGAAGCGCTCACCACGCTGGATCAGCTCGAGCATTCCCGAGTCGCAGTGGCATCTGCCAAGCAGTCAGTTCACATCGGCCGAGTATCAAACCAACAATGTTCTCAATCCAGTGCTCTTTGAAGAAGCCATTTCGCATCTTCCAGCGAACGCTGTGTGCATCGAAATCGCTCCTTATGGTCTGCTGCAGGCGATCTTAAAGCGCGCTCTAAAGGATTCGGTGCACATCTCTTTAACAAAGAAGGGGGAAGCGAACAACGACCAGTTTCTGTTGGGTGCTCTGGGAAGAATCTACTGCAACGGCTTCGACATGGACATCTCTACGCTCTATCCACCAATCAAGTATCCAGTTTCGCGAGGAACTCCAATGATTGCACCGCACATCAAGTGGGATCACAGCGAAGACCATTACGTTATGCGCTTCGAGGACTGCTCGGAAGGAAAGAGCGCAGAACGCAAGGTTCCGATTTCATTGACCGACGTCGAGTACGAGTTCGTGCAAGGGCATTGCATTGACGGTCGTTGTCTGTTTCCCGCTACTGGCTACATCTTTCTGGCGTGGGAAACCTTTGCGATGACCAAAGGAAAGATGTACTTCGACATGAGCGTCGAGTTCGAGGATTTGAAGTTTTTGCGAGCGACTTCACTGTCGAAGGACGTGGAAGTGGAGTTTACGATCGTCATCCATCCGGGCAGCGGAAAGTTTGAGATTTCCGAGGGAAGTTCGACGCTCGTGACTGGTTTTATAAAGCAGGTCGAAAATCCAAAGCTGAGGGATGTTGAAAATGTGACAAATGACAAAATACTCAGGTCTAAGGACTTCTACAAGGAGCTTCGGCTGCGCGGCTATCACTACACTGGCATCTTCCAGTCGGTCGTCGAGGGAAGCTTCGACGGCAACTATGGCAAGATTCGATGGGACTCCAACTGGATTGCCTTCCTCGACTGCCTCTTGCAGCTGAAAATCATTGGAAAAGACACGCGGTCGCTTGTCCTCCCGACCGCCATTCAAAAGTTCACAATTCACACGAGCAAGCACCTGGAAATGCTCGAAAACTTCTCTGAAGGCGATGAGATCTTCTTCGAAGCTCGCATTAGCGATGAGCTGAAGATCATTAGATGCGGCGGAGTCGAAATCGTGAATCTCCAGTCGAGTTCGGTGGGAAGAAGGAAGGCGCCTGGAGTGCCGGTTATTGAATCTTACGAGTTTGTTCCGTACGACTGTGAAGACACAGTGCTGAGTGTTGAGAACGCCACTCGTATAATTTCGCAGCTGCTGATAGAAAATCTCCAAACTACGAAGGTGAAGGGTGTTGAAATTGACACAATCGAAGGAGTTGAGACAATTTTGGGTGGACTCAAGAGTGCTTTGATGGATCTTCCTCTGATAACTGCGGAGATGACGCTTTTGAGTGCGAGAAATTCTGAAGATTTAGGAGAAATTGCGGTAGCTAATGAAGTGATCAACTTGCACAAGGAATGTTCGGTTCTTGTTGCGTCGGCCGAAGTGAAAGTTGAATATGTCGAGAGTCTAGTTGAGGGCGGATTTTTGGTGATGAGAAATTTGAGGGACAATCAAAGTCTTCCTGAAGAGTTGACGAAAATTTCGACACTGCGACTCAAAGATGAGATCTTGACGGTCTTCCAGTGCAGAAAGGAGAAAAGTGATCGAACAAAGACGGCACTTTTGATCTCAAGCGACGACACATCGTTCGCATGGGCCGAAACGGCCAAGAAACTTCTGAAAGCCTCAAATCTTGTGATTGTCGCTGAAAATGATCCGACAAGTGGCGTCCTTGGCCTCGTCAATTGCCTCAGAAAGGAAGCCGACAGCAAGTCAGTGTCCTGCGTGCTTATCGACGACTCGAAAGCCCCAAAGTTTGACCTAAACTGCGAGTTTTACAAAAGTCAGCTCAAGAAAACGCTCGCAATCAACGTCTTTCGCAATGGCAGATGGGGAAGCTACCGACACTTGCCCATTGTGCAAAGTTTCGAAGAAAAATCGACGTTGGATCACTGCTTTGCCAATGCTCTCGTCAAGTCTGATTTGTCCTCCTTGAAGTGGCTGCGAGGACCTCTCAACACCGCCGAAAATGACGTCGTTTGCGTGCACTACGCACCGCTGAACTTCAAGGATGTCATGCTGGCCACTGGAAAGATCTCAGCCGAAATTTTCGTCAGCACACGTCTCGACCTCGAGTGCGTGCTCGGCTTCGAGTATTCGGGCGTGTCGAGCGGAGCCCAGCGTGTCATGGGCTTGATTCCGTCGGGTGCTTTGGCGACCTTCGTCAAAATGGATCCGGCGCTTACATGGCGGTGCCCGAATGAGTGGAGTTTGGCAGAGGCGGCGACAGTTCCGTGCGTCTACGCTACCGTCTACTACGCGTTCTTCGTCAAGACACAAGTCCAGCCGGGCAAGTCGATTCTCATTCACGCCGGCAGCGGTGGCATTGGACTGGCGGCGATTCGCGTCGCTTTCGCCTACGGTTTGGAGGTATTCACGACCGTCAGCACGGAGGAGAAGAAGAACTTTTTGCTCAACGAATTTCCTCAACTCAAGCGCGAAAATATCGGCAATTCACGCGACACTTCCTTCGAGGACATGATTGCAGTGCGAACGAACGGCAAGGGCGTCGATTACGTGCTGAACTCGCTCGCCGAGGAGAAGCTTCACGCATCAATCAGATGCTTGGGCAAGGGCGGAAAATTCCTGGAAATCGGAAAATTCGACATGGAGAAGGACACGAAAATCGGCATGAATGCTCGAGATCGACAT
Protein: 1648 (aa)
MSKIKEIGDDVVISGVAGRFPNSRNLLEFASNLYNKVDMVDEEEKRWRHVNPECPKRMGKIGDIEKFDASFFGVSYRQARTMDPQCRMLLEHAYEAVLDSGMCPKQLAGTKTGVFVGMSYNESEKKWIYENISKEGFGITGSAKALVANRISFALGVTGPSFVCDTACSSSMYALDIAYKMISSGECDAAIVGGTNLCLHPHLSYQFVKLGVVSKDGFCRPFDKDASGFTRAETSCALFLQRRIDAKRVYATILHSKTNCDGFKEEGINFPSSKRQAELLTEFYEEVDIRPSDPSIGYFEAHCTGTKVGDAEECRAIDEVMCKNRKEPLLVGSVKSNMGHSEPGSGVCSISKVILAFETGKIAPNINLKAVKEEITGVVEGRMKVVTETQDLLQPFVPVNSSGFGGANAHALLQINSKAKVNKGIPSDKIPRLVLWSGRTEEAVDAIFDSVTRQPLDAEFVSLIQSSQATSNKANLYRGFGVFTQPENHENNAVCSLREVQHFNGQRRPLVWIYSGMGSQWKAMGAALMKIPTFACAIDKCHGILLAKNVNLKEILTSTDSSTFDNILHSFVGIAAVQVGLTDVLKAVGLEPDFVVGHSLGELGCAYADGCCSAEEMILSAYSRGMVSFETKVVFGSMAAVGLGYKQLKDIVPDGVEVACHNSSESSTISGPADKVAAFVTELKKQGIFAKEVACSNIPYHSSYIADMGANLLARLQKIHKTPAKRSPRWISSSIPESQWHLPSSQFTSAEYQTNNVLNPVLFEEAISHLPANAVCIEIAPYGLLQAILKRALKDSVHISLTKKGEANNDQFLLGALGRIYCNGFDMDISTLYPPIKYPVSRGTPMIAPHIKWDHSEDHYVMRFEDCSEGKSAERKVPISLTDVEYEFVQGHCIDGRCLFPATGYIFLAWETFAMTKGKMYFDMSVEFEDLKFLRATSLSKDVEVEFTIVIHPGSGKFEISEGSSTLVTGFIKQVENPKLRDVENVTNDKILRSKDFYKELRLRGYHYTGIFQSVVEGSFDGNYGKIRWDSNWIAFLDCLLQLKIIGKDTRSLVLPTAIQKFTIHTSKHLEMLENFSEGDEIFFEARISDELKIIRCGGVEIVNLQSSSVGRRKAPGVPVIESYEFVPYDCEDTVLSVENATRIISQLLIENLQTTKVKGVEIDTIEGVETILGGLKSALMDLPLITAEMTLLSARNSEDLGEIAVANEVINLHKECSVLVASAEVKVEYVESLVEGGFLVMRNLRDNQSLPEELTKISTLRLKDEILTVFQCRKEKSDRTKTALLISSDDTSFAWAETAKKLLKASNLVIVAENDPTSGVLGLVNCLRKEADSKSVSCVLIDDSKAPKFDLNCEFYKSQLKKTLAINVFRNGRWGSYRHLPIVQSFEEKSTLDHCFANALVKSDLSSLKWLRGPLNTAENDVVCVHYAPLNFKDVMLATGKISAEIFVSTRLDLECVLGFEYSGVSSGAQRVMGLIPSGALATFVKMDPALTWRCPNEWSLAEAATVPCVYATVYYAFFVKTQVQPGKSILIHAGSGGIGLAAIRVAFAYGLEVFTTVSTEEKKNFLLNEFPQLKRENIGNSRDTSFEDMIAVRTNGKGVDYVLNSLAEEKLHASIRCLGKGGKFLEIGKFDMEKDTKIGMNARDRH
Type | Start | End | Length |
CDS |
207 |
223 |
17 |
CDS |
313 |
2553 |
2241 |
CDS |
2614 |
2843 |
230 |
CDS |
3146 |
4394 |
1249 |
CDS |
4451 |
4736 |
286 |
CDS |
4836 |
5334 |
499 |
CDS |
5391 |
5812 |
422 |
intron |
224 |
312 |
89 |
intron |
2554 |
2613 |
60 |
intron |
2844 |
3145 |
302 |
intron |
4395 |
4450 |
56 |
intron |
4737 |
4835 |
99 |
intron |
5335 |
5390 |
56 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001849863 |
fatty acid synthase S-acetyl transferase [Culex quinquefasciatus] gb|EDS30986.1| fatty acid synthase S-acetyl transferase [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR014031 |
Beta-ketoacyl synthase, C-terminal |
|
InterPro |
IPR016035 |
Acyl transferase/acyl hydrolase/lysophospholipase |
|
InterPro |
IPR014030 |
Beta-ketoacyl synthase, N-terminal |
|
InterPro |
IPR014043 |
Acyl transferase |
|
InterPro |
IPR020801 |
Polyketide synthase, acyl transferase domain |
|
InterPro |
IPR013149 |
Alcohol dehydrogenase, C-terminal |
|
InterPro |
IPR016038 |
Thiolase-like, subgroup |
|
InterPro |
IPR020841 |
Polyketide synthase, beta-ketoacyl synthase domain |
|
InterPro |
IPR016039 |
Thiolase-like |
|
InterPro |
IPR011032 |
GroES-like |
|
InterPro |
IPR020843 |
Polyketide synthase, enoylreductase |
|
InterPro |
IPR001227 |
Acyl transferase domain |
|
InterPro |
IPR016040 |
NAD(P)-binding domain |
|
InterPro |
IPR016036 |
Malonyl-CoA ACP transacylase, ACP-binding |
|
Gene Ontology(BP) |
GO:0008152 |
metabolic process |
|
Gene Ontology(BP) |
GO:0055114 |
oxidation-reduction process |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0000166 |
nucleotide binding |
|
Gene Ontology(MF) |
GO:0008270 |
zinc ion binding |
|
Gene Ontology(MF) |
GO:0016747 |
transferase activity, transferring acyl groups other than amino-acyl groups |
|
Gene Ontology(MF) |
GO:0016740 |
transferase activity |
|
Gene Ontology(MF) |
GO:0003824 |
catalytic activity |
|
Gene Ontology(MF) |
GO:0016491 |
oxidoreductase activity |
|
Pfam |
PF00109.21 |
Beta-ketoacyl synthase, N-terminal domain |
2.2e-62 |
Pfam |
PF00698.16 |
Acyl transferase domain |
6.8e-63 |
Pfam |
PF00107.21 |
Zinc-binding dehydrogenase |
3.1e-14 |
Pfam |
PF00108.18 |
Thiolase, N-terminal domain |
0.0014 |
Pfam |
PF08545.5 |
3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III |
0.0019 |
Pfam |
PF02801.17 |
Beta-ketoacyl synthase, C-terminal domain |
1e-31 |
Pfam |
PF12242.3 |
NAD(P)H binding domain of trans-2-enoyl-CoA reductase |
0.075 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Gene ID |
Pn.11656 |
Pn.06426 |
Pn.03933 |
Pn.04187 |
Pn.11493 |
Pn.04992 |
Pn.08717 |
Pn.06151 |
Pn.15816 |
Pn.00600 |
Pn.01298 |
Pn.14865 |
Pn.04993 |
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.14921 |
B. mori |
BGIBMGA008581-TA |
B. mori |
BGIBMGA013153-TA |
S. invicta |
SI2.2.0_01047 |
S. invicta |
SI2.2.0_07564 |
D. plexippus |
DPOGS212360PA |
C. quinquefasciatus |
CPIJ008367 |
A. aegypti |
AAEL002237 |
S. invicta |
SI2.2.0_11351 |
P. vanderplanki |
Pv.12209 |
P. vanderplanki |
Pv.14772 |
A. aegypti |
AAEL002204 |
P. vanderplanki |
Pv.01475 |
D. melanogaster |
FBgn0040001 |
S. invicta |
SI2.2.0_11327 |
S. invicta |
SI2.2.0_10014 |
S. invicta |
SI2.2.0_00900 |
S. invicta |
SI2.2.0_08502 |
S. invicta |
SI2.2.0_00383 |
S. invicta |
SI2.2.0_15127 |
S. invicta |
SI2.2.0_12057 |
S. invicta |
SI2.2.0_13910 |
S. invicta |
SI2.2.0_14664 |
P. vanderplanki |
Pv.10571 |
N. vitripennis |
NV22399-PA |
N. vitripennis |
NV10926-PA |
P. vanderplanki |
Pv.02828 |
P. vanderplanki |
Pv.12689 |
B. mori |
BGIBMGA013046-TA |
P. vanderplanki |
Pv.03372 |
A. aegypti |
AAEL008160 |
S. invicta |
SI2.2.0_06311 |
S. invicta |
SI2.2.0_14684 |
P. vanderplanki |
Pv.00511 |
P. vanderplanki |
Pv.16863 |
P. vanderplanki |
Pv.14617 |
N. vitripennis |
NV14455-PA |
P. vanderplanki |
Pv.00790 |
S. invicta |
SI2.2.0_05482 |
A. mellifera |
GB12198-PA |
P. vanderplanki |
Pv.14852 |
T. castaneum |
TC007689 |
C. quinquefasciatus |
CPIJ005595 |
P. vanderplanki |
Pv.02964 |
P. vanderplanki |
Pv.02965 |
A. aegypti |
AAEL002227 |
P. vanderplanki |
Pv.05860 |
T. castaneum |
TC015399 |
D. plexippus |
DPOGS206960PA |
H. melpomene |
HMEL005305-PA |
B. mori |
BGIBMGA008582-TA |
A. aegypti |
AAEL002228 |
S. invicta |
SI2.2.0_15862 |
S. invicta |
SI2.2.0_06588 |
B. mori |
BGIBMGA004655-TA |
N. vitripennis |
NV17124-PA |
A. mellifera |
GB16883-PA |
T. castaneum |
TC015340 |
S. invicta |
SI2.2.0_10115 |
T. castaneum |
TC015339 |
S. invicta |
SI2.2.0_13311 |
H. melpomene |
HMEL008318-PA |
S. invicta |
SI2.2.0_16052 |
N. vitripennis |
NV10927-PA |
H. melpomene |
HMEL015510-PA |
N. vitripennis |
NV10111-PA |
S. invicta |
SI2.2.0_14578 |
S. invicta |
SI2.2.0_13320 |
P. vanderplanki |
Pv.14263 |
M. musculus |
ENSMUSG00000025153 |
B. mori |
BGIBMGA013047-TA |
B. mori |
BGIBMGA008579-TA |
H. melpomene |
HMEL015513-PA |
P. vanderplanki |
Pv.10570 |
T. castaneum |
TC000238 |
P. vanderplanki |
Pv.12440 |
P. vanderplanki |
Pv.10251 |
A. gambiae |
AGAP009176 |
P. vanderplanki |
Pv.02966 |
A. aegypti |
AAEL001194 |
N. vitripennis |
NV14456-PA |
A. gambiae |
AGAP008468 |
S. invicta |
SI2.2.0_00168 |
C. quinquefasciatus |
CPIJ003494 |
B. mori |
BGIBMGA008602-TA |
S. invicta |
SI2.2.0_09698 |
B. mori |
BGIBMGA013281-TA |
S. invicta |
SI2.2.0_01792 |
P. vanderplanki |
Pv.00897 |
P. vanderplanki |
Pv.00898 |
P. vanderplanki |
Pv.07928 |
S. invicta |
SI2.2.0_16094 |
H. melpomene |
HMEL004144-PA |
S. invicta |
SI2.2.0_15874 |
D. melanogaster |
FBgn0042627 |
B. mori |
BGIBMGA013151-TA |
P. vanderplanki |
Pv.11254 |
T. castaneum |
TC015400 |
H. sapiens |
ENSP00000304592 |
D. plexippus |
DPOGS207082PA |
A. gambiae |
AGAP001899 |
C. quinquefasciatus |
CPIJ003495 |
S. invicta |
SI2.2.0_02241 |
T. castaneum |
TC015337 |
S. invicta |
SI2.2.0_06292 |
S. invicta |
SI2.2.0_05241 |
D. melanogaster |
FBgn0027571 |
T. castaneum |
TC011522 |
S. invicta |
SI2.2.0_14661 |
S. invicta |
SI2.2.0_03932 |
S. invicta |
SI2.2.0_08063 |
N. vitripennis |
NV50551-PA |
P. vanderplanki |
Pv.09028 |
P. humanus |
PHUM080440-PA |
A. aegypti |
AAEL002200 |
P. humanus |
PHUM448390-PA |
P. humanus |
PHUM565830-PA |
P. vanderplanki |
Pv.03790 |