MidgeBase gene description page [Pn.04992]

Outline

Link to gbrowse

Gene ID Pn.04992
Type Protein coding gene
Scaffold PnScaf4273
Start 1982
End 5756
Direction +

Sequence

Transcript: 3723 (bp)

 ATGCCCGCACGCTCCGAAGATCTAGCGAATGACACGCGACGTAACAGCGCGGTGCCCTCATCCGTGATGGACCTAGACACGAATGAGCGTCACTTCGCACAGCCGCCGCGCGTCGTTCAGGACGACATCTGCATCACAGGCTTCAGCGGTCGTCTGCCCGAGAGCTCCAGCATCGAGGAGTTCAAGCGGAACCTCTTCGACGGCGTCGACATGGTGAACGACGATCCGCGCCGGTGGGCGGCGGGCCTGCACGACCTGCCCACGCGCACCGGCAAAATCAAAACCGAGGACCTGCAGAACCTCGACGCCGCCTTCTTCAAGCTCCACCAGAAGCAGGCCGAGTGCATGGACCCGCAACTCCGCATGCTCCTCGAGTGCACCTACGAGGCCATCGTCGATGCCGGCATCAACCCGCAGGACATCCGCGGCTCGCGCACCGGCGTCTACGTCGGCGTCTCCAACTCGGACAGCGAGGAGTTCTGGTGCCGCGACCCCGACGTCGTCAACGGCTACGGCCTCACCGGATGTGCGCGCTCCATGTTCGCCAACCGCATCTCCTTCACGTTCGACTTCAAGGGCCCGAGCTATGCCGTCGACACGGCCTGCTCCTCATCGCTCTTCGCCATGGACCACGCGTTCAGGGACATCAAGTCGGGCCGCACCGATGCCGCCATTGTCGCCGGCGTCGGGCTCATCTTCAAGCCGACCATGTCGCTGCAGTTCAAGCGCCTGAACATGCTCAGCCCGGACGGAATGTGCAAGGCCTTCGACGAGAGCGGCAACGGCTACGTCCGCTCCGACGGCTGCGTCGTGACCTTCCTGCAAAAGTCGAAGGACTCGCGTCGCATCTACGCGACCGTGCTGAACGTCCGCACGAACACGGACGGCGCCAAGGACCAGGGCATCACGTTCCCGAACGGACAGATGCAGAACCGGCTGATTCGCGAGACCTACGAGGAGATCGGCCTCGATCCGCGCGAAGTCACGTACGTGGAGGCGCACGGGACGGGAACGAAGGTCGGCGACCCGCAGGAGGTCAACTCGATCTGCGACTTCTTCTGCAAGGACCGCTCGACGCCGCTCCTCATTGGCAGCGTGAAGTCGAACATGGGACACTCGGAGCCGGCGTCGGGCGTGTGCTCGGTCGCGAAAATCCTCATCGCTATGGAGGCGGGCATGATTCCCGCCAATTTGCACTTCAAAAACCCCAATCCCGACCTCTACGGCATCATGGACGGACGAATGAAGGTCGTCGACAAGAACACGCCGTGGAGTGGCGGCATTGTCGGCCTCAACAGCTTCGGCTTCGGCGGCGCCAACGCCCACGTCATCCTCAAGTCCAACCCGAAGCCGAAGGCGATCGGCTCGATCGGCGAAATACCTCGACTCGTCGCCTGTTCGGGACGCACCGAGGAAGCGGTGGAGCAGCTTTTGGCCGAAATCGAGGCTAATCGCAACGACGAGGAGTTTCTCGGCCTCATCAACGAGATCCATGCGAAGAACATTCCGATGCACCATTTCCGCGGCTACACGGTGATGGGAGCCGACGGCAGCAACCAGCGCGAGGTTGGCGAGCTGCGCGACGATAAGCGCCCGGTGTGGTTCGTCTACTCGGGCATGGGCAGCCAGTGGGCGAGTATGGCGCGCGAGATGATGCGCGTCGACGTCTTCCGGCAAGCGATCACAAAATGCGCCGACGTGCTGAAACCGGAGGGCGTCGATCTCATCGACATCCTCACCAAGTCGGACGAGGCGCGCTTCGACAACATCCTCAACTCTTTCGTGTCGATTGCGGCCGTGCAGGTCGCGCTCACGGACGTCCTCACGAGTGTCGGCATCAGCCCTGACGGCATGGTAGGCCACTCGGTCGGCGAACTCGGCTGCGCCTATGCCGACGGCTGCTTCACCGCGGAGCAGACCGTGCTGGCCGCGTACTGGAGAGGCCGCTCCATCCTCGACACCGACCTCATCGCCGGCCAGATGGCGGCCGTCGGTCTCAGCTGGGAGGACTGCCAGAAGCGCCTTCCGAAGGACGTCATTGCGGCGTGCCACAACGGCGCCGACAGCGTCACCATCTCCGGGCCGGTCGCGTCGGTCGAGAGGGTCGTGAAGGAGCTGACGGCGGAGGGCATTTTCGCGAAGGCGGTCAAGTCGAGCGGAATCGCCTTCCACAGCAAGTACATCGCCGAGGCGGCGCCGAAGCTGAGAAAGTCCCTCGACAAGATCATCCCGACGCCGAAGGCGCGCTCGGAGCGATGGATCAGCTCCTCGATTCCCGAGTCCGCGTGGTCCGCGCCACTCGCCATGCACAGCTCCTCCGCCTACCACGTCAACAACTTGCTCTCGCCGGTGCTCTTTCACTCGGCCATCCAGCACGTCCCGAGCAACGCGATCTGCGTCGAAATCGCCCCGCACGGCTTGCTGCAGGCCATTCTGAAGCGCTCGCTCGGCACCGACTGCACCAACATCAGTCTCATGAAGCGACAGCATGCCGACAACGTGCTGTTTATGCTGTCGAATGTCGGCAAGCTCTACGCCGCCGGCGCGCAGCCGCAACTGTCGAAGCTCTACCGGCCGGTGACCTTCCCGGTCGGTCGCTCCACTCCGATGCTCAGCTCGAAGGTCGGCTGGGATCACTCGCAAAAGTGGATTCTCCTGGACGTGGGCGGCGAGAGCTCGGGCGAGACTGTTGTCGAGGTGAATTTGGGCAAAGAGGGCGACGCCTTCCTCGCCGGTCACGCCATCGACGGCCGCGTGCTCTTCCCGGCCACCGGCTACATGACGCTCGCGTGGAAGACCTTCGCGAAAATGAAGGGAACGACCCACGACCGACTGCCGGTGGTGCTCGAGAATGTCGTGTTCCATCGTGCGACCATTCTGCCGAAGGACGGATCCGTGAAGTTCGGGCTGAACTTCTTCGATGGCAGCGGGAGGTTTGAGATCTGTGAGGGCGGCTCGCTGGCCGTCTCGGGAACGATCCGAGTGCCCGAGAGCATCGAGAGCGAGGAGCTCCCGCTCGATCCGATCGACTGTGACACGAAGTCGGGGCTGCTGCTGGAGCGCGGCGACATTTACAAGGAGCTGAGACTGCGCGGCTACGACTACGGCGGACTTTTCCGCGGCATCAACAAGAGCGACTCGCGCGCGAACGCCGGCGAGCTCGAGTGGACCGGCAACTGGGTGAGCTTCATGGACACGATGCTGCAGTTCTCGATCCTGGGCAAGGACCTGCGCGAGCTCTACCTGCCGACGAGAATCGAGCGAGTCGTGCTGANNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNACACCGTCCTCGAGAACAGCGGCGGTGCGCTGAAGCTGAAAGTGGTCGAGTACCTCGACGACCGGCTTGCCGAGTCGAGCAACTCGCTGCTCATCCAGAGCGTCGTCGAGAGCGAGCCGACGCTGGCGAGCGACGTCGCAATCGTCACCGTTCTGAAGCCCGACACCTACCAAAACTTTGTCGGCGAGTCGGGCGTGCGTGTGGTCGTGAAGGACGCCACGAAGGGACCGGTCGAGAGCAACTCGTCCTCGCAAACC 

Protein: 1241 (aa)

 MPARSEDLANDTRRNSAVPSSVMDLDTNERHFAQPPRVVQDDICITGFSGRLPESSSIEEFKRNLFDGVDMVNDDPRRWAAGLHDLPTRTGKIKTEDLQNLDAAFFKLHQKQAECMDPQLRMLLECTYEAIVDAGINPQDIRGSRTGVYVGVSNSDSEEFWCRDPDVVNGYGLTGCARSMFANRISFTFDFKGPSYAVDTACSSSLFAMDHAFRDIKSGRTDAAIVAGVGLIFKPTMSLQFKRLNMLSPDGMCKAFDESGNGYVRSDGCVVTFLQKSKDSRRIYATVLNVRTNTDGAKDQGITFPNGQMQNRLIRETYEEIGLDPREVTYVEAHGTGTKVGDPQEVNSICDFFCKDRSTPLLIGSVKSNMGHSEPASGVCSVAKILIAMEAGMIPANLHFKNPNPDLYGIMDGRMKVVDKNTPWSGGIVGLNSFGFGGANAHVILKSNPKPKAIGSIGEIPRLVACSGRTEEAVEQLLAEIEANRNDEEFLGLINEIHAKNIPMHHFRGYTVMGADGSNQREVGELRDDKRPVWFVYSGMGSQWASMAREMMRVDVFRQAITKCADVLKPEGVDLIDILTKSDEARFDNILNSFVSIAAVQVALTDVLTSVGISPDGMVGHSVGELGCAYADGCFTAEQTVLAAYWRGRSILDTDLIAGQMAAVGLSWEDCQKRLPKDVIAACHNGADSVTISGPVASVERVVKELTAEGIFAKAVKSSGIAFHSKYIAEAAPKLRKSLDKIIPTPKARSERWISSSIPESAWSAPLAMHSSSAYHVNNLLSPVLFHSAIQHVPSNAICVEIAPHGLLQAILKRSLGTDCTNISLMKRQHADNVLFMLSNVGKLYAAGAQPQLSKLYRPVTFPVGRSTPMLSSKVGWDHSQKWILLDVGGESSGETVVEVNLGKEGDAFLAGHAIDGRVLFPATGYMTLAWKTFAKMKGTTHDRLPVVLENVVFHRATILPKDGSVKFGLNFFDGSGRFEICEGGSLAVSGTIRVPESIESEELPLDPIDCDTKSGLLLERGDIYKELRLRGYDYGGLFRGINKSDSRANAGELEWTGNWVSFMDTMLQFSILGKDLRELYLPTRIERVVLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTVLENSGGALKLKVVEYLDDRLAESSNSLLIQSVVESEPTLASDVAIVTVLKPDTYQNFVGESGVRVVVKDATKGPVESNSSSQT 
Type Start End Length
CDS 1982 5690 3709
CDS 5740 5753 14
intron 5691 5739 49

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001658180 fatty acid synthase [Aedes aegypti] gb|EAT47728.1| fatty acid synthase [Aedes aegypti] 0.0
InterPro IPR014031 Beta-ketoacyl synthase, C-terminal
InterPro IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase
InterPro IPR014030 Beta-ketoacyl synthase, N-terminal
InterPro IPR014043 Acyl transferase
InterPro IPR020807 Polyketide synthase, dehydratase domain
InterPro IPR020801 Polyketide synthase, acyl transferase domain
InterPro IPR016038 Thiolase-like, subgroup
InterPro IPR018201 Beta-ketoacyl synthase, active site
InterPro IPR020841 Polyketide synthase, beta-ketoacyl synthase domain
InterPro IPR016039 Thiolase-like
InterPro IPR001227 Acyl transferase domain
InterPro IPR016036 Malonyl-CoA ACP transacylase, ACP-binding
Gene Ontology(BP) GO:0008152 metabolic process
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0016740 transferase activity
Gene Ontology(MF) GO:0003824 catalytic activity
Pfam PF00109.21 Beta-ketoacyl synthase, N-terminal domain 7.5e-68
Pfam PF00698.16 Acyl transferase domain 2.7e-72
Pfam PF00108.18 Thiolase, N-terminal domain 0.0036
Pfam PF02801.17 Beta-ketoacyl synthase, C-terminal domain 1.2e-40

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.02651
Pn.11656
Pn.06426
Pn.03933
Pn.04187
Pn.11493
Pn.08717
Pn.06151
Pn.15816
Pn.00600
Pn.01298
Pn.14865
Pn.04993

Orthologous genes

Species Gene ID
P. vanderplanki Pv.14921
B. mori BGIBMGA008581-TA
B. mori BGIBMGA013153-TA
S. invicta SI2.2.0_01047
S. invicta SI2.2.0_07564
D. plexippus DPOGS212360PA
C. quinquefasciatus CPIJ008367
A. aegypti AAEL002237
S. invicta SI2.2.0_11351
P. vanderplanki Pv.12209
P. vanderplanki Pv.14772
A. aegypti AAEL002204
P. vanderplanki Pv.01475
D. melanogaster FBgn0040001
S. invicta SI2.2.0_11327
S. invicta SI2.2.0_10014
S. invicta SI2.2.0_00900
S. invicta SI2.2.0_08502
S. invicta SI2.2.0_00383
S. invicta SI2.2.0_15127
S. invicta SI2.2.0_12057
S. invicta SI2.2.0_13910
S. invicta SI2.2.0_14664
P. vanderplanki Pv.10571
N. vitripennis NV22399-PA
N. vitripennis NV10926-PA
P. vanderplanki Pv.02828
P. vanderplanki Pv.12689
B. mori BGIBMGA013046-TA
P. vanderplanki Pv.03372
A. aegypti AAEL008160
S. invicta SI2.2.0_06311
S. invicta SI2.2.0_14684
P. vanderplanki Pv.00511
P. vanderplanki Pv.16863
P. vanderplanki Pv.14617
N. vitripennis NV14455-PA
P. vanderplanki Pv.00790
S. invicta SI2.2.0_05482
A. mellifera GB12198-PA
P. vanderplanki Pv.14852
T. castaneum TC007689
C. quinquefasciatus CPIJ005595
P. vanderplanki Pv.02964
P. vanderplanki Pv.02965
A. aegypti AAEL002227
P. vanderplanki Pv.05860
T. castaneum TC015399
D. plexippus DPOGS206960PA
H. melpomene HMEL005305-PA
B. mori BGIBMGA008582-TA
A. aegypti AAEL002228
S. invicta SI2.2.0_15862
S. invicta SI2.2.0_06588
B. mori BGIBMGA004655-TA
N. vitripennis NV17124-PA
A. mellifera GB16883-PA
T. castaneum TC015340
S. invicta SI2.2.0_10115
T. castaneum TC015339
S. invicta SI2.2.0_13311
H. melpomene HMEL008318-PA
S. invicta SI2.2.0_16052
N. vitripennis NV10927-PA
H. melpomene HMEL015510-PA
N. vitripennis NV10111-PA
S. invicta SI2.2.0_14578
S. invicta SI2.2.0_13320
P. vanderplanki Pv.14263
M. musculus ENSMUSG00000025153
B. mori BGIBMGA013047-TA
B. mori BGIBMGA008579-TA
H. melpomene HMEL015513-PA
P. vanderplanki Pv.10570
T. castaneum TC000238
P. vanderplanki Pv.12440
P. vanderplanki Pv.10251
A. gambiae AGAP009176
P. vanderplanki Pv.02966
A. aegypti AAEL001194
N. vitripennis NV14456-PA
A. gambiae AGAP008468
S. invicta SI2.2.0_00168
C. quinquefasciatus CPIJ003494
B. mori BGIBMGA008602-TA
S. invicta SI2.2.0_09698
B. mori BGIBMGA013281-TA
S. invicta SI2.2.0_01792
P. vanderplanki Pv.00897
P. vanderplanki Pv.00898
P. vanderplanki Pv.07928
S. invicta SI2.2.0_16094
H. melpomene HMEL004144-PA
S. invicta SI2.2.0_15874
D. melanogaster FBgn0042627
B. mori BGIBMGA013151-TA
P. vanderplanki Pv.11254
T. castaneum TC015400
H. sapiens ENSP00000304592
D. plexippus DPOGS207082PA
A. gambiae AGAP001899
C. quinquefasciatus CPIJ003495
S. invicta SI2.2.0_02241
T. castaneum TC015337
S. invicta SI2.2.0_06292
S. invicta SI2.2.0_05241
D. melanogaster FBgn0027571
T. castaneum TC011522
S. invicta SI2.2.0_14661
S. invicta SI2.2.0_03932
S. invicta SI2.2.0_08063
N. vitripennis NV50551-PA
P. vanderplanki Pv.09028
P. humanus PHUM080440-PA
A. aegypti AAEL002200
P. humanus PHUM448390-PA
P. humanus PHUM565830-PA
P. vanderplanki Pv.03790