MidgeBase gene description page [Pn.00600]

Outline

Link to gbrowse

Gene ID Pn.00600
Type Protein coding gene
Scaffold PnScaf660
Start 109
End 5987
Direction -

Sequence

Transcript: 4299 (bp)

 ATGGAAAGTTGCCCACTGTCGGCTGAAGCTCAACCAAAGCAAAAGAAAGTCTTTTCTCGCATCTACCCATCGACTCCCGACGATGAGATCGTTATTAGCGGCATTTCTGGGCGATTCCCCAGTTCTCGTAACATGCACGATTTTGCTCACAACCTTTACAATAAGATTGATATGGTTGATGATGATGAAAGACGGTGGAAGCACACCAATCCGGAAATTCCGCGACGCATGGGAAAGATTAATAATCTGGAGAAATTCGACGCTACCTTCTTTGGCGTACATTTCAAGCAAGCTCACACAATGGATCCACAATGTCGTATGCTTCTAGAACATGCTTACGAGGCGGTCCTCGATGCCGGCGTCAATCCGAGAACGTTGAGGGGAAGTCGCACGGGCGTCTACATCGGCGCTTGCTTCGCCGAGTCTGAAAAGACTTGGTTCTATGAAAAAGTGTCGACTGGCGGCTTCGGTATCACTGGGTGTGCGCGTGCCATGTTGGCGAACCGAATTTCCTTCACTTTAGGCTTAACCGGACCTTCTTTCTTGCTCGATACTGCTTGCTCCTCCTCGATGTATGCTCTCGATTGTGCCTTCAATGCTATTCGCTGTGGTGAAATTGATGCGGCTCTCGTTGGTGGCTCCAACTTGTTGCTTCATCCCTACGTAACGCTTCAATTCGCTCGTCTCGGAGTGCTCGCCCAGAACGGCTACTGTCGACCGTTCGACAAGGACGGCAGCGGCTACACCAGAGCGGAAGCCATTTGCGTGATGTACTTGCAGAAGGCGAAGAACGCGAAGCGCATCTACGCGAACCTCCTTTACTCGAAGACCAACTGCGACGGCTACAAGGAGGAGGGCATCACGTACCCGAGCGGCAAGATGCAGATGAAACTGCTGAAGGAGTTCTACGACGACCTCGACATTCCGCCGAGCACGGTCGACTATGTTGAGGCCCACAGCACGGGCACTATTGTCGGTGATCCTGAAGAGGTCAGAGCCATCGACACGGTTTACTGCACGGGCCGCGAGAAGCCGCTGCCAGTCGGCTCCGTAAAGTCCAACATGGGCCACTCGGAGAGCACAGCCGGCGCGTGCTCGATCGCGAAAATAATTCTAGCCTTCGAGACGCAGAAAATTCCGCCGAACATCAACTTCGAGTCGATTCGGCCGGGTCTCGAGGGACTGGAGTCGGGACGCTTGCGAGTCGTCGCGAACACCGAGACGCTCAGCGGGCCTCTCATCTCGATCAACTCGTTCGGCTTTGGCGGCGGCAACGCTCACGCGCTCTTCCGCCAGCATCCGAAGGAGAAGGTCAACAGCGGCATCCCGAAGGACGACATCCCGCGGCTGATTTTGTGGTCGAGCCGAACGGAGGAGGGCGTCAATTCGATCCTCGAGAGCGTCCTCAAGCAGCCGCTCGATGCCGAGTACGTCGGCCTGCTGCACAACTGCGTCGCCGGCGAGTCCTCCTCCGCCAACATCTACCGCGGCTTCGGCGTGTTCGCGCAGACCGAGCAGAGCGTCAACGCAACGTGCATCAACCGGGACGTGAAGCATTTCGTCGGATTCAAGCGGCCGGTGGTGTGGGTGTACAGCGGCATGGGCTCGCAATGGAACACCATGGGCAGTGACCTCATGCGCATTCCGATCTTCGCCGAGTCGATCGAGCGGAGCCACAAGATCCTGGAGAGGAAGGGCCTCAACCTCAAGGGCATCCTCACGTCGCAGGAGCCGAAGCTCTTCGACAACATCCTCAACTCGTTCGTGGGCATCGCGGCCATTCAGATCGCGCTCACCGACATCCTCAAGGCGCTCGAGCTCGAACCCGACTACATTATCGGCCACTCGGTGGGCGAGCTGGGCTGCGCCTACGCCGACGGCTGCTTTACCGCCGAGGAGATGATCCTGTCGTCGTACTCGCGCGGCATGGCCAGTCTCGAGACGAACGTCGTGGTCGGCTCGATGGCCGCCGTCGGCATGAGCTTCAAGAAGCTGCGCCCGATCATCCCCGACGGCATCGAGATTGCGTGCCACAACTCGGCCGACTCGTGCACCATTTCGGGACCGGCCGAAAATGTCGCGAAATACGTGGCGGAGCTGAAGGCGCAGAACATTTTCGCGAAGGAGGTGAATTGCTCGAAAATTCCGTACCACAGCTCGTACATCCAGGAGATGGGGCCGAACCTGCTTGCGCGCCTCACGGACGTCATCAAGTGCCCGAAGAAGCGCTCGCCACGCTGGCTGAGCTCGAGCGTGCCGAAGCCGCGGTGGGAGGAGGAGCAATTCCAGTACTCGTCGCCATTTTATCACACGAATAACTTGCTGAGCTCGGTTTTGTTCGAAGAGACCTCGAGCTTGCTGCCGCGAAATGCACTCACCATCGAGATTGCGCCGCACAGCTTGCTGCAGGCGATTCTGAGAAAGTCGATGCCTGAAGCGGTTCACATCGGGCTGACGCAGCGCGGCAACAAGAGCAACTCGACTTTCTTCTTGAGCGCTCTCGGATCAATTCATGAAAACGGAATTGACTTTGATGTGAGCAAGCTCTATCCTCCGGTTGACTTCCCGGTCTCAAGAGGCACACCGATGATCTCACCGCGCATTAAATGGGAACACAGCGAGGACTGGTTTGTGACTCGATTTGAGAGCCAAAAGTCGAATCGCAGTGGCGAACGTCATGTTGTCATCAATATTGCTGATCAAGAGTATGAATACATCATTGGCCACGTAATTGATGGAAGGATCCTCTTCCCAGCGACGGCATATCTCTACATTGTGTGGGAGACTTTGGGTCTCATGATGGGCGTTTATTTCTTCGAGGTCGGCGTTGTTTTCGAAGACGTCAAGTTCATGAGAGCTACTGCTCTGCCCAAGAATCAGGACGTCGAGTTTATCGTCATGATTCAGCCAGGCACTGGAAGATTTGAGATCACGGAGGGCACGAGCGTGCTCGCGACCGGCTACGTGAAGATCGTGGACAACGTGAAGCTCACGGACATCGAGAAACCGAAGGAGAACGGCTACCCGACACTCCTCCAGAAGGACTTCTACAAGGAGCTCCGCTTGCGCGGCTACCACTACCACAGCCTCTTCCGGTGCGTCGAGGAGGCGCGCGGCGACGGCATGGTCGGCAAGATCAAGTGGAACTCCAACTGGGTTTCGTTCATGGACTGCCTTCTGCAGGTGCACATCGTCGGCCAGGATTCGCGCGCCCTCCTCCTCCCGACCGGCATCCAGAAGCTCTCGATTAACCCGAAAATCCATCAATCGATGGTGCACTCGTTCGAGAACGAAAACATCACGCTTGAGGTCTTCACCAACAAGGAACTCAACCTGCTGCGCTGCGGCGGCATCGAGATCCGCGGGCTGCAGGCCAACCCGGTCGCTCGCCGTCGTCCACCCGGCATTCCCGTTCTCGAGACCTATCAGTTCATGCCGCACTTGCCAACGCCGTTCCTCAACAAAATCAACGCCGCTCGTTTCTGCGTGCAGCTGGCGCTCGAGAACGTGCCGACCAACAAGGTGCTCTCGATCGAAATCGACGGAAACGACGGAAAAGAACCGATGAGCGACTTTATTGCGCAGGCACTTGGCGATTTGCCGCTGGTCACCGCTGAGCTCAATTACTTGACCACCAAGGCCATGGACCTCGGCAACATCATCGTCAGCGACTCGAAATTTTCGGCTTTCAAGAACGCCTTTGTTGTCATCAACGGCAACAGCCTCAGCGACAAGACCTTCCTCGAAAAGGTCGCCAATCATCTGCAGGACGGCGGCTTTGTTATTCTCAGGGAATCCAACGAAATTAAGCTGCAGCTGCTGAACGAGCTGCCCTCGCATCACCAACTGATTGCGATAATTCCGAGCGAGAACGAGACGATCATCATGCTGCAGTTCCACAAGAAGCTGCCAGCTCAGCCGCAGAAAATTGTGAAAGTTAGCTCGCAAAATTACGACTGGATGGATGAGCTCAAGCGAGCAGTCAAGGAGGGCCCGACACTCGCATATGCCGAGAAGGAGGAACTTTCGGGAATCATTGGTTTGGTTAACTGCATTCGCAAGGAACCGAATGGCCTCAACCTCAAATGCGTCTTCGTCGACGACTACAGGGCGCCGAAGTTCAGCGAGGACAACGAGTTCTATAAGTCTTTCCTCAAGCAGGGCCTCGCCATCAACGTCTTCAAGAACGGCCAGTGGGGCTCCTATCGCCATCTTCTTCTCACACCAAAGTACGAAAGTGCGCGTCGCGTCGACCACTGCTACGCTAACTCGCTTGTCCGCGGCGACTTTACGCGTCGC 

Protein: 1433 (aa)

 MESCPLSAEAQPKQKKVFSRIYPSTPDDEIVISGISGRFPSSRNMHDFAHNLYNKIDMVDDDERRWKHTNPEIPRRMGKINNLEKFDATFFGVHFKQAHTMDPQCRMLLEHAYEAVLDAGVNPRTLRGSRTGVYIGACFAESEKTWFYEKVSTGGFGITGCARAMLANRISFTLGLTGPSFLLDTACSSSMYALDCAFNAIRCGEIDAALVGGSNLLLHPYVTLQFARLGVLAQNGYCRPFDKDGSGYTRAEAICVMYLQKAKNAKRIYANLLYSKTNCDGYKEEGITYPSGKMQMKLLKEFYDDLDIPPSTVDYVEAHSTGTIVGDPEEVRAIDTVYCTGREKPLPVGSVKSNMGHSESTAGACSIAKIILAFETQKIPPNINFESIRPGLEGLESGRLRVVANTETLSGPLISINSFGFGGGNAHALFRQHPKEKVNSGIPKDDIPRLILWSSRTEEGVNSILESVLKQPLDAEYVGLLHNCVAGESSSANIYRGFGVFAQTEQSVNATCINRDVKHFVGFKRPVVWVYSGMGSQWNTMGSDLMRIPIFAESIERSHKILERKGLNLKGILTSQEPKLFDNILNSFVGIAAIQIALTDILKALELEPDYIIGHSVGELGCAYADGCFTAEEMILSSYSRGMASLETNVVVGSMAAVGMSFKKLRPIIPDGIEIACHNSADSCTISGPAENVAKYVAELKAQNIFAKEVNCSKIPYHSSYIQEMGPNLLARLTDVIKCPKKRSPRWLSSSVPKPRWEEEQFQYSSPFYHTNNLLSSVLFEETSSLLPRNALTIEIAPHSLLQAILRKSMPEAVHIGLTQRGNKSNSTFFLSALGSIHENGIDFDVSKLYPPVDFPVSRGTPMISPRIKWEHSEDWFVTRFESQKSNRSGERHVVINIADQEYEYIIGHVIDGRILFPATAYLYIVWETLGLMMGVYFFEVGVVFEDVKFMRATALPKNQDVEFIVMIQPGTGRFEITEGTSVLATGYVKIVDNVKLTDIEKPKENGYPTLLQKDFYKELRLRGYHYHSLFRCVEEARGDGMVGKIKWNSNWVSFMDCLLQVHIVGQDSRALLLPTGIQKLSINPKIHQSMVHSFENENITLEVFTNKELNLLRCGGIEIRGLQANPVARRRPPGIPVLETYQFMPHLPTPFLNKINAARFCVQLALENVPTNKVLSIEIDGNDGKEPMSDFIAQALGDLPLVTAELNYLTTKAMDLGNIIVSDSKFSAFKNAFVVINGNSLSDKTFLEKVANHLQDGGFVILRESNEIKLQLLNELPSHHQLIAIIPSENETIIMLQFHKKLPAQPQKIVKVSSQNYDWMDELKRAVKEGPTLAYAEKEELSGIIGLVNCIRKEPNGLNLKCVFVDDYRAPKFSEDNEFYKSFLKQGLAINVFKNGQWGSYRHLLLTPKYESARRVDHCYANSLVRGDFTRR 
Type Start End Length
CDS 112 121 10
CDS 192 371 180
CDS 516 1713 1198
CDS 1907 2080 174
CDS 2190 2299 110
CDS 2487 2606 120
CDS 2702 4218 1517
CDS 4294 4307 14
CDS 4392 4684 293
CDS 4899 4992 94
CDS 5280 5389 110
CDS 5451 5601 151
CDS 5660 5987 328
intron 122 191 70
intron 372 515 144
intron 1714 1906 193
intron 2081 2189 109
intron 2300 2486 187
intron 2607 2701 95
intron 4219 4293 75
intron 4308 4391 84
intron 4685 4898 214
intron 4993 5279 287
intron 5390 5450 61
intron 5602 5659 58

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001659008 fatty acid synthase [Aedes aegypti] gb|EAT40085.1| fatty acid synthase [Aedes aegypti] 0.0
InterPro IPR014031 Beta-ketoacyl synthase, C-terminal
InterPro IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase
InterPro IPR014043 Acyl transferase
InterPro IPR014030 Beta-ketoacyl synthase, N-terminal
InterPro IPR020801 Polyketide synthase, acyl transferase domain
InterPro IPR016038 Thiolase-like, subgroup
InterPro IPR018201 Beta-ketoacyl synthase, active site
InterPro IPR016039 Thiolase-like
InterPro IPR020841 Polyketide synthase, beta-ketoacyl synthase domain
InterPro IPR001227 Acyl transferase domain
InterPro IPR016036 Malonyl-CoA ACP transacylase, ACP-binding
Gene Ontology(BP) GO:0008152 metabolic process
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0016740 transferase activity
Gene Ontology(MF) GO:0003824 catalytic activity
Pfam PF00109.21 Beta-ketoacyl synthase, N-terminal domain 1.4e-61
Pfam PF00698.16 Acyl transferase domain 2e-57
Pfam PF00108.18 Thiolase, N-terminal domain 0.00023
Pfam PF08545.5 3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III 0.025
Pfam PF02801.17 Beta-ketoacyl synthase, C-terminal domain 7.6e-37

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.02651
Pn.11656
Pn.06426
Pn.03933
Pn.04187
Pn.11493
Pn.04992
Pn.08717
Pn.06151
Pn.15816
Pn.01298
Pn.14865
Pn.04993

Orthologous genes

Species Gene ID
P. vanderplanki Pv.14921
B. mori BGIBMGA008581-TA
B. mori BGIBMGA013153-TA
S. invicta SI2.2.0_01047
S. invicta SI2.2.0_07564
D. plexippus DPOGS212360PA
C. quinquefasciatus CPIJ008367
A. aegypti AAEL002237
S. invicta SI2.2.0_11351
P. vanderplanki Pv.12209
P. vanderplanki Pv.14772
A. aegypti AAEL002204
P. vanderplanki Pv.01475
D. melanogaster FBgn0040001
S. invicta SI2.2.0_11327
S. invicta SI2.2.0_10014
S. invicta SI2.2.0_00900
S. invicta SI2.2.0_08502
S. invicta SI2.2.0_00383
S. invicta SI2.2.0_15127
S. invicta SI2.2.0_12057
S. invicta SI2.2.0_13910
S. invicta SI2.2.0_14664
P. vanderplanki Pv.10571
N. vitripennis NV22399-PA
N. vitripennis NV10926-PA
P. vanderplanki Pv.02828
P. vanderplanki Pv.12689
B. mori BGIBMGA013046-TA
P. vanderplanki Pv.03372
A. aegypti AAEL008160
S. invicta SI2.2.0_06311
S. invicta SI2.2.0_14684
P. vanderplanki Pv.00511
P. vanderplanki Pv.16863
P. vanderplanki Pv.14617
N. vitripennis NV14455-PA
P. vanderplanki Pv.00790
S. invicta SI2.2.0_05482
A. mellifera GB12198-PA
P. vanderplanki Pv.14852
T. castaneum TC007689
C. quinquefasciatus CPIJ005595
P. vanderplanki Pv.02964
P. vanderplanki Pv.02965
A. aegypti AAEL002227
P. vanderplanki Pv.05860
T. castaneum TC015399
D. plexippus DPOGS206960PA
H. melpomene HMEL005305-PA
B. mori BGIBMGA008582-TA
A. aegypti AAEL002228
S. invicta SI2.2.0_15862
S. invicta SI2.2.0_06588
B. mori BGIBMGA004655-TA
N. vitripennis NV17124-PA
A. mellifera GB16883-PA
T. castaneum TC015340
S. invicta SI2.2.0_10115
T. castaneum TC015339
S. invicta SI2.2.0_13311
H. melpomene HMEL008318-PA
S. invicta SI2.2.0_16052
N. vitripennis NV10927-PA
H. melpomene HMEL015510-PA
N. vitripennis NV10111-PA
S. invicta SI2.2.0_14578
S. invicta SI2.2.0_13320
P. vanderplanki Pv.14263
M. musculus ENSMUSG00000025153
B. mori BGIBMGA013047-TA
B. mori BGIBMGA008579-TA
H. melpomene HMEL015513-PA
P. vanderplanki Pv.10570
T. castaneum TC000238
P. vanderplanki Pv.12440
P. vanderplanki Pv.10251
A. gambiae AGAP009176
P. vanderplanki Pv.02966
A. aegypti AAEL001194
N. vitripennis NV14456-PA
A. gambiae AGAP008468
S. invicta SI2.2.0_00168
C. quinquefasciatus CPIJ003494
B. mori BGIBMGA008602-TA
S. invicta SI2.2.0_09698
B. mori BGIBMGA013281-TA
S. invicta SI2.2.0_01792
P. vanderplanki Pv.00897
P. vanderplanki Pv.00898
P. vanderplanki Pv.07928
S. invicta SI2.2.0_16094
H. melpomene HMEL004144-PA
S. invicta SI2.2.0_15874
D. melanogaster FBgn0042627
B. mori BGIBMGA013151-TA
P. vanderplanki Pv.11254
T. castaneum TC015400
H. sapiens ENSP00000304592
D. plexippus DPOGS207082PA
A. gambiae AGAP001899
C. quinquefasciatus CPIJ003495
S. invicta SI2.2.0_02241
T. castaneum TC015337
S. invicta SI2.2.0_06292
S. invicta SI2.2.0_05241
D. melanogaster FBgn0027571
T. castaneum TC011522
S. invicta SI2.2.0_14661
S. invicta SI2.2.0_03932
S. invicta SI2.2.0_08063
N. vitripennis NV50551-PA
P. vanderplanki Pv.09028
P. humanus PHUM080440-PA
A. aegypti AAEL002200
P. humanus PHUM448390-PA
P. humanus PHUM565830-PA
P. vanderplanki Pv.03790