MidgeBase gene description page [Pn.00600]
Outline
Gene ID | Pn.00600 |
Type | Protein coding gene |
Scaffold | PnScaf660 |
Start | 109 |
End | 5987 |
Direction | - |
Sequence
Transcript: 4299 (bp)
ATGGAAAGTTGCCCACTGTCGGCTGAAGCTCAACCAAAGCAAAAGAAAGTCTTTTCTCGCATCTACCCATCGACTCCCGACGATGAGATCGTTATTAGCGGCATTTCTGGGCGATTCCCCAGTTCTCGTAACATGCACGATTTTGCTCACAACCTTTACAATAAGATTGATATGGTTGATGATGATGAAAGACGGTGGAAGCACACCAATCCGGAAATTCCGCGACGCATGGGAAAGATTAATAATCTGGAGAAATTCGACGCTACCTTCTTTGGCGTACATTTCAAGCAAGCTCACACAATGGATCCACAATGTCGTATGCTTCTAGAACATGCTTACGAGGCGGTCCTCGATGCCGGCGTCAATCCGAGAACGTTGAGGGGAAGTCGCACGGGCGTCTACATCGGCGCTTGCTTCGCCGAGTCTGAAAAGACTTGGTTCTATGAAAAAGTGTCGACTGGCGGCTTCGGTATCACTGGGTGTGCGCGTGCCATGTTGGCGAACCGAATTTCCTTCACTTTAGGCTTAACCGGACCTTCTTTCTTGCTCGATACTGCTTGCTCCTCCTCGATGTATGCTCTCGATTGTGCCTTCAATGCTATTCGCTGTGGTGAAATTGATGCGGCTCTCGTTGGTGGCTCCAACTTGTTGCTTCATCCCTACGTAACGCTTCAATTCGCTCGTCTCGGAGTGCTCGCCCAGAACGGCTACTGTCGACCGTTCGACAAGGACGGCAGCGGCTACACCAGAGCGGAAGCCATTTGCGTGATGTACTTGCAGAAGGCGAAGAACGCGAAGCGCATCTACGCGAACCTCCTTTACTCGAAGACCAACTGCGACGGCTACAAGGAGGAGGGCATCACGTACCCGAGCGGCAAGATGCAGATGAAACTGCTGAAGGAGTTCTACGACGACCTCGACATTCCGCCGAGCACGGTCGACTATGTTGAGGCCCACAGCACGGGCACTATTGTCGGTGATCCTGAAGAGGTCAGAGCCATCGACACGGTTTACTGCACGGGCCGCGAGAAGCCGCTGCCAGTCGGCTCCGTAAAGTCCAACATGGGCCACTCGGAGAGCACAGCCGGCGCGTGCTCGATCGCGAAAATAATTCTAGCCTTCGAGACGCAGAAAATTCCGCCGAACATCAACTTCGAGTCGATTCGGCCGGGTCTCGAGGGACTGGAGTCGGGACGCTTGCGAGTCGTCGCGAACACCGAGACGCTCAGCGGGCCTCTCATCTCGATCAACTCGTTCGGCTTTGGCGGCGGCAACGCTCACGCGCTCTTCCGCCAGCATCCGAAGGAGAAGGTCAACAGCGGCATCCCGAAGGACGACATCCCGCGGCTGATTTTGTGGTCGAGCCGAACGGAGGAGGGCGTCAATTCGATCCTCGAGAGCGTCCTCAAGCAGCCGCTCGATGCCGAGTACGTCGGCCTGCTGCACAACTGCGTCGCCGGCGAGTCCTCCTCCGCCAACATCTACCGCGGCTTCGGCGTGTTCGCGCAGACCGAGCAGAGCGTCAACGCAACGTGCATCAACCGGGACGTGAAGCATTTCGTCGGATTCAAGCGGCCGGTGGTGTGGGTGTACAGCGGCATGGGCTCGCAATGGAACACCATGGGCAGTGACCTCATGCGCATTCCGATCTTCGCCGAGTCGATCGAGCGGAGCCACAAGATCCTGGAGAGGAAGGGCCTCAACCTCAAGGGCATCCTCACGTCGCAGGAGCCGAAGCTCTTCGACAACATCCTCAACTCGTTCGTGGGCATCGCGGCCATTCAGATCGCGCTCACCGACATCCTCAAGGCGCTCGAGCTCGAACCCGACTACATTATCGGCCACTCGGTGGGCGAGCTGGGCTGCGCCTACGCCGACGGCTGCTTTACCGCCGAGGAGATGATCCTGTCGTCGTACTCGCGCGGCATGGCCAGTCTCGAGACGAACGTCGTGGTCGGCTCGATGGCCGCCGTCGGCATGAGCTTCAAGAAGCTGCGCCCGATCATCCCCGACGGCATCGAGATTGCGTGCCACAACTCGGCCGACTCGTGCACCATTTCGGGACCGGCCGAAAATGTCGCGAAATACGTGGCGGAGCTGAAGGCGCAGAACATTTTCGCGAAGGAGGTGAATTGCTCGAAAATTCCGTACCACAGCTCGTACATCCAGGAGATGGGGCCGAACCTGCTTGCGCGCCTCACGGACGTCATCAAGTGCCCGAAGAAGCGCTCGCCACGCTGGCTGAGCTCGAGCGTGCCGAAGCCGCGGTGGGAGGAGGAGCAATTCCAGTACTCGTCGCCATTTTATCACACGAATAACTTGCTGAGCTCGGTTTTGTTCGAAGAGACCTCGAGCTTGCTGCCGCGAAATGCACTCACCATCGAGATTGCGCCGCACAGCTTGCTGCAGGCGATTCTGAGAAAGTCGATGCCTGAAGCGGTTCACATCGGGCTGACGCAGCGCGGCAACAAGAGCAACTCGACTTTCTTCTTGAGCGCTCTCGGATCAATTCATGAAAACGGAATTGACTTTGATGTGAGCAAGCTCTATCCTCCGGTTGACTTCCCGGTCTCAAGAGGCACACCGATGATCTCACCGCGCATTAAATGGGAACACAGCGAGGACTGGTTTGTGACTCGATTTGAGAGCCAAAAGTCGAATCGCAGTGGCGAACGTCATGTTGTCATCAATATTGCTGATCAAGAGTATGAATACATCATTGGCCACGTAATTGATGGAAGGATCCTCTTCCCAGCGACGGCATATCTCTACATTGTGTGGGAGACTTTGGGTCTCATGATGGGCGTTTATTTCTTCGAGGTCGGCGTTGTTTTCGAAGACGTCAAGTTCATGAGAGCTACTGCTCTGCCCAAGAATCAGGACGTCGAGTTTATCGTCATGATTCAGCCAGGCACTGGAAGATTTGAGATCACGGAGGGCACGAGCGTGCTCGCGACCGGCTACGTGAAGATCGTGGACAACGTGAAGCTCACGGACATCGAGAAACCGAAGGAGAACGGCTACCCGACACTCCTCCAGAAGGACTTCTACAAGGAGCTCCGCTTGCGCGGCTACCACTACCACAGCCTCTTCCGGTGCGTCGAGGAGGCGCGCGGCGACGGCATGGTCGGCAAGATCAAGTGGAACTCCAACTGGGTTTCGTTCATGGACTGCCTTCTGCAGGTGCACATCGTCGGCCAGGATTCGCGCGCCCTCCTCCTCCCGACCGGCATCCAGAAGCTCTCGATTAACCCGAAAATCCATCAATCGATGGTGCACTCGTTCGAGAACGAAAACATCACGCTTGAGGTCTTCACCAACAAGGAACTCAACCTGCTGCGCTGCGGCGGCATCGAGATCCGCGGGCTGCAGGCCAACCCGGTCGCTCGCCGTCGTCCACCCGGCATTCCCGTTCTCGAGACCTATCAGTTCATGCCGCACTTGCCAACGCCGTTCCTCAACAAAATCAACGCCGCTCGTTTCTGCGTGCAGCTGGCGCTCGAGAACGTGCCGACCAACAAGGTGCTCTCGATCGAAATCGACGGAAACGACGGAAAAGAACCGATGAGCGACTTTATTGCGCAGGCACTTGGCGATTTGCCGCTGGTCACCGCTGAGCTCAATTACTTGACCACCAAGGCCATGGACCTCGGCAACATCATCGTCAGCGACTCGAAATTTTCGGCTTTCAAGAACGCCTTTGTTGTCATCAACGGCAACAGCCTCAGCGACAAGACCTTCCTCGAAAAGGTCGCCAATCATCTGCAGGACGGCGGCTTTGTTATTCTCAGGGAATCCAACGAAATTAAGCTGCAGCTGCTGAACGAGCTGCCCTCGCATCACCAACTGATTGCGATAATTCCGAGCGAGAACGAGACGATCATCATGCTGCAGTTCCACAAGAAGCTGCCAGCTCAGCCGCAGAAAATTGTGAAAGTTAGCTCGCAAAATTACGACTGGATGGATGAGCTCAAGCGAGCAGTCAAGGAGGGCCCGACACTCGCATATGCCGAGAAGGAGGAACTTTCGGGAATCATTGGTTTGGTTAACTGCATTCGCAAGGAACCGAATGGCCTCAACCTCAAATGCGTCTTCGTCGACGACTACAGGGCGCCGAAGTTCAGCGAGGACAACGAGTTCTATAAGTCTTTCCTCAAGCAGGGCCTCGCCATCAACGTCTTCAAGAACGGCCAGTGGGGCTCCTATCGCCATCTTCTTCTCACACCAAAGTACGAAAGTGCGCGTCGCGTCGACCACTGCTACGCTAACTCGCTTGTCCGCGGCGACTTTACGCGTCGC
Protein: 1433 (aa)
MESCPLSAEAQPKQKKVFSRIYPSTPDDEIVISGISGRFPSSRNMHDFAHNLYNKIDMVDDDERRWKHTNPEIPRRMGKINNLEKFDATFFGVHFKQAHTMDPQCRMLLEHAYEAVLDAGVNPRTLRGSRTGVYIGACFAESEKTWFYEKVSTGGFGITGCARAMLANRISFTLGLTGPSFLLDTACSSSMYALDCAFNAIRCGEIDAALVGGSNLLLHPYVTLQFARLGVLAQNGYCRPFDKDGSGYTRAEAICVMYLQKAKNAKRIYANLLYSKTNCDGYKEEGITYPSGKMQMKLLKEFYDDLDIPPSTVDYVEAHSTGTIVGDPEEVRAIDTVYCTGREKPLPVGSVKSNMGHSESTAGACSIAKIILAFETQKIPPNINFESIRPGLEGLESGRLRVVANTETLSGPLISINSFGFGGGNAHALFRQHPKEKVNSGIPKDDIPRLILWSSRTEEGVNSILESVLKQPLDAEYVGLLHNCVAGESSSANIYRGFGVFAQTEQSVNATCINRDVKHFVGFKRPVVWVYSGMGSQWNTMGSDLMRIPIFAESIERSHKILERKGLNLKGILTSQEPKLFDNILNSFVGIAAIQIALTDILKALELEPDYIIGHSVGELGCAYADGCFTAEEMILSSYSRGMASLETNVVVGSMAAVGMSFKKLRPIIPDGIEIACHNSADSCTISGPAENVAKYVAELKAQNIFAKEVNCSKIPYHSSYIQEMGPNLLARLTDVIKCPKKRSPRWLSSSVPKPRWEEEQFQYSSPFYHTNNLLSSVLFEETSSLLPRNALTIEIAPHSLLQAILRKSMPEAVHIGLTQRGNKSNSTFFLSALGSIHENGIDFDVSKLYPPVDFPVSRGTPMISPRIKWEHSEDWFVTRFESQKSNRSGERHVVINIADQEYEYIIGHVIDGRILFPATAYLYIVWETLGLMMGVYFFEVGVVFEDVKFMRATALPKNQDVEFIVMIQPGTGRFEITEGTSVLATGYVKIVDNVKLTDIEKPKENGYPTLLQKDFYKELRLRGYHYHSLFRCVEEARGDGMVGKIKWNSNWVSFMDCLLQVHIVGQDSRALLLPTGIQKLSINPKIHQSMVHSFENENITLEVFTNKELNLLRCGGIEIRGLQANPVARRRPPGIPVLETYQFMPHLPTPFLNKINAARFCVQLALENVPTNKVLSIEIDGNDGKEPMSDFIAQALGDLPLVTAELNYLTTKAMDLGNIIVSDSKFSAFKNAFVVINGNSLSDKTFLEKVANHLQDGGFVILRESNEIKLQLLNELPSHHQLIAIIPSENETIIMLQFHKKLPAQPQKIVKVSSQNYDWMDELKRAVKEGPTLAYAEKEELSGIIGLVNCIRKEPNGLNLKCVFVDDYRAPKFSEDNEFYKSFLKQGLAINVFKNGQWGSYRHLLLTPKYESARRVDHCYANSLVRGDFTRR
Type | Start | End | Length |
CDS |
112 |
121 |
10 |
CDS |
192 |
371 |
180 |
CDS |
516 |
1713 |
1198 |
CDS |
1907 |
2080 |
174 |
CDS |
2190 |
2299 |
110 |
CDS |
2487 |
2606 |
120 |
CDS |
2702 |
4218 |
1517 |
CDS |
4294 |
4307 |
14 |
CDS |
4392 |
4684 |
293 |
CDS |
4899 |
4992 |
94 |
CDS |
5280 |
5389 |
110 |
CDS |
5451 |
5601 |
151 |
CDS |
5660 |
5987 |
328 |
intron |
122 |
191 |
70 |
intron |
372 |
515 |
144 |
intron |
1714 |
1906 |
193 |
intron |
2081 |
2189 |
109 |
intron |
2300 |
2486 |
187 |
intron |
2607 |
2701 |
95 |
intron |
4219 |
4293 |
75 |
intron |
4308 |
4391 |
84 |
intron |
4685 |
4898 |
214 |
intron |
4993 |
5279 |
287 |
intron |
5390 |
5450 |
61 |
intron |
5602 |
5659 |
58 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001659008 |
fatty acid synthase [Aedes aegypti] gb|EAT40085.1| fatty acid synthase [Aedes aegypti] |
0.0 |
InterPro |
IPR014031 |
Beta-ketoacyl synthase, C-terminal |
|
InterPro |
IPR016035 |
Acyl transferase/acyl hydrolase/lysophospholipase |
|
InterPro |
IPR014043 |
Acyl transferase |
|
InterPro |
IPR014030 |
Beta-ketoacyl synthase, N-terminal |
|
InterPro |
IPR020801 |
Polyketide synthase, acyl transferase domain |
|
InterPro |
IPR016038 |
Thiolase-like, subgroup |
|
InterPro |
IPR018201 |
Beta-ketoacyl synthase, active site |
|
InterPro |
IPR016039 |
Thiolase-like |
|
InterPro |
IPR020841 |
Polyketide synthase, beta-ketoacyl synthase domain |
|
InterPro |
IPR001227 |
Acyl transferase domain |
|
InterPro |
IPR016036 |
Malonyl-CoA ACP transacylase, ACP-binding |
|
Gene Ontology(BP) |
GO:0008152 |
metabolic process |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Gene Ontology(MF) |
GO:0016740 |
transferase activity |
|
Gene Ontology(MF) |
GO:0003824 |
catalytic activity |
|
Pfam |
PF00109.21 |
Beta-ketoacyl synthase, N-terminal domain |
1.4e-61 |
Pfam |
PF00698.16 |
Acyl transferase domain |
2e-57 |
Pfam |
PF00108.18 |
Thiolase, N-terminal domain |
0.00023 |
Pfam |
PF08545.5 |
3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III |
0.025 |
Pfam |
PF02801.17 |
Beta-ketoacyl synthase, C-terminal domain |
7.6e-37 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Gene ID |
Pn.02651 |
Pn.11656 |
Pn.06426 |
Pn.03933 |
Pn.04187 |
Pn.11493 |
Pn.04992 |
Pn.08717 |
Pn.06151 |
Pn.15816 |
Pn.01298 |
Pn.14865 |
Pn.04993 |
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.14921 |
B. mori |
BGIBMGA008581-TA |
B. mori |
BGIBMGA013153-TA |
S. invicta |
SI2.2.0_01047 |
S. invicta |
SI2.2.0_07564 |
D. plexippus |
DPOGS212360PA |
C. quinquefasciatus |
CPIJ008367 |
A. aegypti |
AAEL002237 |
S. invicta |
SI2.2.0_11351 |
P. vanderplanki |
Pv.12209 |
P. vanderplanki |
Pv.14772 |
A. aegypti |
AAEL002204 |
P. vanderplanki |
Pv.01475 |
D. melanogaster |
FBgn0040001 |
S. invicta |
SI2.2.0_11327 |
S. invicta |
SI2.2.0_10014 |
S. invicta |
SI2.2.0_00900 |
S. invicta |
SI2.2.0_08502 |
S. invicta |
SI2.2.0_00383 |
S. invicta |
SI2.2.0_15127 |
S. invicta |
SI2.2.0_12057 |
S. invicta |
SI2.2.0_13910 |
S. invicta |
SI2.2.0_14664 |
P. vanderplanki |
Pv.10571 |
N. vitripennis |
NV22399-PA |
N. vitripennis |
NV10926-PA |
P. vanderplanki |
Pv.02828 |
P. vanderplanki |
Pv.12689 |
B. mori |
BGIBMGA013046-TA |
P. vanderplanki |
Pv.03372 |
A. aegypti |
AAEL008160 |
S. invicta |
SI2.2.0_06311 |
S. invicta |
SI2.2.0_14684 |
P. vanderplanki |
Pv.00511 |
P. vanderplanki |
Pv.16863 |
P. vanderplanki |
Pv.14617 |
N. vitripennis |
NV14455-PA |
P. vanderplanki |
Pv.00790 |
S. invicta |
SI2.2.0_05482 |
A. mellifera |
GB12198-PA |
P. vanderplanki |
Pv.14852 |
T. castaneum |
TC007689 |
C. quinquefasciatus |
CPIJ005595 |
P. vanderplanki |
Pv.02964 |
P. vanderplanki |
Pv.02965 |
A. aegypti |
AAEL002227 |
P. vanderplanki |
Pv.05860 |
T. castaneum |
TC015399 |
D. plexippus |
DPOGS206960PA |
H. melpomene |
HMEL005305-PA |
B. mori |
BGIBMGA008582-TA |
A. aegypti |
AAEL002228 |
S. invicta |
SI2.2.0_15862 |
S. invicta |
SI2.2.0_06588 |
B. mori |
BGIBMGA004655-TA |
N. vitripennis |
NV17124-PA |
A. mellifera |
GB16883-PA |
T. castaneum |
TC015340 |
S. invicta |
SI2.2.0_10115 |
T. castaneum |
TC015339 |
S. invicta |
SI2.2.0_13311 |
H. melpomene |
HMEL008318-PA |
S. invicta |
SI2.2.0_16052 |
N. vitripennis |
NV10927-PA |
H. melpomene |
HMEL015510-PA |
N. vitripennis |
NV10111-PA |
S. invicta |
SI2.2.0_14578 |
S. invicta |
SI2.2.0_13320 |
P. vanderplanki |
Pv.14263 |
M. musculus |
ENSMUSG00000025153 |
B. mori |
BGIBMGA013047-TA |
B. mori |
BGIBMGA008579-TA |
H. melpomene |
HMEL015513-PA |
P. vanderplanki |
Pv.10570 |
T. castaneum |
TC000238 |
P. vanderplanki |
Pv.12440 |
P. vanderplanki |
Pv.10251 |
A. gambiae |
AGAP009176 |
P. vanderplanki |
Pv.02966 |
A. aegypti |
AAEL001194 |
N. vitripennis |
NV14456-PA |
A. gambiae |
AGAP008468 |
S. invicta |
SI2.2.0_00168 |
C. quinquefasciatus |
CPIJ003494 |
B. mori |
BGIBMGA008602-TA |
S. invicta |
SI2.2.0_09698 |
B. mori |
BGIBMGA013281-TA |
S. invicta |
SI2.2.0_01792 |
P. vanderplanki |
Pv.00897 |
P. vanderplanki |
Pv.00898 |
P. vanderplanki |
Pv.07928 |
S. invicta |
SI2.2.0_16094 |
H. melpomene |
HMEL004144-PA |
S. invicta |
SI2.2.0_15874 |
D. melanogaster |
FBgn0042627 |
B. mori |
BGIBMGA013151-TA |
P. vanderplanki |
Pv.11254 |
T. castaneum |
TC015400 |
H. sapiens |
ENSP00000304592 |
D. plexippus |
DPOGS207082PA |
A. gambiae |
AGAP001899 |
C. quinquefasciatus |
CPIJ003495 |
S. invicta |
SI2.2.0_02241 |
T. castaneum |
TC015337 |
S. invicta |
SI2.2.0_06292 |
S. invicta |
SI2.2.0_05241 |
D. melanogaster |
FBgn0027571 |
T. castaneum |
TC011522 |
S. invicta |
SI2.2.0_14661 |
S. invicta |
SI2.2.0_03932 |
S. invicta |
SI2.2.0_08063 |
N. vitripennis |
NV50551-PA |
P. vanderplanki |
Pv.09028 |
P. humanus |
PHUM080440-PA |
A. aegypti |
AAEL002200 |
P. humanus |
PHUM448390-PA |
P. humanus |
PHUM565830-PA |
P. vanderplanki |
Pv.03790 |