MidgeBase gene description page [Pn.02651]

Outline

Link to gbrowse

Gene ID Pn.02651
Type Protein coding gene
Scaffold PnScaf2280
Start 204
End 5812
Direction -

Sequence

Transcript: 4944 (bp)

 ATGTCGAAAATAAAGGAGATCGGTGATGATGTGGTGATTTCGGGTGTGGCGGGTCGCTTCCCAAACTCGCGCAATCTGCTTGAGTTCGCCAGCAATCTCTACAACAAAGTCGACATGGTGGACGAGGAGGAGAAGCGGTGGCGGCACGTAAATCCGGAATGTCCCAAGCGGATGGGAAAAATCGGCGACATCGAGAAGTTCGACGCGTCCTTCTTCGGCGTGAGCTACCGGCAGGCTCGCACGATGGACCCGCAATGTCGGATGCTGCTGGAGCACGCATACGAGGCCGTCCTCGACAGCGGCATGTGCCCGAAGCAGCTGGCTGGAACGAAGACCGGCGTGTTCGTCGGCATGAGCTACAACGAGTCGGAGAAGAAGTGGATCTACGAGAACATCTCGAAGGAGGGCTTCGGCATCACCGGGTCAGCGAAGGCTCTGGTCGCGAATCGAATTTCTTTTGCCCTCGGCGTCACCGGTCCGAGTTTCGTTTGCGACACCGCGTGTAGCTCGTCGATGTACGCACTCGACATCGCGTACAAGATGATTTCTTCGGGCGAGTGCGACGCTGCCATCGTTGGCGGCACAAACCTCTGCCTGCATCCGCACCTGAGCTACCAGTTTGTGAAGCTCGGCGTCGTGTCCAAGGACGGATTTTGTCGGCCGTTTGACAAAGACGCGAGCGGTTTCACTCGGGCCGAGACCAGTTGTGCGCTGTTCCTGCAGCGGCGTATCGACGCGAAGCGAGTCTACGCGACCATTTTACACTCGAAGACGAACTGCGATGGCTTCAAGGAGGAGGGCATCAACTTTCCGAGCAGTAAGAGGCAGGCTGAGCTGCTGACCGAGTTCTACGAAGAGGTCGACATAAGGCCGAGTGATCCGAGCATCGGCTACTTCGAGGCTCACTGCACTGGGACGAAGGTCGGCGACGCAGAGGAGTGTCGAGCGATAGACGAAGTGATGTGCAAGAACCGCAAAGAGCCGCTGCTTGTCGGCTCGGTGAAGAGCAACATGGGACACTCTGAACCCGGTTCGGGAGTTTGTTCGATCAGCAAAGTCATTTTGGCTTTCGAAACAGGCAAGATTGCTCCCAATATCAACCTGAAAGCAGTCAAAGAGGAGATCACTGGTGTAGTTGAAGGTCGGATGAAGGTCGTGACCGAAACTCAAGATCTTCTGCAGCCCTTCGTGCCCGTCAACTCCTCAGGCTTCGGAGGAGCCAACGCACACGCTCTGCTGCAAATAAACTCCAAAGCGAAGGTCAATAAAGGAATTCCGAGCGACAAGATTCCGCGGCTTGTTCTCTGGTCGGGAAGGACCGAAGAGGCGGTTGATGCAATCTTCGACAGTGTCACACGGCAGCCACTTGATGCCGAGTTCGTTTCGCTGATTCAGAGCTCACAAGCAACCTCCAACAAGGCGAATCTCTACAGGGGCTTTGGTGTGTTCACGCAGCCTGAAAATCACGAAAATAACGCGGTTTGTAGTTTGCGCGAAGTGCAGCACTTCAACGGTCAAAGGAGGCCGCTGGTGTGGATCTACAGCGGCATGGGATCGCAGTGGAAAGCGATGGGAGCTGCTTTGATGAAGATCCCAACTTTTGCTTGTGCGATCGACAAATGTCACGGCATTCTGCTGGCAAAGAATGTAAATTTGAAGGAGATTTTAACCTCGACCGATTCGAGCACGTTCGACAACATTTTGCACTCATTTGTCGGCATCGCGGCCGTTCAAGTCGGCCTCACTGACGTTCTCAAGGCGGTTGGCTTGGAGCCCGACTTCGTCGTTGGCCATTCGCTCGGCGAACTCGGCTGCGCCTACGCTGATGGCTGCTGCAGTGCCGAGGAGATGATTCTGTCCGCTTACTCGCGTGGCATGGTGAGTTTCGAGACAAAAGTGGTTTTCGGCTCGATGGCGGCCGTCGGTTTGGGCTACAAGCAGCTCAAAGACATCGTGCCTGATGGCGTTGAGGTCGCCTGCCACAACAGCTCTGAATCGAGCACGATTTCGGGACCTGCCGACAAAGTTGCAGCATTTGTCACTGAGCTGAAAAAACAGGGAATTTTCGCGAAAGAAGTGGCCTGCTCGAACATTCCCTATCATAGCTCTTACATCGCCGACATGGGCGCAAATCTGCTTGCTCGTCTTCAAAAAATCCACAAGACCCCGGCGAAGCGCTCACCACGCTGGATCAGCTCGAGCATTCCCGAGTCGCAGTGGCATCTGCCAAGCAGTCAGTTCACATCGGCCGAGTATCAAACCAACAATGTTCTCAATCCAGTGCTCTTTGAAGAAGCCATTTCGCATCTTCCAGCGAACGCTGTGTGCATCGAAATCGCTCCTTATGGTCTGCTGCAGGCGATCTTAAAGCGCGCTCTAAAGGATTCGGTGCACATCTCTTTAACAAAGAAGGGGGAAGCGAACAACGACCAGTTTCTGTTGGGTGCTCTGGGAAGAATCTACTGCAACGGCTTCGACATGGACATCTCTACGCTCTATCCACCAATCAAGTATCCAGTTTCGCGAGGAACTCCAATGATTGCACCGCACATCAAGTGGGATCACAGCGAAGACCATTACGTTATGCGCTTCGAGGACTGCTCGGAAGGAAAGAGCGCAGAACGCAAGGTTCCGATTTCATTGACCGACGTCGAGTACGAGTTCGTGCAAGGGCATTGCATTGACGGTCGTTGTCTGTTTCCCGCTACTGGCTACATCTTTCTGGCGTGGGAAACCTTTGCGATGACCAAAGGAAAGATGTACTTCGACATGAGCGTCGAGTTCGAGGATTTGAAGTTTTTGCGAGCGACTTCACTGTCGAAGGACGTGGAAGTGGAGTTTACGATCGTCATCCATCCGGGCAGCGGAAAGTTTGAGATTTCCGAGGGAAGTTCGACGCTCGTGACTGGTTTTATAAAGCAGGTCGAAAATCCAAAGCTGAGGGATGTTGAAAATGTGACAAATGACAAAATACTCAGGTCTAAGGACTTCTACAAGGAGCTTCGGCTGCGCGGCTATCACTACACTGGCATCTTCCAGTCGGTCGTCGAGGGAAGCTTCGACGGCAACTATGGCAAGATTCGATGGGACTCCAACTGGATTGCCTTCCTCGACTGCCTCTTGCAGCTGAAAATCATTGGAAAAGACACGCGGTCGCTTGTCCTCCCGACCGCCATTCAAAAGTTCACAATTCACACGAGCAAGCACCTGGAAATGCTCGAAAACTTCTCTGAAGGCGATGAGATCTTCTTCGAAGCTCGCATTAGCGATGAGCTGAAGATCATTAGATGCGGCGGAGTCGAAATCGTGAATCTCCAGTCGAGTTCGGTGGGAAGAAGGAAGGCGCCTGGAGTGCCGGTTATTGAATCTTACGAGTTTGTTCCGTACGACTGTGAAGACACAGTGCTGAGTGTTGAGAACGCCACTCGTATAATTTCGCAGCTGCTGATAGAAAATCTCCAAACTACGAAGGTGAAGGGTGTTGAAATTGACACAATCGAAGGAGTTGAGACAATTTTGGGTGGACTCAAGAGTGCTTTGATGGATCTTCCTCTGATAACTGCGGAGATGACGCTTTTGAGTGCGAGAAATTCTGAAGATTTAGGAGAAATTGCGGTAGCTAATGAAGTGATCAACTTGCACAAGGAATGTTCGGTTCTTGTTGCGTCGGCCGAAGTGAAAGTTGAATATGTCGAGAGTCTAGTTGAGGGCGGATTTTTGGTGATGAGAAATTTGAGGGACAATCAAAGTCTTCCTGAAGAGTTGACGAAAATTTCGACACTGCGACTCAAAGATGAGATCTTGACGGTCTTCCAGTGCAGAAAGGAGAAAAGTGATCGAACAAAGACGGCACTTTTGATCTCAAGCGACGACACATCGTTCGCATGGGCCGAAACGGCCAAGAAACTTCTGAAAGCCTCAAATCTTGTGATTGTCGCTGAAAATGATCCGACAAGTGGCGTCCTTGGCCTCGTCAATTGCCTCAGAAAGGAAGCCGACAGCAAGTCAGTGTCCTGCGTGCTTATCGACGACTCGAAAGCCCCAAAGTTTGACCTAAACTGCGAGTTTTACAAAAGTCAGCTCAAGAAAACGCTCGCAATCAACGTCTTTCGCAATGGCAGATGGGGAAGCTACCGACACTTGCCCATTGTGCAAAGTTTCGAAGAAAAATCGACGTTGGATCACTGCTTTGCCAATGCTCTCGTCAAGTCTGATTTGTCCTCCTTGAAGTGGCTGCGAGGACCTCTCAACACCGCCGAAAATGACGTCGTTTGCGTGCACTACGCACCGCTGAACTTCAAGGATGTCATGCTGGCCACTGGAAAGATCTCAGCCGAAATTTTCGTCAGCACACGTCTCGACCTCGAGTGCGTGCTCGGCTTCGAGTATTCGGGCGTGTCGAGCGGAGCCCAGCGTGTCATGGGCTTGATTCCGTCGGGTGCTTTGGCGACCTTCGTCAAAATGGATCCGGCGCTTACATGGCGGTGCCCGAATGAGTGGAGTTTGGCAGAGGCGGCGACAGTTCCGTGCGTCTACGCTACCGTCTACTACGCGTTCTTCGTCAAGACACAAGTCCAGCCGGGCAAGTCGATTCTCATTCACGCCGGCAGCGGTGGCATTGGACTGGCGGCGATTCGCGTCGCTTTCGCCTACGGTTTGGAGGTATTCACGACCGTCAGCACGGAGGAGAAGAAGAACTTTTTGCTCAACGAATTTCCTCAACTCAAGCGCGAAAATATCGGCAATTCACGCGACACTTCCTTCGAGGACATGATTGCAGTGCGAACGAACGGCAAGGGCGTCGATTACGTGCTGAACTCGCTCGCCGAGGAGAAGCTTCACGCATCAATCAGATGCTTGGGCAAGGGCGGAAAATTCCTGGAAATCGGAAAATTCGACATGGAGAAGGACACGAAAATCGGCATGAATGCTCGAGATCGACAT 

Protein: 1648 (aa)

 MSKIKEIGDDVVISGVAGRFPNSRNLLEFASNLYNKVDMVDEEEKRWRHVNPECPKRMGKIGDIEKFDASFFGVSYRQARTMDPQCRMLLEHAYEAVLDSGMCPKQLAGTKTGVFVGMSYNESEKKWIYENISKEGFGITGSAKALVANRISFALGVTGPSFVCDTACSSSMYALDIAYKMISSGECDAAIVGGTNLCLHPHLSYQFVKLGVVSKDGFCRPFDKDASGFTRAETSCALFLQRRIDAKRVYATILHSKTNCDGFKEEGINFPSSKRQAELLTEFYEEVDIRPSDPSIGYFEAHCTGTKVGDAEECRAIDEVMCKNRKEPLLVGSVKSNMGHSEPGSGVCSISKVILAFETGKIAPNINLKAVKEEITGVVEGRMKVVTETQDLLQPFVPVNSSGFGGANAHALLQINSKAKVNKGIPSDKIPRLVLWSGRTEEAVDAIFDSVTRQPLDAEFVSLIQSSQATSNKANLYRGFGVFTQPENHENNAVCSLREVQHFNGQRRPLVWIYSGMGSQWKAMGAALMKIPTFACAIDKCHGILLAKNVNLKEILTSTDSSTFDNILHSFVGIAAVQVGLTDVLKAVGLEPDFVVGHSLGELGCAYADGCCSAEEMILSAYSRGMVSFETKVVFGSMAAVGLGYKQLKDIVPDGVEVACHNSSESSTISGPADKVAAFVTELKKQGIFAKEVACSNIPYHSSYIADMGANLLARLQKIHKTPAKRSPRWISSSIPESQWHLPSSQFTSAEYQTNNVLNPVLFEEAISHLPANAVCIEIAPYGLLQAILKRALKDSVHISLTKKGEANNDQFLLGALGRIYCNGFDMDISTLYPPIKYPVSRGTPMIAPHIKWDHSEDHYVMRFEDCSEGKSAERKVPISLTDVEYEFVQGHCIDGRCLFPATGYIFLAWETFAMTKGKMYFDMSVEFEDLKFLRATSLSKDVEVEFTIVIHPGSGKFEISEGSSTLVTGFIKQVENPKLRDVENVTNDKILRSKDFYKELRLRGYHYTGIFQSVVEGSFDGNYGKIRWDSNWIAFLDCLLQLKIIGKDTRSLVLPTAIQKFTIHTSKHLEMLENFSEGDEIFFEARISDELKIIRCGGVEIVNLQSSSVGRRKAPGVPVIESYEFVPYDCEDTVLSVENATRIISQLLIENLQTTKVKGVEIDTIEGVETILGGLKSALMDLPLITAEMTLLSARNSEDLGEIAVANEVINLHKECSVLVASAEVKVEYVESLVEGGFLVMRNLRDNQSLPEELTKISTLRLKDEILTVFQCRKEKSDRTKTALLISSDDTSFAWAETAKKLLKASNLVIVAENDPTSGVLGLVNCLRKEADSKSVSCVLIDDSKAPKFDLNCEFYKSQLKKTLAINVFRNGRWGSYRHLPIVQSFEEKSTLDHCFANALVKSDLSSLKWLRGPLNTAENDVVCVHYAPLNFKDVMLATGKISAEIFVSTRLDLECVLGFEYSGVSSGAQRVMGLIPSGALATFVKMDPALTWRCPNEWSLAEAATVPCVYATVYYAFFVKTQVQPGKSILIHAGSGGIGLAAIRVAFAYGLEVFTTVSTEEKKNFLLNEFPQLKRENIGNSRDTSFEDMIAVRTNGKGVDYVLNSLAEEKLHASIRCLGKGGKFLEIGKFDMEKDTKIGMNARDRH 
Type Start End Length
CDS 207 223 17
CDS 313 2553 2241
CDS 2614 2843 230
CDS 3146 4394 1249
CDS 4451 4736 286
CDS 4836 5334 499
CDS 5391 5812 422
intron 224 312 89
intron 2554 2613 60
intron 2844 3145 302
intron 4395 4450 56
intron 4737 4835 99
intron 5335 5390 56

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001849863 fatty acid synthase S-acetyl transferase [Culex quinquefasciatus] gb|EDS30986.1| fatty acid synthase S-acetyl transferase [Culex quinquefasciatus] 0.0
InterPro IPR014031 Beta-ketoacyl synthase, C-terminal
InterPro IPR016035 Acyl transferase/acyl hydrolase/lysophospholipase
InterPro IPR014030 Beta-ketoacyl synthase, N-terminal
InterPro IPR014043 Acyl transferase
InterPro IPR020801 Polyketide synthase, acyl transferase domain
InterPro IPR013149 Alcohol dehydrogenase, C-terminal
InterPro IPR016038 Thiolase-like, subgroup
InterPro IPR020841 Polyketide synthase, beta-ketoacyl synthase domain
InterPro IPR016039 Thiolase-like
InterPro IPR011032 GroES-like
InterPro IPR020843 Polyketide synthase, enoylreductase
InterPro IPR001227 Acyl transferase domain
InterPro IPR016040 NAD(P)-binding domain
InterPro IPR016036 Malonyl-CoA ACP transacylase, ACP-binding
Gene Ontology(BP) GO:0008152 metabolic process
Gene Ontology(BP) GO:0055114 oxidation-reduction process
Gene Ontology(MF) GO:0005515 protein binding
Gene Ontology(MF) GO:0000166 nucleotide binding
Gene Ontology(MF) GO:0008270 zinc ion binding
Gene Ontology(MF) GO:0016747 transferase activity, transferring acyl groups other than amino-acyl groups
Gene Ontology(MF) GO:0016740 transferase activity
Gene Ontology(MF) GO:0003824 catalytic activity
Gene Ontology(MF) GO:0016491 oxidoreductase activity
Pfam PF00109.21 Beta-ketoacyl synthase, N-terminal domain 2.2e-62
Pfam PF00698.16 Acyl transferase domain 6.8e-63
Pfam PF00107.21 Zinc-binding dehydrogenase 3.1e-14
Pfam PF00108.18 Thiolase, N-terminal domain 0.0014
Pfam PF08545.5 3-Oxoacyl-[acyl-carrier-protein (ACP)] synthase III 0.0019
Pfam PF02801.17 Beta-ketoacyl synthase, C-terminal domain 1e-31
Pfam PF12242.3 NAD(P)H binding domain of trans-2-enoyl-CoA reductase 0.075

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.11656
Pn.06426
Pn.03933
Pn.04187
Pn.11493
Pn.04992
Pn.08717
Pn.06151
Pn.15816
Pn.00600
Pn.01298
Pn.14865
Pn.04993

Orthologous genes

Species Gene ID
P. vanderplanki Pv.14921
B. mori BGIBMGA008581-TA
B. mori BGIBMGA013153-TA
S. invicta SI2.2.0_01047
S. invicta SI2.2.0_07564
D. plexippus DPOGS212360PA
C. quinquefasciatus CPIJ008367
A. aegypti AAEL002237
S. invicta SI2.2.0_11351
P. vanderplanki Pv.12209
P. vanderplanki Pv.14772
A. aegypti AAEL002204
P. vanderplanki Pv.01475
D. melanogaster FBgn0040001
S. invicta SI2.2.0_11327
S. invicta SI2.2.0_10014
S. invicta SI2.2.0_00900
S. invicta SI2.2.0_08502
S. invicta SI2.2.0_00383
S. invicta SI2.2.0_15127
S. invicta SI2.2.0_12057
S. invicta SI2.2.0_13910
S. invicta SI2.2.0_14664
P. vanderplanki Pv.10571
N. vitripennis NV22399-PA
N. vitripennis NV10926-PA
P. vanderplanki Pv.02828
P. vanderplanki Pv.12689
B. mori BGIBMGA013046-TA
P. vanderplanki Pv.03372
A. aegypti AAEL008160
S. invicta SI2.2.0_06311
S. invicta SI2.2.0_14684
P. vanderplanki Pv.00511
P. vanderplanki Pv.16863
P. vanderplanki Pv.14617
N. vitripennis NV14455-PA
P. vanderplanki Pv.00790
S. invicta SI2.2.0_05482
A. mellifera GB12198-PA
P. vanderplanki Pv.14852
T. castaneum TC007689
C. quinquefasciatus CPIJ005595
P. vanderplanki Pv.02964
P. vanderplanki Pv.02965
A. aegypti AAEL002227
P. vanderplanki Pv.05860
T. castaneum TC015399
D. plexippus DPOGS206960PA
H. melpomene HMEL005305-PA
B. mori BGIBMGA008582-TA
A. aegypti AAEL002228
S. invicta SI2.2.0_15862
S. invicta SI2.2.0_06588
B. mori BGIBMGA004655-TA
N. vitripennis NV17124-PA
A. mellifera GB16883-PA
T. castaneum TC015340
S. invicta SI2.2.0_10115
T. castaneum TC015339
S. invicta SI2.2.0_13311
H. melpomene HMEL008318-PA
S. invicta SI2.2.0_16052
N. vitripennis NV10927-PA
H. melpomene HMEL015510-PA
N. vitripennis NV10111-PA
S. invicta SI2.2.0_14578
S. invicta SI2.2.0_13320
P. vanderplanki Pv.14263
M. musculus ENSMUSG00000025153
B. mori BGIBMGA013047-TA
B. mori BGIBMGA008579-TA
H. melpomene HMEL015513-PA
P. vanderplanki Pv.10570
T. castaneum TC000238
P. vanderplanki Pv.12440
P. vanderplanki Pv.10251
A. gambiae AGAP009176
P. vanderplanki Pv.02966
A. aegypti AAEL001194
N. vitripennis NV14456-PA
A. gambiae AGAP008468
S. invicta SI2.2.0_00168
C. quinquefasciatus CPIJ003494
B. mori BGIBMGA008602-TA
S. invicta SI2.2.0_09698
B. mori BGIBMGA013281-TA
S. invicta SI2.2.0_01792
P. vanderplanki Pv.00897
P. vanderplanki Pv.00898
P. vanderplanki Pv.07928
S. invicta SI2.2.0_16094
H. melpomene HMEL004144-PA
S. invicta SI2.2.0_15874
D. melanogaster FBgn0042627
B. mori BGIBMGA013151-TA
P. vanderplanki Pv.11254
T. castaneum TC015400
H. sapiens ENSP00000304592
D. plexippus DPOGS207082PA
A. gambiae AGAP001899
C. quinquefasciatus CPIJ003495
S. invicta SI2.2.0_02241
T. castaneum TC015337
S. invicta SI2.2.0_06292
S. invicta SI2.2.0_05241
D. melanogaster FBgn0027571
T. castaneum TC011522
S. invicta SI2.2.0_14661
S. invicta SI2.2.0_03932
S. invicta SI2.2.0_08063
N. vitripennis NV50551-PA
P. vanderplanki Pv.09028
P. humanus PHUM080440-PA
A. aegypti AAEL002200
P. humanus PHUM448390-PA
P. humanus PHUM565830-PA
P. vanderplanki Pv.03790