MidgeBase gene description page [Pn.12034]
Outline
Gene ID | Pn.12034 |
Type | Protein coding gene |
Scaffold | PnScaf15298 |
Start | 24900 |
End | 27762 |
Direction | - |
Sequence
Transcript: 2652 (bp)
ATGTTCATCGCATTATTTCGTCGCTACAAGCTGCTCTTTCTCCTTGGCTTACTTGTTCTTCTCATTCAAGTGTTTCTCGCTTACAAATCAATCAAAATTCCTATTGACTCGGGCAATAAGCTGCTCGTACAAAAATTCAGCAGCCTCAAGGACCGTCTACAGAACAATCATAATCTTGAAGAAAAGCAGCTATCATTGAATGACGACGAGGACATTATCAACTCGCAGGAAGGAGGAGCTTCGTCGTCGGTGAAGAGTGATAAGGACAAGGCAGCAACTCTACTCAGTGAATTGAAATTCAAACCAAAATGTGATATTTTAAAGGATAAAGAAGTCATATCAGCAGTTCAACGAGCACGAACACAGCAGTGTAAACAGCACATAATTAATATCGCGTGCCAGATTAAAACGGGCGTATTATACCCAAAGAGACTCCCAAACACATGTCCAAGCGGCAACTATATTGAGAACCGCTCGCTCGGCTGCTTCAAAGATACCAAGAAGCATCGACTCCTCTCGAGCTTGTACTCTAACTTTAAAGAGACAAACTCGCCCAAGAAATGTATACAGATATGCCTCCAATCGGGCTTCGTTTATGCCGGAGTGCAGTATTCGAGCGAATGCTTCTGTGGCAATCATCAACCGACGATTGAAGCAAAGCTAGCGGATTCGAGCTGCAACATGAAATGCCCTGCCGAACCTAAATCAACATGTGGAGGCTATTTCACAATGAACATTTACGAGACTGGCTTAGCGAAGTACACATCACAACAAATTACCGGTTCTCAACCAAAACTAAATGTCGTTGATAGCAATAAGAAGGTCAAAATTGTTTTCCTCCTGACATTAAACGGACGAGCTTTAAGGCAAATTTATCGTTTAATTAAATCCTTGTACAGCGTTGAGCACTATTATTATATTCACGTCGATTCGCGACAAGATTATCTTTATCGAGAGCTTTTGAAACTGGAGGCGATTTTTCCGAACGTACGACTATCACGAAGGAGGCTTGCAACGATATGGGGTGGAGCATCGCTTCTCGAAATGCTGCTTATGTGCATGAGTGATTTGTTAGAGTCCGACTGGGAGTGGGATTTTGTGTTGAATCTTAGTGAGAGCGATTTTCCAATCAAAACAGTGGACAAGCTAACAAACTTTTTGTCAGCAAATCGGGACAAGAATTTCGTAAAGTCGCACGGTAGAGAGACGCAACGATTTATCCAGAAGCAGGGCTTAGACAAGACATTTGTCGAATGTGATACACACATGTGGCGCATTGGTGATCGTGTCTTGCCGGACGGCATACAAATCGATGGAGGAAGTGATTGGATATGCTTGTCAAGAAAGTTCGTTTCGTATGTCACAGCCGAGTTGCGGGACGAACTGGTCGAGGGACTGTTGAAGATCTTCCAGTTTACTCTTCTTCCAGCCGAATCATTCTTTCACACTGCCATACGAAATTCACTATTTTGTGACACATACGTTGATAATAACCTTCACATTACGAATTGGAAGAGGAAACTCGGATGCAAGTGTCAGTATCGACACATTTGCGATTGGTGTGGCTGCAGTCCAAACGATTTCAAAAACGACGACTGGCCGCGTCTACAAGCTACTGAACACAAGCAGCTATTTTTCGGTAGAAAATTTGAACCTGTGGTAAATCAACTGGTGATCCTTCAACTGGAGGAGTGGATGACGGGTCCCTATTCACAAGACTACTTAAATCTAAACAGCTATTGGCAGAGTACCTATCACTTTGAGGATAAAAGTCCTTTGCCGAATCAAGCGCTCTTGCTTGTTGCCAATAGTCTGATTCGAATCAACTCGAAATCGAATTCAGTTCAACAGTTTTACGAGCCGGTGAGAGTACTTGAAGTAACCGATTACTTTGATTTAGATGTCTATAAAGGATTTTTAATACGACATGAGGCAAAAATTAATGTGAATCTTACTGTTGAACTTGAAATGTGGAGCCGTCCAAACCACCAGCATGCGCAAGTGTCAAAAACTAATAAAATTGCCAAGAAAATTATGCAGTTAGAGGTGAGCACGGACTTTGATCAAAAGGAGCAAATGTATAGGAACTTTCCGCGAATAATTGGTCAGCAGTCGGAACCGGTTTTGGTGCTCAAACTGTCGGGAACATCGCATGTTGAAAACTCAACACTCACACTTACGGTTGTTTGGATTGATCCGAACGAAAAAGTGGAGGAAGTTGGGGAGCTGACTATCGAAGACATTACTGTTACATCAATAAATTTTTCAAAATCAAATCTAAAACATCCGCTAATGAGTGGTGTTTGGACAGTTAAACTGCTCCAAAAAAAATCACTGATAGGATTAACAAAATTTCTTGTCCTTCCCACTCTGAGTGACGCATTACCGTCATCATTGACCAAAGAATCAAATGCAAGTCAAAATCAATTAGATAAGTCAATAGCAAATTTTTATTTAATAAAAGACACGTGCATATCCTACAACCAGAAAAATATTCGAGACATAATAGGCACATATTTAGCGAACGATGCTAATGGCAATTCCAAAAATATTATAAAATTTAGCGAATGCAAGAAATCGATGTGGAGCTCTTTTTCGCCCGATCCCAAAAGTGAATTAATATCTGATTTTGGAAATTTTGATGGTTCATCA
Protein: 884 (aa)
MFIALFRRYKLLFLLGLLVLLIQVFLAYKSIKIPIDSGNKLLVQKFSSLKDRLQNNHNLEEKQLSLNDDEDIINSQEGGASSSVKSDKDKAATLLSELKFKPKCDILKDKEVISAVQRARTQQCKQHIINIACQIKTGVLYPKRLPNTCPSGNYIENRSLGCFKDTKKHRLLSSLYSNFKETNSPKKCIQICLQSGFVYAGVQYSSECFCGNHQPTIEAKLADSSCNMKCPAEPKSTCGGYFTMNIYETGLAKYTSQQITGSQPKLNVVDSNKKVKIVFLLTLNGRALRQIYRLIKSLYSVEHYYYIHVDSRQDYLYRELLKLEAIFPNVRLSRRRLATIWGGASLLEMLLMCMSDLLESDWEWDFVLNLSESDFPIKTVDKLTNFLSANRDKNFVKSHGRETQRFIQKQGLDKTFVECDTHMWRIGDRVLPDGIQIDGGSDWICLSRKFVSYVTAELRDELVEGLLKIFQFTLLPAESFFHTAIRNSLFCDTYVDNNLHITNWKRKLGCKCQYRHICDWCGCSPNDFKNDDWPRLQATEHKQLFFGRKFEPVVNQLVILQLEEWMTGPYSQDYLNLNSYWQSTYHFEDKSPLPNQALLLVANSLIRINSKSNSVQQFYEPVRVLEVTDYFDLDVYKGFLIRHEAKINVNLTVELEMWSRPNHQHAQVSKTNKIAKKIMQLEVSTDFDQKEQMYRNFPRIIGQQSEPVLVLKLSGTSHVENSTLTLTVVWIDPNEKVEEVGELTIEDITVTSINFSKSNLKHPLMSGVWTVKLLQKKSLIGLTKFLVLPTLSDALPSSLTKESNASQNQLDKSIANFYLIKDTCISYNQKNIRDIIGTYLANDANGNSKNIIKFSECKKSMWSSFSPDPKSELISDFGNFDGSS
Type | Start | End | Length |
CDS |
24903 |
26530 |
1628 |
CDS |
26590 |
26680 |
91 |
CDS |
26736 |
26911 |
176 |
CDS |
27006 |
27762 |
757 |
intron |
26531 |
26589 |
59 |
intron |
26681 |
26735 |
55 |
intron |
26912 |
27005 |
94 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001658334 |
xylosyltransferase [Aedes aegypti] gb|EAT40893.1| xylosyltransferase [Aedes aegypti] |
0.0 |
InterPro |
IPR013994 |
Carbohydrate-binding WSC, subgroup |
|
InterPro |
IPR024448 |
Xylosyltransferase, metazoan |
|
InterPro |
IPR003406 |
Glycosyl transferase, family 14 |
|
InterPro |
IPR002889 |
Carbohydrate-binding WSC |
|
Gene Ontology(BP) |
GO:0006024 |
glycosaminoglycan biosynthetic process |
|
Gene Ontology(CC) |
GO:0016020 |
membrane |
|
Gene Ontology(MF) |
GO:0008375 |
acetylglucosaminyltransferase activity |
|
Gene Ontology(MF) |
GO:0030158 |
protein xylosyltransferase activity |
|
Pfam |
PF01822.14 |
WSC domain |
1.3e-18 |
Pfam |
PF12529.3 |
Xylosyltransferase C terminal |
1.1e-48 |
Pfam |
PF02485.16 |
Core-2/I-Branching enzyme |
1e-43 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
D. plexippus |
DPOGS211641PA |
P. humanus |
PHUM187000-PA |
D. melanogaster |
FBgn0015360 |
H. sapiens |
ENSP00000017003 |
A. gambiae |
AGAP005811 |
H. melpomene |
HMEL006645-PA |
B. mori |
BGIBMGA011693-TA |
H. sapiens |
ENSP00000426501 |
P. vanderplanki |
Pv.17580 |
C. quinquefasciatus |
CPIJ019812 |
M. musculus |
ENSMUSG00000020868 |
H. sapiens |
ENSP00000261381 |
A. aegypti |
AAEL007409 |
H. sapiens |
ENSP00000365733 |
A. mellifera |
GB18395-PA |
T. castaneum |
TC002371 |
M. musculus |
ENSMUSG00000030657 |
A. gambiae |
AGAP005810 |
N. vitripennis |
NV12679-PA |
S. invicta |
SI2.2.0_06529 |