MidgeBase gene description page [Pn.08039]

Outline

Link to gbrowse

Gene ID Pn.08039
Type Protein coding gene
Scaffold PnScaf7576
Start 10090
End 12880
Direction +

Sequence

Transcript: 1989 (bp)

 ATGTATACGTCAAAGTTTAGAGTTTGCAGAATCTTGTTTATTGCCGGCTCGGTGTGGATCTGCACGGTGCTGACGCTCTTCTACTACTGGGAACTCATCATGGAGCACCAGCTGCGGGCGCAGAAATTCTCAAAAAACCCAACCGATGATAACTTAGTCAATTTTCAAAGGTTGGAGGCCGAGCCGCGTGCTGAGCCAGCGCCCACCACTAAAATTCCCTTTTACGTCACGAAAAAGCGGCAAGTCATGACTTTGAAGGAGCACACAACGACACGGGAAGCGACCACCACCTCAAGCACTACGTCACCTGCCACTACAACGACGACCTCGACGACTGTCAGAAAAACAACAACAACGACGCCTAAGCGACGAATGTACGAGCGCGTTGTCGCGACAATTTCGCCCGAGGTCTACAAGACACTGGGACTGGGCGAGAACCCAGGCGAGGCCGGTCGTCCCATCAAGATCTCAGACCCGCCGGCCGACATCAAGAAGAAGATCGAAGACGGCTGGGCACGCCACGAATTCAACGAGTTCCTCTCGGACCTGATCTCAGTCAACCGGAGCGTGCCCGATCCGCGGTCCATGTACTGCCGAAAGGAGGGGCTCTACCTCAACGAGCTGCCCACGACCTCGGTCATAATTATCTTCCACAACGAGGCATGGTCGACCTTGTTGAGGAGTGTTCACTCCGTCATTAACAGATCTCCTCCCCACCTAGTTAAGGAAATTATTTTAGTCGATGATTATTCAAATATGTTGCATTTGAAGGAGGCCCTCGAGGACTACATGTCTGACTTTCCGAAAGTTCGAATCGTCCGCATGGAGAAGCGCGTCGGACTCATCAAGGCGAGAATTGCAGGCACAAACGCTGCAACTGCACCGACGCTTACGTTCCTCGACTCGCACATTGAGTGCGCCGAAGGCTGGCTCGAACCGCTGCTCGACCGAATCGCGAGGGACAAAACGAAGGTCGTTTGCCCCGTGATCGACGTAATTGACGATGACACGCTGGGCTTTAGCTATCAGGATTCGAGCGGCCTGCAAGTCGGCGGCTTCGACTGGGAAATGACGTTCGACTGGCACATGGTGCCGTCGCGGGAGCGAAAGCGAAAGAAGGATCAGTCGGAGCCGACATTCTCGCCGACAATGGCGGGCGGATTGTTCTCGATAGACAAGGCCTTTTTCGAGCGATTAGGAACGTACGATGATGGATTCGATATTTGGGGTGCGGAGAATCTCGAGCTGAGCTTCAAGACGTGGATGTGCGGCGGAACTTTGGAGATAATTCCGTGCTCGCACGTCGGGCACATCTTCCGGAAGAAATCCCCTTACAAATGGCGCCCAGGTGTCGATGTGCTCAAAATTAATACCGTGCGCTTGGTGGAGGTTTGGCTTGACGAATACTCGCGCTACTACTATGTGCGGCGCGGCAGCCACAAGGGCGACTTTGGCGACATTTCGAAGCGCGTCGAGCTGCGCAAGAACCTCAACTGCAAGAGCTTCAAGTGGTACCTCGAAAATGTGTGGCCCGAAATGACCGTGCCTGACAACATCGCCGAGGGCTGGATTCGAAGCAAGGGCATCAACAATCCGACGTGCTTTGATGCCGCATTTGAAAATCACGATTCGACCTCGAGAATCGCCTTCTACGGCTGTCACGACTACGGTGGCAATCAGTTCTTTGAGTTCTCGACGAAGCACGAGATCAAGAGAAAGCGACATTGCCTCGATTATTCGAACAGCACCAACGAGTTGAAGTTTGTGCTTTGCCACGGTGCCAAAGGCGACCAGTACTGGGACATTGATGTGGAAACGCGACAAATTTACCACCGGGCATCCAACAAGTGCCTCGAGATCTTCAAAAACAACTACCAGTACGAGCCGACGATGCAGGAGTGCGACGAGAAAAACAACAATCAAAAGTGGGACTTTCAGTATTTGTACGAGGAGAAGTTGTTGGGAAGCAATAAAACGACCGTTGCGGTT 

Protein: 663 (aa)

 MYTSKFRVCRILFIAGSVWICTVLTLFYYWELIMEHQLRAQKFSKNPTDDNLVNFQRLEAEPRAEPAPTTKIPFYVTKKRQVMTLKEHTTTREATTTSSTTSPATTTTTSTTVRKTTTTTPKRRMYERVVATISPEVYKTLGLGENPGEAGRPIKISDPPADIKKKIEDGWARHEFNEFLSDLISVNRSVPDPRSMYCRKEGLYLNELPTTSVIIIFHNEAWSTLLRSVHSVINRSPPHLVKEIILVDDYSNMLHLKEALEDYMSDFPKVRIVRMEKRVGLIKARIAGTNAATAPTLTFLDSHIECAEGWLEPLLDRIARDKTKVVCPVIDVIDDDTLGFSYQDSSGLQVGGFDWEMTFDWHMVPSRERKRKKDQSEPTFSPTMAGGLFSIDKAFFERLGTYDDGFDIWGAENLELSFKTWMCGGTLEIIPCSHVGHIFRKKSPYKWRPGVDVLKINTVRLVEVWLDEYSRYYYVRRGSHKGDFGDISKRVELRKNLNCKSFKWYLENVWPEMTVPDNIAEGWIRSKGINNPTCFDAAFENHDSTSRIAFYGCHDYGGNQFFEFSTKHEIKRKRHCLDYSNSTNELKFVLCHGAKGDQYWDIDVETRQIYHRASNKCLEIFKNNYQYEPTMQECDEKNNNQKWDFQYLYEEKLLGSNKTTVAV 
Type Start End Length
CDS 10090 10793 704
CDS 11038 11093 56
CDS 11163 11476 314
CDS 11873 12026 154
CDS 12117 12877 761
intron 10794 11037 244
intron 11094 11162 69
intron 11477 11872 396
intron 12027 12116 90

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr NP_725602 CG30463, isoform A [Drosophila melanogaster] ref|NP_001097342.1| CG30463, isoform D [Drosophila melanogaster] sp|Q8MRC9.2|GALT9_DROME RecName: Full=Putative polypeptide N-acetylgalactosaminyltransferase 9; Short=pp-GaNTase 9; AltName: Full=Protein-UDP acetylgalactosaminyltransferase 9; AltName: Full=UDP-GalNAc:polypeptide N-acetylgalactosaminyltransferase 9 gb|AAF57966.2| CG30463, isoform A [Drosophila melanogaster] gb|ABV53823.1| CG30463, isoform D [Drosophila melanogaster] 0.0
InterPro IPR000772 Ricin B lectin domain
InterPro IPR001173 Glycosyl transferase, family 2
Pfam PF02709.9 N-terminal domain of galactosyltransferase 3.6e-07
Pfam PF00535.21 Glycosyl transferase family 2 9.3e-31
Pfam PF13333.1 Integrase core domain 0.055
Pfam PF10111.4 Glycosyltransferase like family 2 0.0003
Pfam PF00652.17 Ricin-type beta-trefoil lectin domain 2.9e-24
Pfam PF14200.1 Ricin-type beta-trefoil lectin domain-like 0.00054

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.10791

Orthologous genes

Species Gene ID
P. vanderplanki Pv.10199
B. mori BGIBMGA005279-TA
P. humanus PHUM170220-PA
A. aegypti AAEL001121
C. quinquefasciatus CPIJ005699
P. vanderplanki Pv.13649
C. quinquefasciatus CPIJ005695
A. aegypti AAEL008252
T. castaneum TC005479
N. vitripennis NV13453-PA
C. quinquefasciatus CPIJ005698
C. quinquefasciatus CPIJ005697
P. vanderplanki Pv.13831
S. invicta SI2.2.0_06269
H. melpomene HMEL003641-PA
D. plexippus DPOGS204937PA
A. aegypti AAEL001151
A. mellifera GB13681-PA
D. melanogaster FBgn0050463
A. gambiae AGAP012253
A. aegypti AAEL001146
A. gambiae AGAP006881
A. aegypti AAEL001122