MidgeBase gene description page [Pn.09888]
Outline
Gene ID | Pn.09888 |
Type | Protein coding gene |
Scaffold | PnScaf10345 |
Start | 5954 |
End | 11553 |
Direction | - |
Sequence
Transcript: 3516 (bp)
ATGAGCTATTCCTGCGGTCTGACGATAAACAACACTGTCGGCGCCGAGATCACGAATATAAGCGGCACGCATCTTCTGGGCTACTCCGACGCGGACGTGCTGGCAATTGTGCGGGTCAGCGGTCAGACGGCGATAGTTCCGCGCGCCATCTGCTTGCGATTTCCCAACATCCAAGCGGCCGCATTCTTCGGCTCAGGGCTGACGACCGTCACGGGCTCCTCTTTCGGCGGCTGCCGCTCGCTCGCGTGGCTCGAGCTCCTCAACAACCGCATCGCCTCAGTCGCTGCCGACGCCTTTCTCAACAACACGCAGCTCGACTTCCTCGACCTCGGCCAGAACCGCCTGACGACCCTGCCTGTGACGGTTTTTGCACCGCTCGCGAGTCTCCGCTCGGTCGACATCCGCGAAAATCCGTTCGCCAGCATCCCAAGCGGAATTTTCGCGAACCTCACCGAACTGCGCAACGTGTACATGCAGAGCGTCGGCATTTCCGAGGTGAACCCGCACTGGTTCAGTGCCGGCAACAGCGTTCGGAGCTTCTACGTTGGCGGCAACAACCTGCGCACAGTCCGGGCCGGTTCTTTCGCGACGCTCGGCACGCTGCTGATTCTCAGCCTCATCGACAGCAACGTGCGCTCGATAGAGGTTGGAGCCTTCGACGGCCTCGCCTTCGTCACCTACATGTACTTGGAGCGCAATGAGATCGAGGAGATCCCTGCAGGTGTCTTCGACGGCCTTGCACGCCTCTCAACGCTCGACCTCTCCTCAAACAACATCCGGACGCTGACCAACGGAAGCTTCAGATCGCTCAGTAGTCTGCGCGAGCTCACGATAAACAGCTGCGGGGTTCGCGACATCGAGGTCGGTGCCTTCGAGGGGCTCTCGAGCCTCGGCATTTTGGACGCGAATTTGAACTCGATCGAGTACATCGCGCCGGCAGTTTTTGCGCCGCTCGTCAACCTCGGGCATCTGGGCTTGCGGAGCAACAGGCTGCGCGTCCTCTACCGCAACTCGTTTGGCGCGAAAGTGAGCCTGATCGAGTCGCTGGACTTTGACGCGAACGTCATCAACGCGGTCGAGCGGCAGCTGATCGAGGACGCCGCGAGGCTGAACATGATGAAGTTCAAGGACAACATTTGCTCGAGCGGCACTTTCGGCTCCTTCATGGTCAATCGCGAGTGGTACATGATTGTGCTGAGCCAGTGCTTCAGCAACTTCGAAAATACCATCAGCACGATAACCGACAACAATGCACAATACCAGTTCTTTGCTGCAAATTTACCGGGATTTTTCGCCCGCGTGCAGGCAGTCGACACCATCGAGATCGCGCTCAGTCCAGTCAACGGCACAGCCTCGCAGCTGATCGAGGTTCTGATCGGAACGGCCAACAACACGCTGTCGGTGATCCGCGCCAATCAGCAGACCGACGTAGCCATCGTGCCGTCGCCGGGCATCATCAGAAGCGGCGAGTCAAACACTTTCAGAGTGTCGTGGGTCAACGGTGTGATTTTGGCGTTCAGAGAAAACGAAGAGTTCCCTTTTCTGGTGCACACCATGCTTGAGCCGTTTGACGTCAACTTTTTTGGACTCCGGACGAGCCACGGCTGCCTCAATAAAAAGAAACAAGAAAATAATAAAGAAAGAAGAAGAAGAAACTTATCGAACGCCAAAAACCCAATCTTATCGCCGACATTTTCACGTTTCCGCGCGATTTCAGCGCCTGCTTCGCTCTCAGTCTTTCCCCTCACCTCGACTGCACGCGACATGAAGCTCCCAGCTCTCGCGACCTTCCTCTCGCTCTTCGTCCTCCGAGCGACGGCCGAAACGGTGACCTGCGAGTTCGGCAGCCGCACCATCGGCGGCGTCTCGCTCTACAGCTGCGTCCTCCAAATCAACGGCACCGCCGGCGCGGAGGTCACCGAAATCGGCGGCACGCACCTCGCCGGCCGCACCGACGCCGACGTGCTCGGCATCGTGCGCCTCAGCGGCAGCCTCACCGCCATCCCGGCCGTCATCTGCGCGCGCTTCCCCAACCTCCAGCGAGTGGCGCTCCCCAATGCCGGCCTCACCGTCCTCGCCGACGACGCCCTGCGTGGCTGCCCGCGCCTCACGTGGCTCTCGCTCCTCAGCAACCGCATCGGCTCCGTGTCGGCCGGCGCCTTCGCCAACAACCCCGAGCTCACCTTCCTCGACCTCGACTCCAACAGCCTCAGCACCCTGCCCGAGGCCGTCTTTCGCAACCTCAGCCGCCTCGAGAGCCTCGACCTCCGCTCCAACCCCCTCACCGCGCCCCTTCCCGACGCCATCTTCCGCGATCTCACGTCGCTGTCGCGGCTCTTCCTGCAGTCGACCGGCATGCGCGCCATCAACCCGCTCTGGTTCAACACGACCACGCGCCTCGTCGCCCTCTACCTCGGCAACAACCAGATCGAGACCATCAGCGCCAACAATGTTGCCACGCTCACGTCCCTCGAGGTGCGTACACTTCTGAGTCTCTTCGGCAACCGTCTGGGCTTCGTGGGCGGCAACACGTTTGCAGCGCTGCGAAACCTCCGCTTCCTCGACATTGCCTACAACGGCATCAGCCTGATCCATCCGCTGCGCTTCGCCGGCCTCGAGATGCTGACGACGCTCGACATGTCGGGCAACAACCTCACGGCCATCGGTGCCGGCGCCTTCCGGCAACTGGCGAATCTGGACAATCTTTACCTGAGCGGCGCCGGCATCCGAAGGCTCGACCCAGCCGCCTTCGAGGGCCTCACGCAACTCACCTTCCTGAACCTCAACTTCAATGAGATCGAGGACCTGCCGGCCGGCGTGTTTGCGCCAATGGCGAGCCTCAGCTCGGTGAACCTGTGGCAGAACCGCCTGAAGACCGTTCGACGCGACATCTTCGGCGCCAACGTCGCCTCGCTGACGTCGCTCGACCTCGACGAAAATGTCATTAATGGCGTCGAGCGACGACTCATTGACGACGCCGCGAGCCTCCAGCGGCTCTTCTTCCTCGAGAACATTTGCGCGGACGTCATGCTCGTCGGCATCGAGAGGAACCGCACCGAGCTCATGGAGAGGCTCGGCAGGTGCTTCCGCAATTATGAGCTGACCGTTGAAACGACGACCGACAACAACGCTCAGTACCAGTTCTACCCGGCCAGTCTGCCCGGTCTTGTGACCCGCGTGCAGGCAGTCGACACCATCGAGATCGCGCTCAGTCCAGTCAACGGCACAGCCTCGCAGCTGATCGAGGTTCTGATCGGAACGGCCAACAACACGCTGTCGGTAATTCGCGCCAATCAGCAGACCGACGTAGCCGTCGTGCCGTCGCCGGGCATCATCAGCGAGGACGAGTCGCTGACGCTGAGAGTCGCGTGGGTCAACAACGTGGTGCTGGTGTTCAGAGGAAACGACCAGTGGCCTTTCTTGGTGCACACCATGGCTGAGCCGTTTGACGTCAACTTTTATGGTCTCCGGTCGCAACGAAGCAGAGCATCTTGGATTGTGCAGCCGATCGGAATC
Protein: 1172 (aa)
MSYSCGLTINNTVGAEITNISGTHLLGYSDADVLAIVRVSGQTAIVPRAICLRFPNIQAAAFFGSGLTTVTGSSFGGCRSLAWLELLNNRIASVAADAFLNNTQLDFLDLGQNRLTTLPVTVFAPLASLRSVDIRENPFASIPSGIFANLTELRNVYMQSVGISEVNPHWFSAGNSVRSFYVGGNNLRTVRAGSFATLGTLLILSLIDSNVRSIEVGAFDGLAFVTYMYLERNEIEEIPAGVFDGLARLSTLDLSSNNIRTLTNGSFRSLSSLRELTINSCGVRDIEVGAFEGLSSLGILDANLNSIEYIAPAVFAPLVNLGHLGLRSNRLRVLYRNSFGAKVSLIESLDFDANVINAVERQLIEDAARLNMMKFKDNICSSGTFGSFMVNREWYMIVLSQCFSNFENTISTITDNNAQYQFFAANLPGFFARVQAVDTIEIALSPVNGTASQLIEVLIGTANNTLSVIRANQQTDVAIVPSPGIIRSGESNTFRVSWVNGVILAFRENEEFPFLVHTMLEPFDVNFFGLRTSHGCLNKKKQENNKERRRRNLSNAKNPILSPTFSRFRAISAPASLSVFPLTSTARDMKLPALATFLSLFVLRATAETVTCEFGSRTIGGVSLYSCVLQINGTAGAEVTEIGGTHLAGRTDADVLGIVRLSGSLTAIPAVICARFPNLQRVALPNAGLTVLADDALRGCPRLTWLSLLSNRIGSVSAGAFANNPELTFLDLDSNSLSTLPEAVFRNLSRLESLDLRSNPLTAPLPDAIFRDLTSLSRLFLQSTGMRAINPLWFNTTTRLVALYLGNNQIETISANNVATLTSLEVRTLLSLFGNRLGFVGGNTFAALRNLRFLDIAYNGISLIHPLRFAGLEMLTTLDMSGNNLTAIGAGAFRQLANLDNLYLSGAGIRRLDPAAFEGLTQLTFLNLNFNEIEDLPAGVFAPMASLSSVNLWQNRLKTVRRDIFGANVASLTSLDLDENVINGVERRLIDDAASLQRLFFLENICADVMLVGIERNRTELMERLGRCFRNYELTVETTTDNNAQYQFYPASLPGLVTRVQAVDTIEIALSPVNGTASQLIEVLIGTANNTLSVIRANQQTDVAVVPSPGIISEDESLTLRVAWVNNVVLVFRGNDQWPFLVHTMAEPFDVNFYGLRSQRSRASWIVQPIGI
Type | Start | End | Length |
CDS |
5957 |
5996 |
40 |
CDS |
6178 |
6544 |
367 |
CDS |
6719 |
7343 |
625 |
CDS |
7676 |
8561 |
886 |
CDS |
9013 |
9379 |
367 |
CDS |
9740 |
10249 |
510 |
CDS |
10569 |
11211 |
643 |
CDS |
11476 |
11553 |
78 |
intron |
5997 |
6177 |
181 |
intron |
6545 |
6718 |
174 |
intron |
7344 |
7675 |
332 |
intron |
8562 |
9012 |
451 |
intron |
9380 |
9739 |
360 |
intron |
10250 |
10568 |
319 |
intron |
11212 |
11475 |
264 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
EFW47854 |
hypothetical protein CAOG_05792 [Capsaspora owczarzaki ATCC 30864] |
5e-33 |
InterPro |
IPR022041 |
Farnesoic acid O-methyl transferase |
|
InterPro |
IPR003591 |
Leucine-rich repeat, typical subtype |
|
InterPro |
IPR001611 |
Leucine-rich repeat |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF13855.1 |
Leucine rich repeat |
2.4e-96 |
Pfam |
PF12799.2 |
Leucine Rich repeats (2 copies) |
7.3e-52 |
Pfam |
PF12248.3 |
Farnesoic acid 0-methyl transferase |
1.2e-28 |
Pfam |
PF13504.1 |
Leucine rich repeat |
2.8e-11 |
Pfam |
PF00560.28 |
Leucine Rich Repeat |
5.6e-29 |
Pfam |
PF13306.1 |
Leucine rich repeats (6 copies) |
3.3e-42 |
Pfam |
PF13516.1 |
Leucine Rich repeat |
3.2e-14 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
P. vanderplanki |
Pv.06946 |
P. vanderplanki |
Pv.06948 |