MidgeBase gene description page [Pn.09888]

Outline

Link to gbrowse

Gene ID Pn.09888
Type Protein coding gene
Scaffold PnScaf10345
Start 5954
End 11553
Direction -

Sequence

Transcript: 3516 (bp)

 ATGAGCTATTCCTGCGGTCTGACGATAAACAACACTGTCGGCGCCGAGATCACGAATATAAGCGGCACGCATCTTCTGGGCTACTCCGACGCGGACGTGCTGGCAATTGTGCGGGTCAGCGGTCAGACGGCGATAGTTCCGCGCGCCATCTGCTTGCGATTTCCCAACATCCAAGCGGCCGCATTCTTCGGCTCAGGGCTGACGACCGTCACGGGCTCCTCTTTCGGCGGCTGCCGCTCGCTCGCGTGGCTCGAGCTCCTCAACAACCGCATCGCCTCAGTCGCTGCCGACGCCTTTCTCAACAACACGCAGCTCGACTTCCTCGACCTCGGCCAGAACCGCCTGACGACCCTGCCTGTGACGGTTTTTGCACCGCTCGCGAGTCTCCGCTCGGTCGACATCCGCGAAAATCCGTTCGCCAGCATCCCAAGCGGAATTTTCGCGAACCTCACCGAACTGCGCAACGTGTACATGCAGAGCGTCGGCATTTCCGAGGTGAACCCGCACTGGTTCAGTGCCGGCAACAGCGTTCGGAGCTTCTACGTTGGCGGCAACAACCTGCGCACAGTCCGGGCCGGTTCTTTCGCGACGCTCGGCACGCTGCTGATTCTCAGCCTCATCGACAGCAACGTGCGCTCGATAGAGGTTGGAGCCTTCGACGGCCTCGCCTTCGTCACCTACATGTACTTGGAGCGCAATGAGATCGAGGAGATCCCTGCAGGTGTCTTCGACGGCCTTGCACGCCTCTCAACGCTCGACCTCTCCTCAAACAACATCCGGACGCTGACCAACGGAAGCTTCAGATCGCTCAGTAGTCTGCGCGAGCTCACGATAAACAGCTGCGGGGTTCGCGACATCGAGGTCGGTGCCTTCGAGGGGCTCTCGAGCCTCGGCATTTTGGACGCGAATTTGAACTCGATCGAGTACATCGCGCCGGCAGTTTTTGCGCCGCTCGTCAACCTCGGGCATCTGGGCTTGCGGAGCAACAGGCTGCGCGTCCTCTACCGCAACTCGTTTGGCGCGAAAGTGAGCCTGATCGAGTCGCTGGACTTTGACGCGAACGTCATCAACGCGGTCGAGCGGCAGCTGATCGAGGACGCCGCGAGGCTGAACATGATGAAGTTCAAGGACAACATTTGCTCGAGCGGCACTTTCGGCTCCTTCATGGTCAATCGCGAGTGGTACATGATTGTGCTGAGCCAGTGCTTCAGCAACTTCGAAAATACCATCAGCACGATAACCGACAACAATGCACAATACCAGTTCTTTGCTGCAAATTTACCGGGATTTTTCGCCCGCGTGCAGGCAGTCGACACCATCGAGATCGCGCTCAGTCCAGTCAACGGCACAGCCTCGCAGCTGATCGAGGTTCTGATCGGAACGGCCAACAACACGCTGTCGGTGATCCGCGCCAATCAGCAGACCGACGTAGCCATCGTGCCGTCGCCGGGCATCATCAGAAGCGGCGAGTCAAACACTTTCAGAGTGTCGTGGGTCAACGGTGTGATTTTGGCGTTCAGAGAAAACGAAGAGTTCCCTTTTCTGGTGCACACCATGCTTGAGCCGTTTGACGTCAACTTTTTTGGACTCCGGACGAGCCACGGCTGCCTCAATAAAAAGAAACAAGAAAATAATAAAGAAAGAAGAAGAAGAAACTTATCGAACGCCAAAAACCCAATCTTATCGCCGACATTTTCACGTTTCCGCGCGATTTCAGCGCCTGCTTCGCTCTCAGTCTTTCCCCTCACCTCGACTGCACGCGACATGAAGCTCCCAGCTCTCGCGACCTTCCTCTCGCTCTTCGTCCTCCGAGCGACGGCCGAAACGGTGACCTGCGAGTTCGGCAGCCGCACCATCGGCGGCGTCTCGCTCTACAGCTGCGTCCTCCAAATCAACGGCACCGCCGGCGCGGAGGTCACCGAAATCGGCGGCACGCACCTCGCCGGCCGCACCGACGCCGACGTGCTCGGCATCGTGCGCCTCAGCGGCAGCCTCACCGCCATCCCGGCCGTCATCTGCGCGCGCTTCCCCAACCTCCAGCGAGTGGCGCTCCCCAATGCCGGCCTCACCGTCCTCGCCGACGACGCCCTGCGTGGCTGCCCGCGCCTCACGTGGCTCTCGCTCCTCAGCAACCGCATCGGCTCCGTGTCGGCCGGCGCCTTCGCCAACAACCCCGAGCTCACCTTCCTCGACCTCGACTCCAACAGCCTCAGCACCCTGCCCGAGGCCGTCTTTCGCAACCTCAGCCGCCTCGAGAGCCTCGACCTCCGCTCCAACCCCCTCACCGCGCCCCTTCCCGACGCCATCTTCCGCGATCTCACGTCGCTGTCGCGGCTCTTCCTGCAGTCGACCGGCATGCGCGCCATCAACCCGCTCTGGTTCAACACGACCACGCGCCTCGTCGCCCTCTACCTCGGCAACAACCAGATCGAGACCATCAGCGCCAACAATGTTGCCACGCTCACGTCCCTCGAGGTGCGTACACTTCTGAGTCTCTTCGGCAACCGTCTGGGCTTCGTGGGCGGCAACACGTTTGCAGCGCTGCGAAACCTCCGCTTCCTCGACATTGCCTACAACGGCATCAGCCTGATCCATCCGCTGCGCTTCGCCGGCCTCGAGATGCTGACGACGCTCGACATGTCGGGCAACAACCTCACGGCCATCGGTGCCGGCGCCTTCCGGCAACTGGCGAATCTGGACAATCTTTACCTGAGCGGCGCCGGCATCCGAAGGCTCGACCCAGCCGCCTTCGAGGGCCTCACGCAACTCACCTTCCTGAACCTCAACTTCAATGAGATCGAGGACCTGCCGGCCGGCGTGTTTGCGCCAATGGCGAGCCTCAGCTCGGTGAACCTGTGGCAGAACCGCCTGAAGACCGTTCGACGCGACATCTTCGGCGCCAACGTCGCCTCGCTGACGTCGCTCGACCTCGACGAAAATGTCATTAATGGCGTCGAGCGACGACTCATTGACGACGCCGCGAGCCTCCAGCGGCTCTTCTTCCTCGAGAACATTTGCGCGGACGTCATGCTCGTCGGCATCGAGAGGAACCGCACCGAGCTCATGGAGAGGCTCGGCAGGTGCTTCCGCAATTATGAGCTGACCGTTGAAACGACGACCGACAACAACGCTCAGTACCAGTTCTACCCGGCCAGTCTGCCCGGTCTTGTGACCCGCGTGCAGGCAGTCGACACCATCGAGATCGCGCTCAGTCCAGTCAACGGCACAGCCTCGCAGCTGATCGAGGTTCTGATCGGAACGGCCAACAACACGCTGTCGGTAATTCGCGCCAATCAGCAGACCGACGTAGCCGTCGTGCCGTCGCCGGGCATCATCAGCGAGGACGAGTCGCTGACGCTGAGAGTCGCGTGGGTCAACAACGTGGTGCTGGTGTTCAGAGGAAACGACCAGTGGCCTTTCTTGGTGCACACCATGGCTGAGCCGTTTGACGTCAACTTTTATGGTCTCCGGTCGCAACGAAGCAGAGCATCTTGGATTGTGCAGCCGATCGGAATC 

Protein: 1172 (aa)

 MSYSCGLTINNTVGAEITNISGTHLLGYSDADVLAIVRVSGQTAIVPRAICLRFPNIQAAAFFGSGLTTVTGSSFGGCRSLAWLELLNNRIASVAADAFLNNTQLDFLDLGQNRLTTLPVTVFAPLASLRSVDIRENPFASIPSGIFANLTELRNVYMQSVGISEVNPHWFSAGNSVRSFYVGGNNLRTVRAGSFATLGTLLILSLIDSNVRSIEVGAFDGLAFVTYMYLERNEIEEIPAGVFDGLARLSTLDLSSNNIRTLTNGSFRSLSSLRELTINSCGVRDIEVGAFEGLSSLGILDANLNSIEYIAPAVFAPLVNLGHLGLRSNRLRVLYRNSFGAKVSLIESLDFDANVINAVERQLIEDAARLNMMKFKDNICSSGTFGSFMVNREWYMIVLSQCFSNFENTISTITDNNAQYQFFAANLPGFFARVQAVDTIEIALSPVNGTASQLIEVLIGTANNTLSVIRANQQTDVAIVPSPGIIRSGESNTFRVSWVNGVILAFRENEEFPFLVHTMLEPFDVNFFGLRTSHGCLNKKKQENNKERRRRNLSNAKNPILSPTFSRFRAISAPASLSVFPLTSTARDMKLPALATFLSLFVLRATAETVTCEFGSRTIGGVSLYSCVLQINGTAGAEVTEIGGTHLAGRTDADVLGIVRLSGSLTAIPAVICARFPNLQRVALPNAGLTVLADDALRGCPRLTWLSLLSNRIGSVSAGAFANNPELTFLDLDSNSLSTLPEAVFRNLSRLESLDLRSNPLTAPLPDAIFRDLTSLSRLFLQSTGMRAINPLWFNTTTRLVALYLGNNQIETISANNVATLTSLEVRTLLSLFGNRLGFVGGNTFAALRNLRFLDIAYNGISLIHPLRFAGLEMLTTLDMSGNNLTAIGAGAFRQLANLDNLYLSGAGIRRLDPAAFEGLTQLTFLNLNFNEIEDLPAGVFAPMASLSSVNLWQNRLKTVRRDIFGANVASLTSLDLDENVINGVERRLIDDAASLQRLFFLENICADVMLVGIERNRTELMERLGRCFRNYELTVETTTDNNAQYQFYPASLPGLVTRVQAVDTIEIALSPVNGTASQLIEVLIGTANNTLSVIRANQQTDVAVVPSPGIISEDESLTLRVAWVNNVVLVFRGNDQWPFLVHTMAEPFDVNFYGLRSQRSRASWIVQPIGI 
Type Start End Length
CDS 5957 5996 40
CDS 6178 6544 367
CDS 6719 7343 625
CDS 7676 8561 886
CDS 9013 9379 367
CDS 9740 10249 510
CDS 10569 11211 643
CDS 11476 11553 78
intron 5997 6177 181
intron 6545 6718 174
intron 7344 7675 332
intron 8562 9012 451
intron 9380 9739 360
intron 10250 10568 319
intron 11212 11475 264

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr EFW47854 hypothetical protein CAOG_05792 [Capsaspora owczarzaki ATCC 30864] 5e-33
InterPro IPR022041 Farnesoic acid O-methyl transferase
InterPro IPR003591 Leucine-rich repeat, typical subtype
InterPro IPR001611 Leucine-rich repeat
Gene Ontology(MF) GO:0005515 protein binding
Pfam PF13855.1 Leucine rich repeat 2.4e-96
Pfam PF12799.2 Leucine Rich repeats (2 copies) 7.3e-52
Pfam PF12248.3 Farnesoic acid 0-methyl transferase 1.2e-28
Pfam PF13504.1 Leucine rich repeat 2.8e-11
Pfam PF00560.28 Leucine Rich Repeat 5.6e-29
Pfam PF13306.1 Leucine rich repeats (6 copies) 3.3e-42
Pfam PF13516.1 Leucine Rich repeat 3.2e-14

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
P. vanderplanki Pv.06946
P. vanderplanki Pv.06948