MidgeBase gene description page [Pn.12591]

Outline

Link to gbrowse

Gene ID Pn.12591
Type Protein coding gene
Scaffold PnScaf17048
Start 13540
End 15672
Direction -

Sequence

Transcript: 1287 (bp)

 ATGCCTTACACAAAATACTGCTTACGCTTACCTGGACGTGACGGCTTACCCGGCTTGGAAGGCAGAAAGGGAGAAAGAGGATTCCCAGGACCAAAGGGAGATCAAGGTTTACCAGGCCCGATTGGCTTGCAAGGCGAGAAGGGCGATCAAGGTTTCCCCGGTAGAAATGGTGCTAACGGAATACCCGGTATCAAGGGTGATAAGGGATTGCAAGGACTACCCGGACTTCCAGGACCTGTAGGTTATCCAGGCGACAAAGGAATGCAAGGTCCTCGTGGAAATGATGGTTTGCAGGGTTTGCCTGGTACGCCGGGTGAAAAGGGAGAGCCAGGTCTTCAAGCACCACCGCCTATTATTGGACCACCAGGAAAGCCGGGATTGCCTGGACAAAAGGGAGACAGAGGACTTCCAGGTGCTCCGGGTCTGATCGGTCTGCAGGGTGAGAGAGGAGAACAAGGAGAGATTGGTTTGATCGGAGTTGAAGGTCAAAGAGGTTTACCTGGTCCTAGAGGTGAAATCGGTTTGTCTGGCCCGCCAGGTAGAGACGGTGCCCCAGGTTTGCCTGGCGCGAAAGGAAATGCTGGACTTCCTTGCTCGGCAGCTCAAGACTACCTAACGGGTCTCCTCTTGGTTAAGCACAGTCAGTCGGAAGATATTCCACAGTGTGAACCGGGACACGTAAAGCTATGGGACGGCTACTCACTTATGTATGTCGATGGCAACGATTATCCAGCCAATCAGGACTTGGGCTCGCCCGGTTCATGCGTCCGCAAATTCTCCACAATGCCGGTCATGGCTTGCGGCCAAAACAACGTCTGCAATTATGCATCACGCAACGATCGTACCTTCTGGCTTTCGACTTCAAAGGAAATTCCAATGATGCCTGTGTCAGACTTCGAAATGCGTCCATATATCTCGCGCTGCGCCGTCTGCGAAGTTCCTTCAAACGTCATCGCAATTCACAGCCAATCGCAGCAAGTTCCTGAATGTCCACAAGGCTGGGACTCACTCTGGATCGGTTACACATTCATGATGCATACCGCCGTTGGTCATGGTGGTGGCGGCCAAGCGCTCTCTGGACCTGGCTCGTGCTTGCAAGATTTCCGCGCCACACCATTCATCGAATGTAACGGAGGCAAGGGTCAATGTCATTACTACGAGACAATGACGAGCTTCTGGATGGTAACAATCGACCAACAAAATCAATTCCGTACACCCGAGCAGCAGACTCTCAAGGCAGGAACTCTCCATACAAAAGTTTCAAGATGTAACGTGTGCATACGAATA 

Protein: 429 (aa)

 MPYTKYCLRLPGRDGLPGLEGRKGERGFPGPKGDQGLPGPIGLQGEKGDQGFPGRNGANGIPGIKGDKGLQGLPGLPGPVGYPGDKGMQGPRGNDGLQGLPGTPGEKGEPGLQAPPPIIGPPGKPGLPGQKGDRGLPGAPGLIGLQGERGEQGEIGLIGVEGQRGLPGPRGEIGLSGPPGRDGAPGLPGAKGNAGLPCSAAQDYLTGLLLVKHSQSEDIPQCEPGHVKLWDGYSLMYVDGNDYPANQDLGSPGSCVRKFSTMPVMACGQNNVCNYASRNDRTFWLSTSKEIPMMPVSDFEMRPYISRCAVCEVPSNVIAIHSQSQQVPECPQGWDSLWIGYTFMMHTAVGHGGGGQALSGPGSCLQDFRATPFIECNGGKGQCHYYETMTSFWMVTIDQQNQFRTPEQQTLKAGTLHTKVSRCNVCIRI 
Type Start End Length
CDS 13543 13794 252
CDS 13856 14379 524
CDS 14451 14523 73
CDS 14587 14639 53
CDS 14698 14750 53
CDS 14805 14886 82
CDS 14955 15000 46
CDS 15058 15110 53
CDS 15171 15214 44
CDS 15278 15359 82
CDS 15648 15672 25
intron 13795 13855 61
intron 14380 14450 71
intron 14524 14586 63
intron 14640 14697 58
intron 14751 14804 54
intron 14887 14954 68
intron 15001 15057 57
intron 15111 15170 60
intron 15215 15277 63
intron 15360 15647 288

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001962982 GF15711 [Drosophila ananassae] gb|EDV32203.1| GF15711 [Drosophila ananassae] 1e-151
InterPro IPR016187 C-type lectin fold
InterPro IPR001442 Collagen IV, non-collagenous
InterPro IPR008160 Collagen triple helix repeat
Gene Ontology(CC) GO:0005581 collagen
Gene Ontology(MF) GO:0005201 extracellular matrix structural constituent
Pfam PF01391.13 Collagen triple helix repeat (20 copies) 4.5e-33
Pfam PF01413.14 C-terminal tandem repeated domain in type 4 procollagen 2.2e-85

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID
Pn.09808

Orthologous genes

Species Gene ID
H. sapiens ENSP00000380382
D. melanogaster FBgn0016075
P. vanderplanki Pv.17128
P. humanus PHUM136040-PA
A. mellifera GB14564-PA
B. mori BGIBMGA014039-TA
H. melpomene HMEL002288-PA
T. castaneum TC013472
S. invicta SI2.2.0_02661
H. sapiens ENSP00000445236
H. sapiens ENSP00000353654
H. sapiens ENSP00000443707
N. vitripennis NV11106-PA
M. musculus ENSMUSG00000031273
D. plexippus DPOGS206535PA
H. melpomene HMEL002285-PA
A. gambiae AGAP009200
P. vanderplanki Pv.11194
B. mori BGIBMGA014040-TA
D. plexippus DPOGS206549PA
H. sapiens ENSP00000378340
D. melanogaster FBgn0000299
C. quinquefasciatus CPIJ802363
M. musculus ENSMUSG00000031502
H. sapiens ENSP00000334733
H. sapiens ENSP00000443348
S. invicta SI2.2.0_09523
A. mellifera GB13353-PA
P. humanus PHUM136030-PA
H. sapiens ENSP00000364979
A. gambiae AGAP009201
H. sapiens ENSP00000361290
P. vanderplanki Pv.17126
T. castaneum TC014326
C. quinquefasciatus CPIJ802355