MidgeBase gene description page [Pn.02701]
Outline
Gene ID | Pn.02701 |
Type | Protein coding gene |
Scaffold | PnScaf2322 |
Start | 10923 |
End | 18730 |
Direction | - |
Sequence
Transcript: 2583 (bp)
ATGTTCCGGATAAGATTCGTGTTAATTTTACTAATTTTGACCACCCACGACGTTTACTCGCAAATCGGTCAATCGATCAAAGATTCTATTTTAGAGTACGACCTCCTGTCAGCGGCAGTGGACTTGGAGAATATCAACAGCGGAGGGGTGACGTATATCGAAGAAGTCAATGGACCAGCGTTTTATGAGTTCCGATCGAATGCGAACATCAGAAATCCTTACAAAGTCATCGCTCCGAATTTCTTGAAGAACTTCGTCCTAATAGCGACTGTGAAAGTGACGCATCGTGCCGACTCCTACTTATTTTCGATAGTAAATTCACTCGAGACGGTAGCACAGTTCGGAGTGCGAGTATCACCGGTGCAGCGAAGCTACCTCAACATCACGCTAATCTACAACGATCTTCAGAAAGCGACAGACTCATCTGTCTCGTTCACGCTAACATATGACTCGAAGAATTGGATTAACTTTGCGATACAGATGATTGATGAGCGCGTGTCGCTCTATCACAATTGCCTCAAAATCGACGAGCGCAACGTCTCATCGCGGCAAACGCTGACATTTGAGTCGGCTTCCATTTTCTACCTCGCACAGGCAGGCTCAATTTTGAAGGGAAAATTCGAGGTTATATCGACAAGAGCTAAAGGGATGATTGGTGCATTTATTTCACTAATACTTTTATCGACGGTCCTCGTGACGGCCTCAACCAAGGGCTGGTGGTTTGGCTTGAACAAAAATAATGGCGAGCATGTCGCTGCTCGCATTCAGGGCGCAATTCAATTTCTGAAGCTCTACGACTATCCGGAATTTCTGTCCGTAATTTGTAATAAAAGCATCGCGGCAAACAGCGAGTCAGATTTTAATGATTACAGTAACGAGAACGAGAGTCCCGTTCTGCAAGCCCCGCCAGACTTTAAAAATTACGGCGGTTACCGCATGAAGGGAGAGAAGGGTGACAAAGGCGCCCGAGGAATTCCAGGAGATTCAATAAGAGGACCGCCGGGTCCAAAGGGAGAATGCCAGATAGTTAATGTGAATAACAATACCAACAACAATTACAATAATAACAATAATTACAAACCGACGGAACAAAAATTAGCACCAGTTTGTGCATGCAATTACGACAATATTATTGATATTCTCCATAATGAGTCAGTTAGGAGTGTTTTAAAGGGTCCGCCGGGCTTACCAGGTCTAACGGGTCCGCCCGGAGAAAAGGGTCAGAAGGGCGAAATGGGTGATAAGGGCTCTGACGGCATCGACGGAATCCCAGGCTTACCAGGAACCCCGGGCGAGAATTACGAGGAATCGATGATGGGAAATATCCGATCAAAAGAGTCACGTGGCGATAAGGGCGACAAAGGTGACATGGGCATGAAAGGACAGAAGGGAGACGGTGGCTTGAAAGGAGAGAAAGGAGCTTGCATAACAGTTCCCGAAGTGCAGACCAACAATTGCGGCTGTCCATTCAACGACACGTACAAAGGAATGAAGGGCGACAAGGGCCTTCGTGGGAAGCGAGGAAAGACTGGCGCTCAAGGAGAGAAGGGGCAGAAAGGCGACAGTGGTGGAGCGGCAGGAGACAAGGGAGACAAAGGCGAACGTGGACCACCAGGTCTTCCTGGACCCCCCTACAGCTCTTTTGCCGACGACTCGATGAACTATCAGCGACCGTCAGGCGTCGGCACGATAATTACGTTTCAGAACACTGACACTATGATAAAACAATCATCCACATATCCTGCAGGCACCATCGCATACGTTGTCGATGAAGAGGCGCTGCTCGTGAAAGTTTCAAAGGGCTGGCAATACATCGCGTTGGGAACACTGCTGCCATTCACAACCCCCTATGTCACGACATCCGTATCGCCAACCTCCTACATGGACTTGCAAGCTTCAAACCTGCTCAACAGCAACATTTTGAAATCTCCCGAGAGCTATACCTTTACGACTCCGCCCGAGTACGAAACATGGAACCCGAAAATGCTGAGATTGATTGCATTGAACGAGCCGTATTCTGGCAATTTGCAAGGACTGCGAAATGCCGACTTGAATTGTCATCGGCAGGCAAGAAGAGCGGGCTTGATGGGCAACTTCCGTGCTTTCCTCTCGACGCGAATTCAAAACTTGGACTCATTAATAAAGCCCGAGGACAGGGAGTTGCCAATTACGAATTTGCGTGGAGATGTGCTCTTCAACTCATTCAATGCCATTTTCAATAATAATGCTCAAGGGATATTTATGTCCTCCAATTCGCCGCGAATCATCAGCTTTAGCGGAAAGAACGTCATGAACGACAACAGTTGGCCTCATAAAGTCGTCTGGCATGGCGCACGTGCCGACTCAATAGACACAAACTGCGAAGGCTGGCACAGTAATTTCGCAGACAAAATTGGCTTGGGAAGCAGTTTGCTAGGAAATAAGTTACTCGCTCAAGAGATGTACAGTTGTCAACAGAAGAATATCGTATTATGCATTGAGGTATTATCGCACAGTAGTAGCGGCAGCAGCAGCAGCGATATCGCGAATCGTCGCAAGCGCGAAATGATGGCGAACGCCGACGACATATATGACATCGAAAAG
Protein: 861 (aa)
MFRIRFVLILLILTTHDVYSQIGQSIKDSILEYDLLSAAVDLENINSGGVTYIEEVNGPAFYEFRSNANIRNPYKVIAPNFLKNFVLIATVKVTHRADSYLFSIVNSLETVAQFGVRVSPVQRSYLNITLIYNDLQKATDSSVSFTLTYDSKNWINFAIQMIDERVSLYHNCLKIDERNVSSRQTLTFESASIFYLAQAGSILKGKFEVISTRAKGMIGAFISLILLSTVLVTASTKGWWFGLNKNNGEHVAARIQGAIQFLKLYDYPEFLSVICNKSIAANSESDFNDYSNENESPVLQAPPDFKNYGGYRMKGEKGDKGARGIPGDSIRGPPGPKGECQIVNVNNNTNNNYNNNNNYKPTEQKLAPVCACNYDNIIDILHNESVRSVLKGPPGLPGLTGPPGEKGQKGEMGDKGSDGIDGIPGLPGTPGENYEESMMGNIRSKESRGDKGDKGDMGMKGQKGDGGLKGEKGACITVPEVQTNNCGCPFNDTYKGMKGDKGLRGKRGKTGAQGEKGQKGDSGGAAGDKGDKGERGPPGLPGPPYSSFADDSMNYQRPSGVGTIITFQNTDTMIKQSSTYPAGTIAYVVDEEALLVKVSKGWQYIALGTLLPFTTPYVTTSVSPTSYMDLQASNLLNSNILKSPESYTFTTPPEYETWNPKMLRLIALNEPYSGNLQGLRNADLNCHRQARRAGLMGNFRAFLSTRIQNLDSLIKPEDRELPITNLRGDVLFNSFNAIFNNNAQGIFMSSNSPRIISFSGKNVMNDNSWPHKVVWHGARADSIDTNCEGWHSNFADKIGLGSSLLGNKLLAQEMYSCQQKNIVLCIEVLSHSSSGSSSSDIANRRKREMMANADDIYDIEK
Type | Start | End | Length |
CDS |
10926 |
11071 |
146 |
CDS |
11195 |
11266 |
72 |
CDS |
11359 |
11417 |
59 |
CDS |
11509 |
11697 |
189 |
CDS |
11779 |
11909 |
131 |
CDS |
11983 |
12024 |
42 |
CDS |
12085 |
12210 |
126 |
CDS |
12289 |
12381 |
93 |
CDS |
12459 |
12535 |
77 |
CDS |
12610 |
12642 |
33 |
CDS |
12753 |
12786 |
34 |
CDS |
12855 |
12941 |
87 |
CDS |
13006 |
13082 |
77 |
CDS |
13156 |
13183 |
28 |
CDS |
13503 |
13528 |
26 |
CDS |
13605 |
13662 |
58 |
CDS |
14147 |
14178 |
32 |
CDS |
14253 |
14280 |
28 |
CDS |
14343 |
14350 |
8 |
CDS |
14419 |
14455 |
37 |
CDS |
14523 |
14642 |
120 |
CDS |
14815 |
14891 |
77 |
CDS |
15127 |
15137 |
11 |
CDS |
15230 |
15269 |
40 |
CDS |
15343 |
15375 |
33 |
CDS |
15480 |
15558 |
79 |
CDS |
15688 |
15759 |
72 |
CDS |
15964 |
16107 |
144 |
CDS |
17372 |
17656 |
285 |
CDS |
17781 |
17806 |
26 |
CDS |
17969 |
18136 |
168 |
CDS |
18461 |
18523 |
63 |
CDS |
18580 |
18647 |
68 |
CDS |
18717 |
18730 |
14 |
intron |
11072 |
11194 |
123 |
intron |
11267 |
11358 |
92 |
intron |
11418 |
11508 |
91 |
intron |
11698 |
11778 |
81 |
intron |
11910 |
11982 |
73 |
intron |
12025 |
12084 |
60 |
intron |
12211 |
12288 |
78 |
intron |
12382 |
12458 |
77 |
intron |
12536 |
12609 |
74 |
intron |
12643 |
12752 |
110 |
intron |
12787 |
12854 |
68 |
intron |
12942 |
13005 |
64 |
intron |
13083 |
13155 |
73 |
intron |
13184 |
13502 |
319 |
intron |
13529 |
13604 |
76 |
intron |
13663 |
14146 |
484 |
intron |
14179 |
14252 |
74 |
intron |
14281 |
14342 |
62 |
intron |
14351 |
14418 |
68 |
intron |
14456 |
14522 |
67 |
intron |
14643 |
14814 |
172 |
intron |
14892 |
15126 |
235 |
intron |
15138 |
15229 |
92 |
intron |
15270 |
15342 |
73 |
intron |
15376 |
15479 |
104 |
intron |
15559 |
15687 |
129 |
intron |
15760 |
15963 |
204 |
intron |
16108 |
17371 |
1264 |
intron |
17657 |
17780 |
124 |
intron |
17807 |
17968 |
162 |
intron |
18137 |
18460 |
324 |
intron |
18524 |
18579 |
56 |
intron |
18648 |
18716 |
69 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
AEL31279 |
multiplexin collagen isoform Afl1 [Drosophila melanogaster] |
1e-127 |
InterPro |
IPR016187 |
C-type lectin fold |
|
InterPro |
IPR010515 |
Collagenase NC10/endostatin |
|
InterPro |
IPR008985 |
Concanavalin A-like lectin/glucanase |
|
InterPro |
IPR008160 |
Collagen triple helix repeat |
|
InterPro |
IPR016186 |
C-type lectin-like |
|
Gene Ontology(BP) |
GO:0007155 |
cell adhesion |
|
Gene Ontology(CC) |
GO:0031012 |
extracellular matrix |
|
Gene Ontology(MF) |
GO:0005198 |
structural molecule activity |
|
Pfam |
PF01391.13 |
Collagen triple helix repeat (20 copies) |
1.8e-17 |
Pfam |
PF06482.6 |
Collagenase NC10 and Endostatin |
1.1e-77 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
H. sapiens |
ENSP00000364140 |
M. musculus |
ENSMUSG00000001435 |
A. aegypti |
AAEL002256 |
B. mori |
BGIBMGA012062-TA |
C. quinquefasciatus |
CPIJ005619 |
H. sapiens |
ENSP00000339118 |
H. sapiens |
ENSP00000347665 |
H. melpomene |
HMEL008255-PA |
P. humanus |
PHUM321580-PA |
A. mellifera |
GB18123-PA |
A. gambiae |
AGAP006516 |
H. sapiens |
ENSP00000383191 |
D. melanogaster |
FBgn0260660 |
S. invicta |
SI2.2.0_00485 |
C. quinquefasciatus |
CPIJ005620 |
H. melpomene |
HMEL007561-PA |
H. sapiens |
ENSP00000415692 |
N. vitripennis |
NV16687-PA |
D. plexippus |
DPOGS207247PA |
P. vanderplanki |
Pv.02449 |
H. sapiens |
ENSP00000352798 |
T. castaneum |
TC003787 |
D. plexippus |
DPOGS207250PA |