MidgeBase gene description page [Pn.12066]
Outline
Gene ID | Pn.12066 |
Type | Protein coding gene |
Scaffold | PnScaf15426 |
Start | 614 |
End | 6107 |
Direction | - |
Sequence
Transcript: 3936 (bp)
ATGTATGGCGAGAGAGATATCAATATCGATCAAATTGTCGTACAATCAGAAGGTGGCAACGCATGGACATCAGCGATATCAGATTTCCAACAGTACTTGATTATTGATTTGGGCGCCACGAAAAACATTACGCGCATCTCCATTCAGGGACGTCCGCATCACAGCGAGTATGTCTCTGAGTTCAGCATCAGCTACGGCTACAATGGCCTCGACTACGCGGACTACAAAGAGCCCGGCGGAAACACAAAGACACTTAGCCGATCAGCATGGACTCCGGTTGAGAATACGTACAACCACTATCTGTCGATAGATCTTGGTGGAAGGAAAATGATAAGAAAAATTTCCACGCTCGGTCGTCCAAACACTCATGAATATGTAACTGAATATATTATAACATACTCGGACGATGGTGAATTGTGGAAGGCTTACACTAACAACGATGCTCAGGATAAGCTTTTTAAGGGAAACGACAACGGCGACGACGTGAAGCACAATGTCTTCGATGTGCCGATAATTGCCCAGTGGGTTCGTGTCAATCCGACGAGATGGAATGACAGAATCTCCCTTCGCGTCGAGCTCTACGGATGCGACTACTATGCCGAAACGGTCTACTTCAACGGAAGCAGCTTGCTGAGTCTCGACCTGCTTCGCGAGCCGATTTCTGCTTCGCGAGAGACGCTTCAGTTCAGATTCAAGACCTCACACGCGAACGGCATTCTCATGTACTCGAAGGGCACGCAGGGAGACTTTTTTGCGCTGCAGCTGTACGAAAACCGAATGATTCTCAACATCAACCTCGGCTCACATTCGATGAGCTCGTTCTCGGTCGGAAGCCTGCTCGATGACAACGTGTGGCACGATGTCGTCATTTCGAGAAATCGCCGGGACATTTTGTTCTCCGTCGATCGTGTGGTGGTCGATGGAAAGATTAAGGGAGACTTCGACAAGCTGGATCTCAATCGAATGCTTTTTGTGGGCGGCGTTCCGTCGAGAGACGAAGGCCTCGTTGTTACGCAAAACTTCACTGGCTGCATCGAGAACTTGTACTTGAACACGACGAACTTCATTCGCGAAATGAAGGAGGCCTATTCGGAGGGTCAATACAGTCGCTTCGATAAGGTCAACACTCTGTACAACTGCCCCGATCCGCCCATCATGCCCGTCACCTTCCTCACGCGGACGTCCTACGCGAAGCTAAAGGGCTATGAGGGCGTGAAGAGCCTGAACGTCTCGCTTTCGTTCAGAACTTACGAGGACAAGGGACTGATTATTTACCACGAGTTTACTTCCAAGGGATACGTGAAGGTCTTCCTGGAGGACGGACGCGTGAAGACCGAAATAAAAACGGACGAAAAGGAATTTCATAGCACCGGCGATCATCGTCGCGGCATCGTTTTGGACAACTATGACGAGCAGTTTAATGACGGCAGATGGCACTCGCTGATTTTGACAATCAAGGAGAACAGCTTGATCATTGAGATCGATCAGCGACCTATGAGGACGGAAAAGTTGTTTAAGATCTTGACTGGAGCTTATTACTACATCGGAGGTTCGAAGACCAAAGAGATTCACAACATCATGGGCAATCGGGACGGATTTCTCGGATGCATGCGTCAGATTTCCGTCGACGGCAACTTCAAGCTTCCGCACGATTGGAAAGACGAGGACTTCTGCTGCAAAAATGAAATTCTCATGGACGCCTGTCACATGGTTGACCGTTGCAATCCGAACCCCTGCAAGCACAACGGTGTTTGTCGGCAAAATTCCTACGAATTTTTTTGTGACTGCGGCAATAGTGGATATTCTGGCGCTGTGTGCCATACTTCTTTGAACCCGCTATCTTGTCAAGCTTTCAAAAACGTGCAGACAGTCGGCCAAAAAGCAAACATCAAGATAGACGTCGACGGTTCCGGTCCACTCGATCCATTCGATGTCACGTGCGAGTTCTTGACAGACGGGCGTGTGCTCACCGTTCTGGGTCATTCGTCTGAGCATTCAACCGTGGTCGACAGTTTCCAGGAGCCAGGTTCCTACGATCAAACCATCGAGTACAATGCGAGAATGCCACAAATTGAAGCTCTCTTGAATCGCTCGCGAGAGTGCTCGCAAAGATTGATCTACAGCTGCCGTAATGCGAGACTTTTCAATTCGCCATCTGACGAGCTGAGCTTCCGACCATTTGGCTGGTGGCTGTCGAGACAAAATCAGATGATGGACTATTGGGCGGGCGCCTTGAAGGGCTCGCGCAAGTGCCAGTGCGGCATCGTCGGAAACTGTGTCGACCCGACCAAATGGTGCAATTGCGATGCCAACACGTACGATTGGCTCGAAGACAGCGGCGAAATAAAAGACAAGGAATACCTGCCGGTCAGAGGTCTCCGATTTGGCGACACGGGAACGGCGCTCGATGAGAAGCAAGCGAAATACACGCTCGGTCCTCTCATCTGCGAGGGAGACGATCTTTTCAATAATGTGGTGACATTCAGAATCACTGACGCAACAATCAACTTGCCTCGCTTCGACATGGGACACTCTGGCGACATTTACTTGGAGTTTAAGACGACGCAGGAAAATGCCGTCATTTTCCACGCTACTGGCCACACTGACTACATAAAGCTGTCAATAATCGATGGAAACAAGCTGAAATTCCAATATAGGGCTGGTTCGGGTCCGTTGTCGGTTGACGTTCTCACATCGTATCCTCTGAATGATAACAACTGGCACTCGGTGAGTGTCGAGAGAAACCGAAAAGAAGCTCGTTTGGTTGTGGACGGAGCCACAAAATCAGAAGTTCGTGAACCGCCAGGACCTGTTCGTGCTCTCTACTTGACCTCTGAGCTATCGATTGGCGCTACGCTCGACTATCAAGACGGCTTCGTTGGATGCATTCGCGCTCTGCTCTTGAACGGCAATCCTATAGACCTCAAATCGTATGCCGAGAGGAAGCTGTACGGCGTCTCGGCAGGCTGCGTTGGTCGATGCGAGAGCAGTCCTTGTCTCAACAACGGAACATGTTTTGAAAGATATGATGGCTTCACTTGCGATTGCCGCTGGAGCTCCTTCAAGGGACCAATCTGCGCTGACGAAATCGGTGTCAACCTGAGATCGGACTCGATAATCAAATACGACTTCTTGGGATCGTGGCGCTCAACAATTTCGGAAAATATCCGCGTCGGCTTCACAACGACAAACCAGAAAGGCTTCCTCCTCGGCTTCAGCTCAAACATCACTGGAGAGTATCTCACCATACTCGTTTCGAACTCGGGTGCTTTGAAGTTTGTCTTTGATTTTGGCTTCGAAAGACAGGAGCTTTCCTTCCCGGGCGTTCACTTCGGCCTTGGTCAATTTCACGACGTTCGCTTCATGAGAAAAAATTCCGGATCAACTGTCGTCATTATTGTCGACAACTACGAGCCGAAGGAGTTTCATTTCGACATCAAGGACTCGGCAGACGCCCAATTCAACAACATACAGTACATGTACATTGGCAAGAACGAGAGTATGACTGACGGTTTTGTTGGATGCGTTTCCCGAGTTGAATTCGATGACATCTTCCCGCTGAAGCTTCTCTTCCAACAAAATCCTCCGCCAAACGTGAAATCGATGGGTCCATCTCTCCTAACGGAAGACTTTTGCGGCGTTGAACCGGTCACCTTGCCTCCGGTCATCAAGGAAACCAGACCACCGCCGATTATTGACGAGGATAAGCTGAGAAGTTACGACGGAGTCAGTGCAGGATTCCTAGGAAGTCTCTTGTTCATCATTCTTTTGTTGCTGCTGATAATGGCAATTCTGATTTATCGTCACATGTCTCGCCATAAGGGTGAATATTTGACCCAAGAGGATAAGGGAGCAGATGACGCATTAGATCCAGATGATGCCGTTGTGCACTCAACGACCGGTCACCATGTCACCAAAAAGAAAGAATGGTTCATT
Protein: 1312 (aa)
MYGERDINIDQIVVQSEGGNAWTSAISDFQQYLIIDLGATKNITRISIQGRPHHSEYVSEFSISYGYNGLDYADYKEPGGNTKTLSRSAWTPVENTYNHYLSIDLGGRKMIRKISTLGRPNTHEYVTEYIITYSDDGELWKAYTNNDAQDKLFKGNDNGDDVKHNVFDVPIIAQWVRVNPTRWNDRISLRVELYGCDYYAETVYFNGSSLLSLDLLREPISASRETLQFRFKTSHANGILMYSKGTQGDFFALQLYENRMILNINLGSHSMSSFSVGSLLDDNVWHDVVISRNRRDILFSVDRVVVDGKIKGDFDKLDLNRMLFVGGVPSRDEGLVVTQNFTGCIENLYLNTTNFIREMKEAYSEGQYSRFDKVNTLYNCPDPPIMPVTFLTRTSYAKLKGYEGVKSLNVSLSFRTYEDKGLIIYHEFTSKGYVKVFLEDGRVKTEIKTDEKEFHSTGDHRRGIVLDNYDEQFNDGRWHSLILTIKENSLIIEIDQRPMRTEKLFKILTGAYYYIGGSKTKEIHNIMGNRDGFLGCMRQISVDGNFKLPHDWKDEDFCCKNEILMDACHMVDRCNPNPCKHNGVCRQNSYEFFCDCGNSGYSGAVCHTSLNPLSCQAFKNVQTVGQKANIKIDVDGSGPLDPFDVTCEFLTDGRVLTVLGHSSEHSTVVDSFQEPGSYDQTIEYNARMPQIEALLNRSRECSQRLIYSCRNARLFNSPSDELSFRPFGWWLSRQNQMMDYWAGALKGSRKCQCGIVGNCVDPTKWCNCDANTYDWLEDSGEIKDKEYLPVRGLRFGDTGTALDEKQAKYTLGPLICEGDDLFNNVVTFRITDATINLPRFDMGHSGDIYLEFKTTQENAVIFHATGHTDYIKLSIIDGNKLKFQYRAGSGPLSVDVLTSYPLNDNNWHSVSVERNRKEARLVVDGATKSEVREPPGPVRALYLTSELSIGATLDYQDGFVGCIRALLLNGNPIDLKSYAERKLYGVSAGCVGRCESSPCLNNGTCFERYDGFTCDCRWSSFKGPICADEIGVNLRSDSIIKYDFLGSWRSTISENIRVGFTTTNQKGFLLGFSSNITGEYLTILVSNSGALKFVFDFGFERQELSFPGVHFGLGQFHDVRFMRKNSGSTVVIIVDNYEPKEFHFDIKDSADAQFNNIQYMYIGKNESMTDGFVGCVSRVEFDDIFPLKLLFQQNPPPNVKSMGPSLLTEDFCGVEPVTLPPVIKETRPPPIIDEDKLRSYDGVSAGFLGSLLFIILLLLLIMAILIYRHMSRHKGEYLTQEDKGADDALDPDDAVVHSTTGHHVTKKKEWFI
Type | Start | End | Length |
CDS |
617 |
804 |
188 |
CDS |
878 |
1012 |
135 |
CDS |
1080 |
1618 |
539 |
CDS |
1720 |
1987 |
268 |
CDS |
2048 |
2191 |
144 |
CDS |
2255 |
2761 |
507 |
CDS |
2824 |
3153 |
330 |
CDS |
3277 |
4648 |
1372 |
CDS |
4786 |
4982 |
197 |
CDS |
5268 |
5274 |
7 |
CDS |
5465 |
5661 |
197 |
CDS |
6056 |
6107 |
52 |
intron |
805 |
877 |
73 |
intron |
1013 |
1079 |
67 |
intron |
1619 |
1719 |
101 |
intron |
1988 |
2047 |
60 |
intron |
2192 |
2254 |
63 |
intron |
2762 |
2823 |
62 |
intron |
3154 |
3276 |
123 |
intron |
4649 |
4785 |
137 |
intron |
4983 |
5267 |
285 |
intron |
5275 |
5464 |
190 |
intron |
5662 |
6055 |
394 |
Auto annotation result
Program/Analysis | Accession | Description | Score/Expectation |
BLASTP/NCBI-nr |
XP_001861636 |
neurexin-4 [Culex quinquefasciatus] gb|EDS35896.1| neurexin-4 [Culex quinquefasciatus] |
0.0 |
InterPro |
IPR000421 |
Coagulation factor 5/8 C-terminal type domain |
|
InterPro |
IPR006210 |
Epidermal growth factor-like |
|
InterPro |
IPR000742 |
Epidermal growth factor-like domain |
|
InterPro |
IPR013320 |
Concanavalin A-like lectin/glucanase, subgroup |
|
InterPro |
IPR008985 |
Concanavalin A-like lectin/glucanase |
|
InterPro |
IPR001791 |
Laminin G domain |
|
InterPro |
IPR008979 |
Galactose-binding domain-like |
|
Gene Ontology(BP) |
GO:0007155 |
cell adhesion |
|
Gene Ontology(MF) |
GO:0005515 |
protein binding |
|
Pfam |
PF02210.19 |
Laminin G domain |
4.5e-84 |
Pfam |
PF00054.18 |
Laminin G domain |
3.5e-53 |
Pfam |
PF00008.22 |
EGF-like domain |
0.0012 |
Pfam |
PF00754.20 |
F5/8 type C domain |
4.6e-36 |
Pfam |
PF13385.1 |
Concanavalin A-like lectin/glucanases superfamily |
1.2e-14 |
Expression level (RPKM)
Paralog/Ortholog genes
Paralogous genes
Orthologous genes
Species |
Gene ID |
H. sapiens |
ENSP00000320728 |
M. musculus |
ENSMUSG00000031772 |
M. musculus |
ENSMUSG00000039419 |
N. vitripennis |
NV13304-PA |
A. mellifera |
GB14382-PA |
H. sapiens |
ENSP00000297668 |
H. sapiens |
ENSP00000350863 |
H. sapiens |
ENSP00000432863 |
H. sapiens |
ENSP00000439733 |
H. sapiens |
ENSP00000306893 |
H. sapiens |
ENSP00000354778 |
M. musculus |
ENSMUSG00000038048 |
H. sapiens |
ENSP00000264638 |
H. sapiens |
ENSP00000417628 |
H. sapiens |
ENSP00000366787 |
T. castaneum |
TC011252 |
H. sapiens |
ENSP00000418741 |
A. gambiae |
AGAP007545 |
H. sapiens |
ENSP00000466571 |
H. sapiens |
ENSP00000440732 |
H. sapiens |
ENSP00000399013 |
H. sapiens |
ENSP00000340890 |
H. sapiens |
ENSP00000366881 |
H. sapiens |
ENSP00000276974 |
P. humanus |
PHUM259620-PA |
H. sapiens |
ENSP00000366884 |
M. musculus |
ENSMUSG00000067028 |
P. vanderplanki |
Pv.04956 |
H. sapiens |
ENSP00000366784 |
M. musculus |
ENSMUSG00000017167 |
M. musculus |
ENSMUSG00000033063 |
S. invicta |
SI2.2.0_05420 |
H. sapiens |
ENSP00000405700 |
M. musculus |
ENSMUSG00000070695 |
D. melanogaster |
FBgn0013997 |
B. mori |
BGIBMGA002547-TA |
D. plexippus |
DPOGS210220PA |
C. quinquefasciatus |
CPIJ011417 |
A. aegypti |
AAEL005321 |
H. sapiens |
ENSP00000432883 |
H. sapiens |
ENSP00000366887 |