MidgeBase gene description page [Pn.12066]

Outline

Link to gbrowse

Gene ID Pn.12066
Type Protein coding gene
Scaffold PnScaf15426
Start 614
End 6107
Direction -

Sequence

Transcript: 3936 (bp)

 ATGTATGGCGAGAGAGATATCAATATCGATCAAATTGTCGTACAATCAGAAGGTGGCAACGCATGGACATCAGCGATATCAGATTTCCAACAGTACTTGATTATTGATTTGGGCGCCACGAAAAACATTACGCGCATCTCCATTCAGGGACGTCCGCATCACAGCGAGTATGTCTCTGAGTTCAGCATCAGCTACGGCTACAATGGCCTCGACTACGCGGACTACAAAGAGCCCGGCGGAAACACAAAGACACTTAGCCGATCAGCATGGACTCCGGTTGAGAATACGTACAACCACTATCTGTCGATAGATCTTGGTGGAAGGAAAATGATAAGAAAAATTTCCACGCTCGGTCGTCCAAACACTCATGAATATGTAACTGAATATATTATAACATACTCGGACGATGGTGAATTGTGGAAGGCTTACACTAACAACGATGCTCAGGATAAGCTTTTTAAGGGAAACGACAACGGCGACGACGTGAAGCACAATGTCTTCGATGTGCCGATAATTGCCCAGTGGGTTCGTGTCAATCCGACGAGATGGAATGACAGAATCTCCCTTCGCGTCGAGCTCTACGGATGCGACTACTATGCCGAAACGGTCTACTTCAACGGAAGCAGCTTGCTGAGTCTCGACCTGCTTCGCGAGCCGATTTCTGCTTCGCGAGAGACGCTTCAGTTCAGATTCAAGACCTCACACGCGAACGGCATTCTCATGTACTCGAAGGGCACGCAGGGAGACTTTTTTGCGCTGCAGCTGTACGAAAACCGAATGATTCTCAACATCAACCTCGGCTCACATTCGATGAGCTCGTTCTCGGTCGGAAGCCTGCTCGATGACAACGTGTGGCACGATGTCGTCATTTCGAGAAATCGCCGGGACATTTTGTTCTCCGTCGATCGTGTGGTGGTCGATGGAAAGATTAAGGGAGACTTCGACAAGCTGGATCTCAATCGAATGCTTTTTGTGGGCGGCGTTCCGTCGAGAGACGAAGGCCTCGTTGTTACGCAAAACTTCACTGGCTGCATCGAGAACTTGTACTTGAACACGACGAACTTCATTCGCGAAATGAAGGAGGCCTATTCGGAGGGTCAATACAGTCGCTTCGATAAGGTCAACACTCTGTACAACTGCCCCGATCCGCCCATCATGCCCGTCACCTTCCTCACGCGGACGTCCTACGCGAAGCTAAAGGGCTATGAGGGCGTGAAGAGCCTGAACGTCTCGCTTTCGTTCAGAACTTACGAGGACAAGGGACTGATTATTTACCACGAGTTTACTTCCAAGGGATACGTGAAGGTCTTCCTGGAGGACGGACGCGTGAAGACCGAAATAAAAACGGACGAAAAGGAATTTCATAGCACCGGCGATCATCGTCGCGGCATCGTTTTGGACAACTATGACGAGCAGTTTAATGACGGCAGATGGCACTCGCTGATTTTGACAATCAAGGAGAACAGCTTGATCATTGAGATCGATCAGCGACCTATGAGGACGGAAAAGTTGTTTAAGATCTTGACTGGAGCTTATTACTACATCGGAGGTTCGAAGACCAAAGAGATTCACAACATCATGGGCAATCGGGACGGATTTCTCGGATGCATGCGTCAGATTTCCGTCGACGGCAACTTCAAGCTTCCGCACGATTGGAAAGACGAGGACTTCTGCTGCAAAAATGAAATTCTCATGGACGCCTGTCACATGGTTGACCGTTGCAATCCGAACCCCTGCAAGCACAACGGTGTTTGTCGGCAAAATTCCTACGAATTTTTTTGTGACTGCGGCAATAGTGGATATTCTGGCGCTGTGTGCCATACTTCTTTGAACCCGCTATCTTGTCAAGCTTTCAAAAACGTGCAGACAGTCGGCCAAAAAGCAAACATCAAGATAGACGTCGACGGTTCCGGTCCACTCGATCCATTCGATGTCACGTGCGAGTTCTTGACAGACGGGCGTGTGCTCACCGTTCTGGGTCATTCGTCTGAGCATTCAACCGTGGTCGACAGTTTCCAGGAGCCAGGTTCCTACGATCAAACCATCGAGTACAATGCGAGAATGCCACAAATTGAAGCTCTCTTGAATCGCTCGCGAGAGTGCTCGCAAAGATTGATCTACAGCTGCCGTAATGCGAGACTTTTCAATTCGCCATCTGACGAGCTGAGCTTCCGACCATTTGGCTGGTGGCTGTCGAGACAAAATCAGATGATGGACTATTGGGCGGGCGCCTTGAAGGGCTCGCGCAAGTGCCAGTGCGGCATCGTCGGAAACTGTGTCGACCCGACCAAATGGTGCAATTGCGATGCCAACACGTACGATTGGCTCGAAGACAGCGGCGAAATAAAAGACAAGGAATACCTGCCGGTCAGAGGTCTCCGATTTGGCGACACGGGAACGGCGCTCGATGAGAAGCAAGCGAAATACACGCTCGGTCCTCTCATCTGCGAGGGAGACGATCTTTTCAATAATGTGGTGACATTCAGAATCACTGACGCAACAATCAACTTGCCTCGCTTCGACATGGGACACTCTGGCGACATTTACTTGGAGTTTAAGACGACGCAGGAAAATGCCGTCATTTTCCACGCTACTGGCCACACTGACTACATAAAGCTGTCAATAATCGATGGAAACAAGCTGAAATTCCAATATAGGGCTGGTTCGGGTCCGTTGTCGGTTGACGTTCTCACATCGTATCCTCTGAATGATAACAACTGGCACTCGGTGAGTGTCGAGAGAAACCGAAAAGAAGCTCGTTTGGTTGTGGACGGAGCCACAAAATCAGAAGTTCGTGAACCGCCAGGACCTGTTCGTGCTCTCTACTTGACCTCTGAGCTATCGATTGGCGCTACGCTCGACTATCAAGACGGCTTCGTTGGATGCATTCGCGCTCTGCTCTTGAACGGCAATCCTATAGACCTCAAATCGTATGCCGAGAGGAAGCTGTACGGCGTCTCGGCAGGCTGCGTTGGTCGATGCGAGAGCAGTCCTTGTCTCAACAACGGAACATGTTTTGAAAGATATGATGGCTTCACTTGCGATTGCCGCTGGAGCTCCTTCAAGGGACCAATCTGCGCTGACGAAATCGGTGTCAACCTGAGATCGGACTCGATAATCAAATACGACTTCTTGGGATCGTGGCGCTCAACAATTTCGGAAAATATCCGCGTCGGCTTCACAACGACAAACCAGAAAGGCTTCCTCCTCGGCTTCAGCTCAAACATCACTGGAGAGTATCTCACCATACTCGTTTCGAACTCGGGTGCTTTGAAGTTTGTCTTTGATTTTGGCTTCGAAAGACAGGAGCTTTCCTTCCCGGGCGTTCACTTCGGCCTTGGTCAATTTCACGACGTTCGCTTCATGAGAAAAAATTCCGGATCAACTGTCGTCATTATTGTCGACAACTACGAGCCGAAGGAGTTTCATTTCGACATCAAGGACTCGGCAGACGCCCAATTCAACAACATACAGTACATGTACATTGGCAAGAACGAGAGTATGACTGACGGTTTTGTTGGATGCGTTTCCCGAGTTGAATTCGATGACATCTTCCCGCTGAAGCTTCTCTTCCAACAAAATCCTCCGCCAAACGTGAAATCGATGGGTCCATCTCTCCTAACGGAAGACTTTTGCGGCGTTGAACCGGTCACCTTGCCTCCGGTCATCAAGGAAACCAGACCACCGCCGATTATTGACGAGGATAAGCTGAGAAGTTACGACGGAGTCAGTGCAGGATTCCTAGGAAGTCTCTTGTTCATCATTCTTTTGTTGCTGCTGATAATGGCAATTCTGATTTATCGTCACATGTCTCGCCATAAGGGTGAATATTTGACCCAAGAGGATAAGGGAGCAGATGACGCATTAGATCCAGATGATGCCGTTGTGCACTCAACGACCGGTCACCATGTCACCAAAAAGAAAGAATGGTTCATT 

Protein: 1312 (aa)

 MYGERDINIDQIVVQSEGGNAWTSAISDFQQYLIIDLGATKNITRISIQGRPHHSEYVSEFSISYGYNGLDYADYKEPGGNTKTLSRSAWTPVENTYNHYLSIDLGGRKMIRKISTLGRPNTHEYVTEYIITYSDDGELWKAYTNNDAQDKLFKGNDNGDDVKHNVFDVPIIAQWVRVNPTRWNDRISLRVELYGCDYYAETVYFNGSSLLSLDLLREPISASRETLQFRFKTSHANGILMYSKGTQGDFFALQLYENRMILNINLGSHSMSSFSVGSLLDDNVWHDVVISRNRRDILFSVDRVVVDGKIKGDFDKLDLNRMLFVGGVPSRDEGLVVTQNFTGCIENLYLNTTNFIREMKEAYSEGQYSRFDKVNTLYNCPDPPIMPVTFLTRTSYAKLKGYEGVKSLNVSLSFRTYEDKGLIIYHEFTSKGYVKVFLEDGRVKTEIKTDEKEFHSTGDHRRGIVLDNYDEQFNDGRWHSLILTIKENSLIIEIDQRPMRTEKLFKILTGAYYYIGGSKTKEIHNIMGNRDGFLGCMRQISVDGNFKLPHDWKDEDFCCKNEILMDACHMVDRCNPNPCKHNGVCRQNSYEFFCDCGNSGYSGAVCHTSLNPLSCQAFKNVQTVGQKANIKIDVDGSGPLDPFDVTCEFLTDGRVLTVLGHSSEHSTVVDSFQEPGSYDQTIEYNARMPQIEALLNRSRECSQRLIYSCRNARLFNSPSDELSFRPFGWWLSRQNQMMDYWAGALKGSRKCQCGIVGNCVDPTKWCNCDANTYDWLEDSGEIKDKEYLPVRGLRFGDTGTALDEKQAKYTLGPLICEGDDLFNNVVTFRITDATINLPRFDMGHSGDIYLEFKTTQENAVIFHATGHTDYIKLSIIDGNKLKFQYRAGSGPLSVDVLTSYPLNDNNWHSVSVERNRKEARLVVDGATKSEVREPPGPVRALYLTSELSIGATLDYQDGFVGCIRALLLNGNPIDLKSYAERKLYGVSAGCVGRCESSPCLNNGTCFERYDGFTCDCRWSSFKGPICADEIGVNLRSDSIIKYDFLGSWRSTISENIRVGFTTTNQKGFLLGFSSNITGEYLTILVSNSGALKFVFDFGFERQELSFPGVHFGLGQFHDVRFMRKNSGSTVVIIVDNYEPKEFHFDIKDSADAQFNNIQYMYIGKNESMTDGFVGCVSRVEFDDIFPLKLLFQQNPPPNVKSMGPSLLTEDFCGVEPVTLPPVIKETRPPPIIDEDKLRSYDGVSAGFLGSLLFIILLLLLIMAILIYRHMSRHKGEYLTQEDKGADDALDPDDAVVHSTTGHHVTKKKEWFI 
Type Start End Length
CDS 617 804 188
CDS 878 1012 135
CDS 1080 1618 539
CDS 1720 1987 268
CDS 2048 2191 144
CDS 2255 2761 507
CDS 2824 3153 330
CDS 3277 4648 1372
CDS 4786 4982 197
CDS 5268 5274 7
CDS 5465 5661 197
CDS 6056 6107 52
intron 805 877 73
intron 1013 1079 67
intron 1619 1719 101
intron 1988 2047 60
intron 2192 2254 63
intron 2762 2823 62
intron 3154 3276 123
intron 4649 4785 137
intron 4983 5267 285
intron 5275 5464 190
intron 5662 6055 394

Auto annotation result

Program/Analysis Accession Description Score/Expectation
BLASTP/NCBI-nr XP_001861636 neurexin-4 [Culex quinquefasciatus] gb|EDS35896.1| neurexin-4 [Culex quinquefasciatus] 0.0
InterPro IPR000421 Coagulation factor 5/8 C-terminal type domain
InterPro IPR006210 Epidermal growth factor-like
InterPro IPR000742 Epidermal growth factor-like domain
InterPro IPR013320 Concanavalin A-like lectin/glucanase, subgroup
InterPro IPR008985 Concanavalin A-like lectin/glucanase
InterPro IPR001791 Laminin G domain
InterPro IPR008979 Galactose-binding domain-like
Gene Ontology(BP) GO:0007155 cell adhesion
Gene Ontology(MF) GO:0005515 protein binding
Pfam PF02210.19 Laminin G domain 4.5e-84
Pfam PF00054.18 Laminin G domain 3.5e-53
Pfam PF00008.22 EGF-like domain 0.0012
Pfam PF00754.20 F5/8 type C domain 4.6e-36
Pfam PF13385.1 Concanavalin A-like lectin/glucanases superfamily 1.2e-14

Expression level (RPKM)

Paralog/Ortholog genes

Paralogous genes

Gene ID

Orthologous genes

Species Gene ID
H. sapiens ENSP00000320728
M. musculus ENSMUSG00000031772
M. musculus ENSMUSG00000039419
N. vitripennis NV13304-PA
A. mellifera GB14382-PA
H. sapiens ENSP00000297668
H. sapiens ENSP00000350863
H. sapiens ENSP00000432863
H. sapiens ENSP00000439733
H. sapiens ENSP00000306893
H. sapiens ENSP00000354778
M. musculus ENSMUSG00000038048
H. sapiens ENSP00000264638
H. sapiens ENSP00000417628
H. sapiens ENSP00000366787
T. castaneum TC011252
H. sapiens ENSP00000418741
A. gambiae AGAP007545
H. sapiens ENSP00000466571
H. sapiens ENSP00000440732
H. sapiens ENSP00000399013
H. sapiens ENSP00000340890
H. sapiens ENSP00000366881
H. sapiens ENSP00000276974
P. humanus PHUM259620-PA
H. sapiens ENSP00000366884
M. musculus ENSMUSG00000067028
P. vanderplanki Pv.04956
H. sapiens ENSP00000366784
M. musculus ENSMUSG00000017167
M. musculus ENSMUSG00000033063
S. invicta SI2.2.0_05420
H. sapiens ENSP00000405700
M. musculus ENSMUSG00000070695
D. melanogaster FBgn0013997
B. mori BGIBMGA002547-TA
D. plexippus DPOGS210220PA
C. quinquefasciatus CPIJ011417
A. aegypti AAEL005321
H. sapiens ENSP00000432883
H. sapiens ENSP00000366887