Basic Information

Insect
Gerris buenoi
Gene Symbol
nfxl1
Assembly
GCA_001010745.2
Location
KZ651493.1:238064-246636[+]

Transcription Factor Domain

TF Family
zf-NF-X1
Domain
zf-NF-X1 domain
PFAM
PF01422
TF Group
Zinc-Coordinating Group
Description
This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
Hmmscan Out
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc
1 20 0.54 1.5e+04 -2.3 0.6 15 18 123 126 122 127 0.81
2 20 2 5.5e+04 -5.0 1.9 15 19 165 169 165 169 0.85
3 20 0.69 1.9e+04 -2.6 0.9 4 10 200 206 200 206 0.91
4 20 1.8e-08 0.00049 21.6 16.9 1 19 214 231 214 231 0.99
5 20 1.7 4.6e+04 -3.9 0.8 6 18 256 260 255 261 0.49
6 20 1.7e-07 0.0048 18.5 16.1 3 18 268 283 266 284 0.91
7 20 1.4 3.7e+04 -3.6 1.0 6 10 309 313 308 313 0.88
8 20 7.2e-05 2 10.1 15.8 1 19 319 338 319 338 0.91
9 20 4.4e-06 0.12 14.0 13.3 1 18 373 390 373 391 0.97
10 20 1.8e-05 0.49 12.0 15.8 1 18 425 442 425 443 0.94
11 20 8.2e-10 2.3e-05 25.9 12.0 1 19 452 470 452 470 0.98
12 20 0.59 1.6e+04 -2.4 1.0 5 10 497 502 497 502 0.94
13 20 0.00064 18 7.1 4.2 9 19 517 527 515 527 0.89
14 20 0.0024 67 5.2 7.6 1 12 536 546 536 546 0.95
15 20 2 5.5e+04 -6.3 3.1 14 18 575 579 575 579 0.79
16 20 2 5.5e+04 -4.2 0.4 9 11 597 599 597 600 0.73
17 20 9.7e-05 2.7 9.7 6.5 1 12 616 627 606 627 0.91
18 20 0.02 5.6e+02 2.3 12.5 4 18 652 666 650 667 0.92
19 20 2.6e-05 0.7 11.5 14.5 1 17 712 727 712 732 0.90
20 20 0.35 9.6e+03 -1.7 0.7 5 10 761 766 761 766 0.95

Sequence Information

Coding Sequence
ATGGAGAGGAATAGAAATAATCCGTGGAACAGGAACCCTCCGAAGGGTAACAGATTGTCCGGAGCGAATGGGAACTTGGCTGTTAGAAACCCTTCTGGTGGAGAGGCAAAGTTTAGAGAAGCCCAGGAAAGAATGCACCAATCTGTTAAAAAACATTTGGAAAATGACTATGAGTCTTCCGAAGAAGAGGAAGAGCTTGAAACTCAGCCTATTcttgAGTCCGTATTGAAATCATATTCACAGCTTGGTGGACACATTGGAGATTTAGAAAGAACTCAGTCTCATTTGGAAAATACTTTTCAATCTGGATCAGCAGTTTGTCTCATTTGTATTGCAACTGTAAGGAGAGCTGACAAGATTTGGAGTTGTCAATCATGTTATGGATTCTTTCACCTCAACTGTATTCAAAGATGGGCAAAAGATAGCTTAGCACAACAGAAATTAATTGCTGAAGACCGCCCTAATTTGAAatcattagtttttaaatggttatgtccCAAATGTAGACAAGAGTACCAACCAGATGTTATTcctaaaaaatatacatgtttcTGTGGTCAAACAGAAGATCCAGAATATCAGCAGTGGCTAGTACCACATTCATGTGGAGAGAGATGTAATCGAGCATTGCAGCCAGTTTGTGGTCATACATGTCTTCTACTTTGTCATCCAGGTCCTTGTCCACCATGTCCAAAAGTTATCCAGGCAAAATGCTTCTGTAAACGTCAAACAAAGCAAGCAAGATGTAGTCATAAGTCATGGTCTTGCGGCAAGCCTTGTTCAACTCTCCTAGCTTGTAAAAAGCACAAGTGTCAGTCTCAATGTCATGACGGACCATGTCCTCCTTGTGATAAAACTGAAGCATTCACCTGCCTTTGTGGGAATAAGTCTAAACGTATTCCATGTGCTCAGCTATCATGGCAATGTGACAAGGTTTGTGGTAAGCCATTCTCCTGTGGTTTGCATAAGTGTGATATCATTTGTCATCCAGGTGGTGGGTGTGGCCCATGCCCAAGGACTCAGCCTAGGTCTTGTCCTTGTGGAAAATCTGTCCAAATACTACCTTGTGAACAAGATATACCAACTTGCGGTGGAACATGTGATAAAATACTTGACTGTGGTTTACACAGGTGTCCTCAACGATGTCATAACTCTCCTTGCGGATCGTGTCTGGAAATAATTGAGAAAAGATGCCGTTGTGGTTTCAATTCAAAGGAAGTATCTTGTCGCAAGGATTATTTGTGTGAAATAAAGTGTAAAAGAACTAGAGACTGCAATATACACCCATGTAACAGAAAGTGCTGTGATGGTAACTGTCCACCATGTGAGAAACCCTGTGGAAAGACTTTAAGTTGTGGTCAGCATAAATGTAATTCTGTATGCCACAGAGGTTTGTGCTACCCTTGTCAACAAACCAAGCCTCTATCTTGCAGATGTGGCATGACATCTGTCATTGTACCttgtgttaaaagaaaattacgTCCACCTAAATGTAACAAATTGTGCAGAATACCGCCTGATTGTGATCATCCTAGTCGAGAGAAGCATAGATGTCACTTTGGTTCATGTCCAACTTGTCGACAGGTCTGTGGTAAAACTCTGTCATGCGGTCATACCTGTCCATCAAACTGTCATACAGCTGTACTTTCCTATTCAGAGCCCAATGCAAAACCAgctacaccttgcgatttgGTAAATATGTCAGTGGAAATGAAAAAATTGCCTTGTCCTTTGTGCATCGTTCCAGTAGAAGTTACATGTCTTGGTAAGCACGAAACTATTCATGTTCCATGTCACAGAGCTACACCCACAAGCTGTGGGAGGCCATGTGGAAGACGGCTTGACTGTACTAATCACACTTGTCAGCTTCCATGCCACAAAGTACTTGATGAGAATGGGAAGAAGAAATGTGAAGAGTGTGATAAAAGTTGTCAACTTAAGAGGACTTGTGATCATCAGTGCAAACGATGGTGCCATCCACCTCCCTGTGCTGATTGTAACATAAGCATCAAGATGCCTTGCCACTGTAATGTCACATTTGTCTTTGCAAAGTGCCACGAATGGCAAGCTGCAAACAATGAACAGAAAAATCTTCTTCTTTCTTGCAGCAATAATTGCCCTAAACTTCTGAGCTGTGGACACAGATGTGGTAAAAAATGCCATGAAGGAGAATGTACCCCCCAGTCATCTTGTAAAAGGAAAACTAAAGTAACTTGCCCATGTAAACGTCTTAAGAAAGAATTTCAATGTTCTCTTGTCACTTCAGGATCTACTAATTTAACTTGTGATGAACTTTGCAAtgaaatcaaacaaaataaaattaagGAGGTAGAGAGAAGCCAGGAGCAAGCTCGTATAGCAGAAGAGAAAAAGAACAAAGAAGAATTAGAGAAATACGAGAAGAAGATGCAAGGAAGAAAAAGGTCAAGGAAGAGAATAGACAGCCAGCAGAATACTGATGATAATTtcttaatgaaatataaatatttaattttagccacTAGTTCCATTGTCACTGCCGTAAttgcatttatatatttcaagtaA
Protein Sequence
MERNRNNPWNRNPPKGNRLSGANGNLAVRNPSGGEAKFREAQERMHQSVKKHLENDYESSEEEEELETQPILESVLKSYSQLGGHIGDLERTQSHLENTFQSGSAVCLICIATVRRADKIWSCQSCYGFFHLNCIQRWAKDSLAQQKLIAEDRPNLKSLVFKWLCPKCRQEYQPDVIPKKYTCFCGQTEDPEYQQWLVPHSCGERCNRALQPVCGHTCLLLCHPGPCPPCPKVIQAKCFCKRQTKQARCSHKSWSCGKPCSTLLACKKHKCQSQCHDGPCPPCDKTEAFTCLCGNKSKRIPCAQLSWQCDKVCGKPFSCGLHKCDIICHPGGGCGPCPRTQPRSCPCGKSVQILPCEQDIPTCGGTCDKILDCGLHRCPQRCHNSPCGSCLEIIEKRCRCGFNSKEVSCRKDYLCEIKCKRTRDCNIHPCNRKCCDGNCPPCEKPCGKTLSCGQHKCNSVCHRGLCYPCQQTKPLSCRCGMTSVIVPCVKRKLRPPKCNKLCRIPPDCDHPSREKHRCHFGSCPTCRQVCGKTLSCGHTCPSNCHTAVLSYSEPNAKPATPCDLVNMSVEMKKLPCPLCIVPVEVTCLGKHETIHVPCHRATPTSCGRPCGRRLDCTNHTCQLPCHKVLDENGKKKCEECDKSCQLKRTCDHQCKRWCHPPPCADCNISIKMPCHCNVTFVFAKCHEWQAANNEQKNLLLSCSNNCPKLLSCGHRCGKKCHEGECTPQSSCKRKTKVTCPCKRLKKEFQCSLVTSGSTNLTCDELCNEIKQNKIKEVERSQEQARIAEEKKNKEELEKYEKKMQGRKRSRKRIDSQQNTDDNFLMKYKYLILATSSIVTAVIAFIYFK*

Similar Transcription Factors

Sequence clustering based on sequence similarity using MMseqs2

100% Identity
iTF_00744284;
90% Identity
iTF_00744284;
80% Identity
-