Vcin023062.1
Basic Information
- Insect
- Villa cingulata
- Gene Symbol
- nfxl1
- Assembly
- GCA_951394055.1
- Location
- OX596021.1:3160458-3163541[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 21 2 4.3e+04 -5.1 1.9 15 19 205 209 205 209 0.85 2 21 1.6 3.4e+04 -3.8 1.3 4 10 240 246 240 246 0.86 3 21 1.9e-08 0.00041 21.5 16.9 1 19 254 271 254 271 0.99 4 21 6.8e-09 0.00014 23.0 13.2 1 18 307 325 307 326 0.97 5 21 1.4 3.1e+04 -3.7 1.8 6 10 351 355 350 355 0.89 6 21 2.9e-10 6.3e-06 27.3 11.4 1 19 361 379 361 379 0.98 7 21 0.95 2e+04 -3.1 0.5 1 7 404 409 403 411 0.49 8 21 7.4e-06 0.16 13.2 9.5 1 18 414 431 414 432 0.97 9 21 0.19 4.1e+03 -0.9 0.5 1 10 441 450 441 450 0.75 10 21 3.3e-05 0.71 11.2 15.8 1 18 466 483 466 484 0.91 11 21 5.1e-08 0.0011 20.1 12.3 3 18 495 510 493 511 0.91 12 21 2 4.3e+04 -4.3 1.5 5 10 539 544 539 544 0.83 13 21 1.7 3.6e+04 -3.9 0.5 9 12 549 552 548 554 0.72 14 21 0.037 7.9e+02 1.4 5.1 10 18 558 566 555 567 0.91 15 21 0.028 6e+02 1.8 5.7 4 12 581 589 579 589 0.89 16 21 0.34 7.3e+03 -1.7 0.4 6 10 647 651 646 653 0.89 17 21 0.00063 14 7.1 5.9 1 12 657 668 651 668 0.79 18 21 2 4.3e+04 -4.5 1.9 5 10 683 688 683 688 0.89 19 21 0.0075 1.6e+02 3.6 16.0 1 18 696 712 696 713 0.93 20 21 1.2e-05 0.26 12.5 9.6 1 18 764 783 764 784 0.85 21 21 2 4.3e+04 -4.2 1.3 6 10 814 818 813 818 0.85
Sequence Information
- Coding Sequence
- atgtcaaataaaaatgtatggaAAAATATTTCAACGACAACACAAAATTCTCATCCACAAAAGAAGCCGAATCCAAATCATCAAAAAGGAAAGgggaaaaataatacaaatacgaCAAAGGAATCGATGGGTGTGAATAATGATGGACTGTCGAAAAATTCTGTTCCCGTACCAGTAGTAACCAGTAAAAATAATGATGCAacattagaaaaatttgaagaaattcagaagaaaaatttaCGAAAAGCAAAAGAATTTGCCGAAGAATACAATTCTAGCTCAGATGAAGAAGATTTAAACACAAATGATCTTTTAAgtACGATTTTCAAAAAGTATTTTGGAGAAACTACACAGGTATCAAAGACACAAACATTTATTGAGAATTTTCTTCAGTCTGGATCAATTGTATGTCTCATCTGTATTGGTAGTGTTAAAAGGAACGATGGTGTTTGGTCATGTAAAAATTGTTactgtatttttcatttgaattgtaTTAAACGATGGGGAAATGATAGTATAGCACAACAAAAACTGTATTCAGACCAAGAACAGGGCTATTATAATAATGACGGTGAATATAtaccaaagaaaaataaaccGATACGATGGTATTGTCCAAAATGTCGAAAAGAAAATTCTCCCGCTGATATACCAAAATTTTATGAGTGTTTTTGCGGAAAAGAAATTAATCCACAAAATCATCCATGGCTAATACCACATTCTTGCGGTGAACAATGCGGAAAATATTTAGATCCAAATTGTGGGCATACGTGTTTATTATTATGTCATCCAGGACCATGCCCACCTTGTCCTCAGACAGTTTCGATATCTTGTAAATGCGGACGTTCTCCTCCAAAGTTTGTAAGATGTTTTCAGAAAATTTGGAGTTGTAATTCAAAATGTCTTCAAACACTGCCCTGCGgaatacataaatgtgaaaGCATCTGTCACGAAACAGGAAAATGTCCACCATGTAATaagaaaagtaaacaaaaatgtCAATGCGAAGCAGAAGTAGACGAAAGAAATTGCTGTGAATCGATTTGGCAATGTAAGAAAGTCTGCAATAAATTGCTGCCGTGTGGCATTCACAAATGTAAAAAGATTTGTCATGCTGGTGACTGCGGAAGTTGTTCTTTGGGACTGCCAAGAACATGTCCTTGTGGCAAAACAAAATCAGTTGCACCATGCACTGAAGCAATTGAAACTTGCGGGGATACATGCCAAAAATTATTAGCTTGCGGTGAACATTATTGCACAATGCGTTGTCATAAGAACGAATGTAGTCCATGCCTCACAATTGTACAAAAGAAATGTCGTTGTGGTTTACATACAAAGGAATTACCTTGTTCAAAGGTATTTTCGTGCGAGACAAAATGTAAACGATTAAGAGAATGCGAGAAGCATGCTTGCAATAAAAAATGTTGTGATGGTCAATGTCCTCCTTGTGACAAGATATGCAACAAGACTTTATCTTGTAAAAAGCATAAATGCAAATCAATATGTCACGATGGTCCTTGTTACCCGTGTGAATTGAAATCACAAATAAAATGTAGATGTGGATCTGCCGTAATAATTGTACCGTGTGGAAGAGAACGGAGAACGCGACCTCCAAAATGCCGCAAACCATGCAGAATACCATCAAAATGTCATCATCAAAATGCACACAATTGTCACATGGATGAATGCCCTCCatgtaataaaaaatgtgaACTTCGAAATGACACAACAAATTGTGATCATCCCTGTGAAGCTAAATGTCATGACGCTGTTAAGGTTTCTGTTAAAAAGAAGTCAAATAACATTTGGGATTGGAATCAAGAaaactttgaatATAAAAAGTTACCACATCCAAGATGCGAAATAAAAGTGATGGTTACTTGTCTGGGTGGACACGAAACTTGTTTATGGCCCTGTTGGAATTCAAAACCGACATCATGTCAGAAAATTTGCGGTCGTttattgaattgtaaaaatcATTATTGTCAAAAAATATGTCATACCGTATCTAATCTAGAGGATATGAATGAACAGATAGAATGTGCTCGATGTGAAGAAGAATGTAAATTACTCCGACCAGAAGGATGTAGTCATCTATGTAAAAAACCTTGCCATCCGCCACCAtgtcaaaaatgtaattttacgATAAGAACAAATTGTCATTGCGGTTTATCGCAAGTGTTTTACAAATGTTCTGAATTCTATGATTTAAATTTGACTAAGGATGAACTTCAAATGATGCAAGAAAAACTAAAATGCTGTGGAAATAGATGCATTCGAAATTATCCTTGTGGACATCGATGTACGTTAATTTGTCATTCTGGTATATGCTCGAATCCAGAAAGttgcaaaaagaaaatgaaaatttattgcgAATGTAAAAATCTAAAAACTGATATTACATGTGAAAAATATCGTAATGGTTTCATAAATCTTCAATGTGATGAAACATGTTTACGTAAAAAAGAACATGAGGATAGAATTCGTCAACAACGTGAAGAGATTCAACAAAAAGCAGAAGAGGAAAAAAATCGTTTAGAATTAGAAATGTTTGAAAAGAAATTTggtaaaaagaaatataaagaacgtaaaaatgttgaaaaaatacaagaaaaaaatgattatcATAAACTATTATGGATTTGCAGTGGAGCACTTTTGGttataatatcaattttaatatttatttattcagcgcaataa
- Protein Sequence
- MSNKNVWKNISTTTQNSHPQKKPNPNHQKGKGKNNTNTTKESMGVNNDGLSKNSVPVPVVTSKNNDATLEKFEEIQKKNLRKAKEFAEEYNSSSDEEDLNTNDLLSTIFKKYFGETTQVSKTQTFIENFLQSGSIVCLICIGSVKRNDGVWSCKNCYCIFHLNCIKRWGNDSIAQQKLYSDQEQGYYNNDGEYIPKKNKPIRWYCPKCRKENSPADIPKFYECFCGKEINPQNHPWLIPHSCGEQCGKYLDPNCGHTCLLLCHPGPCPPCPQTVSISCKCGRSPPKFVRCFQKIWSCNSKCLQTLPCGIHKCESICHETGKCPPCNKKSKQKCQCEAEVDERNCCESIWQCKKVCNKLLPCGIHKCKKICHAGDCGSCSLGLPRTCPCGKTKSVAPCTEAIETCGDTCQKLLACGEHYCTMRCHKNECSPCLTIVQKKCRCGLHTKELPCSKVFSCETKCKRLRECEKHACNKKCCDGQCPPCDKICNKTLSCKKHKCKSICHDGPCYPCELKSQIKCRCGSAVIIVPCGRERRTRPPKCRKPCRIPSKCHHQNAHNCHMDECPPCNKKCELRNDTTNCDHPCEAKCHDAVKVSVKKKSNNIWDWNQENFEYKKLPHPRCEIKVMVTCLGGHETCLWPCWNSKPTSCQKICGRLLNCKNHYCQKICHTVSNLEDMNEQIECARCEEECKLLRPEGCSHLCKKPCHPPPCQKCNFTIRTNCHCGLSQVFYKCSEFYDLNLTKDELQMMQEKLKCCGNRCIRNYPCGHRCTLICHSGICSNPESCKKKMKIYCECKNLKTDITCEKYRNGFINLQCDETCLRKKEHEDRIRQQREEIQQKAEEEKNRLELEMFEKKFGKKKYKERKNVEKIQEKNDYHKLLWICSGALLVIISILIFIYSAQ
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -