Csim032371.2
Basic Information
- Insect
- Calliopum simillimum
- Gene Symbol
- nfxl1
- Assembly
- GCA_951812925.1
- Location
- OX638371.1:100115983-100123187[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 19 3 4.4e+04 -8.1 6.4 15 19 176 180 175 180 0.82 2 19 0.2 2.9e+03 -0.4 0.4 4 10 211 217 210 217 0.95 3 19 4.7e-08 0.00068 20.8 17.0 1 19 225 242 225 242 0.99 4 19 3 4.4e+04 -4.9 2.3 6 10 268 272 268 272 0.96 5 19 0.0035 51 5.3 14.6 1 18 278 296 278 297 0.87 6 19 3.3e-07 0.0049 18.1 10.1 1 19 332 350 332 350 0.99 7 19 2.2e-07 0.0032 18.7 7.8 1 18 385 402 385 403 0.96 8 19 0.46 6.7e+03 -1.5 0.7 1 5 412 416 412 421 0.75 9 19 3 4.4e+04 -5.0 2.3 5 10 426 431 426 431 0.91 10 19 1e-05 0.15 13.4 14.7 1 18 437 454 437 455 0.98 11 19 9.5e-08 0.0014 19.9 13.1 3 18 466 481 458 482 0.85 12 19 0.48 7e+03 -1.6 1.2 5 10 509 514 509 514 0.95 13 19 0.53 7.7e+03 -1.7 12.1 9 18 527 536 520 537 0.83 14 19 0.0046 68 4.9 6.2 1 11 549 558 549 559 0.95 15 19 0.53 7.7e+03 -1.7 1.2 6 10 619 623 618 623 0.85 16 19 6.3e-06 0.092 14.0 4.9 1 12 629 640 629 640 0.96 17 19 0.079 1.2e+03 0.9 18.9 3 18 669 684 668 685 0.91 18 19 3 4.4e+04 -6.8 3.1 10 13 692 695 691 695 0.68 19 19 0.0027 40 5.6 11.2 4 15 738 749 726 755 0.90
Sequence Information
- Coding Sequence
- atgccgaTTGAGAATACTTCCTTCAGTAAAAAGTCtgataataatgaaaaaagcaaagttAAGTTAAAAAACAACACTACGGCTTCTTCAACTACGGCACAGAAATTTGAAGAAATCAAAAACAGAAACACAGAGAACGTGAAAAAATTGGTGGAAGGCTATGAATCAAGTTCTGATGAAGAAGAACTGGATGAAGGGAACATTTTAGATAAACTCTATAAGAACTATAAAGATAATGCCAGTGGACAACAGCAGCAACTTTTATCCAAAACATCAAGTTTCCTTGAAAATGTTTTACAGTCGGGAGCTGCTACCTGCCTCATCTGTATCGCGAGTGTAAAACGTGCCGATGAAATTTGGTCATGTCGACACTGCTTCtgcttttttcatttaaactGCATAAAACGTTGGGCTAATGATAGTATTGCTCAGGTGAAATCCAATCAGCAGCATTCTGACCAAGGTTATTACAATAATTTAGGTACATATATAtcaccaaaaaaaccaaaaagtccCAAATGGTGTTGTCCGCAATGCCGAACGGATTATGAACCGAATCAGCGACCTTTAAACTATGAGTGTTTCTGTGGGAAAGAGCTTAATCCCGCCCAACAAGCCTGGCTGGTGCCACATTCTTGCGGCGAAATTTGTGGCAAAAACTTATTGCCGGAATGTGGCCACAAATGTCTGCTGCTGTGTCATCCAGGACCGTGTCCACCTTGTCCTCAAAATGTGTACACGTCATGCAAGTGTGGAAAATCTAAAGCAAAATCAGTGAGATGTTTTACGAAGACATGGATGTGTGAGGAGAAGtgtaccaaattcctATCTTGTCAAAAACATCGATGTGGTCAAATATGTCACGAGCAAGAAAAATGTCCACCATGCAATCGCACCAgtaaacaaaaatgtatttgtaaAAGTGAGGAATCAATTCGCAAATGCTCAGAATTATTGTGGCAGTGCAAAAATattTGTAATCGCAAGTATGCTTGTGGAATTCATTTTTGCAAGAGAATTTGTCACTCTGATCCATGTGGTGAATGTCCGCTGGGATTCCAACGCTCTTGTCCTTGTGGAAAAACaaaaaaaatcggaccTTGTGATGAAACAATAGACTCGTGCGGTGATACATGTCAGAAAAAATTACCTTGTGGACTTCACTTTTGTACACAGAGGTGTCATAAAGGCGACTGTAGTTTGtGTTTGGTTATTaccaaaaaaatgtgtcgctgCGGAAAACATGAAAAAGAATTGCCATGTTGGAAATCATTTACATGTGAAACTAAGTGCAAGCAAATTCGTGAATGTGGAAAGCATACATGCAACAAAAAGtgcTGTGATAGTCTTTGCCCACCCTGCGATAAGGTTTGTGGAAAAAGCTTATCATGCAAGAAGCATAAATGCAAATCCATTTGCCATGATGGCCCTTGTTATCCTTGTGATTTGACTTCGCAAATCAATTGTCGTTGCGGtagtacaacaaaaaaagttccttGCGGTAGTGAGCGTACGGCGCGCGTAACATGTAATGAACCATGCagAATCCCATCTAGATGTCATCATACAAATCGTCATCGTTGTCATAAAAATGAATGTCCGCCATGTAATCAAAAATGTGGTCTTGTTAATGATTTCAGTGGATGTGGCCATTTATGTGAAGCTAAATGTCATGCAGCAGTAAAAACGTTTTTAAATGCAACAGCACAAAAGAAAACAGTATGGGATTATAACTCCAAAAATTTTGAGTTCAAAAAACTCCAACATCCGCAATGCGAGGTAAAAGTTATGGTAATATGCATTGGTGGTCATGAAAAGGCGGAATGGCCCTGCTGGAATTCAAAACCAACATCATGTCAACGACCATGtaatcgttttttaaaatgcGGCTGGCATAAATGTGAAAAAGTCTGTCATTCGGTGTTAGATAAAAGCAACATGCAGGAACAAGACGGATGCGCCAGTTGCACGAAAGGATGTACAATACCACGTCCTCAAGGTTGCACACATCCATGTCAGCGCGCTTGTCATACCCCACCATGTCATCCGTGTAACGCTATCATCAAAACAAAATGTCACTGTGGACTGAACCAGGTTGTCTTTTTATGCTCCGAGTATAATAGTAAAGATGTAGAATCAGAAGATCTCGCCATAATCCAACAAAGCAAATTAAGTTGTGGTGGACGTTGTGTAAAAAATtatCCATGTGACCACCGTTGCACAATGATATGTCATGCTGGAGAATGTCTTAATTCTGATATGTGTCGTCGGAAGGTTCGCATTTTTTGTGCTTGCAAAcgtataaaaatagaaatttcctGTGATAAATATCGTACCGGTTTACAAACACTTCCATGCGACGAATATTGTGAGCAAACACGTGCTATCACCGAAGAACTTAAAAAACAAGAGtccgaaaaacaaaaacttttagaagaagaaaaaaatcgtaTTGAACTTGAACAataccaaaaattatttgtcaaaAAGAAATACAAAGAACGTAAAGTTGCTACAGAAAAgactaagaaaaatttaaattggaaaTTAATTTCGATTTATATTGGAATCGCATTTGCTATGCTAATAGCTGTTGGTGTGGCGCTTTATGaaggaaattaa
- Protein Sequence
- MPIENTSFSKKSDNNEKSKVKLKNNTTASSTTAQKFEEIKNRNTENVKKLVEGYESSSDEEELDEGNILDKLYKNYKDNASGQQQQLLSKTSSFLENVLQSGAATCLICIASVKRADEIWSCRHCFCFFHLNCIKRWANDSIAQVKSNQQHSDQGYYNNLGTYISPKKPKSPKWCCPQCRTDYEPNQRPLNYECFCGKELNPAQQAWLVPHSCGEICGKNLLPECGHKCLLLCHPGPCPPCPQNVYTSCKCGKSKAKSVRCFTKTWMCEEKCTKFLSCQKHRCGQICHEQEKCPPCNRTSKQKCICKSEESIRKCSELLWQCKNICNRKYACGIHFCKRICHSDPCGECPLGFQRSCPCGKTKKIGPCDETIDSCGDTCQKKLPCGLHFCTQRCHKGDCSLCLVITKKMCRCGKHEKELPCWKSFTCETKCKQIRECGKHTCNKKCCDSLCPPCDKVCGKSLSCKKHKCKSICHDGPCYPCDLTSQINCRCGSTTKKVPCGSERTARVTCNEPCRIPSRCHHTNRHRCHKNECPPCNQKCGLVNDFSGCGHLCEAKCHAAVKTFLNATAQKKTVWDYNSKNFEFKKLQHPQCEVKVMVICIGGHEKAEWPCWNSKPTSCQRPCNRFLKCGWHKCEKVCHSVLDKSNMQEQDGCASCTKGCTIPRPQGCTHPCQRACHTPPCHPCNAIIKTKCHCGLNQVVFLCSEYNSKDVESEDLAIIQQSKLSCGGRCVKNYPCDHRCTMICHAGECLNSDMCRRKVRIFCACKRIKIEISCDKYRTGLQTLPCDEYCEQTRAITEELKKQESEKQKLLEEEKNRIELEQYQKLFVKKKYKERKVATEKTKKNLNWKLISIYIGIAFAMLIAVGVALYEGN
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- iTF_01486830;
- 90% Identity
- -
- 80% Identity
- -