Eara017488.1
Basic Information
- Insect
- Ecitomorpha arachnoides
- Gene Symbol
- nfx1
- Assembly
- GCA_027574945.2
- Location
- JAODGD020031597.1:2470-5065[-]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 16 3 1.3e+04 -3.2 1.6 15 19 103 107 102 107 0.82 2 16 0.00018 0.75 10.4 12.7 4 19 150 165 141 165 0.89 3 16 3 1.3e+04 -3.2 1.0 5 10 192 197 192 197 0.92 4 16 4.5e-08 0.00019 21.9 10.8 1 19 203 221 203 221 0.98 5 16 5.6 2.4e+04 -4.0 1.0 6 10 245 249 245 249 0.95 6 16 1.2e-07 0.00052 20.5 17.9 1 19 255 273 255 273 0.99 7 16 0.17 7e+02 0.9 0.6 5 10 302 307 302 308 0.96 8 16 6.5e-07 0.0028 18.1 8.0 4 18 321 335 320 336 0.95 9 16 2.5 1.1e+04 -2.9 1.4 5 10 361 366 361 366 0.91 10 16 0.00059 2.5 8.7 4.8 1 12 372 383 372 384 0.95 11 16 5.6 2.4e+04 -4.0 1.1 4 10 393 399 392 399 0.83 12 16 5.1e-07 0.0021 18.5 14.3 1 18 405 421 405 422 0.99 13 16 6 2.5e+04 -4.9 1.5 6 10 451 455 450 455 0.79 14 16 0.0013 5.5 7.6 4.4 10 19 470 479 467 479 0.91 15 16 0.14 6e+02 1.1 12.8 1 15 515 528 515 539 0.88 16 16 6 2.5e+04 -5.6 15.5 1 18 546 568 546 569 0.78
Sequence Information
- Coding Sequence
- ATGCTTCTCATTAGTCAAGAGTCCTTAAACGTAAAAGCACAAAGGAAACATCACCTGGAAAACTACCCGACGGCATTAGCCTTCCCGTCCACTGAAGTGGTAGACAATCTTCATCTCCTCACAGAACGCGCGCGTGATCTGGCGAAGCAGCTCATCAACAGTGAGTATGAATGCTCTAGCTGCCTGGAGGTGATCCATCTGGACTCACCTACGTGGTCGTGCAGGTGCTgctataatatatttcaTCGTAGCTGTATAAGTCGCTGGAGTCGTGAGTGCTGCACATCCGAGGGCGGGGCCTTTGCGTGCCCGCAGTGCCGCGCCGAACAGAAGGGTGCAATCGAGTATGTCTGTTTCTGTGGCAAAGTCTCAATGCCTCAATTTTCGCCTCTCATTATCCCTCACTCATGCGGTGGTTCGTGTGGGCGCTCGAGGAGCGACTGCCCACACAAGTGTTCTAATCAATGCCACCCAGGGCCATGCATCGCATGCTCTGCCATCGCGGGCCCCGTGCACTGTCCGTGTGGAGCGTCCACCTACACGTACCCATGCGGTACTCCAGATCCGATGACTACGTGTGAAAATGCCTGCAGCAAGCTCCTCAACTGCGGCGTGCACAGGTGTGCGTCCCAGTGCCACGCTGGGGCTTGCCAGTCCTGCAGTGTCAGCGTTCTGGCCACTTGTGGGTGCGGCAGAGTAGTTGAAGAAGCGGTTTGTGGAAGCACGCTGCTGTGCGATGAGGTGTGTGGAAAGCGGTTACGCTGCGGCAGCCACAGCTGCCCTTTGCCGTGCCACACTGGTGAATGCCCACCGTGCCCGACAGACCCAGGCAGTATCAACACATGCCCCTGTGGGCGCAAGCCTTTGACCACGCAGCGAACCAGCTGCATGGATCCCATACCCACATGCCAGCAACTGTGTGGCAAGCGATTGCACCCACCGCTACCGGGCGGTGAGGTGCACACGTGCAGATCCACGTGCCACAGCGGGGCGTGCCCTGACTGCGAGGCCACGGTTACTGTGATTTGCCCATGTGGCTACACAAAACGGAAACTGCAGTGTCGTGATCGTGACTCTGTGAAGTGCACCCGTCCGTGCGGGACCAAGCTCTCCTGTGGGCGTCACTTCTGCAAGGAAGTTTGCTGTCCGGCGCGTGGGCGTGTGGAGGGCCCCGAGCATCAATGCTTCGTTACGTGCGGTCGCGCGCTCAAGTGCGGCCATGCGTGCACTGAGCCATGCCATCTAGGGTCGTGCCCGCCGTGTATTCACTTTACCACGAGCCCGCTCTACTGCCGGTGCGGAGCCACCATGCTGATGCCACCGCAACCGTGCGGGACTCAGCCGCCCAGCTGCCAGCGTCAGTGCGAAGTGGAGCGACCTTGCGGTCACCGGCAGTCTGTGCACTCGTGTCACATGGGTGAGTGTCCTGTGTGCACGTTTCTGGTGACACGCCTGTGTGTCGGCGGACACGAGATGCTCAAAAATGTCCCCTGCTGGAACAGCAACGTGGTGTGTGCCCGCAAGTGCATGAAGCCACTCCCATGTGGACATGAATGCGGTCGCTCGTGCCACAGAGATGATTGTGTTGGACCTGCGAACTCGTGCTCTCAACCTTGCGTTGTTGTCCACGACGACTGTGGCCACGTGTGTGGTAGAAACTGCCACGCCACTCCCGATGGAATCCCTACACCGTGCCCTCCGTGTGGTGCAAAAGTTGAGGTTTCCTGCCAATGCGGCAAGTCAACGACGACCATGCAGTGTCTCAAGCATACCCAGAAACTACGCGAGCACAACATCGCGCATCCTGGAACCCCATTTGAAGTCCCATGCAAGCCTTCCTGCACCCCGCGCTTCGGCATTTTGGCAGAGAGGGTGGTCAGCGCGACCAAGTCGCACATATACTTTTACTCCAAGTACCTATGGAAAACTGCGACCGAGTCGATTGAGGTGGTGGTGGACCTGGAGAATAAGCTAGATGCATTCCTGAAAGCCAGTGATACAATGTTGGCCATGCCACCTTGCCGAGGAGAGCGGCGGCATATCATTCATGCTCTCTGTATCCACTACGACATCCGAACGGAAAGCGTGGATGCTGAACCAAAGCGCTCGTGCATTCTGACTAAAACCGCCAACAGTAAAGCGCCACTTGTGCTGCTCAGTGAGTCCGTGAAGGTACGAAAGAACGACCCGCATGCATACTACACGTCTATCATGTTGAGTAACTCTGCCACAAGAGCGCGCTTGGTGATTCATCTGCTCGGAGACGAGATTAGCGAAGCCTACATATCGAACTTCCTCAAGGAGGTTACAGGGTCATACATTTTTGGTGACCTTGAGAACGTCGGCGATAACTGGAAGGCATCCCTTGTGTTTCACTCCCCAAATGCTCAATCACAAGCAATTAAACTCATGAGGCAGCGCCAAACGTACTTCAAGTTTTGGGTGGAGACAAAGTGA
- Protein Sequence
- MLLISQESLNVKAQRKHHLENYPTALAFPSTEVVDNLHLLTERARDLAKQLINSEYECSSCLEVIHLDSPTWSCRCCYNIFHRSCISRWSRECCTSEGGAFACPQCRAEQKGAIEYVCFCGKVSMPQFSPLIIPHSCGGSCGRSRSDCPHKCSNQCHPGPCIACSAIAGPVHCPCGASTYTYPCGTPDPMTTCENACSKLLNCGVHRCASQCHAGACQSCSVSVLATCGCGRVVEEAVCGSTLLCDEVCGKRLRCGSHSCPLPCHTGECPPCPTDPGSINTCPCGRKPLTTQRTSCMDPIPTCQQLCGKRLHPPLPGGEVHTCRSTCHSGACPDCEATVTVICPCGYTKRKLQCRDRDSVKCTRPCGTKLSCGRHFCKEVCCPARGRVEGPEHQCFVTCGRALKCGHACTEPCHLGSCPPCIHFTTSPLYCRCGATMLMPPQPCGTQPPSCQRQCEVERPCGHRQSVHSCHMGECPVCTFLVTRLCVGGHEMLKNVPCWNSNVVCARKCMKPLPCGHECGRSCHRDDCVGPANSCSQPCVVVHDDCGHVCGRNCHATPDGIPTPCPPCGAKVEVSCQCGKSTTTMQCLKHTQKLREHNIAHPGTPFEVPCKPSCTPRFGILAERVVSATKSHIYFYSKYLWKTATESIEVVVDLENKLDAFLKASDTMLAMPPCRGERRHIIHALCIHYDIRTESVDAEPKRSCILTKTANSKAPLVLLSESVKVRKNDPHAYYTSIMLSNSATRARLVIHLLGDEISEAYISNFLKEVTGSYIFGDLENVGDNWKASLVFHSPNAQSQAIKLMRQRQTYFKFWVETK
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -