Mang008576.1
Basic Information
- Insect
- Molanna angustata
- Gene Symbol
- nfxl1
- Assembly
- GCA_963576475.1
- Location
- OY754966.1:39140886-39161687[+]
Transcription Factor Domain
- TF Family
- zf-NF-X1
- Domain
- zf-NF-X1 domain
- PFAM
- PF01422
- TF Group
- Zinc-Coordinating Group
- Description
- This domain is presumed to be a zinc binding domain. The following pattern describes the zinc finger. C-X(1-6)-H-X-C-X3-C(H/C)-X(3-4)-(H/C)-X(1-10)-C Where X can be any amino acid, and numbers in brackets indicate the number of residues. Two position can be either his or cys. This family includes Swiss:P40798, Swiss:Q12986 and Swiss:P53971. The zinc fingers in Swiss:Q12986 bind to DNA [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 12 1.1 3.5e+04 -3.3 0.9 15 18 176 179 175 181 0.50 2 12 1.1 3.4e+04 -3.3 1.5 15 19 222 226 221 226 0.81 3 12 0.27 8.2e+03 -1.3 0.2 4 10 257 263 257 263 0.93 4 12 6.4e-08 0.0019 19.8 16.7 1 19 271 288 271 288 0.99 5 12 2 6.1e+04 -4.3 1.6 5 10 313 318 313 318 0.90 6 12 4.2e-05 1.3 10.8 17.8 1 18 324 341 324 342 0.93 7 12 1 3.2e+04 -3.2 1.0 6 10 367 371 366 371 0.88 8 12 4.8e-07 0.015 17.0 8.2 1 18 377 394 377 395 0.97 9 12 0.00091 28 6.6 10.7 3 18 432 447 430 448 0.91 10 12 8.7e-07 0.027 16.2 17.1 1 18 483 500 483 501 0.98 11 12 2.4e-06 0.073 14.8 17.9 1 19 510 528 504 528 0.95 12 12 1.1 3.5e+04 -3.3 1.4 5 10 556 561 556 561 0.93
Sequence Information
- Coding Sequence
- ATGAATCGAGATGCGAGAGCAACTGCTTCGACGTCGGCTGATGCCAGGTTTAATCAGTCTAGTGCTGAAATGCAGGCAACTGCTTCGACGTCGACTGATGCCAGGTTTAATCAGTCTAGTGCTGAAATGCAGGCAACTGCTTCGACGTCGACTGATGCCAGGTTTAATCAGTCTAGTGCTGAAATGCAGGCAACTGCTTCGACGTCGACTGATGCCAGGTTTAATCAGACTAGTACTGAAATGCAGGCAACTTCTTCGACGTCGGCTGATGCCAAGTTTAAGCAGGCTAGTGCTAAAATGCAGGCAGCTGTTCAGCAGCATATGCCAGACTATGAATCGTCCTCAGACGAGGATGAGATCCAGAGTTCAAGTATAATTGaaTCTATATTTGTCAACTTTGGTAGAGATAGCGGTAAAGCATACCTAGGCAGAACGAGGGAATTCTTGGAGCAGCTGTTTCAGTCGGGGGGTATTACTTGTGCTGTATGTCTTGTGTCCATTAAACGAGCACAACCCATATGGAACTGCTCATCATGTCACTCTGCCTTTCATCTGCCTTGCATCCAAAGATGGAGCAAGCAGTCTATTAATATACAAGTGGCTAGTCCTGTTGTTGGAGATATACCAGTCATCGACAGGAGCTTTCGTTTGCCTCAGTGGTCGTGTCCTAAATGCAGGGCAACCTATAATCAGGACGAAATTCCCCGTTACTATGAGTGTTTCTGCAACAAGAAACGTGACCCTGAGTTTCATGCATGGGTTATTCCGCACTCTTGTGGTGATATTTGTGGGAAACTGCTACAGCCCAACTGTGGACACTCTTGTCTTCTGCTATGCCATCcagGTCCATGTCCACCTTGCCCAAAGATGGTTCAAGCCAAGTGTGGATGTGGAAAGAGCAATGCCGTGCCAAGAAGGTGCACCCAGTCTGAGTGGACGTGCAATGAAGAGTGTCGCCGCTTGTTGGACTGTGATCAACACCACTGCAAGAAAGGCTGTCACTCTGGATCTTGTCCACCGTGTGACAAGACCAAGAGCCAAGCGTGCATTTGCCAAGCACAGACCATGCTAGTGCCTTGCAATCAGACGCAGTGGCAATGCGATAAGGTGTGCAATAAAGAGTTGTCTTGTGGCTACCACAGATGTGTGCGTGTTTGTCACATTGATGAGTGCGGTCCTTGTCTCGGTGGTGGAGAACAATCCTGTCCATGTGGATCTACGAGAGCCATACTTGAATGCCCAGAGGAGATACCCCCTTGTGCCAATACATGTAACAAACTGCTACCGTGTTTCATGCACCGTTGCCCACATAAGTGTCATAAGAACGCTTGCCCActgTGTTTGCAACTGGTCGAGAAGAACTGTCGCTGCGGCGCACACATCCGATCCAGTGCATGTGGGCGTCCCGTGCCCTTGTGCGACAGCAAGTGCCGTGCCATGCGTGTGTGCGGACAGCACCCATGCAACAGGAGGTGCTGCGATGGCGCATGCCCACCGTGCGATCGCTTCTGCGGTCGCATGCTGAGCTGTGCGCGCCACCGCTGTACAGCACCGTGCCACTGCGGCCCTTGCTATCCATGCCCACTCACGTCTGATGTGCACTGCCGCTGTCAGGCCACCAAGATCACGGTGATGTGCGGCATGCAGAGAAAGTACCGGCCTCCTAAATGCACAAAAATATGCAgCCAGCGAGGTCCTGCTGATCACTCCTTGTTCAGCATTTATCTTCGGATAGGGGGACTAAATGCCCGCTGGGTCGGGAGCCGGTGGCACTCTTGTGGTGGTGCGGGTCGAAGCGAAAGACAGAGGAAGACCACCCATCCTACCCATTTCGTGAGGACGTCGGGAGACGGGGAGGCTGGGCGTGGAACGGCCAACTGCGCCAGGGTCTTTCTTGCTCTCGATATTTCTCATGTCCCTAAATTAACACGCTTTCAGTACACTGTGGGAAGTCCCAATTACACACACTCACAATACACGTACAGCACGCTCTAG
- Protein Sequence
- MNRDARATASTSADARFNQSSAEMQATASTSTDARFNQSSAEMQATASTSTDARFNQSSAEMQATASTSTDARFNQTSTEMQATSSTSADAKFKQASAKMQAAVQQHMPDYESSSDEDEIQSSSIIESIFVNFGRDSGKAYLGRTREFLEQLFQSGGITCAVCLVSIKRAQPIWNCSSCHSAFHLPCIQRWSKQSINIQVASPVVGDIPVIDRSFRLPQWSCPKCRATYNQDEIPRYYECFCNKKRDPEFHAWVIPHSCGDICGKLLQPNCGHSCLLLCHPGPCPPCPKMVQAKCGCGKSNAVPRRCTQSEWTCNEECRRLLDCDQHHCKKGCHSGSCPPCDKTKSQACICQAQTMLVPCNQTQWQCDKVCNKELSCGYHRCVRVCHIDECGPCLGGGEQSCPCGSTRAILECPEEIPPCANTCNKLLPCFMHRCPHKCHKNACPLCLQLVEKNCRCGAHIRSSACGRPVPLCDSKCRAMRVCGQHPCNRRCCDGACPPCDRFCGRMLSCARHRCTAPCHCGPCYPCPLTSDVHCRCQATKITVMCGMQRKYRPPKCTKICSQRGPADHSLFSIYLRIGGLNARWVGSRWHSCGGAGRSERQRKTTHPTHFVRTSGDGEAGRGTANCARVFLALDISHVPKLTRFQYTVGSPNYTHSQYTYSTL
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -