Ffus023842.1
Basic Information
- Insect
- Frankliniella fusca
- Gene Symbol
- -
- Assembly
- GCA_029379305.1
- Location
- JAHWGI010001390.1:79250-87768[-]
Transcription Factor Domain
- TF Family
- zf-C2H2
- Domain
- zf-C2H2 domain
- PFAM
- PF00096
- TF Group
- Zinc-Coordinating Group
- Description
- The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger. #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter [1].
- Hmmscan Out
-
# of c-Evalue i-Evalue score bias hmm coord from hmm coord to ali coord from ali coord to env coord from env coord to acc 1 34 0.035 3 9.2 0.2 2 23 233 253 233 253 0.96 2 34 3.9e-05 0.0033 18.6 0.1 3 23 261 281 259 281 0.96 3 34 1.1e-06 9.2e-05 23.5 1.3 1 23 287 309 287 309 0.98 4 34 0.0001 0.0088 17.2 2.9 1 23 315 337 315 337 0.99 5 34 0.00071 0.061 14.6 2.9 1 23 343 365 343 365 0.98 6 34 0.0017 0.15 13.4 7.1 1 23 370 392 370 392 0.98 7 34 3.8e-06 0.00032 21.7 0.7 1 23 403 425 403 425 0.99 8 34 0.00033 0.028 15.6 0.1 1 23 431 453 431 453 0.99 9 34 5.6e-07 4.8e-05 24.3 1.9 1 23 459 481 459 481 0.97 10 34 3.4e-05 0.0029 18.7 6.3 1 23 487 509 487 509 0.98 11 34 4.6e-05 0.004 18.3 1.2 1 23 515 539 515 539 0.93 12 34 0.0036 0.31 12.3 0.6 1 21 603 623 603 624 0.95 13 34 0.19 16 6.9 1.1 3 23 697 717 696 717 0.97 14 34 0.00027 0.023 15.9 0.1 2 23 723 744 722 744 0.95 15 34 0.004 0.34 12.2 0.9 2 23 781 802 780 802 0.97 16 34 2.8e-05 0.0024 19.0 1.4 1 23 808 830 808 830 0.97 17 34 0.00017 0.015 16.5 0.1 1 23 836 858 836 858 0.97 18 34 1.2e-06 0.0001 23.3 2.1 1 23 864 886 864 886 0.98 19 34 2.4 2.1e+02 3.5 4.6 1 23 897 919 897 919 0.97 20 34 0.023 1.9 9.8 2.7 1 23 925 947 925 947 0.98 21 34 7.4e-07 6.3e-05 24.0 0.2 1 23 953 975 953 975 0.98 22 34 3.5e-06 0.0003 21.8 3.1 1 21 981 1001 981 1002 0.95 23 34 0.0033 0.29 12.5 0.3 2 22 1010 1030 1009 1033 0.89 24 34 0.0084 0.72 11.2 0.2 1 23 1085 1107 1085 1108 0.95 25 34 0.00014 0.012 16.8 1.3 1 23 1162 1184 1162 1184 0.97 26 34 3e-05 0.0025 18.9 3.2 1 23 1190 1212 1190 1212 0.99 27 34 0.00016 0.014 16.6 1.2 2 23 1219 1240 1218 1240 0.97 28 34 0.035 3 9.2 2.9 1 23 1246 1268 1246 1268 0.95 29 34 3.4e-05 0.0029 18.7 0.3 1 23 1274 1297 1274 1297 0.98 30 34 4e-05 0.0035 18.5 1.7 2 23 1308 1329 1307 1329 0.98 31 34 1.5e-06 0.00013 23.0 1.2 1 23 1335 1357 1335 1357 0.98 32 34 6e-07 5.2e-05 24.2 1.1 1 23 1363 1385 1363 1385 0.98 33 34 2e-06 0.00017 22.6 1.6 1 23 1391 1413 1391 1413 0.98 34 34 0.011 0.98 10.8 1.5 1 21 1419 1439 1419 1440 0.95
Sequence Information
- Coding Sequence
- ATGGATCCCTCAAACATAATGGAAGTTTGTCGACTGTGCCTCGATAAAGAACGGGTTACAGTTCCTATATTCGAAGGAGAGGGTGCTGATCGGGAGATATGTTTGAAAATTGGGTCCTGTCTTCCTGTGAAGgTGTCTAGCGATGATCCTCTCCCAAAGAAAATTTGTGATGAATGTACTCATAAAGTGGAGCTTGTCTATTCCTTTTGGAACACAACCACTAATGCAGAAAAACAGTTATTGGAATGGTTGGGCATGTGTGAACCCAACCATGACACCCCTATGGATTTGTTAAAGACTGAAATGATTAATACAGATCTAAGTCAAATGGACTTACCAGAGGATTCACAGGACATGCTCGATCAAAGTGGTGTTACAGCTGACGGTAGTGATGAGGAGGGTGGTAGGGAAGAAGACTCTGGTGGAGGGGACTATCGTTATGAAGATGGGACATCCAATGACCCCTCTTCAATGGATATTTCACATCCTGACTCATTACCTGAAGCTGGCCCATCAGGAATACAGCAACAATCCCAAACTCAAACTCAACCTCCTGCCCAGACTGTTCAAGCATCACAACCACAATCAGCACCAGCAGAGCAGCAAGAAACTCCACCTATAAATCCTATTAGATACTGTGACTCCAATGAGAATGGTGGGAAGAGAGAGCAAGTGCCTTCCAAGGCTCAAAAAGAAGTTTGTGATATATGTGGGAAAAAGTATGTAGCAAGTTTCATGATTGTGCATAAGCGAACTCATACAAGTGAGAAACCTCTTGATTGTCCTGTTTGTGGCACatcttttgtaaattacaaaGCTTTGCGTAGTCATATGGTGAGCCACTCAGATTATCGTCCTTTTTCATGCGAAATTTGTGATCGAAAATTCAAGCGAAATAATGAACTAAACATGCATATGAAAATTCACTCTGGTGAGAAGGCTTACCGATGTGATTTATGTGATTATTCTTGTGTACAGAAATCTAATCTAGTTATCCATAAGAAACGCCATAGTAATGAATATAGGTTCATGTGTGACATTTGTGACAAGGGTTGCTACACTACCCAAGAACTTCAAAAACATAAGATGACTCACATGGATATTGCTTTTAAATGCAAGATTTGCAACAAAGAGTTCCACCATAAATATGGCTTAACAATGCATGAAAAAAAGCATGTTCCCAATTATGTACCTCCTGTTGGACAATATAAATGCGAACTCTGTGGGAAGAGCTATGTCAAGTTTGGTGGCTTGAAGGTTCATATGAGAAAACATACTGGTGTGAAACCTTATACTTGTGATGTATGTGGTAAATCTGTTTCAAGTAGGGCTATTCTGGAAGTCCATAAGCGTACTCATACTGGAGAGAAACCCCACATATGTGAAGTTTGTGGGAAAATGTTTACTTCGAAGAGGTATCTTGAGACCCACATTAGAACTCATACTGGAGAGAAACCTTTTAAGTGCAATGAATGTGGAAAATGTTTCACTCAGCACTCAACTCTTCTTGTACATAACAGATACCATACTGGTGATCGTCCTTATGAATGTCCTATATGTGGTAAAGGTTTTGTGTTACGCTGTCAACTGAAATCCCATCAGAAAATGGTGAAGCACCATTATTCAGAGTCACACATGGAAGTTGAGGAAATTCTTGAAGAGCAGGAAGATCTTAATGAAGTTTCCGAGGCTTCATTGGGAgatgctgatgctgctgatggAGATGTGATAATAGCTTCAACTAATGACACTTCAAGTATTGTTATTTCAAGTTACGAGGATATAGAAATAGATGATAAGATCATGTTTTCTTGTTCTACTTGTAAAACAATGTTTGCAGACAAGAACCTTTTGCTTGATCATCAGAAAAGTTCATGTAGGCTTAATGATATCCTAAAAGCTGAACCAACTGATTCTGATAATGTTTTCCAGACTGAGCTTAAGCTGCTTGCATTTCTTTCCAGACTTACAGCTGGCTCTGGAGCTTCTGGCGAAAGAGTTATGGGTGGAATCTGTCAAATCCATATCACAACTGGTGAAATGGAAGATCATCTTCAGGAGTTGCATTCAGAACCAAATAATAAGTTTTGCAACATttgtaaaattaatttaaaaagcCAAATTGCTCTTGTCCATCACAAAAATATACATAGATCAGACAATTTGAATTGTGAACTGTGTGGAGTCCGCTTTGCATACCCATATCAGTTGCAAAGACATGCCAAGGAACATAGGACAACACAAGCTGAGTTGCGTAAGCTTTGTGATGAAAGTTTAACATTAAAAGATAGTGTGATGCAGCAATGTGTAGCTGAAAAAAATTTTGAAGGCAGGCAGTTAAAATGTGAGGTATGCCAAAAATCTTTCAAGATTTATTTAGATTTTAAGGATCATTTAAAGATCCATCAGAACACACTAAAGCATAATTGTGAAGTTTGTGGACTGGTTTTCCGGAAGAAAAGTTCTCTGGTGTCCCATCTCAATAGACATAAGAAAACTTTCAAGTTTGTTTGTGACATTTGCGGGAAAGGGTTTTATGTTGCATCTGAATTTTCTGCTCATAAATTGATTCATTCTGGCATTCGCTCTCATAAATGTATGACTTGTGGAAAGTCATACAGCCGCAAAAGTGCTTTAGTTGCTCACATGAAATCTCATTTACCAAACTATAAAGCCAGGCAGCCTACTCATCAATGTGAATCATGTGGAAAACTTTGTCATAGTACCACTGCTCTTTTAGAGCACTCACGAGTTCATACTGGAATAAGCAATTACAATTGCTCAGTTTGTCAAAAATCTCTGAGCAACAAAGTTTCTCTTGACTGTCATTTAAGAACACACAGTGGTGAGAAACCATTTATTTGCAAAGTATGTGGTAAAGCATTTATATCCAGGAAAAATCTAACTATCCATGTTAGAGTTCACACAGGTGAAAAGCCTTATCGCTGCTCACATTGTGGGAAATCTTTCTCTCAGCGAAGTTCAATGACTAAACATGTAAAGAAATCATTTTGCAGTACTGAACTGTCCTGCTCAATATGCAACATTACTTTTGAATCTACGGGTGAACTTGAAAACCATCAGACAAAGGAAAAGCATTTTCCCCTGTCAAACGCAAAGATATCCTCTGTTCTGTTTAGTCAAACTGAAGGTTTGTCTTCTGTGTCAATCCTCACTGAGAAGGCCAACCGCATCGCCGCCATGCTCAAGGCGGCCGCCGCGTCGGGCTCGCTGAGCCGCGCCACCCGGACATTCCCCTGCCCCGACTGCGACCTGATCCTGCCCAGCCGCCAGCGGCTCAACCAGCACTGGAAGGTGCACCACGAGACGCCGCAGCTGAACCCGGTCTCGCCGCGCCCCAAGCCCCAGCCCATCGACGAGGAGGAGCTGGAgggcgagcgcgccgccgccgtcgcggccgccaTCAAGGTGGAGGTGCTCTCCGGCGACGAGGACGAGATCGTCAACGTGCCGCAGGCCGTGTTCGAGTGCGCGCAGTGCCACAAGGAGTTCAGCAGCGAGAAGTCGCTCAAGGTGCACCAGGTGGTGCACTCGGAGGAGCGGCCCTACACGTGCAACGTGTGCTTCAAGTCGTACAAGCGCCCGTACGAGATGAAGATGCACCAGCGCAAGCACACGGAGCAGAAGAAGCTGCAGTGCAACGAGTGCGACTACACGACCATCTACAAGTCGGCCCTCAACACGCACAAGCGGCGCCACAGTCAGGAGTTCAAGTACCACTGCGAGATTTGCGGGAAGGGCTTCTTTGTCCACACGTGGCTGCTGGAGCACAAGAACTTCCACACGGGCGAGAAGCCCTTTCAGTGCGACGTGTGCGGCAAGGCCTTCCCCTACACGCGCTACCTGGCCGCCCACAAGAAAACTATCCACCCGGACAGCAACCAGCCCCCAAAGATCAACCAGTGCGCCACGTGCGGGAAAGTGTTCGCGCACCGAAACTCGCTCTCCCTCCACATGAAGTCCCACACAGGCGAGAACGTGTACGTCTGTGATATCTGCGGCAAGACGCTGACCAACAAGGAGCACCTGAACCTGCACCGGCGCATCCACACGGGCGAGAAGCCCCACAAGTGCTCGTACTGCGGCAAGGCCTTTAGCAAACCCGGCAACCTGACGCTCCACGAGCGTGTGCACACAGGGGAGCGCCCCTACTCGTGTGACATTTGTAACAAGTCGTTCTCGCAGCGTTCTACGCTCGTGATCCACAAGCGATATCACACCGGGCAGCGGCCGTATGTGTGCCCGCACTGCCAGAAGGGCTTTGTCTGTAAAGCTCTGCTCAATTCTCACCTCAAGAACAGTTGTTATACGACGGGTTAA
- Protein Sequence
- MDPSNIMEVCRLCLDKERVTVPIFEGEGADREICLKIGSCLPVKVSSDDPLPKKICDECTHKVELVYSFWNTTTNAEKQLLEWLGMCEPNHDTPMDLLKTEMINTDLSQMDLPEDSQDMLDQSGVTADGSDEEGGREEDSGGGDYRYEDGTSNDPSSMDISHPDSLPEAGPSGIQQQSQTQTQPPAQTVQASQPQSAPAEQQETPPINPIRYCDSNENGGKREQVPSKAQKEVCDICGKKYVASFMIVHKRTHTSEKPLDCPVCGTSFVNYKALRSHMVSHSDYRPFSCEICDRKFKRNNELNMHMKIHSGEKAYRCDLCDYSCVQKSNLVIHKKRHSNEYRFMCDICDKGCYTTQELQKHKMTHMDIAFKCKICNKEFHHKYGLTMHEKKHVPNYVPPVGQYKCELCGKSYVKFGGLKVHMRKHTGVKPYTCDVCGKSVSSRAILEVHKRTHTGEKPHICEVCGKMFTSKRYLETHIRTHTGEKPFKCNECGKCFTQHSTLLVHNRYHTGDRPYECPICGKGFVLRCQLKSHQKMVKHHYSESHMEVEEILEEQEDLNEVSEASLGDADAADGDVIIASTNDTSSIVISSYEDIEIDDKIMFSCSTCKTMFADKNLLLDHQKSSCRLNDILKAEPTDSDNVFQTELKLLAFLSRLTAGSGASGERVMGGICQIHITTGEMEDHLQELHSEPNNKFCNICKINLKSQIALVHHKNIHRSDNLNCELCGVRFAYPYQLQRHAKEHRTTQAELRKLCDESLTLKDSVMQQCVAEKNFEGRQLKCEVCQKSFKIYLDFKDHLKIHQNTLKHNCEVCGLVFRKKSSLVSHLNRHKKTFKFVCDICGKGFYVASEFSAHKLIHSGIRSHKCMTCGKSYSRKSALVAHMKSHLPNYKARQPTHQCESCGKLCHSTTALLEHSRVHTGISNYNCSVCQKSLSNKVSLDCHLRTHSGEKPFICKVCGKAFISRKNLTIHVRVHTGEKPYRCSHCGKSFSQRSSMTKHVKKSFCSTELSCSICNITFESTGELENHQTKEKHFPLSNAKISSVLFSQTEGLSSVSILTEKANRIAAMLKAAAASGSLSRATRTFPCPDCDLILPSRQRLNQHWKVHHETPQLNPVSPRPKPQPIDEEELEGERAAAVAAAIKVEVLSGDEDEIVNVPQAVFECAQCHKEFSSEKSLKVHQVVHSEERPYTCNVCFKSYKRPYEMKMHQRKHTEQKKLQCNECDYTTIYKSALNTHKRRHSQEFKYHCEICGKGFFVHTWLLEHKNFHTGEKPFQCDVCGKAFPYTRYLAAHKKTIHPDSNQPPKINQCATCGKVFAHRNSLSLHMKSHTGENVYVCDICGKTLTNKEHLNLHRRIHTGEKPHKCSYCGKAFSKPGNLTLHERVHTGERPYSCDICNKSFSQRSTLVIHKRYHTGQRPYVCPHCQKGFVCKALLNSHLKNSCYTTG*
Similar Transcription Factors
Sequence clustering based on sequence similarity using MMseqs2
- 100% Identity
- -
- 90% Identity
- -
- 80% Identity
- -