NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1924573064|ref|NP_001375317|]
View 

sterol regulatory element-binding protein 1 isoform 7 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
bHLH_SF super family cl00081
basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators ...
319-378 1.36e-33

basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. Members of the bHLH superfamily have two highly conserved and functionally distinct regions. The basic part is at the amino end of the bHLH that may bind DNA to a consensus hexanucleotide sequence known as the E box (CANNTG). Different families of bHLH proteins recognize different E-box consensus sequences. At the carboxyl-terminal end of the region is the HLH region that interacts with other proteins to form homo- and heterodimers. bHLH proteins function as a diverse set of regulatory factors because they recognize different DNA sequences and dimerize with different proteins. The bHLH proteins can be divided to cell-type specific and widely expressed proteins. The cell-type specific members of bHLH superfamily are involved in cell-fate determination and act in neurogenesis, cardiogenesis, myogenesis, and hematopoiesis.


The actual alignment was detected with superfamily member cd18921:

Pssm-ID: 469605  Cd Length: 75  Bit Score: 123.85  E-value: 1.36e-33
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  319 QSRGEKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQENLSLRTAVHK 378
Cdd:cd18921      1 PSKGEKRTAHNAIEKRYRSSINDKIIELKDLVvgteaklnksavlrkAIDYIRFLQQSNQKLKQENMALKMAVQK 75
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
87-210 3.11e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 3.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPlsPPQPAPTPLKMYPSMPafsPGPGIKeesvPLSILQT---PTPQPLPGALLPQSFPAPAPP-------Q 156
Cdd:pfam03154  366 LPNPQSHKHP--PHLSGPSPFQMNSNLP---PPPALK----PLSSLSThhpPSAHPPPLQLMPQSQQLPPPPaqppvltQ 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1924573064  157 FSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVP 210
Cdd:pfam03154  437 SQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMP 490
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
49-320 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   49 PPYAGSGAGGTdPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIkeeSVP 128
Cdd:pfam05109  476 PTPAGTTSGAS-PVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV---TTP 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  129 LSILQTPTP---QPLPGALLP---QSFPAPA---PPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLpglpLASPPGVPPV 199
Cdd:pfam05109  552 TPNATSPTPavtTPTPNATIPtlgKTSPTSAvttPTPNATSPTVGETSPQANTTNHTLGGTSSTPV----VTSPPKNATS 627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  200 SLHT---QVQSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVPVLLQPHFIKADSLL-LTAMKTDGATVKAAGLSPLVSGTT 275
Cdd:pfam05109  628 AVTTgqhNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITqVTPASTSTHHVSTSSPAPRPGTTS 707
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1924573064  276 VQTGPlptlvsGGTILATVPLVVDAEKLPINRLAAGSKAPASAQS 320
Cdd:pfam05109  708 QASGP------GNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKT 746
 
Name Accession Description Interval E-value
bHLHzip_SREBP1 cd18921
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding ...
319-378 1.36e-33

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding protein 1 (SREBP1) and similar proteins; SREBP1, also termed Class D basic helix-loop-helix protein 1 (bHLHd1), or sterol regulatory element-binding transcription factor 1 (SREBF1), is a member of a family of bHLHzip transcription factors that recognize sterol regulatory element 1 (SRE-1). It acts as a transcriptional activator required for lipid homeostasis. It may control transcription of the low-density lipoprotein receptor gene as well as the fatty acid. SREBP1 has dual sequence specificity binding to both an E-box motif (5'-ATCACGTGA-3') and to SRE-1 (5'-ATCACCCCAC-3').


Pssm-ID: 381491  Cd Length: 75  Bit Score: 123.85  E-value: 1.36e-33
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  319 QSRGEKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQENLSLRTAVHK 378
Cdd:cd18921      1 PSKGEKRTAHNAIEKRYRSSINDKIIELKDLVvgteaklnksavlrkAIDYIRFLQQSNQKLKQENMALKMAVQK 75
HLH pfam00010
Helix-loop-helix DNA-binding domain;
324-359 1.11e-08

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 52.08  E-value: 1.11e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1924573064  324 KRTAHNAIEKRYRSSINDKIIELKDLV-----------------AIDYIRFLQ 359
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLptlppdkklskaeilrlAIEYIKHLQ 53
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
87-210 3.11e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 3.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPlsPPQPAPTPLKMYPSMPafsPGPGIKeesvPLSILQT---PTPQPLPGALLPQSFPAPAPP-------Q 156
Cdd:pfam03154  366 LPNPQSHKHP--PHLSGPSPFQMNSNLP---PPPALK----PLSSLSThhpPSAHPPPLQLMPQSQQLPPPPaqppvltQ 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1924573064  157 FSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVP 210
Cdd:pfam03154  437 SQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMP 490
HLH smart00353
helix loop helix domain;
329-364 1.97e-06

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 45.67  E-value: 1.97e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1924573064   329 NAIEKRYRSSINDKIIELKDLV-----------------AIDYIRFLQHSNQK 364
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLptlpknkklskaeilrlAIEYIKSLQEELQK 53
PHA03378 PHA03378
EBNA-3B; Provisional
87-212 3.73e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.22  E-value: 3.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPLSPPQPAPTPLKmypsMPAFSPGPGIKEESVPlsiLQTPTPQPLPGALL-PQSFPAPAPPQfSSTPVLGY 165
Cdd:PHA03378   704 MRPPAAPPGRAQRPAAATGRAR----PPAAAPGRARPPAAAP---GRARPPAAAPGRARpPAAAPGRARPP-AAAPGAPT 775
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1924573064  166 PSPPggfsTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVPQQ 212
Cdd:PHA03378   776 PQPP----PQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQ 818
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
49-320 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   49 PPYAGSGAGGTdPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIkeeSVP 128
Cdd:pfam05109  476 PTPAGTTSGAS-PVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV---TTP 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  129 LSILQTPTP---QPLPGALLP---QSFPAPA---PPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLpglpLASPPGVPPV 199
Cdd:pfam05109  552 TPNATSPTPavtTPTPNATIPtlgKTSPTSAvttPTPNATSPTVGETSPQANTTNHTLGGTSSTPV----VTSPPKNATS 627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  200 SLHT---QVQSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVPVLLQPHFIKADSLL-LTAMKTDGATVKAAGLSPLVSGTT 275
Cdd:pfam05109  628 AVTTgqhNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITqVTPASTSTHHVSTSSPAPRPGTTS 707
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1924573064  276 VQTGPlptlvsGGTILATVPLVVDAEKLPINRLAAGSKAPASAQS 320
Cdd:pfam05109  708 QASGP------GNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKT 746
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
95-199 3.93e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 39.39  E-value: 3.93e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064    95 SPLSPPQPAPTPLKMYPSMPAFSPGPGiKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFST 174
Cdd:smart00818   43 SQQHPPTHTLQPHHHIPVLPAQQPVVP-QQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVH 121
                            90       100
                    ....*....|....*....|....*
gi 1924573064   175 GSPPGNTQQPLPGLPLASPPgvPPV 199
Cdd:smart00818  122 PIPPLPPQPPLPPMFPMQPL--PPL 144
 
Name Accession Description Interval E-value
bHLHzip_SREBP1 cd18921
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding ...
319-378 1.36e-33

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding protein 1 (SREBP1) and similar proteins; SREBP1, also termed Class D basic helix-loop-helix protein 1 (bHLHd1), or sterol regulatory element-binding transcription factor 1 (SREBF1), is a member of a family of bHLHzip transcription factors that recognize sterol regulatory element 1 (SRE-1). It acts as a transcriptional activator required for lipid homeostasis. It may control transcription of the low-density lipoprotein receptor gene as well as the fatty acid. SREBP1 has dual sequence specificity binding to both an E-box motif (5'-ATCACGTGA-3') and to SRE-1 (5'-ATCACCCCAC-3').


Pssm-ID: 381491  Cd Length: 75  Bit Score: 123.85  E-value: 1.36e-33
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  319 QSRGEKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQENLSLRTAVHK 378
Cdd:cd18921      1 PSKGEKRTAHNAIEKRYRSSINDKIIELKDLVvgteaklnksavlrkAIDYIRFLQQSNQKLKQENMALKMAVQK 75
bHLHzip_SREBP cd11394
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding ...
319-375 4.06e-29

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding protein (SREBP) family; The SREBP family includes SREBP1 and SREBP2, which are bHLHzip transcriptional activator of genes encoding proteins essential for cholesterol biosynthesis/uptake and fatty acid biosynthesis. SREBP1 and SREBP2 are principally found in the liver and in adipocytes and made up of an N-terminal transcription factor portion (composed of an activation domain, a bHLHzip domain, and a nuclear localization signal), a hydrophobic region containing two membrane spanning regions, and a C-terminal regulatory segment. They recognize a symmetric sterol regulatory element (TCACNCCAC) instead of E-box.


Pssm-ID: 381400 [Multi-domain]  Cd Length: 73  Bit Score: 110.83  E-value: 4.06e-29
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1924573064  319 QSRGEKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQENLSLRTA 375
Cdd:cd11394      1 PPKVEKRSAHNAIEKRYRSSINDRIIELKDLVvgpdakmnksavlrkAIDYIRYLQKVNQKLKQENMALKKA 72
bHLHzip_SREBP2 cd18922
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding ...
321-380 1.84e-26

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding protein 2 (SREBP2) and similar proteins; SREBP2, also termed Class D basic helix-loop-helix protein 2 (bHLHd2), or sterol regulatory element-binding transcription factor 2 (SREBF2), is a member of a family of bHLHzip transcription factors that recognize sterol regulatory element 1 (SRE-1). It acts as a transcription activator of cholesterol biosynthesis.


Pssm-ID: 381492 [Multi-domain]  Cd Length: 77  Bit Score: 103.50  E-value: 1.84e-26
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  321 RGEKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQENLSLRTAVHKSK 380
Cdd:cd18922      3 EGERRTTHNIIEKRYRSSINDKIIELKDLVmgtdakmhksgvlrkAIDYIKYLQQVNHKLRQENMALKLANQKNK 77
bHLHzip_scHMS1_like cd11399
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Saccharomyces cerevisiae transcription ...
323-376 9.18e-16

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Saccharomyces cerevisiae transcription factor HMS1 and similar proteins; HMS1, also termed high-copy MEP suppressor protein 1, is a putative bHLHzip transcription factor involved in exit from mitosis and pseudohyphal differentiation.


Pssm-ID: 381405 [Multi-domain]  Cd Length: 96  Bit Score: 73.66  E-value: 9.18e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  323 EKRTAHNAIEKRYRSSINDKIIELKDLV-------------------------------------AIDYIRFLQHSNQKL 365
Cdd:cd11399      2 SKKTAHNMIEKRYRSNINDRIAELRDSVpalreayksargededeedlggltpatklnkatilskATEYIRHLEKKNKRL 81
                           90
                   ....*....|.
gi 1924573064  366 KQENLSLRTAV 376
Cdd:cd11399     82 SRENASLRERL 92
bHLHzip_SREBP_like cd11395
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding ...
323-376 2.13e-14

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in sterol regulatory element-binding protein (SREBP) family and similar proteins; The SREBP family includes SREBP1 and SREBP2, which are bHLHzip transcriptional activator of genes encoding proteins essential for cholesterol biosynthesis/uptake and fatty acid biosynthesis. SREBP1 and SREBP2 are principally found in the liver and in adipocytes and made up of an N-terminal transcription factor portion (composed of an activation domain, a bHLHzip domain, and a nuclear localization signal), a hydrophobic region containing two membrane spanning regions, and a C-terminal regulatory segment. They recognize a symmetric sterol regulatory element (TCACNCCAC) instead of E-box. The family also includes Saccharomyces cerevisiae transcription factor HMS1 (also termed high-copy MEP suppressor protein 1) and serine-rich protein TYE7. HMS1 is a putative bHLHzip transcription factor involved in exit from mitosis and pseudohyphal differentiation. TYE7, also termed basic-helix-loop-helix protein SGC1, is a putative bHLHzip transcription activator required for Ty1-mediated glycolytic gene expression. TYE7 N-terminal is extremely rich in serine residues. It binds DNA on E-box motifs, 5'-CANNTG-3'. TYE7 is not essential for growth.


Pssm-ID: 381401 [Multi-domain]  Cd Length: 87  Bit Score: 69.67  E-value: 2.13e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  323 EKRTAHNAIEKRYRSSINDKIIELKDLV--------------------------------AIDYIRFLQHSNQKLKQENL 370
Cdd:cd11395      2 RKRLPHNAIEKRYRSNLNTKIERLRDAIpslrspegksddgglgglapttklskatiltkAIEYIRHLEQENERLEEENE 81

                   ....*.
gi 1924573064  371 SLRTAV 376
Cdd:cd11395     82 ELRQQV 87
HLH pfam00010
Helix-loop-helix DNA-binding domain;
324-359 1.11e-08

Helix-loop-helix DNA-binding domain;


Pssm-ID: 459628 [Multi-domain]  Cd Length: 53  Bit Score: 52.08  E-value: 1.11e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1924573064  324 KRTAHNAIEKRYRSSINDKIIELKDLV-----------------AIDYIRFLQ 359
Cdd:pfam00010    1 RREAHNERERRRRDRINDAFDELRELLptlppdkklskaeilrlAIEYIKHLQ 53
bHLHzip_MITF_like cd11397
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the microphthalmia-associated ...
324-367 2.33e-07

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in the microphthalmia-associated transcription factor family (MITF) family; The MITF (also known as microphthalmia-TFE, or MiT) family is a small family that contain a basic helix loop helix domain associated with a leucine zipper (bHLHZip). The MITF family comprises four genes in mammals (MITF, TFE3, TFEB, and TFEC); each gene has different functions. MITF is involved in neural crest melanocytes development as well as the pigmented retinal epithelium. TFEB is required for vascularization of the mouse placenta. TFE3 is involved in B cell function. TFEC regulates gene expression in macrophages. The MITF family can form homodimers or heterodimers with each other but not with other bHLH or bHLHzip proteins.


Pssm-ID: 381403  Cd Length: 69  Bit Score: 48.83  E-value: 2.33e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1924573064  324 KRTAHNAIEKRYRSSINDKIIELKDLV------------------AIDYIRFLQHSNQKLKQ 367
Cdd:cd11397      5 KKDNHNMIERRRRFNINDRIKELGTLLpksndpdmrwnkgtilkaSVDYIRKLQKEQERLRQ 66
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
87-210 3.11e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.77  E-value: 3.11e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPlsPPQPAPTPLKMYPSMPafsPGPGIKeesvPLSILQT---PTPQPLPGALLPQSFPAPAPP-------Q 156
Cdd:pfam03154  366 LPNPQSHKHP--PHLSGPSPFQMNSNLP---PPPALK----PLSSLSThhpPSAHPPPLQLMPQSQQLPPPPaqppvltQ 436
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 1924573064  157 FSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVP 210
Cdd:pfam03154  437 SQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMP 490
bHLHzip_USF_MITF cd11387
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in USF/MITF family; The USF (upstream ...
328-363 1.37e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in USF/MITF family; The USF (upstream stimulatory factor)/MITF (microphthalmia-associated transcription factor) family includes two bHLHzip transcription factor subfamilies. USFs are ubiquitously expressed and key regulators of a wide number of gene regulation networks, including the stress and immune responses, cell cycle and proliferation, lipid and glucid metabolism. USFs recruit chromatin remodeling enzymes and interact with co-activators and the members of the transcription pre-initiation complex. USFs interact with high affinity to E-box regulatory elements. The MITF (also known as microphthalmia-TFE, or MiT) subfamily comprises four genes in mammals (MITF, TFE3, TFEB, and TFEC); each gene has different functions. MITF is involved in neural crest melanocytes development as well as the pigmented retinal epithelium. TFEB is required for vascularization of the mouse placenta. TFE3 is involved in B cell function. TFEC regulates gene expression in macrophages. The MITF subfamily proteins can form homodimers or heterodimers with each other but not with other bHLH or bHLHzip proteins.


Pssm-ID: 381393 [Multi-domain]  Cd Length: 58  Bit Score: 46.48  E-value: 1.37e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1924573064  328 HNAIEKRYRSSINDKIIELKDLV--------------------AIDYIRFLQHSNQ 363
Cdd:cd11387      2 HNAVERRRRDNINEKIQELGSLVppsrletkdlkpnkgsilskAVEYIRELQNQNQ 57
bHLHzip_Max cd11406
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, ...
324-374 1.95e-06

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in protein Max and similar proteins; Max, also termed Class D basic helix-loop-helix protein 4 (bHLHd4), or Myc-associated factor X, is a bHLHZip transcription regulator that forms a sequence-specific DNA-binding protein complex with MYC or MAD which recognizes the core sequence 5'-CAC[GA]TG-3'. The MYC:MAX complex is a transcriptional activator, whereas the MAD:MAX complex is a transcriptional repressor. Max homodimer bind DNA but is transcriptionally inactive. Targeted deletion of max results in early embryonic lethality in mice.


Pssm-ID: 381412  Cd Length: 69  Bit Score: 46.19  E-value: 1.95e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1924573064  324 KRTAHNAIEKRYRSSINDKIIELKDLV----------------AIDYIRFLQHSNQKLKQENLSLRT 374
Cdd:cd11406      1 KRAHHNALERKRRDHIKDSFHSLRDSVpslqgekasraqilkkATEYIQYMRRKNHTHQQDIDDLKR 67
HLH smart00353
helix loop helix domain;
329-364 1.97e-06

helix loop helix domain;


Pssm-ID: 197674 [Multi-domain]  Cd Length: 53  Bit Score: 45.67  E-value: 1.97e-06
                            10        20        30        40        50
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1924573064   329 NAIEKRYRSSINDKIIELKDLV-----------------AIDYIRFLQHSNQK 364
Cdd:smart00353    1 NARERRRRRKINEAFDELRSLLptlpknkklskaeilrlAIEYIKSLQEELQK 53
PHA03378 PHA03378
EBNA-3B; Provisional
87-212 3.73e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 51.22  E-value: 3.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPLSPPQPAPTPLKmypsMPAFSPGPGIKEESVPlsiLQTPTPQPLPGALL-PQSFPAPAPPQfSSTPVLGY 165
Cdd:PHA03378   704 MRPPAAPPGRAQRPAAATGRAR----PPAAAPGRARPPAAAP---GRARPPAAAPGRARpPAAAPGRARPP-AAAPGAPT 775
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1924573064  166 PSPPggfsTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVVPQQ 212
Cdd:PHA03378   776 PQPP----PQAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQ 818
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
90-214 5.41e-06

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 50.80  E-value: 5.41e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   90 PQAAPSPLSPPQPAPT----------PLKMYPSMPAFSP--GPGIKEESVPLSILQTPTPQPLPGallPQSFPAPAPPQF 157
Cdd:pfam09770  212 QQPAPAPAQPPAAPPAqqaqqqqqfpPQIQQQQQPQQQPqqPQQHPGQGHPVTILQRPQSPQPDP---AQPSIQPQAQQF 288
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  158 SSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQSVV--------PQQLL 214
Cdd:pfam09770  289 HQQPPPVPVQPTQILQNPNRLSAARVGYPQNPQPGVQPAPAHQAHRQQGSFGrqapiithPQQLA 353
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
50-198 1.22e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.77  E-value: 1.22e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   50 PYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPL 129
Cdd:pfam03154  199 PTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPM 278
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  130 SILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFSTGS--PPGNTQ---------QPLPGLPLASPPGVPP 198
Cdd:pfam03154  279 PHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIhtPPSQSQlqsqqppreQPLPPAPLSMPHIKPP 358
PHA03378 PHA03378
EBNA-3B; Provisional
44-170 1.73e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 49.30  E-value: 1.73e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   44 PGLFDPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKmypsMPAFSPGPGIK 123
Cdd:PHA03378   691 PGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR----PPAAAPGRARP 766
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1924573064  124 EESVPLSILQTPTPQPLPGALL-PQSFPAPA-PPQFSSTPV-LGYPSPPG 170
Cdd:PHA03378   767 PAAAPGAPTPQPPPQAPPAPQQrPRGAPTPQpPPQAGPTSMqLMPRAAPG 816
bHLHzip_TFE3 cd18928
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in transcription factor E3 (TFE3) and ...
323-383 1.91e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in transcription factor E3 (TFE3) and similar proteins; TFE3, also termed Class E basic helix-loop-helix protein 33 (bHLHe33), is a bHLHzip transcription factor that is involved in B cell function. It specifically recognizes and binds E-box sequences (5'-CANNTG-3'). Its efficient DNA-binding requires dimerization with itself or with another MiT/TFE family member such as TFEB or MITF.


Pssm-ID: 381498 [Multi-domain]  Cd Length: 91  Bit Score: 44.27  E-value: 1.91e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1924573064  323 EKRTAHNAIEKRYRSSINDKIIELKDLV------------------AIDYIRFLQHSNQKLKQENLSLRTAVHKSKSLK 383
Cdd:cd18928     10 QKKDNHNLIERRRRFNINDRIKELGTLIpkstdpemrwnkgtilkaSVDYIRKLQKEQQRSKEIEMRQRKLEQANRSLQ 88
PHA03247 PHA03247
large tegument protein UL36; Provisional
93-212 2.93e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 48.78  E-value: 2.93e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   93 APSPLSPPQPAPTPLKMYPSMPAfSPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPvlGYPSPPGGF 172
Cdd:PHA03247  2699 ADPPPPPPTPEPAPHALVSATPL-PPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGP--PAPAPPAAP 2775
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 1924573064  173 STGSPPGNTQQP-------LPGLPLASPPGVPPVSLHTQVQSVVPQQ 212
Cdd:PHA03247  2776 AAGPPRRLTRPAvaslsesRESLPSPWDPADPPAAVLAPAAALPPAA 2822
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
88-209 3.76e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.99  E-value: 3.76e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   88 SGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEE-----SVPLSILQTPTPQplpgallpQSFPAPA---PPQFSS 159
Cdd:pfam05109  464 TGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESkapdmTSPTSAVTTPTPN--------ATSPTPAvttPTPNAT 535
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|
gi 1924573064  160 TPVLGYPSPPGGFSTGSPpgNTQQPLPGLPLASPPGVPPVSLHTQVQSVV 209
Cdd:pfam05109  536 SPTLGKTSPTSAVTTPTP--NATSPTPAVTTPTPNATIPTLGKTSPTSAV 583
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
90-237 4.40e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.84  E-value: 4.40e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   90 PQAAPSPLSPPQPAPTPLKMYPSMPAfsPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPP 169
Cdd:pfam03154  200 TPSAPSVPPQGSPATSQPPNQTQSTA--APHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPP 277
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1924573064  170 GGFSTGSPPGNTQQPLPGLPLasppGVPPVSLHTQVqSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVP 237
Cdd:pfam03154  278 MPHSLQTGPSHMQHPVPPQPF----PLTPQSSQSQV-PPGPSPAAPGQSQQRIHTPPSQSQLQSQQPP 340
bHLHzip_MITF cd18926
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in microphthalmia-associated ...
312-382 6.88e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in microphthalmia-associated transcription factor (MITF) and similar proteins; MITF, also termed Class E basic helix-loop-helix protein 32 (bHLHe32), is a bHLHzip transcription factor that is involved in neural crest melanocytes development as well as the pigmented retinal epithelium. It regulates the expression of genes with essential roles in cell differentiation, proliferation and survival. It binds to M-boxes (5'-TCATGTG-3') and symmetrical DNA sequences (E-boxes) (5'-CACGTG-3') found in the promoters of target genes, such as BCL2 and tyrosinase (TYR).


Pssm-ID: 381496  Cd Length: 104  Bit Score: 43.15  E-value: 6.88e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  312 SKAPASAQSRgEKRTAHNAIEKRYRSSINDKIIELKDLV------------------AIDYIRFLQHSNQKLKQENLSLR 373
Cdd:cd18926      2 SEARALAKER-QKKDNHNLIERRRRFNINDRIKELGTLIpksndpdmrwnkgtilkaSVDYIRKLQREQQRAKELENRQK 80

                   ....*....
gi 1924573064  374 TAVHKSKSL 382
Cdd:cd18926     81 KLEHANRHL 89
bHLHzip_scCBP1 cd11398
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Saccharomyces cerevisiae ...
323-368 8.32e-05

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Saccharomyces cerevisiae centromere-binding protein 1 (CBP-1) and similar proteins; CBP-1, also termed centromere promoter factor 1 (CPF1), or centromere-binding factor 1 (CBF1), is a bHLHzip protein that is required for chromosome stability and methionine prototrophy. It binds as a homodimer to the centromere DNA elements I (CDEI, GTCACATG) region of the centromere that is required for optimal centromere function.


Pssm-ID: 381404 [Multi-domain]  Cd Length: 89  Bit Score: 42.33  E-value: 8.32e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1924573064  323 EKRTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQE 368
Cdd:cd11398      6 QRRDNHKEVERRRRENINEGINELAALVpgnareknkgailarAVEYIQELQETEAKNIEK 66
bHLH_AtBIM_like cd11453
basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana BES1-interacting Myc-like ...
325-365 1.04e-04

basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana BES1-interacting Myc-like proteins (BIMs) and similar proteins; The family includes Arabidopsis thaliana BIM1 and its homologs (BIM2 and BIM3), which are bHLH transcription factors that interact with BES1 to regulate transcription of Brassinosteroid (BR)-induced gene. BR regulates many growth and developmental processes such as cell elongation, vascular development, senescence stress responses, and photomorphogenesis. BIM1 heterodimerize with BES1 and bind to E-box sequences present in many BR-induced promoters to regulated BR-induced genes.


Pssm-ID: 381459  Cd Length: 77  Bit Score: 41.69  E-value: 1.04e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1924573064  325 RTAHNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKL 365
Cdd:cd11453      4 RSKHSATEQRRRSKINERLQALRDLIphsdqkrdkasflleVIEYIQALQEKVAKL 59
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
48-199 1.24e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 46.41  E-value: 1.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   48 DPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESV 127
Cdd:PRK12323   402 PPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPA 481
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  128 PLSILQTPTPQP--------LPGAL----LPQSFPAPAPPQFSSTPVLGYPSPPGGFSTgSPPGNTQQPLPGLPLASPPG 195
Cdd:PRK12323   482 RAAPAAAPAPADddpppweeLPPEFaspaPAQPDAAPAGWVAESIPDPATADPDDAFET-LAPAPAAAPAPRAAAATEPV 560

                   ....
gi 1924573064  196 VPPV 199
Cdd:PRK12323   561 VAPR 564
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
49-320 1.39e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.39e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   49 PPYAGSGAGGTdPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIkeeSVP 128
Cdd:pfam05109  476 PTPAGTTSGAS-PVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSAV---TTP 551
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  129 LSILQTPTP---QPLPGALLP---QSFPAPA---PPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLpglpLASPPGVPPV 199
Cdd:pfam05109  552 TPNATSPTPavtTPTPNATIPtlgKTSPTSAvttPTPNATSPTVGETSPQANTTNHTLGGTSSTPV----VTSPPKNATS 627
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  200 SLHT---QVQSVVPQQLLTVTAAPTAAPVTTTVTSQIQQVPVLLQPHFIKADSLL-LTAMKTDGATVKAAGLSPLVSGTT 275
Cdd:pfam05109  628 AVTTgqhNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGENITqVTPASTSTHHVSTSSPAPRPGTTS 707
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1924573064  276 VQTGPlptlvsGGTILATVPLVVDAEKLPINRLAAGSKAPASAQS 320
Cdd:pfam05109  708 QASGP------GNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQKT 746
bHLHzip_USF2 cd18923
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in upstream stimulatory factor 2 (USF2) ...
321-369 1.50e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in upstream stimulatory factor 2 (USF2) and similar proteins; USF2, also termed Class B basic helix-loop-helix protein 12 (bHLHb12), or major late transcription factor 2, or FOS-interacting protein (FIP), or upstream transcription factor 2, is a bHLHzip transcription factor that binds to a symmetrical DNA sequence (E-boxes) (5'-CACGTG-3') that is found in a variety of viral and cellular promoters.


Pssm-ID: 381493  Cd Length: 80  Bit Score: 41.61  E-value: 1.50e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  321 RGEKRTA-HNAIEKRYRSSINDKIIELKDLV--------------------AIDYIRFLQHSNQKLkQEN 369
Cdd:cd18923      4 RDERRRAqHNEVERRRRDKINNWIVQLSKIIpdcntdnsktgaskggilskACDYIRELRQTNQRM-QET 72
bHLH_SF cd00083
basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators ...
332-359 2.41e-04

basic Helix Loop Helix (bHLH) domain superfamily; bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. Members of the bHLH superfamily have two highly conserved and functionally distinct regions. The basic part is at the amino end of the bHLH that may bind DNA to a consensus hexanucleotide sequence known as the E box (CANNTG). Different families of bHLH proteins recognize different E-box consensus sequences. At the carboxyl-terminal end of the region is the HLH region that interacts with other proteins to form homo- and heterodimers. bHLH proteins function as a diverse set of regulatory factors because they recognize different DNA sequences and dimerize with different proteins. The bHLH proteins can be divided to cell-type specific and widely expressed proteins. The cell-type specific members of bHLH superfamily are involved in cell-fate determination and act in neurogenesis, cardiogenesis, myogenesis, and hematopoiesis.


Pssm-ID: 381392 [Multi-domain]  Cd Length: 46  Bit Score: 39.81  E-value: 2.41e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*
gi 1924573064  332 EKRYRSSINDKIIELKDLV-----------------AIDYIRFLQ 359
Cdd:cd00083      1 ERRRRDKINDAFEELKRLLpelpdskklskasilqkAVEYIRELQ 45
bHLHzip_Mlx_like cd11404
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, ...
324-376 2.59e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in Max-like protein X (Mlx) family; Mlx, also termed Class D basic helix-loop-helix protein 13 (bHLHd13), or Max-like bHLHZip protein, or protein BigMax, or transcription factor-like protein 4, is a Max-like bHLHZip transcription regulator that interacts with the Max network of transcription factors. It forms a sequence-specific DNA-binding protein complex with some member of Mad family (Mad1 and Mad4) and Mondo family but not the Myc family and bind the E-box DNA to control transcription. The family also includes Saccharomyces cerevisiae INO4, which is a bHLH transcriptional activator of phospholipid synthetic genes (such as INO1, CHO1/PSS, CHO2/PEM1, OPI3/PEM2, etc.). It is required for de-repression of phospholipid biosynthetic gene expression in response to inositol deprivation in yeast.


Pssm-ID: 381410 [Multi-domain]  Cd Length: 70  Bit Score: 40.36  E-value: 2.59e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1924573064  324 KRTAHNAIEKRYRSSINDKIIELKDLV----------------AIDYIRFLQHSNQKLKQENLSLRTAV 376
Cdd:cd11404      2 RRLNHVRSEKKRRELIKKGYDELCALVpgldpqkrtkadilqkAADWIQELKEENEKLEEQLDELKEAA 70
bHLH_AtbHLH_like cd11393
basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana genes coding transcription ...
328-365 4.37e-04

basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana genes coding transcription factors and similar proteins; bHLH proteins are the second largest class of plant transcription factors that regulate transcription of genes that are involve in many essential physiological and developmental process. bHLH proteins are transcriptional regulators that are found in organisms from yeast to humans. The Arabidopsis bHLH proteins that have been characterized so far have roles in regulation of fruit dehiscence, cell development (carpel, anther and epidermal), phytochrome signaling, flavonoid biosynthesis, hormone signaling and stress responses.


Pssm-ID: 381399 [Multi-domain]  Cd Length: 53  Bit Score: 39.09  E-value: 4.37e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1924573064  328 HNAIEKRYRSSINDKIIELKDLV--------------AIDYIRFLQHSNQKL 365
Cdd:cd11393      1 HSIAERKRREKINERIRALRSLVpnggktdkasildeAIEYIKFLQEQVKVL 52
bHLHzip_TFEB cd18927
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in transcription factor EB (TFEB) and ...
313-373 5.78e-04

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in transcription factor EB (TFEB) and similar proteins; TFEB, also termed Class E basic helix-loop-helix protein 35 (bHLHe35), is a bHLHzip transcription factor that is required for vascularization of the mouse placenta. It specifically recognizes and binds E-box sequences (5'-CANNTG-3'). Its efficient DNA-binding requires dimerization with itself or with another MiT/TFE family member such as TFE3 or MITF.


Pssm-ID: 381497  Cd Length: 91  Bit Score: 39.96  E-value: 5.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  313 KAPASAQSRgEKRTAHNAIEKRYRSSINDKIIELKDLV------------------AIDYIRFLQHSNQKLKQ-ENLSLR 373
Cdd:cd18927      1 EARALAKER-QKKDNHNLIERRRRFNINDRIKELGTLIpktndldvrwnkgtilkaSVDYIKRMQKDLQRSRElENHSRR 79
PHA01929 PHA01929
putative scaffolding protein
87-200 7.51e-04

putative scaffolding protein


Pssm-ID: 177328  Cd Length: 306  Bit Score: 43.12  E-value: 7.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   87 LSGPQAAPSPLSPPQPAPTPLkMYPSMPAFSPGPGIKEEsvplsiLQTPTPQPLP---GALLPQSFP-APAPPQFSSTPV 162
Cdd:PHA01929    12 LAGLVANVPPAAAPTPQPNPV-IQPQAPVQPGQPGAPQQ------LAIPTQQPQPvptSAMTPHVVQqAPAQPAPAAPPA 84
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 1924573064  163 LGYPSP----PGGFSTGSPPGNTQQPLPGLPLASPPGVPPVS 200
Cdd:PHA01929    85 AGAALPealeVPPPPAFTPNGEIVGTLAGNLEGDPQLAPSVS 126
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
88-282 7.60e-04

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 43.60  E-value: 7.60e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   88 SGPQAAPSPLSPPQPAPTPLKMYPSMPAF---SP-------------GPGIKEESVPLSILQTP--TPQPLPGALLPQSF 149
Cdd:pfam15685  326 SGGPAAPASHARALPPPAYTTFPGSKPKFdwvSPpdgperhfrfngaGGGIGAPRRRAAALSGPwgSPPPPPGKAHPIPG 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  150 PAPAPPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPP-GVPPVSLHTQvQSVVPQQL-LTVTAAPTAAPVTT 227
Cdd:pfam15685  406 PRRPAPALLAPPMFIFPAPTNGEPVRPGPPAPQALLPRPPPPTPPaTPPPVPPPIP-QLPALQPMpLAAARPPTPRPCPG 484
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1924573064  228 TVTSQIQQVPVLLQPHFIKADSllltamktDGATVKAAGLSPLVSGTTVQTGPLP 282
Cdd:pfam15685  485 HGESALAPAPTAPLPPALAADQ--------APAPALAAAPAPSPAPAPATADPLP 531
PHA03247 PHA03247
large tegument protein UL36; Provisional
88-201 9.19e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 9.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   88 SGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPS 167
Cdd:PHA03247  2746 AGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPA 2825
                           90       100       110
                   ....*....|....*....|....*....|....
gi 1924573064  168 PPggfstgSPPGNTQQPLPGlPLASPPGVPPVSL 201
Cdd:PHA03247  2826 GP------LPPPTSAQPTAP-PPPPGPPPPSLPL 2852
bHLHzip_USF3 cd18910
basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix ...
320-365 1.43e-03

basic Helix-Loop-Helix-zipper (bHLHzip) domain found in basic helix-loop-helix domain-containing protein USF3 and similar proteins; USF3, also termed upstream transcription factor 3, is a bHLHzip protein that is involved in the negative regulation of epithelial-mesenchymal transition, the process by which epithelial cells lose their polarity and adhesion properties to become mesenchymal cells with enhanced migration and invasive properties.


Pssm-ID: 381480  Cd Length: 65  Bit Score: 38.05  E-value: 1.43e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1924573064  320 SRGEKRTAHNAIEKRYRSSINDKIIELKDLV----------------AIDYIRFLQHSNQKL 365
Cdd:cd18910      1 RREKKRESHNEVERRRKDKINAGINKIGELLpdrdakkqsknmileqAYKYIVELKKKNDKL 62
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
94-200 1.73e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 42.11  E-value: 1.73e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   94 PSPLSPPQPAPTPLKMYPSMpaFSPGPgikeESVPLSILQTPTPQPLPgaLLPQSFPAPAPPQF---SSTPVLGYP---- 166
Cdd:pfam15279  175 GKPQQHPPPSPLPAFMEPSS--MPPPF----LRPPPSIPQPNSPLSNP--MLPGIGPPPKPPRNlgpPSNPMHRPPfsph 246
                           90       100       110
                   ....*....|....*....|....*....|....*
gi 1924573064  167 -SPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVS 200
Cdd:pfam15279  247 hPPPPPTPPGPPPGLPPPPPRGFTPPFGPPFPPVN 281
PHA03247 PHA03247
large tegument protein UL36; Provisional
50-214 2.30e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.23  E-value: 2.30e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   50 PYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPL 129
Cdd:PHA03247  2790 SLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSR 2869
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  130 SILQTPTPQPLPGALL---------PQSFPAPaPPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPL--PGLPLASPPGVPP 198
Cdd:PHA03247  2870 SPAAKPAAPARPPVRRlarpavsrsTESFALP-PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPppPPRPQPPLAPTTD 2948
                          170
                   ....*....|....*.
gi 1924573064  199 VSLHTQVQSVVPQQLL 214
Cdd:PHA03247  2949 PAGAGEPSGAVPQPWL 2964
PHA03247 PHA03247
large tegument protein UL36; Provisional
49-212 3.23e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 41.85  E-value: 3.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   49 PPYAGSGAGGTDPASPDTS-SPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESV 127
Cdd:PHA03247  2760 PPTTAGPPAPAPPAAPAAGpPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAP 2839
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  128 PLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFSTGSPPGNTQ---QPLPGLPLASPPgVPPVSLHTQ 204
Cdd:PHA03247  2840 PPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRStesFALPPDQPERPP-QPQAPPPPQ 2918

                   ....*...
gi 1924573064  205 VQSVVPQQ 212
Cdd:PHA03247  2919 PQPQPPPP 2926
Amelogenin smart00818
Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem ...
95-199 3.93e-03

Amelogenins, cell adhesion proteins, play a role in the biomineralisation of teeth; They seem to regulate formation of crystallites during the secretory stage of tooth enamel development and are thought to play a major role in the structural organisation and mineralisation of developing enamel. The extracellular matrix of the developing enamel comprises two major classes of protein: the hydrophobic amelogenins and the acidic enamelins. Circular dichroism studies of porcine amelogenin have shown that the protein consists of 3 discrete folding units: the N-terminal region appears to contain beta-strand structures, while the C-terminal region displays characteristics of a random coil conformation. Subsequent studies on the bovine protein have indicated the amelogenin structure to contain a repetitive beta-turn segment and a "beta-spiral" between Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich region. The beta-spiral offers a probable site for interactions with Ca2+ ions. Muatations in the human amelogenin gene (AMGX) cause X-linked hypoplastic amelogenesis imperfecta, a disease characterised by defective enamel. A 9bp deletion in exon 2 of AMGX results in the loss of codons for Ile5, Leu6, Phe7 and Ala8, and replacement by a new threonine codon, disrupting the 16-residue (Met1-Ala16) amelogenin signal peptide.


Pssm-ID: 197891 [Multi-domain]  Cd Length: 165  Bit Score: 39.39  E-value: 3.93e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064    95 SPLSPPQPAPTPLKMYPSMPAFSPGPGiKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFST 174
Cdd:smart00818   43 SQQHPPTHTLQPHHHIPVLPAQQPVVP-QQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVH 121
                            90       100
                    ....*....|....*....|....*
gi 1924573064   175 GSPPGNTQQPLPGLPLASPPgvPPV 199
Cdd:smart00818  122 PIPPLPPQPPLPPMFPMQPL--PPL 144
PHA02682 PHA02682
ORF080 virion core protein; Provisional
95-205 5.50e-03

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 40.23  E-value: 5.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   95 SPLSPPQPAPTPlkmYPSMPAFSPgpgikeesvplsilqtptPQPLPGALLPQSFPAPAPPQFSSTPVLGYPSPPGGFST 174
Cdd:PHA02682    80 SPLAPSPACAAP---APACPACAP------------------AAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAP 138
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1924573064  175 GSPPGNTQ-QPLPGLPLASP-PGVPPVSLHTQV 205
Cdd:PHA02682   139 ACPPSTRQcPPAPPLPTPKPaPAAKPIFLHNQL 171
PHA03369 PHA03369
capsid maturational protease; Provisional
92-211 6.13e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 40.75  E-value: 6.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   92 AAPSPLSPPQPAPTPLKMYPSMPAFSPgpgiKEESVPLSILQTPTPQPLPGALlpqsFPAPAPPQFSSTPVLGYPSPPGG 171
Cdd:PHA03369   352 SLTAPSRVLAAAAKVAVIAAPQTHTGP----ADRQRPQRPDGIPYSVPARSPM----TAYPPVPQFCGDPGLVSPYNPQS 423
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 1924573064  172 FSTGSPPgntqQPLPGLPLASP-PGVPPVSLHTQVQSVVPQ 211
Cdd:PHA03369   424 PGTSYGP----EPVGPVPPQPTnPYVMPISMANMVYPGHPQ 460
bHLH_AtMEE8_like cd19698
basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana protein maternal effect ...
328-368 9.12e-03

basic helix-loop-helix (bHLH) domain found in Arabidopsis thaliana protein maternal effect embryo arrest 8 (AtMEE8) and similar proteins; AtMEE8, also termed AtbHLH108, or EN 132, is a bHLH transcription factor required during early embryo development, for the endosperm formation.


Pssm-ID: 381541  Cd Length: 71  Bit Score: 36.27  E-value: 9.12e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 1924573064  328 HNAIEKRYRSSINDKIIELKDLV---------------AIDYIRFLQHSNQKLKQE 368
Cdd:cd19698      1 HNLLEKKRRERIKDKIEILKGLTpnctpksdiasilscVIDYIKSLLLQVEKLSMG 56
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
48-212 9.14e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.35  E-value: 9.14e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   48 DPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPlkmyPSMPAFSPGPGIKeesv 127
Cdd:PRK07764   625 AAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPP----APAPAAPAAPAGA---- 696
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  128 plsilqtPTPQPLPGAllpqsfPAPAPPQFSSTPVLGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPPVSLHTQVQS 207
Cdd:PRK07764   697 -------APAQPAPAP------AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPA 763

                   ....*
gi 1924573064  208 VVPQQ 212
Cdd:PRK07764   764 PAPAA 768
PHA03247 PHA03247
large tegument protein UL36; Provisional
90-201 9.24e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 40.31  E-value: 9.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   90 PQAAPSPLSPPQPAPTPLKMYPSMPAFSPGPGIKEESVPlsilqTPTPQPLPGALL---PQSFPAPAPPQFSSTPVLGY- 165
Cdd:PHA03247  2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-----PPPPPPRPQPPLaptTDPAGAGEPSGAVPQPWLGAl 2967
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 1924573064  166 --------------PSPPGGFSTGSPPGNTQQPLPG-------LPLASPPGVPPVSL 201
Cdd:PHA03247  2968 vpgrvavprfrvpqPAPSREAPASSTPPLTGHSLSRvsswassLALHEETDPPPVSL 3024
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
39-198 9.62e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.97  E-value: 9.62e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064   39 QDSDFPGLFDPPYAGSGAGGTDPASPDTSSPGSLSPPPATLSSSLEAFLSGPQAAPSPLSPPQPAPTPLKMYPSMPAFSP 118
Cdd:PRK07764   638 EASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDP 717
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1924573064  119 GPGIKEESVPLSILQTPTPQPLPGALLPQSFPAPAPPQFSSTPVlGYPSPPGGFSTGSPPGNTQQPLPGLPLASPPGVPP 198
Cdd:PRK07764   718 AAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPP-PAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDE 796
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH